8:00 |
8:45 |
Registration
|
8:45 |
9:00 |
Opening Remarks
|
9:00 |
10:00 |
Plenary talk
Addressing Challenges in Multi-Criteria Optimization Problems for Healthcare Personnel Scheduling Problems
Amy Ellen Mainville Cohn
abstract
Addressing Challenges in Multi-Criteria Optimization Problems for Healthcare Personnel Scheduling Problems
Amy Ellen Mainville Cohn
Healthcare personnel scheduling problems look on the surface to be clear candidates for combinatorial optimization techniques such as integer programming. A key challenge, however, is the lack of a single, well-defined objective function. Instead, there are many competing metrics of importance including service coverage and patient care, provider well-being and satisfaction, and (in the case of medical trainees) educational considerations. In this talk, I will consider a number of personnel scheduling problems from the University of Michigan health system, focusing not only on the models and algorithms needed to solve these problems, but also on the techniques needed to identify solutions that are desirable and acceptable for practical use.
|
|
10:00 |
10:15 |
Break
|
10:15 |
11:45 |
Parallel technical sessions
Nonlinear and Stochastic Optimization Algorithms (MONRO)
|
Constrained Optimization (LEHIGH)
|
Nonconvex optimization - Part (i) (IACOCCA)
|
Optimization and Healthcare (WHITEHOUSE)
|
A Theoretical and Empirical Comparison of Gradient Approximations Methods in Derivative-Free Optimization
Liyuan Cao
abstract
A Theoretical and Empirical Comparison of Gradient Approximations Methods in Derivative-Free Optimization
Liyuan Cao
In our recently published paper, we analyze several methods for approximating gradients of noisy functions using only function values. These methods include finite differences, linear interpolation, Gaussian smoothing and smoothing on a sphere. The methods differ in the number of functions sampled, the choice of the sample points, and the way in which the gradient approximations are derived. For each method, we derive bounds on the number of samples and the sampling radius which guarantee favorable convergence properties for a line search or fixed step size descent method. We present numerical results evaluating the quality of the gradient approximations as well as their performance in conjunction with a line search derivative-free optimization algorithm.
|
New notions of simultaneous diagonalizability of quadratic forms with applications to QCQPs
Alex Wang
abstract
New notions of simultaneous diagonalizability of quadratic forms with applications to QCQPs
Alex Wang
A set of quadratic forms is simultaneously diagonalizable via congruence (SDC) if there exists a basis under which each of the quadratic forms is diagonal. This property appears naturally when analyzing quadratically constrained quadratic programs (QCQPs) and has important implications in this context. This talk will present a new weaker notion of simultaneous diagonalizability which extends the reach of the SDC property. Specifically, we say that a set of quadratic forms is d-restricted SDC (d-RSDC) if it is the restriction of an SDC set in up to d-many additional dimensions. Surprisingly, we will see that almost every pair of symmetric matrices is 1-RSDC.
We accompany our theoretical results with preliminary numerical experiments applying the RSDC property to QCQPs with a single quadratic constraint and additional linear constraints.
Based on joint work with Rujun Jiang.
|
Proximity in Concave Integer Quadratic Programming
Alberto Del Pia
abstract
Proximity in Concave Integer Quadratic Programming
Alberto Del Pia
A classic result by Cook, Gerards, Schrijver, and Tardos provides an upper bound of n∆ on the proximity of optimal solutions of an Integer Linear Programming problem and its standard linear relaxation. In this bound, n is the number of variables and ∆ denotes the maximum of the absolute values of the subdeterminants of the constraint matrix. Hochbaum and Shanthikumar, and Werman and Magagnosc showed that the same upper bound is valid if a more general convex function is minimized, instead of a linear function. No proximity result of this type is known when the objective function is nonconvex. In fact, if we minimize a concave quadratic, no upper bound can be given as a function of n and ∆. Our key observation is that, in this setting, proximity phenomena still occur, but only if we consider also approximate solutions instead of optimal solutions only. In our main result we provide upper bounds on the distance between approximate (resp., optimal) solutions to a Concave Integer Quadratic Programming problem and optimal (resp., approximate) solutions of its continuous relaxation. Our bounds are functions of n, ∆, and a parameter ε that controls the quality of the approximation. Furthermore, we discuss how far from optimal are our proximity bounds. This is joint work with Mingchen Ma.
|
Distributionally Robust Optimization Approaches for a Mobile Facility Routing and Scheduling Problem
Karmel Shehadeh
abstract
Distributionally Robust Optimization Approaches for a Mobile Facility Routing and Scheduling Problem
Karmel Shehadeh
We study a mobile facility (MF) routing and scheduling problem in which probability distributions of the time-dependent demand for MF services is unknown. To address distributional ambiguity, we propose and analyze two distributionally robust MF routing and scheduling (DMFRS) models that seek to minimize the fixed cost of establishing the MF fleet and maximum expected transportation and unmet demand costs over all possible demand distributions residing within an ambiguity set. In the first model, we use a moment-based ambiguity set. In the second model, we use an ambiguity set that incorporates all distributions within a 1-Wasserstein distance from a reference distribution. To solve DMFRS models, we propose a decomposition-based algorithm and derive lower bound and two–families of symmetry-breaking inequalities to strengthen the master problem and speed up convergence. Finally, we present extensive computational experiments comparing the operational and computational performance of the proposed distributionally robust models and a stochastic programming model and drive insights into DMFRS.
|
Efficient algorithms for some variants of the extended trust-region subproblems
Maziar Salahi
abstract
Efficient algorithms for some variants of the extended trust-region subproblems
Maziar Salahi
In this talk, we discuss variants of the extended trust-region subproblems (eTRS), that are extensions of the well-known TRS. Even when we add one linear inequality constraint to TRS, the celebrated strong duality result may fail for eTRS. Thus several conditions are proposed in the literature implying strong duality and exact SDP relaxation. For example, exact SOCP/SDP relaxation is given for the case when extra linear constraints do not intersect inside the ball. We proposed the first efficient algorithm for eTRS when it has one linear constraint using the generalized eigenvalue problem which was also extended to two linear constraints. Taking advantage of the hard case and local nonglobal minimum of TRS , we were also able to develop an efficient algorithm for the case when linear constraints do not intersect side the ball, overperforming the SOCP/SDP relaxation.
|
Second-Order Conic and Polyhedral Approximations of the Exponential Cone: Application to Mixed-Integer Exponential Conic Programs
Qing Ye
abstract
Second-Order Conic and Polyhedral Approximations of the Exponential Cone: Application to Mixed-Integer Exponential Conic Programs
Qing Ye
Exponents and logarithms exist in many important applications such as logistic regression, maximum likelihood, relative entropy and so on. Since the exponential cone can be viewed as the epigraph of perspective of the natural exponential function or the hypograph of perspective of the natural logarithm function, many mixed-integer convex programs involving exponential or logarithm functions can be recast as mixed-integer exponential conic programs (MIECPs). Recently, solver MOSEK is able to solve large-scale continuous exponential conic programs (ECPs). However, unlike mixed-integer linear programs (MILPs) and mixed-integer second-order conic programs (MISOCPs), MIECPs are far beyond development. To harvest the past efforts on MILPs and MISOCPs, this paper presents second-order conic (SOC) and polyhedral approximation schemes for the exponential cone with application to MIECPs. To do so, we first extend and generalize existing SOC approximation approaches in the extended space, propose new scaling and shifting methods, prove approximation accuracies, and derive lower bounds of approximations. We then study the polyhedral outer approximation of the exponential cones in the original space using gradient inequalities, show its approximation accuracy, and derive a lower bound of the approximation. When implementing SOC approximations, we suggest learning the approximation pattern by testing smaller cases and then applying to the large-scale cases; and for the polyhedral approximation, we suggest using the cutting plane method when solving the continuous ECPs and branch and cut method for MIECPs. Our numerical study shows that the proposed scaling, shifting, and polyhedral outer approximation methods outperform solver MOSEK for both continuous ECPs and MIECPs and can achieve up to 20 times speed-ups compared to solver MOSEK when solving MIECPs.
|
Two-halfspace closure
Amitabh Basu
abstract
Two-halfspace closure
Amitabh Basu
We define a new cutting plane closure for pure integer programs called the two-halfspace closure. It is a natural generalization of the well-known Chvátal-Gomory closure. We prove that the two-halfspace closure is polyhedral. We also study the corresponding 2-halfpsace rank of any valid inequality and show that it is at most the split rank of the inequality. Moreover, while the split rank can be strictly larger than the two-halfspace rank, the split rank is at most twice the two-halfspace rank. A key step of our analysis shows that the split closure of a rational polyhedron can be obtained by considering the split closures of all k-dimensional (rational) projections of the polyhedron, for any fixed k≥2. This result may be of independent interest.
|
Distributionally Robust Home Service Routing and Appointment Scheduling with Random Travel and Service Times
Man Yiu Tsang
abstract
Distributionally Robust Home Service Routing and Appointment Scheduling with Random Travel and Service Times
Man Yiu Tsang
We study an integrated routing and appointment scheduling problem arising from home service practice. Specifically, given a set of customers within a service region that an operator needs to serve, we seek to find the operator's route and time schedule. The travel time between customers and the service time of each customer is random. The probability distributions of these random parameters are unknown. To address distributional ambiguity, we propose and analyze two distributionally robust home service routing and scheduling models that search for optimal routing and scheduling decisions to minimize the worst-case expectation of operational costs over all distributions residing within an ambiguity set. In the first model, we use a moment-based ambiguity set. In the second model, we use a Wasserstein ambiguity set. We derive equivalent mixed-integer linear programming reformulations of both models. In an extensive numerical experiment, we investigate the proposed models computational and operational performance and derive insights into DHRAS.
|
Generalized Cyclic Stochastic Approximation and its Application in Multi-agent Systems
Jiahao Shi
abstract
Generalized Cyclic Stochastic Approximation and its Application in Multi-agent Systems
Jiahao Shi
Stochastic approximation (SA) is a powerful class of iterative algorithms for minimizing a loss function, L(θ), when only noisy observations of L(θ) or its gradient are available. In this talk, we will present a generalized cyclic SA (GCSA) algorithm, a variant of SA procedures, where θ is divided into multiple subvectors that are updated one at a time. The subvector to update may be selected according to a random variable or according to a predetermined pattern. The convergence of GCSA, asymptotic normality of GCSA, and efficiency of GCSA relative to its non-cyclic counterpart are investigated. Finally, we apply the GCSA algorithm to a multi-agent stochastic optimization problem
|
On the rescaling and projection algorithm
Negar Soheili Azad
abstract
On the rescaling and projection algorithm
Negar Soheili Azad
The projection and rescaling algorithm is a recently developed method that combines a basic procedure involving only low-cost operations with a periodic rescaling step. We propose a simple projection and rescaling algorithm that finds the "most interior" solutions to the pair of primal-dual polyhedral feasibility problems, an extension of the original projection and rescaling algorithm that finds a solution to one of these problems when it’s feasible. We also present extensive numerical experiments on synthetic problem instances with varied levels of conditioning for both polyhedral and second-order cone feasibility problems. Our computational experiments provide promising evidence for the effectiveness of the projection and rescaling algorithm.
|
Convexification of the Lennard-Jones Potential
Anatoliy Kuznetsov
abstract
Convexification of the Lennard-Jones Potential
Anatoliy Kuznetsov
The Lennard-Jones potential is a semi-empirical relationship describing the energy between two particles due to van der Waals interactions. It has a wide range of applications in computational chemistry, including modeling nonbonded interactions in force fields used for protein folding calculations. We apply recently developed convexification techniques to derive the convex envelope of the Lennard-Jones potential for several geometries. Compared to convex relaxations generated using standard factorable relaxation techniques, the convex envelope introduces a significantly smaller relaxation gap.
|
A Voice base Nonsmooth Nonconvex optimization model for Parkinson Disease Severity Estimation.
Habib Ghaffari Hadigheh
abstract
A Voice base Nonsmooth Nonconvex optimization model for Parkinson Disease Severity Estimation.
Habib Ghaffari Hadigheh
Parkinson's Disease (PD) is a common progressive neurodegenerative disorder characterized by several motor and non-motor features. It affects 1% of people older than 60, up to 4% of those over 80, and nearly one-half a million people over 50 in the U.S.. The healthcare-related cost due to PD is increasing as the population's longevity increases in developed countries, showing the importance of identifying patients with PD symptoms in the early stages. The most widely used tool in determining the symptoms and severity of PD is the Unified Parkinson's disease scale (UPDRS) which requires clinical expertise and experience. Many conventional ML models were introduced based on non-invasive measurements such as acoustic features using the patient's speech sample and UPDRS; However, most models identify the person as either a patient with PD or healthy controls (HS). There are challenges to generalizing the results of evaluations due to the limited number of data records that were collected from only one clinic in most cases. Another problem is that the UPDRS is scored based on highly qualitative data, which affects the reliability of PD's prediction in its early stages.
Having some experimental data may help one to find a reasonable regression function that relates the acoustic features of the patient's voice to the motor scores that an expert assigns to the case. Uncertainty theory initiated in 2007 is a mathematical framework devised to formalize human reasoning. Here, we use the facts of this theory to find an appropriate regression function to avoid overfitting commonly observed with interpolation methods. The problem of finding such a function is modeled as a non-smooth non-convex problem. This high-dimensional problem is unconstrained but computationally expensive. To avoid this cost, we devise a linearly constrained quadratic convex relaxation of the original model. We will discuss the computational complexity of the two models and the reasons we expect the relaxation to perform well despite the lower computational cost.
|
|
11:45 |
12:00 |
Break
|
12:00 |
13:00 |
Plenary talk
Inexact high-order proximal-point methods with auxiliary search procedure
Yurii Nesterov
abstract
Inexact high-order proximal-point methods with auxiliary search procedure
Yurii Nesterov
In this paper, we present the further development of Bi-Level Unconstrained Minimization by a new pth-order proximal-point method with the convergence rate O(1/k^((1+3p)/2)), where k is the iteration counter. In this method, we replace the auxiliary line-search procedure by a segment search. This allows bounding its complexity by a logarithm of the desired accuracy. Each step in this search needs an approximate computation of a high-order proximal-point operator. Under assumption on the boundedness of the (p+1)th derivative of the objective function, this can be done by one step of the pth order augmented tensor method. In this way, for p = 2, we get a new second-order method with the rate of convergence O(1/k^(7/2)) and logarithmic complexity of the auxiliary search at each iteration. Another possibility is to compute the proximal-point operator by a lower-order minimization method. As an example, for p = 3, we consider the upper-level process convergent as O(1/k^5). Assuming boundedness of the fourth derivative, an appropriate approximation of the proximal-point operator can be computed by a second-order method in a logarithmic number of iterations. This combination gives a second-order scheme with much better complexity than the existing theoretical limits.
|
|
13:00 |
14:00 |
Lunch
|
14:00 |
15:30 |
Parallel technical sessions
Methods for Large-Scale, Nonlinear and Stochastic Optimization (MONRO)
|
Quantum Optimization I (LEHIGH)
|
Nonconvex optimization, Part (ii) (IACOCCA)
|
OR in Healthcare (WHITEHOUSE)
|
SQP for Nonlinear Equality Constrained Stochastic Optimization
Baoyu Zhou
abstract
SQP for Nonlinear Equality Constrained Stochastic Optimization
Baoyu Zhou
Sequential quadratic optimization algorithms are proposed for solving smooth nonlinear optimization problems with equality constraints. The main focus is an algorithm proposed for the case when the constraint functions are deterministic, and constraint function and derivative values can be computed explicitly, but the objective function is stochastic. It is assumed in this setting that it is intractable to compute objective function and derivative values explicitly, although one can compute stochastic function and gradient estimates. We propose an adaptive stepsize selection scheme, since the objective is stochastic, for which it is assumed that line searches would be intractable. Under reasonable assumptions, convergence in expectation from remote starting points is proved for the proposed algorithm. The results of numerical experiments demonstrate the practical performance of our proposed techniques.
|
An Inexact-Feasible Interior Point Method for Linear Optimization with High Adaptability to Quantum Computers
Mohammadhossein Mohammadisiahroudi
abstract
An Inexact-Feasible Interior Point Method for Linear Optimization with High Adaptability to Quantum Computers
Mohammadhossein Mohammadisiahroudi
Quantum computing can speed up Interior Point Methods (IPMs) by using Quantum Linear System Algorithms (QLSAs) to solve the Newton systems. Since QLSAs inherently produce inexact solutions, an Inexact-Feasible IPM (IF-IPM) is proposed for linear optimization problems using a novel system that produces inexact but feasible steps. We also discuss how QLSAs can be used efficiently in an Iterative Refinement scheme to find an exact solution without excessive time of QLSAs. The results show that IF-QIPM has better time complexity than other quantum and classical IPMs w.r.t the dimension. The IF-IPM is implemented with both classical and quantum solvers to investigate its efficiency numerically.
|
Sum-of-squares lower bounds for random combinatorial problems
Dmitriy Kunisky
abstract
Sum-of-squares lower bounds for random combinatorial problems
Dmitriy Kunisky
I will present results showing that sum-of-squares semidefinite programming relaxations cannot certify strong bounds on random combinatorial problems in high dimension, focusing in particular on the Sherrington-Kirkpatrick model of statistical physics, the problem of optimizing a Gaussian quadratic form over the hypercube. I will describe how the proofs of these results are driven by understanding the geometric structure of high-dimensional sum-of-squares feasible points via the Gram matrix factorizations of associated pseudomoment matrices. We will see how these considerations suggest higher-degree analogs of the notions of "vector cuts" and "vector colorings" often offered as interpretations of the simpler Goemans-Williamson and Lovasz theta function relaxations of graph problems. The talk is based on joint work with Afonso Bandeira.
|
Solution Methods for Integrated Surgery Scheduling and Inventory Problem
Amogh Bhosekar
abstract
Solution Methods for Integrated Surgery Scheduling and Inventory Problem
Amogh Bhosekar
Considering the availability and cost of surgical instruments while creating operating rooms (ORs) schedule provides an opportunity to reduce the cost of healthcare. We propose a mixed integer programming (MIP) model for the integrated problem to determine the schedule of surgeries and assignments of instruments to surgeries over a week. The objective of the model is to minimize the cost of opening the ORs, overtime, idle-time, and the cost of using/borrowing instruments to satisfy the demand. To generate solutions in reasonable time, a Lagrangean decomposition-based heuristic is proposed in which the integrated problem is separated into a surgery scheduling problem and an instrument assignment problem. The surgery scheduling model has a special structure that has no resource sharing among surgeries assigned different (day,OR) tuples in the planning horizon. This renders the sequencing decisions for each (day,OR) assignment independent once a patient to (day,OR) schedule is given. A partitioning procedure based on Logic Based Benders Decomposition is used to solve the scheduling problem and results in MIP sequence optimization problem one for each (day,OR). The results of our experiments indicate that integrating decisions lowers the total system costs.
|
Diagonalized Hessian Estimates for Convex and Nonconvex Optimization and Comparison with Natural Gradient Method
Shiqing Sun
abstract
Diagonalized Hessian Estimates for Convex and Nonconvex Optimization and Comparison with Natural Gradient Method
Shiqing Sun
Adaptive gradient algorithms, including AdaGrad, Adam and AMSGrad, are motivated by natural gradient methods, which are designed to accelerate convergence relative to stochastic gradient descent (SGD) by utilizing Fisher information matrices (FIM) to rescale stochastic gradients. However, considering the difference between Hessian matrices and FIM, along with the fact that the loss curvature is indeed characterized by Hessian matrices, there is a need for an algorithm that estimates the Hessian information directly. We describe an algorithm, called diagSG, which approximates the diagonalized Hessian matrix directly via simultaneous perturbation stochastic approximation (SPSA). In this presentation, we discuss the convergence of diagSG in both convex and nonconvex optimization. In addition, we present a theoretical comparison between diagSG and some natural gradient methods, like AdaGrad, relative to their efficiency in both the finite iteration and asymptotic sense. Numeral experiments also reveal its advantage in fast convergence in deep neural network problems with datasets like CIFAR-100.
|
Quantum Interior Point Methods for Semidefinite Optimization
Brandon Augustino
abstract
Quantum Interior Point Methods for Semidefinite Optimization
Brandon Augustino
We present two quantum interior point methods for semidefinite optimization problems, building on recent advances in quantum linear system solvers. The first scheme, more similar to a classical solver, computes an inexact search direction and is not guaranteed to stay feasible; the second scheme uses a nullspace representation of the Newton linear system to ensure feasibility even with inexact search directions. This second scheme would be impractical in the classical world, but it is well-suited for a hybrid quantum-classical setting. We show that both schemes converge to an optimal solution of the semidefinite optimization problem under standard assumptions. By comparing the theoretical performance of classical and quantum interior point methods with respect to various input parameters, we show that our second scheme obtains a speedup over classical algorithms in terms of the size of the problem, but has worse dependence on other numerical parameters.
|
Sparse regression: decompositions, convexifications and algorithms
Andres Gomez
abstract
Sparse regression: decompositions, convexifications and algorithms
Andres Gomez
We study sparse regression problems, this is, regression problems where a small subset of the possible predictor variables should be used. Sparse regression problem can be easily modeled with indicator variables controlling which variables can be non-zero, although natural big-M formulations often result in poor relaxations. We propose novel convexifications for the epigraphs of a special class of quadratic functions with indicators with arbitrary constraints on the indicator variables.
|
Dynamic Tuberculosis Screening for Healthcare Employees
Mahsa Kiani
abstract
Dynamic Tuberculosis Screening for Healthcare Employees
Mahsa Kiani
Regular tuberculosis (TB) screening is required for healthcare employees since they can come into contact with infected patients. TB is a serious, contagious, and potentially deadly disease. Early detection of the disease, even when it is in latent form, prevents the spread of the disease and helps with treatment. Currently, there are two types of TB diagnostic tests on the market: skin test and blood test. The cost of the blood test is much higher than the skin test. However, the possibility of getting a false positive or false negative result in a skin test is higher especially for persons with specific characteristics, which can increase costs. In this study, we categorize healthcare employees into multiple risk groups based on the department they work in, the specific job they do, and their birth country. We create a Markov decision process (MDP) model to decide which TB test should be utilized for each employee group to minimize the total costs related to testing, undetected infections, employees' time lost. Due to the size of the problem, we use approximate dynamic programming (ADP) to obtain a near-optimal solution. By analyzing this solution to the ADP, we specify not only the type of the tests that should be used but also the frequency with which each test should be administered. Based on this analysis, we propose a simple policy that can be used by healthcare facilities since such facilities may not have the expertise or the resources to develop and solve sophisticated optimization models.
|
Complexity of Projected Newton Methods for Bound-constrained Optimization
Yue Xie
abstract
Complexity of Projected Newton Methods for Bound-constrained Optimization
Yue Xie
Deriving complexity guarantees for nonconvex optimization problems are driven by long standing theoretical interests and by their relevance to machine learning and data science. This talk discusses complexity of algorithms for bound-constrained nonconvex optimization. We observe from the past work that pursuit of the state-of-art complexity guarantees can compromise the practicality of an algorithm. Therefore, we propose two practical projected Newton types of methods with complexity guarantees matching the best known. The first method is a scaled variant of Bertsekas' two-metric projection method, which can be shown to output an epsilon approximate first-order point in O(epsilon^{-2}) iterations. The second is a projected Newton-Conjugate Gradient method, which locates an epsilon approximate second-order point with high probability in O(epsilon^{-3/2}) iterations. Preliminary numerical experiments on Nonnegative Matrix Factorization indicate practicality of the latter algorithm.
|
Characterization and Mitigation of Errors in Quantum Computing via Consistent Bayesian
Muqing Zheng
abstract
Characterization and Mitigation of Errors in Quantum Computing via Consistent Bayesian
Muqing Zheng
Various noise models have been developed in quantum computing study to describe the propagation and effect of the noise which is caused by imperfect implementation of hardware. Identifying parameters such as gate and readout error rates are critical to these models. We use a Bayesian inference approach to identity posterior distributions of these parameters, such that they can be characterized more elaborately. By characterising the device errors in this way, we can further improve the accuracy of quantum error mitigation. Experiments conducted on IBM's quantum computing devices suggest that our approach provides better error mitigation performance than existing techniques used by the vendor. Also, our approach outperforms the standard Bayesian inference method in such experiments.
|
Rank Pump: A Feasibility Heuristic For Polynomial Optimization
Chen Chen
abstract
Rank Pump: A Feasibility Heuristic For Polynomial Optimization
Chen Chen
The feasibility pump is a well-known primal heuristic for integer programming that involves two alternating
sequences of projections. The original pump was designed for binary problems, and found such projections using linear programming and simple rounding. Unfortunately, the elegance of the pump may be lost in other settings. For instance, a natural extension of the pump to nonconvex MINLP involves NP-hard projection problems. We present our adaptation of the feasibility pump to polynomial optimization, called the rank pump. The rank pump has polynomial-time iterations, as all its projection problems can be
solved in polynomial time.
|
Optimal Nurse Allocation for the Surgical System: A Tandem Network with Flexible Servers
Tong Zhang
abstract
Optimal Nurse Allocation for the Surgical System: A Tandem Network with Flexible Servers
Tong Zhang
Nursing shortage is a major challenge faced by the US healthcare system. Cross-training of highly skilled nursing staff has the potential to increase the nursing capacity at low cost. However, strategies to effectively allocate cross-trained nurses to tasks should be explored. In this paper, a tandem queueing system with flexible servers is considered to model a surgical system. When a patient needs surgery, they must go through three stages: pre-operative care, surgery, and post-operative care. Patients may move through a pre-operative area, the operating room (OR), and a post-anesthesia care unit (PACU) during these stages, or they may recover in the OR due to lack of PACU beds or staff. We consider different sets of operational rules regarding how patients flow through these locations. Considering holding costs and unit rewards for each patient departure, we model a Markov decision process to optimize the nurse allocation decisions under each set of rules. We investigate policies that maximize long-run average rewards and examine conditions under which nurse cross-training is beneficial.
|
|
15:30 |
15:45 |
Break
|
15:45 |
16:45 |
Plenary talk
Introduction to Quantum Annealing
Catherine McGeoch
abstract
Introduction to Quantum Annealing
Catherine McGeoch
Quantum annealing is a novel computational paradigm that runs on purpose-built computing platforms that exploit quantum effects to heuristically solve (NP-hard) problems in combinatorial optimization. The talk will present an overview of how quantum annealing works, and how to use it. It will survey features of D-Wave’s Advantage processor, which contains 5000+ qubits and was launched in late 2020, and of the software stack above it.
|
|
16:45 |
18:15 |
Parallel technical sessions
Advances in Large-Scale Nonlinear Optimization (MONRO)
|
Quantum Optimization II (LEHIGH)
|
Advances in Nonconvex Optimization (IACOCCA)
|
Optimization with Applications in Healthcare (WHITEHOUSE)
|
Average Curvature FISTA for Nonconvex Smooth Composite Optimization Problems
Jiaming Liang
abstract
Average Curvature FISTA for Nonconvex Smooth Composite Optimization Problems
Jiaming Liang
A previous authors’ paper introduces an accelerated composite gradient (ACG) variant, namely AC-ACG, for solving nonconvex smooth composite optimization (N-SCO) problems. In contrast to other ACG variants, AC-ACG estimates the local upper curvature of the N-SCO problem by using the average of the observed upper-Lipschitz curvatures obtained during the previous iterations, and uses this estimation and two composite resolvent evaluations to compute the next iterate. This paper presents an alternative FISTA-type ACG variant, namely AC-FISTA, which has the following additional features: i) it performs an average of one composite resolvent evaluation per iteration; and ii) it estimates the local upper curvature by using the average of the previously observed upper (instead of upper-Lipschitz) curvatures. These two properties acting together yield a practical AC-FISTA variant which substantially outperforms earlier ACG variants, including the AC-ACG variants discussed in the aforementioned authors’ paper.
|
Quantum-inspired formulations for the max k-cut problem
Ramin Fakhimi
abstract
Quantum-inspired formulations for the max k-cut problem
Ramin Fakhimi
Solving combinatorial optimization problems on quantum computers has attracted many researchers since the emergence of quantum computing. The max k-cut problem is a challenging combinatorial optimization problem with multiple well-known optimization formulations.
However, its mixed-integer linear optimization (MILO) formulations and mixed-integer semidefinite optimization formulation are all time-consuming to be solved. Motivated by recent progress in classic and quantum solvers, we study a binary quadratic optimization (BQO) formulation and two quadratic unconstrained binary optimization formulations. First, we compare the BQO formulation with the MILO formulations. Further, we propose an algorithm that converts any feasible fractional solution of the BQO formulation to a feasible binary solution whose objective value is at least as good as that of the fractional solution. Finally, we find tight penalty coefficients for the proposed quadratic unconstrained binary optimization formulations.
|
A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance
Minhui Huang
abstract
A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance
Minhui Huang
The Wasserstein distance has become increasingly important in machine learning and deep learning. Despite its popularity, the Wasserstein distance is hard to approximate because of the curse of dimensionality. A recently proposed approach to alleviate the curse of dimensionality is to project the sampled data from the high dimensional probability distribution onto a lower-dimensional subspace, and then compute the Wasserstein distance between the projected data. However, this approach requires to solve a max-min problem over the Stiefel manifold, which is very challenging in practice. The only existing work that solves this problem directly is the RGAS (Riemannian Gradient Ascent with Sinkhorn Iteration) algorithm, which requires to solve an entropy-regularized optimal transport problem in each iteration, and thus can be costly for large-scale problems. In this paper, we propose a Riemannian block coordinate descent (RBCD) method to solve this problem, which is based on a novel reformulation of the regularized max-min problem over the Stiefel manifold. We show that the complexity of arithmetic operations for RBCD to obtain an $\epsilon$-stationary point is $O(\epsilon^{-3})$. This significantly improves the corresponding complexity of RGAS, which is $O(\epsilon^{-12})$. Moreover, our RBCD has very low per-iteration complexity, and hence is suitable for large-scale problems. Numerical results on both synthetic and real datasets demonstrate that our method is more efficient than existing methods, especially when the number of sampled data is very large.
|
Data-Driven Distributionally Robust Surgery Planning in Flexible Operating Rooms Over a Wasserstein Ambiguity
Karmel Shehadeh
abstract
Data-Driven Distributionally Robust Surgery Planning in Flexible Operating Rooms Over a Wasserstein Ambiguity
Karmel Shehadeh
We study elective surgery planning in flexible operating rooms where emergency patients are accommodated in the existing elective surgery schedule. Probability distributions of surgeries durations are unknown, and only a small set of historical realizations is available. To address distributional ambiguity, we first construct an ambiguity set that encompasses all possible distributions of surgery duration within a Wasserstein distance from the empirical distribution. We then define a data-driven distributionally robust surgery assignment (DSA) problem, which seeks to determine optimal elective surgery assigning decisions to available surgical blocks in multiple ORs to minimize the sum of patient-related costs and the expectation of OR overtime and idle time costs over all distributions residing in the ambiguity set. Using DSA structural properties, we derive an equivalent mixed-integer linear programming (MILP) reformulation of the min-max DSA model, that can be solved efficiently using off-the-shelf optimization software. Using real-world surgery data, we conduct extensive numerical experiments comparing the operational and computational performance of our approach with two state-of-the-art approaches.
|
Implicit Regularization of Sub-Gradient Method in Robust Matrix Recovery: Don't be Afraid of Outliers
Jianhao Ma
abstract
Implicit Regularization of Sub-Gradient Method in Robust Matrix Recovery: Don't be Afraid of Outliers
Jianhao Ma
It is well-known that simple short-sighted algorithms, such as gradient descent, generalize well in the over-parameterized learning tasks, due to their implicit regularization. However, it is unknown whether the implicit regularization of these algorithms can be extended to robust learning tasks, where a subset of samples may be grossly corrupted with noise. In this work, we provide a positive answer to this question in the context of robust matrix recovery problem. In particular, we consider the problem of recovering a low-rank matrix from a number of linear measurements, where a subset of measurements are corrupted with large noise. We show that a simple sub-gradient method converges to the true low-rank solution efficiently, when it is applied to the over-parameterized l1-loss function without any explicit regularization or rank constraint. Moreover, by building upon a new notion of restricted isometry property, called sign-RIP, we prove the robustness of the sub-gradient method against outliers in the over-parameterized regime. In particular, we show that, with Gaussian measurements, the sub-gradient method is guaranteed to converge to the true low-rank solution, even if an arbitrary fraction of the measurements are grossly corrupted with noise.
|
Improving QAOA with Warm-Start Initializations and Custom Mixers
Reuben Tate
abstract
Improving QAOA with Warm-Start Initializations and Custom Mixers
Reuben Tate
In this talk, we consider bridging classical optimization techniques with quantum algorithms. We propose using classical "warm-starts" (obtained via solutions to low-rank semidefinite programming relaxations) in order to initialize the starting state of the Quantum Approximate Optimization Algorithm (QAOA) in the context of the MAX-CUT problem. In addition to changing the initial state, we also consider changing the mixing Hamiltonian in a way that allows us analyze QAOA through the lens of quantum adiabatic algorithms. Our experiments suggest that this modified version of QAOA is robust against quantum noise and is able to yield higher quality cuts (compared to standard QAOA or the classical Goemans-Williamson algorithm) even with low-circuit depth and limited training time for most instances. We provide simulation and theoretical results on the performance of the proposed framework. This is based on joint work with Bryan Gard, Swati Gupta and Greg Mohler.
|
A Smoothing Scheme for Nonconvex-Concave Min-Max Problems
Weiwei Kong
abstract
A Smoothing Scheme for Nonconvex-Concave Min-Max Problems
Weiwei Kong
This talk presents a smoothing scheme for obtaining an approximate stationary point of a composite nonconvex-concave min-max problem by applying a well-known algorithm to the composite smooth approximation of the original problem. More specifically, approximate stationary points of the original problem are obtained by applying (to its composite smooth approximation) an accelerated inexact proximal point method presented in a previous paper by the authors. Iteration complexity bounds for the smoothing scheme are also given for two notions of approximate stationarity. Finally, numerical results are given to demonstrate the efficiency of the scheme.
|
Simulation-Based Optimization of Dynamic Appointment Scheduling Problem with Patient Unpunctuality and Provider Lateness
Secil Sozuer
abstract
Simulation-Based Optimization of Dynamic Appointment Scheduling Problem with Patient Unpunctuality and Provider Lateness
Secil Sozuer
Healthcare providers are under growing pressure to improve efficiency due to an aging population and increasing expenditures. This research is designed to address a particular healthcare scheduling problem, dynamic and stochastic appointment scheduling with patient unpunctuality and provider lateness. We consider that the stochasticity is coming from uncertain patient requests, uncertain service duration, patient unpunctuality and provider lateness. The aim is to find the optimal schedule start time for the patients in order to minimize the expected cost incurred from patient waiting time, server idle time, and server overtime. By conducting perturbation analysis for the gradient estimation, a Sample Average Approximation (SAA), a Robust Stochastic Approximation (Robust SA) and adaptive Stochastic Approximation (ad-SA) algorithms are used. The structural properties of the sample path cost function and expected cost function are studied. Numerical experiments show the computational advantages of using perturbation based gradient information over CPLEX and interior point methods for our problem.
|
Inexact Proximal Gradient Methods
Daniel Robinson
abstract
Inexact Proximal Gradient Methods
Daniel Robinson
I will discuss our recent work on developing INEXACT proximal-gradient methods. In particular, we design termination conditions that indicate how accurately each proximal-gradient subproblem must be solved. Such termination conditions are crucial when the proximal-gradient subproblem does not have a closed form solution, as is the case for important regularizers such as the overlapping group regularizer and the latent group regularizer. Unlike previous work in this area, which has focused on developing complexity results by choosing the termination conditions IN ADVANCE, we base our termination conditions on information local to the current iterate. As such, our conditions are more practical and lead to different types of complexity results, which will be discussed.
|
Qubo Reformulations of Combinatorial Optimization Problems
Rodolfo Alexander Quintero Ospina
abstract
Qubo Reformulations of Combinatorial Optimization Problems
Rodolfo Alexander Quintero Ospina
Adiabatic quantum computers have shown to outperform classical computers in solving some particular instances of NP-hard problems, like the Graph partitioning problem. To do this, a Quadratic Unconstrained Binary Optimization (QUBO) formulation is needed. Given that many combinatorial problems, in particular, NP-hard problems, can be formulated as QUBO instances, the interest in getting implementable QUBO formulations of such problems has grown in recent years. In this presentation, we will focus on the QUBO formulations of the independent set and the maximum k-colorable subgraph problems, and some possible limitations to implement them in quantum computers.
|
Smooth nonconvex-nonconcave min-max optimization problems with small inner maximization constraint set
Meisam Razaviyayn
abstract
Smooth nonconvex-nonconcave min-max optimization problems with small inner maximization constraint set
Meisam Razaviyayn
Recent applications that arise in machine learning have surged significant interest in solving min-max optimization problems. This problem has been extensively studied in the convex-concave regime for which a global optimal solution solution can be computed efficiently. However, in the nonconvex-nonconcave (smooth) regime, most problems cannot be solved to any reasonable notion of stationarity. In this work, we identify a class of smooth nonconvex-nonconcave min-max problems that can be solved efficiently up to first-order stationarity of its Moreau envelope. In particular, we propose an efficient algorithm for finding (first-order) stationary solutions to nonconvex-nonconcave min-max problems when the radius of the constraint set for the inner maximization problem is comparable to the desired accuracy in the outer problem. Our results are the first of its kind that find stationary solutions to nonconvex-nonconcave min-max problems without assuming any restriction on the objective function (other than standard smoothness assumptions). We also discuss the validity of our assumptions and evaluate the performance of our algorithm on the problem of training robust neural networks against adversarial attacks.
|
Supply Side Flexibility in Revenue Management: An Application in Medical Wire and Device Manufacturing
Cigdem Gurgur
abstract
Supply Side Flexibility in Revenue Management: An Application in Medical Wire and Device Manufacturing
Cigdem Gurgur
Revenue management is a concept aimed to maximize capacity utilization and through that maximize revenues. While the main developments in revenue management have taken place in the fields of service industries, relatively little research has been done for the manufacturing sector. We consider revenue management for a medical wire and device manufacturing company with make-to-order mode of operation, complex alloy market, broad-product mix and limited inventory capacity. Orders with different profit margins arrive stochastically and the company has to decide which orders to accept and which orders to reject. In the “Make-to-Order,” the allocation of the finite capacity to certain orders is complicated because the processes that are used to make a product are consistent, but vary in the manner in which product goes through the process. Furthermore, the number of iterative loops in which a particular order may go through the same process varies. We model the problem with a Markov decision process and propose a value iteration heuristic. In numerical tests we show the potential benefit of using revenue management instead of a first-come-first-serve policy and assess the performance of the heuristic procedure.
|
|
18:30 |
20:30 |
Student Social
|