scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Optimization and Control in 2021"


Posted Content
TL;DR: In this article, a non-autonomous nonlinear deterministic model is proposed to study the control of COVID-19 to unravel the cost and economic health outcomes for the autonomous nonlinear model proposed for the Kingdom of Saudi Arabia.
Abstract: Cost-effectiveness analysis is a mode of determining both the cost and economic health outcomes of one or more control interventions. In this work, we have formulated a non-autonomous nonlinear deterministic model to study the control of COVID-19 to unravel the cost and economic health outcomes for the autonomous nonlinear model proposed for the Kingdom of Saudi Arabia. The optimal control model captures four time-dependent control functions, thus, $u_1$-practising physical or social distancing protocols; $u_2$-practising personal hygiene by cleaning contaminated surfaces with alcohol-based detergents; $u_3$-practising proper and safety measures by exposed, asymptomatic and symptomatic infected individuals; $u_4$-fumigating schools in all levels of education, sports facilities, commercial areas and religious worship centres. We proved the existence of the proposed optimal control model. The optimality system associated with the non-autonomous epidemic model is derived using Pontryagin's maximum principle. We have performed numerical simulations to investigate extensive cost-effectiveness analysis for fourteen optimal control strategies. Comparing the control strategies, we noticed that; Strategy 1 (practising physical or social distancing protocols) is the most cost-saving and most effective control intervention in Saudi Arabia in the absence of vaccination. But, in terms of the infection averted, we saw that strategy 6, strategy 11, strategy 12, and strategy 14 are just as good in controlling COVID-19.

71 citations


Posted Content
TL;DR: In this paper, the authors discuss connections between sequential system identification and control for linear time-invariant systems, often termed indirect data-driven control, as well as a contemporary direct data driven control approach seeking an optimal decision compatible with recorded data assembled in a Hankel matrix and robustified through suitable regularizations.
Abstract: We discuss connections between sequential system identification and control for linear time-invariant systems, often termed indirect data-driven control, as well as a contemporary direct data-driven control approach seeking an optimal decision compatible with recorded data assembled in a Hankel matrix and robustified through suitable regularizations. We formulate these two problems in the language of behavioral systems theory and parametric mathematical programs, and we bridge them through a multi-criteria formulation trading off system identification and control objectives. We illustrate our results with two methods from subspace identification and control: namely, subspace predictive control and low-rank approximation which constrain trajectories to be consistent with a non-parametric predictor derived from (respectively, the column span of) a data Hankel matrix. In both cases we conclude that direct and regularized data-driven control can be derived as convex relaxation of the indirect approach, and the regularizations account for an implicit identification step. Our analysis further reveals a novel regularizer and a plausible hypothesis explaining the remarkable empirical performance of direct methods on nonlinear systems.

49 citations


Journal ArticleDOI
TL;DR: In this article, a generalized modeling framework for co-optimizing energy infrastructure investment and operation across power and transportation sectors and the supply chains of electricity and H$_2$, while accounting for spatio-temporal variations in energy demand and supply was developed.
Abstract: There is growing interest in hydrogen (H$_2$) use for long-duration energy storage in a future electric grid dominated by variable renewable energy (VRE) resources. Modelling the role of H$_2$ as grid-scale energy storage, often referred as "power-to-gas-to-power (P2G2P)" overlooks the cost-sharing and emission benefits from using the deployed H$_2$ production and storage assets to also supply H$_2$ for decarbonizing other end-use sectors where direct electrification may be challenged. Here, we develop a generalized modelling framework for co-optimizing energy infrastructure investment and operation across power and transportation sectors and the supply chains of electricity and H$_2$, while accounting for spatio-temporal variations in energy demand and supply. Applying this sector-coupling framework to the U.S. Northeast under a range of technology cost and carbon price scenarios, we find a greater value of power-to-H$_2$ (P2G) versus P2G2P routes. P2G provides flexible demand response, while the extra cost and efficiency penalties of P2G2P routes make the solution less attractive for grid balancing. The effects of sector-coupling are significant, boosting VRE generation by 12-55% with both increased capacities and reduced curtailments and reducing the total system cost (or levelized costs of energy) by 6-14% under 96% decarbonization scenarios. Both the cost savings and emission reductions from sector coupling increase with H$_2$ demand for other end-uses, more than doubling for a 96% decarbonization scenario as H$_2$ demand quadraples. Moreover, we found that the deployment of carbon capture and storage is more cost-effective in the H$_2$ sector because of the lower cost and higher utilization rate. These findings highlight the importance of using an integrated multi-sector energy system framework with multiple energy vectors in planning energy system decarbonization pathways.

47 citations


Posted Content
TL;DR: The correspondence between CMKV-MDP and a general lifted MDP on the space of probability measures is proved, and the dynamic programming Bellman fixed point equation satisfied by the value function is established.
Abstract: We develop an exhaustive study of Markov decision process (MDP) under mean field interaction both on states and actions in the presence of common noise, and when optimization is performed over open-loop controls on infinite horizon. Such model, called CMKV-MDP for conditional McKean-Vlasov MDP, arises and is obtained here rigorously with a rate of convergence as the asymptotic problem of N-cooperative agents controlled by a social planner/influencer that observes the environment noises but not necessarily the individual states of the agents. We highlight the crucial role of relaxed controls and randomization hypothesis for this class of models with respect to classical MDP theory. We prove the correspondence between CMKV-MDP and a general lifted MDP on the space of probability measures, and establish the dynamic programming Bellman fixed point equation satisfied by the value function, as well as the existence of-optimal randomized feedback controls. The arguments of proof involve an original measurable optimal coupling for the Wasserstein distance. This provides a procedure for learning strategies in a large population of interacting collaborative agents. MSC Classification: 90C40, 49L20.

43 citations


BookDOI
TL;DR: In this paper, a unified framework for the study of multilevel mixed integer linear optimization problems and multistage stochastic MILO problems with recourse is introduced, which highlights the common mathematical structure of the two problems and allows for the development of a common algorithmic framework.
Abstract: We introduce a unified framework for the study of multilevel mixed integer linear optimization problems and multistage stochastic mixed integer linear optimization problems with recourse. The framework highlights the common mathematical structure of the two problems and allows for the development of a common algorithmic framework. Focusing on the two-stage case, we investigate, in particular, the nature of the value function of the second-stage problem, highlighting its connection to dual functions and the theory of duality for mixed integer linear optimization problems, and summarize different reformulations. We then present two main solution techniques, one based on a Benders-like decomposition to approximate either the risk function or the value function, and the other one based on cutting plane generation.

41 citations


Journal ArticleDOI
TL;DR: In this article, a generalized branch-and-cut approach is proposed for solving mixed integer bilevel linear optimization problems (MIBLPs) using a generalization of the branch and cut approach.
Abstract: In this paper, we describe a comprehensive algorithmic framework for solving mixed integer bilevel linear optimization problems (MIBLPs) using a generalized branch-and-cut approach. The framework presented merges features from existing algorithms (for both traditional mixed integer linear optimization and MIBLPs) with new techniques to produce a flexible and robust framework capable of solving a wide range of bilevel optimization problems. The framework has been fully implemented in the open-source solver MibS. The paper describes the algorithmic options offered by MibS and presents computational results evaluating the effectiveness of the various options for the solution of a number of classes of bilevel optimization problems from the literature.

26 citations


Journal ArticleDOI
TL;DR: In this article, the effects of physical distance on the SARS-CoV-2 virus transmission were investigated, through a fractional mathematical model, with the goal to minimize the number of susceptible and infected while maximizing the number recovered.
Abstract: We investigate, through a fractional mathematical model, the effects of physical distance on the SARS-CoV-2 virus transmission. Two controls are considered in our model for eradication of the spread of COVID-19: media education, through campaigns explaining the importance of social distancing, use of face masks, etc., towards all population, while the second one is quarantine social isolation of the exposed individuals. A general fractional order optimal control problem, and associated optimality conditions of Pontryagin type, are discussed, with the goal to minimize the number of susceptible and infected while maximizing the number of recovered. The extremals are then numerically obtained.

23 citations


Posted Content
TL;DR: A model predictive control scheme to control unknown linear time-invariant systems using only measured input-output data and no model knowledge is presented.
Abstract: We present a model predictive control (MPC) scheme to control linear time-invariant systems using only measured input-output data and no model knowledge. The scheme includes a terminal cost and a terminal set constraint on an extended state containing past input-output values. We provide an explicit design procedure for the corresponding terminal ingredients that only uses measured input-output data. Further, we prove that the MPC scheme based on these terminal ingredients exponentially stabilizes the desired setpoint in closed loop. Finally, we illustrate the advantages over existing data-driven MPC approaches with a numerical example.

22 citations


Posted Content
TL;DR: In this article, the authors present a 250 line Matlab code for topology optimization for linearized buckling criteria, which can handle stiffness, volume and load factors either as the objective function or as constraints.
Abstract: We present a 250 line Matlab code for topology optimization for linearized buckling criteria. The code is conceived to handle stiffness, volume and Buckling Load Factors (BLFs) either as the objective function or as constraints. We use the Kreisselmeier-Steinhauser aggregation function in order to reduce multiple objectives (viz. constraints) to a single, differentiable one. Then, the problem is sequentially approximated by using MMA-like expansions and an OC-like scheme is tailored to update the variables. The inspection of the stress stiffness matrix leads to a vectorized implementation for its efficient construction and for the sensitivity analysis of the BLFs. This, coupled with the efficiency improvements already presented by Ferrari and Sigmund 2020, cuts all the computational bottlenecks associated with setting up the buckling analysis and allows buckling topology optimization problems of an interesting size to be solved on a laptop. The efficiency and flexibility of the code is demonstrated over a few structural design examples and some ideas are given for possible extensions.

21 citations


Posted Content
TL;DR: It has turned out to be key to consider a monolithic system as an interconnection of subsystems to capture dynamical properties of systems at the interconnection level by those of the subsystems and the characteristics of their interconnection.
Abstract: A central notion in systems theory is dissipativity, which has been introduced by Jan Willems with the explicit goal of arriving at a fundamental understanding of the stability properties of feedback interconnections. In robust control, the framework of integral quadratic constraints (IQCs) builds on the seminal contributions of Yakubovich and Zames in the 1960's. It provides a technique for analyzing the stability of an interconnection of some linear system in feedback with a whole class of systems, also refereed to as uncertainty. In this paper we survey the key ideas of exploiting dissipativity and integral quadratic constraints for the computational analysis of robust stability and performance properties of uncertain interconnections in terms of linear matrix inequalities. In particular for dynamic supply rates, the paper revolves around the notion of finite-horizon integral quadratic constraints with a terminal cost. We reveal that this provides a seamless link between the general IQC theorem and dissipativity theory that has been established only rather recently.

20 citations


Posted Content
TL;DR: In this paper, the convergence of entropically regularized optimal transport to optimal transport is studied in terms of large deviations principle, and the exact convergence rate is determined in a general setting and linked to the Kantorovich potential of optimal transport.
Abstract: We study the convergence of entropically regularized optimal transport to optimal transport The main result is concerned with the convergence of the associated optimizers and takes the form of a large deviations principle quantifying the local exponential convergence rate as the regularization parameter vanishes The exact rate function is determined in a general setting and linked to the Kantorovich potential of optimal transport Our arguments are based on the geometry of the optimizers and inspired by the use of $c$-cyclical monotonicity in classical transport theory The results can also be phrased in terms of Schrodinger bridges

Posted Content
TL;DR: In this article, a data-driven MPC approach to control unknown nonlinear systems using only measured input-output data with closed-loop stability guarantees is presented. But this approach is limited to affine systems.
Abstract: We present a novel data-driven MPC approach to control unknown nonlinear systems using only measured input-output data with closed-loop stability guarantees. Our scheme relies on the data-driven system parametrization provided by the Fundamental Lemma of Willems et al. We use new input-output measurements online to update the data, exploiting local linear approximations of the underlying system. We prove that our MPC scheme, which only requires solving strictly convex quadratic programs online, ensures that the closed loop (practically) converges to the (unknown) optimal reachable equilibrium that tracks a desired output reference. As intermediate results of independent interest, we extend the Fundamental Lemma to affine systems and we propose a data-driven tracking MPC scheme with guaranteed robustness. The theoretical analysis of this MPC scheme relies on novel robustness bounds w.r.t. noisy data for the open-loop optimal control problem, which are directly transferable to other data-driven MPC schemes in the literature. The applicability of our approach is illustrated with a numerical application to a continuous stirred tank reactor.

Posted Content
TL;DR: This paper develops a framework to deal with average data-rate constraints in a tractable manner that combines ideas from both information and control theories and shows that the proposed class of coding schemes can achieve mean square stability at averageData-rates that are, at most, 1.254 bits per sample away from the absolute minimum rate for stability.
Abstract: Theorem~ 4.1 in the 2011 paper "A Framework for Control System Design Subject to Average Data-Rate Constraints" allows one to lower bound average operational data rates in feedback loops (including the situation in which encoder and decoder have side information). Unfortunately, its proof is invalid. In this note we first state the theorem and explain why its proof is flawed, and then provide a correct proof under weaker assumptions.

Posted Content
TL;DR: In a Hilbert space setting, for convex optimization, the convergence of the iterates to optimal solutions for a class of accelerated first-order algorithms can be interpreted as discrete temporal versions of an inertial dynamic involving both viscous damping and Hessian-driven damping.
Abstract: In a Hilbert space setting, for convex optimization, we show the convergence of the iterates to optimal solutions for a class of accelerated first-order algorithms. They can be interpreted as discrete temporal versions of an inertial dynamic involving both viscous damping and Hessian-driven damping. The asymptotically vanishing viscous damping is linked to the accelerated gradient method of Nesterov while the Hessian driven damping makes it possible to significantly attenuate the oscillations. By treating the Hessian-driven damping as the time derivative of the gradient term, this gives, in discretized form, first-order algorithms. These results complement the previous work of the authors where it was shown the fast convergence of the values, and the fast convergence towards zero of the gradients. KEYWORDS Convergence of iterates; Hessian driven damping; inertial optimization algorithms; Nesterov accelerated gradient method; time rescaling.

Posted Content
TL;DR: The Single-Timescale stochAstic BiLevEl Optimization (STABLE) algorithm as mentioned in this paper uses a single-timescale update with a fixed batch size to solve the stochastic bilevel problem.
Abstract: Stochastic bilevel optimization generalizes the classic stochastic optimization from the minimization of a single objective to the minimization of an objective function that depends the solution of another optimization problem. Recently, stochastic bilevel optimization is regaining popularity in emerging machine learning applications such as hyper-parameter optimization and model-agnostic meta learning. To solve this class of stochastic optimization problems, existing methods require either double-loop or two-timescale updates, which are sometimes less efficient. This paper develops a new optimization method for a class of stochastic bilevel problems that we term Single-Timescale stochAstic BiLevEl optimization (STABLE) method. STABLE runs in a single loop fashion, and uses a single-timescale update with a fixed batch size. To achieve an $\epsilon$-stationary point of the bilevel problem, STABLE requires ${\cal O}(\epsilon^{-2})$ samples in total; and to achieve an $\epsilon$-optimal solution in the strongly convex case, STABLE requires ${\cal O}(\epsilon^{-1})$ samples. To the best of our knowledge, this is the first bilevel optimization algorithm achieving the same order of sample complexity as the stochastic gradient descent method for the single-level stochastic optimization.

Posted Content
TL;DR: A general framework is developed which allows the characterization of decision formulations which are optimal in a precise sense and shows that under certain mild technical assumptions closely related to the existence of a sufficient statistic satisfying a large deviation principle, the optimal decision enjoys an intuitive separation into an estimation and a subsequent robust optimization step.
Abstract: We propose a statistically optimal approach to construct data-driven decisions for stochastic optimization problems. Fundamentally, a data-driven decision is simply a function that maps the available training data to a feasible action. It can always be expressed as the minimizer of a surrogate optimization model constructed from the data. The quality of a data-driven decision is measured by its out-of-sample risk. An additional quality measure is its out-of-sample disappointment, which we define as the probability that the out-of-sample risk exceeds the optimal value of the surrogate optimization model. An ideal data-driven decision should minimize the out-of-sample risk simultaneously with respect to every conceivable probability measure as the true measure is unkown. Unfortunately, such ideal data-driven decisions are generally unavailable. This prompts us to seek data-driven decisions that minimize the out-of-sample risk subject to an upper bound on the out-of-sample disappointment. We prove that such Pareto-dominant data-driven decisions exist under conditions that allow for interesting applications: the unknown data-generating probability measure must belong to a parametric ambiguity set, and the corresponding parameters must admit a sufficient statistic that satisfies a large deviation principle. We can further prove that the surrogate optimization model must be a distributionally robust optimization problem constructed from the sufficient statistic and the rate function of its large deviation principle. Hence the optimal method for mapping data to decisions is to solve a distributionally robust optimization model. Maybe surprisingly, this result holds even when the training data is non-i.i.d. Our analysis reveals how the structural properties of the data-generating stochastic process impact the shape of the ambiguity set underlying the optimal distributionally robust model.

Journal ArticleDOI
TL;DR: CIL provides an extensive modular optimization framework for prototyping reconstruction methods including sparsity and total variation regularization, as well as tools for loading, preprocessing and visualizing tomographic data.
Abstract: We present the Core Imaging Library (CIL), an open-source Python framework for tomographic imaging with particular emphasis on reconstruction of challenging datasets. Conventional filtered back-projection reconstruction tends to be insufficient for highly noisy, incomplete, non-standard or multi-channel data arising for example in dynamic, spectral and in situ tomography. CIL provides an extensive modular optimisation framework for prototyping reconstruction methods including sparsity and total variation regularisation, as well as tools for loading, preprocessing and visualising tomographic data. The capabilities of CIL are demonstrated on a synchrotron example dataset and three challenging cases spanning golden-ratio neutron tomography, cone-beam X-ray laminography and positron emission tomography.

Posted Content
TL;DR: The results provide new insights on how perturbations propagate through the NLP graph and on how the problem formulation influences such propagation, and provide empirical evidence that positive objective curvature and constraint flexibility tend to dampen propagation.
Abstract: We study solution sensitivity for nonlinear programs (NLPs) whose structure is induced by a graph $\mathcal{G}=(\mathcal{V},\mathcal{E})$. These graph-structured NLPs arise in many applications such as dynamic optimization, stochastic optimization, optimization with partial differential equations, and network optimization. We show that the sensitivity of the primal-dual solution at node $i\in \mathcal{V}$ against a data perturbation at node $j\in \mathcal{V}$ is bounded by $\Upsilon \rho^{d_\mathcal{G}(i,j)}$ for constants $\Upsilon>0$ and $\rho\in(0,1)$ and where $d_\mathcal{G}(i,j)$ is the distance between $i$ and $j$ on $\mathcal{G}$. In other words, the sensitivity of the solution decays exponentially with the distance to the perturbation point. This result, which we call exponential decay of sensitivity (EDS), holds under fairly standard assumptions used in classical NLP sensitivity theory: the strong second-order sufficiency condition and the linear independence constraint qualification. We also present conditions under which the constants $(\Upsilon,\rho)$ remain uniformly bounded; this allows us to characterize behavior for NLPs defined over subgraphs of infinite graphs (e.g., as those arising in problems with unbounded domains). Our results provide new insights on how perturbations propagate through the NLP graph and on how the problem formulation influences such propagation. Specifically, we provide empirical evidence that positive objective curvature and constraint flexibility tend to dampen propagation. The developments are illustrated with numerical examples.

Posted Content
TL;DR: In this paper, a general framework for large-scale model-based derivative-free optimization based on iterative minimization within random subspaces is introduced, which achieves scalability by constructing local linear interpolation models to approximate the Jacobian.
Abstract: We introduce a general framework for large-scale model-based derivative-free optimization based on iterative minimization within random subspaces. We present a probabilistic worst-case complexity analysis for our method, where in particular we prove high-probability bounds on the number of iterations before a given optimality is achieved. This framework is specialized to nonlinear least-squares problems, with a model-based framework based on the Gauss-Newton method. This method achieves scalability by constructing local linear interpolation models to approximate the Jacobian, and computes new steps at each iteration in a subspace with user-determined dimension. We then describe a practical implementation of this framework, which we call DFBGN. We outline efficient techniques for selecting the interpolation points and search subspace, yielding an implementation that has a low per-iteration linear algebra cost (linear in the problem dimension) while also achieving fast objective decrease as measured by evaluations. Extensive numerical results demonstrate that DFBGN has improved scalability, yielding strong performance on large-scale nonlinear least-squares problems.

Journal ArticleDOI
TL;DR: In this article, the authors established both forms of convergence for a special class of linear denoisers, such as nonlocal means and almost any convex data-fidelity, using the convergence theory of averaged operators.
Abstract: A standard model for image reconstruction involves the minimization of a data-fidelity term along with a regularizer, where the optimization is performed using proximal algorithms such as ISTA and ADMM. In plug-and-play (PnP) regularization, the proximal operator (associated with the regularizer) in ISTA and ADMM is replaced by a powerful image denoiser. Although PnP regularization works surprisingly well in practice, its theoretical convergence -- whether convergence of the PnP iterates is guaranteed and if they minimize some objective function -- is not completely understood even for simple linear denoisers such as nonlocal means. In particular, while there are works where either iterate or objective convergence is established separately, a simultaneous guarantee on iterate and objective convergence is not available for any denoiser to our knowledge. In this paper, we establish both forms of convergence for a special class of linear denoisers. Notably, unlike existing works where the focus is on symmetric denoisers, our analysis covers non-symmetric denoisers such as nonlocal means and almost any convex data-fidelity. The novelty in this regard is that we make use of the convergence theory of averaged operators and we work with a special inner product (and norm) derived from the linear denoiser; the latter requires us to appropriately define the gradient and proximal operators associated with the data-fidelity term. We validate our convergence results using image reconstruction experiments.

Posted Content
TL;DR: In this article, a collision avoidance algorithm based on the ship domain which has variable size by the ship speed is proposed, to include the spatial constraints to optimization, and the effect of wind disturbance is taken into account to the trajectory planning to make a feasible trajectory.
Abstract: To realize autonomous shipping, autonomous berthing and unberthing are some of the technical challenges. In the past, numerous research have been done on the optimization of trajectory planning of berthing problems. However, these studies assumed only a simple berth and did not consider obstacles. Optimization of trajectory planning on berthing and unberthing in actual ports must consider the spatial constraints and maintain sufficient distance to obstacles. The main contributions of this study are as follows: (i) a collision avoidance algorithm based on the ship domain which has variable size by the ship speed is proposed, to include the spatial constraints to optimization; (ii) the effect of wind disturbance is taken into account to the trajectory planning to make a feasible trajectory based on the capacity limit of actuators; (iii) showing that the optimization method for berthing is also eligible for the unberthing, which has been almost neglected; (iv) waypoints are included to the optimization process, to make optimization easier on practical applications. The authors tested the proposed method on two existing ports. The proposed method performed well on both the berthing and the unberthing problem and optimized the control input and the trajectory while avoiding collision with the complex obstacles.

Posted Content
TL;DR: In this article, the authors studied the resiliency during a pandemic of On-Demand Multimodal Transit Systems (ODMTS), a new generation of transit systems that combine a network of highfrequency trains and buses with on-demand shuttles to serve the first and last miles and act as feeders to the fixed network.
Abstract: During the COVID-19 pandemic, the collapse of the public transit ridership led to significant budget deficits due to dramatic decreases in fare revenues. Additionally, public transit agencies are facing challenges of reduced vehicle capacity due to social distancing requirements, additional costs of cleaning and protective equipment, and increased downtime for vehicle cleaning. Due to these constraints on resources and budgets, many transit agencies have adopted essential service plans with reduced service hours, number of routes, or frequencies. This paper studies the resiliency during a pandemic of On-Demand Multimodal Transit Systems (ODMTS), a new generation of transit systems that combine a network of high-frequency trains and buses with on-demand shuttles to serve the first and last miles and act as feeders to the fixed network. It presents a case study for the city of Atlanta and evaluates ODMTS for multiple scenarios of depressed demand and social distancing representing various stages of the pandemic. The case study relies on a real data from the Metropolitan Atlanta Rapid Transit Authority (MARTA), an optimization pipeline for the design of ODMTS, and a detailed simulation of these designs. The case study demonstrates how ODMTS provide a resilient solution in terms of cost, convenience, and accessibility for this wide range of scenarios.

Posted Content
TL;DR: Learning to Optimize (L2O) as discussed by the authors is an emerging approach that leverages machine learning to develop optimization methods, aiming at reducing the laborious iterations of hand engineering.
Abstract: Learning to optimize (L2O) is an emerging approach that leverages machine learning to develop optimization methods, aiming at reducing the laborious iterations of hand engineering. It automates the design of an optimization method based on its performance on a set of training problems. This data-driven procedure generates methods that can efficiently solve problems similar to those in the training. In sharp contrast, the typical and traditional designs of optimization methods are theory-driven, so they obtain performance guarantees over the classes of problems specified by the theory. The difference makes L2O suitable for repeatedly solving a certain type of optimization problems over a specific distribution of data, while it typically fails on out-of-distribution problems. The practicality of L2O depends on the type of target optimization, the chosen architecture of the method to learn, and the training procedure. This new paradigm has motivated a community of researchers to explore L2O and report their findings. This article is poised to be the first comprehensive survey and benchmark of L2O for continuous optimization. We set up taxonomies, categorize existing works and research directions, present insights, and identify open challenges. We also benchmarked many existing L2O approaches on a few but representative optimization problems. For reproducible research and fair benchmarking purposes, we released our software implementation and data in the package Open-L2O at this https URL.

Posted Content
TL;DR: In this paper, the authors investigated how different MPC strategies perform on energy management systems in buildings and energy hubs, and proposed a distributed control based on dual decomposition, which has the advantages of both approaches.
Abstract: Model predictive control (MPC) strategies can be applied to the coordination of energy hubs to reduce their energy consumption. Despite the effectiveness of these techniques, their potential for energy savings are potentially underutilized due to the fact that energy demands are often assumed to be fixed quantities rather than controlled dynamic variables. The joint optimization of energy hubs and buildings' energy management systems can result in higher energy savings. This paper investigates how different MPC strategies perform on energy management systems in buildings and energy hubs. We first discuss two MPC approaches; centralized and decentralized. While the centralized control strategy offers optimal performance, its implementation is computationally prohibitive and raises privacy concerns. On the other hand, the decentralized control approach, which offers ease of implementation, displays significantly lower performance. We propose a third strategy, distributed control based on dual decomposition, which has the advantages of both approaches. Numerical case studies and comparisons demonstrate that the performance of distributed control is close to the performance of the centralized case, while maintaining a significantly lower computational burden, especially in large-scale scenarios with many agents. Finally, we validate and verify the reliability of the proposed method through an experiment on a full-scale energy hub system in the NEST demonstrator in D\"{u}bendorf, Switzerland.

Book ChapterDOI
TL;DR: In this paper, the outer minimization problem was considered as a minimization with inexact oracle, which is either minimization or a maximization problem, and an inexact variant of Vaydya's cutting-plane method or a variant of accelerated gradient method was used to solve the outer problem.
Abstract: In this paper, we consider two types of problems that have some similarity in their structure, namely, min-min problems and min-max saddle-point problems. Our approach is based on considering the outer minimization problem as a minimization problem with inexact oracle. This inexact oracle is calculated via inexact solution of the inner problem, which is either minimization or a maximization problem. Our main assumptions are that the problem is smooth and the available oracle is mixed: it is only possible to evaluate the gradient w.r.t. the outer block of variables which corresponds to the outer minimization problem, whereas for the inner problem only zeroth-order oracle is available. To solve the inner problem we use accelerated gradient-free method with zeroth-order oracle. To solve the outer problem we use either inexact variant of Vaydya's cutting-plane method or a variant of accelerated gradient method. As a result, we propose a framework that leads to non-asymptotic complexity bounds for both min-min and min-max problems. Moreover, we estimate separately the number of first- and zeroth-order oracle calls which are sufficient to reach any desired accuracy.

Journal ArticleDOI
TL;DR: In this article, conditions for ensuring forward invariance of safe sets under sampled-data system dynamics with piecewise-constant controllers and fixed time-steps are presented. But the proposed conditions are less conservative than those in earlier studies, and they enable the use of barrier functions that are impossible to implement with the desired time-step using existing methods.
Abstract: This paper presents conditions for ensuring forward invariance of safe sets under sampled-data system dynamics with piecewise-constant controllers and fixed time-steps. First, we introduce two different metrics to compare the conservativeness of sufficient conditions on forward invariance under piecewise-constant controllers. Then, we propose three approaches for guaranteeing forward invariance, two motivated by continuous-time barrier functions, and one motivated by discrete-time barrier functions. All proposed conditions are control affine, and thus can be incorporated into quadratic programs for control synthesis. We show that the proposed conditions are less conservative than those in earlier studies, and show via simulation how this enables the use of barrier functions that are impossible to implement with the desired time-step using existing methods.

Posted Content
TL;DR: A generic variance-reduced algorithm for minimizing a sum of several smooth functions plus a regularizer, in a sequential or distributed manner, formulated with general stochastic operators, which allow it to cover many existing randomization mechanisms within a unified framework.
Abstract: We propose a generic variance-reduced algorithm, which we call MUltiple RANdomized Algorithm (MURANA), for minimizing a sum of several smooth functions plus a regularizer, in a sequential or distributed manner. Our method is formulated with general stochastic operators, which allow us to model various strategies for reducing the computational complexity. For example, MURANA supports sparse activation of the gradients, and also reduction of the communication load via compression of the update vectors. This versatility allows MURANA to cover many existing randomization mechanisms within a unified framework. However, MURANA also encodes new methods as special cases. We highlight one of them, which we call ELVIRA, and show that it improves upon Loopless SVRG.

Posted Content
TL;DR: In this article, the authors considered the case of Dirichlet/Neumann/Robin boundary conditions for the both boundary control and boundary condition and showed that the control strategy achieves the exponential stabilization of the closed-loop system, provided the dimension of the observer is selected large enough.
Abstract: This paper is concerned with the output feedback boundary stabilization of general 1-D reaction diffusion PDEs in the presence of an arbitrarily large input delay. We consider the cases of Dirichlet/Neumann/Robin boundary conditions for the both boundary control and boundary condition. The boundary measurement takes the form of a either Dirichlet or Neumann trace. The adopted control strategy is composed of a finite-dimensional observer estimating the first modes of the PDE coupled with a predictor to compensate the input delay. In this context, we show for any arbitrary value of the input delay that the control strategy achieves the exponential stabilization of the closed-loop system, for system trajectories evaluated in $H^1$ norm (also in $L^2$ norm in the case of a Dirichlet boundary measurement), provided the dimension of the observer is selected large enough. The reported proof of this result requires to perform both control design and stability analysis using simultaneously the (non-homogeneous) original version of the PDE and one of its equivalent homogeneous representations.

Posted Content
TL;DR: In this paper, an actor-critic framework inspired by reinforcement learning is proposed for high-dimensional elliptic partial differential equations (PDEs) with high dimensional value functions, where the authors employ a policy gradient approach to improve the control and derive a variance reduced least square temporal difference method (VR-LSTD) using stochastic calculus.
Abstract: We propose a novel numerical method for high dimensional Hamilton--Jacobi--Bellman (HJB) type elliptic partial differential equations (PDEs). The HJB PDEs, reformulated as optimal control problems, are tackled by the actor-critic framework inspired by reinforcement learning, based on neural network parametrization of the value and control functions. Within the actor-critic framework, we employ a policy gradient approach to improve the control, while for the value function, we derive a variance reduced least square temporal difference method (VR-LSTD) using stochastic calculus. To numerically discretize the stochastic control problem, we employ an adaptive stepsize scheme to improve the accuracy near the domain boundary. Numerical examples up to $20$ spatial dimensions including the linear quadratic regulators, the stochastic Van der Pol oscillators, and the diffusive Eikonal equations are presented to validate the effectiveness of our proposed method.

Posted Content
TL;DR: In this paper, the authors give an overview of the consensus-based global optimization algorithm and its recent variants using component-wise independent or common noise, which is useful for high-dimensional problems.
Abstract: In this chapter we give an overview of the consensus-based global optimization algorithm and its recent variants. We recall the formulation and analytical results of the original model, then we discuss variants using component-wise independent or common noise. In combination with mini-batch approaches those variants were tailored for machine learning applications. Moreover, it turns out that the analytical estimates are dimension independent, which is useful for high-dimensional problems. We discuss the relationship of consensus-based optimization with particle swarm optimization, a method widely used in the engineering community. Then we survey a variant of consensus-based optimization that is proposed for global optimization problems constrained to hyper-surfaces. We conclude the chapter with remarks on applications, preprints and open problems.