scispace - formally typeset
Search or ask a question

Showing papers in "arXiv: Optimization and Control in 2013"


Book ChapterDOI
TL;DR: In this article, a new algorithm, namely, flower pollination algorithm, inspired by the pollination process of flowers, was proposed, which is more efficient than both GA and PSO.
Abstract: Flower pollination is an intriguing process in the natural world. Its evolutionary characteristics can be used to design new optimization algorithms. In this paper, we propose a new algorithm, namely, flower pollination algorithm, inspired by the pollination process of flowers. We first use ten test functions to validate the new algorithm, and compare its performance with genetic algorithms and particle swarm optimization. Our simulation results show the flower algorithm is more efficient than both GA and PSO. We also use the flower algorithm to solve a nonlinear design benchmark, which shows the convergence rate is almost exponential.

1,415 citations


Posted Content
TL;DR: In this paper, the stochastic average gradient (SAG) method was proposed to optimize the sum of a finite number of smooth convex functions, which achieves a faster convergence rate than black-box SG methods.
Abstract: We propose the stochastic average gradient (SAG) method for optimizing the sum of a finite number of smooth convex functions. Like stochastic gradient (SG) methods, the SAG method's iteration cost is independent of the number of terms in the sum. However, by incorporating a memory of previous gradient values the SAG method achieves a faster convergence rate than black-box SG methods. The convergence rate is improved from O(1/k^{1/2}) to O(1/k) in general, and when the sum is strongly-convex the convergence rate is improved from the sub-linear O(1/k) to a linear convergence rate of the form O(p^k) for p \textless{} 1. Further, in many cases the convergence rate of the new method is also faster than black-box deterministic gradient methods, in terms of the number of gradient evaluations. Numerical experiments indicate that the new algorithm often dramatically outperforms existing SG and deterministic gradient methods, and that the performance may be further improved through the use of non-uniform sampling strategies.

744 citations


Posted Content
TL;DR: The impact of low rotational inertia on power system stability and operation is investigated, new analysis insights are contributed and impact mitigation options are offered.
Abstract: Large-scale deployment of RES has led to significant generation shares of variable RES in power systems worldwide. RES units, notably inverter-connected wind turbines and PV that as such do not provide rotational inertia, are effectively displacing conventional generators and their rotating machinery. The traditional assumption that grid inertia is sufficiently high with only small variations over time is thus not valid for power systems with high RES shares. This has implications for frequency dynamics and power system stability and operation. Frequency dynamics are faster in power systems with low rotational inertia, making frequency control and power system operation more challenging. This paper investigates the impact of low rotational inertia on power system stability and operation, contributes new analysis insights and offers mitigation options for low inertia impacts.

508 citations


Posted Content
TL;DR: This paper studies the decentralized gradient descent method, in which each agent $i$ updates its local variable by combining the average of its neighbors' with a local negative-gradient step $-\alpha f_i(x_{(i)})$.
Abstract: Consider the consensus problem of minimizing $f(x)=\sum_{i=1}^n f_i(x)$ where each $f_i$ is only known to one individual agent $i$ out of a connected network of $n$ agents. All the agents shall collaboratively solve this problem and obtain the solution subject to data exchanges restricted to between neighboring agents. Such algorithms avoid the need of a fusion center, offer better network load balance, and improve data privacy. We study the decentralized gradient descent method in which each agent $i$ updates its variable $x_{(i)}$, which is a local approximate to the unknown variable $x$, by combining the average of its neighbors' with the negative gradient step $-\alpha abla f_i(x_{(i)})$. The iteration is $$x_{(i)}(k+1) \gets \sum_{\text{neighbor} j \text{of} i} w_{ij} x_{(j)}(k) - \alpha abla f_i(x_{(i)}(k)),\quad\text{for each agent} i,$$ where the averaging coefficients form a symmetric doubly stochastic matrix $W=[w_{ij}] \in \mathbb{R}^{n \times n}$. We analyze the convergence of this iteration and derive its converge rate, assuming that each $f_i$ is proper closed convex and lower bounded, $ abla f_i$ is Lipschitz continuous with constant $L_{f_i}$, and stepsize $\alpha$ is fixed. Provided that $\alpha < O(1/L_h)$ where $L_h=\max_i\{L_{f_i}\}$, the objective error at the averaged solution, $f(\frac{1}{n}\sum_i x_{(i)}(k))-f^*$, reduces at a speed of $O(1/k)$ until it reaches $O(\alpha)$. If $f_i$ are further (restricted) strongly convex, then both $\frac{1}{n}\sum_i x_{(i)}(k)$ and each $x_{(i)}(k)$ converge to the global minimizer $x^*$ at a linear rate until reaching an $O(\alpha)$-neighborhood of $x^*$. We also develop an iteration for decentralized basis pursuit and establish its linear convergence to an $O(\alpha)$-neighborhood of the true unknown sparse signal.

439 citations


Posted Content
TL;DR: This paper extends the recently developed firefly algorithm to solve multiobjective optimization problems and validate the proposed approach using a selected subset of test functions and then apply it to solve design optimization benchmarks.
Abstract: Design problems in industrial engineering often involve a large number of design variables with multiple objectives, under complex nonlinear constraints. The algorithms for multiobjective problems can be significantly different from the methods for single objective optimization. To find the Pareto front and non-dominated set for a nonlinear multiobjective optimization problem may require significant computing effort, even for seemingly simple problems. Metaheuristic algorithms start to show their advantages in dealing with multiobjective optimization. In this paper, we extend the recently developed firefly algorithm to solve multiobjective optimization problems. We validate the proposed approach using a selected subset of test functions and then apply it to solve design optimization benchmarks. We will discuss our results and provide topics for further research.

358 citations


Posted Content
TL;DR: This work develops a broadcast-based algorithm, termed the subgradient-push, which steers every node to an optimal value under a standard assumption of subgradient boundedness, which converges at a rate of O (ln t/√t), where the constant depends on the initial values at the nodes, the sub gradient norms, and, more interestingly, on both the consensus speed and the imbalances of influence among the nodes.
Abstract: We consider distributed optimization by a collection of nodes, each having access to its own convex function, whose collective goal is to minimize the sum of the functions. The communications between nodes are described by a time-varying sequence of directed graphs, which is uniformly strongly connected. For such communications, assuming that every node knows its out-degree, we develop a broadcast-based algorithm, termed the subgradient-push, which steers every node to an optimal value under a standard assumption of subgradient boundedness. The subgradient-push requires no knowledge of either the number of agents or the graph sequence to implement. Our analysis shows that the subgradient-push algorithm converges at a rate of $O(\ln(t)/\sqrt{t})$, where the constant depends on the initial values at the nodes, the subgradient norms, and, more interestingly, on both the consensus speed and the imbalances of influence among the nodes.

293 citations


Posted Content
TL;DR: This paper discusses a variant of the algorithm which consists of applying a post-optimization phase to evaluate a short list of solutions generated by several independent runs of the RSG method, and shows that such modification allows to improve significantly the large-deviation properties of the algorithms.
Abstract: In this paper, we introduce a new stochastic approximation (SA) type algorithm, namely the randomized stochastic gradient (RSG) method, for solving an important class of nonlinear (possibly nonconvex) stochastic programming (SP) problems. We establish the complexity of this method for computing an approximate stationary point of a nonlinear programming problem. We also show that this method possesses a nearly optimal rate of convergence if the problem is convex. We discuss a variant of the algorithm which consists of applying a post-optimization phase to evaluate a short list of solutions generated by several independent runs of the RSG method, and show that such modification allows to improve significantly the large-deviation properties of the algorithm. These methods are then specialized for solving a class of simulation-based optimization problems in which only stochastic zeroth-order information is available.

293 citations


Posted Content
TL;DR: A novel asynchronous ADMM based distributed method is presented for the general formulation of a network of agents that are cooperatively solving a global optimization problem and it is shown that it converges at the rate O (1=k).
Abstract: We consider a network of agents that are cooperatively solving a global optimization problem, where the objective function is the sum of privately known local objective functions of the agents and the decision variables are coupled via linear constraints. Recent literature focused on special cases of this formulation and studied their distributed solution through either subgradient based methods with O(1/sqrt(k)) rate of convergence (where k is the iteration number) or Alternating Direction Method of Multipliers (ADMM) based methods, which require a synchronous implementation and a globally known order on the agents. In this paper, we present a novel asynchronous ADMM based distributed method for the general formulation and show that it converges at the rate O(1/k).

273 citations


Posted Content
TL;DR: APPROX as discussed by the authors is a stochastic coordinate descent method for minimizing the sum of convex functions each of which depends on a small number of coordinates only, which can be implemented without the need to perform full-dimensional vector operations.
Abstract: We propose a new stochastic coordinate descent method for minimizing the sum of convex functions each of which depends on a small number of coordinates only. Our method (APPROX) is simultaneously Accelerated, Parallel and PROXimal; this is the first time such a method is proposed. In the special case when the number of processors is equal to the number of coordinates, the method converges at the rate $2\bar{\omega}\bar{L} R^2/(k+1)^2 $, where $k$ is the iteration counter, $\bar{\omega}$ is an average degree of separability of the loss function, $\bar{L}$ is the average of Lipschitz constants associated with the coordinates and individual functions in the sum, and $R$ is the distance of the initial point from the minimizer. We show that the method can be implemented without the need to perform full-dimensional vector operations, which is the major bottleneck of existing accelerated coordinate descent methods. The fact that the method depends on the average degree of separability, and not on the maximum degree of separability, can be attributed to the use of new safe large stepsizes, leading to improved expected separable overapproximation (ESO). These are of independent interest and can be utilized in all existing parallel stochastic coordinate descent algorithms based on the concept of ESO.

270 citations


Journal ArticleDOI
TL;DR: In this article, a distributed convex optimization framework is developed for energy trading between islanded microgrids, where the problem consists of several island-grids that exchange energy flows by means of an arbitrary topology, and a subgradient-based cost minimization algorithm is proposed that converges to the optimal solution in a practical number of iterations.
Abstract: In this paper, a distributed convex optimization framework is developed for energy trading between islanded microgrids. More specifically, the problem consists of several islanded microgrids that exchange energy flows by means of an arbitrary topology. Due to scalability issues and in order to safeguard local information on cost functions, a subgradient-based cost minimization algorithm is proposed that converges to the optimal solution in a practical number of iterations and with a limited communication overhead. Furthermore, this approach allows for a very intuitive economics interpretation that explains the algorithm iterations in terms of "supply--demand model" and "market clearing". Numerical results are given in terms of convergence rate of the algorithm and attained costs for different network topologies.

251 citations


Posted Content
Vincent Duval1, Gabriel Peyré1
TL;DR: This paper shows that when the signal-to-noise level is large enough, and provided the aforementioned dual certificate is non-degenerate, the solution of the discretized problem is supported on pairs of Diracs which are neighbors of the Diracs of the input measure, as the grid size tends to zero.
Abstract: This paper studies sparse spikes deconvolution over the space of measures. We focus our attention to the recovery properties of the support of the measure, i.e. the location of the Dirac masses. For non-degenerate sums of Diracs, we show that, when the signal-to-noise ratio is large enough, total variation regularization (which is the natural extension of the L1 norm of vectors to the setting of measures) recovers the exact same number of Diracs. We also show that both the locations and the heights of these Diracs converge toward those of the input measure when the noise drops to zero. The exact speed of convergence is governed by a specific dual certificate, which can be computed by solving a linear system. We draw connections between the support of the recovered measure on a continuous domain and on a discretized grid. We show that when the signal-to-noise level is large enough, the solution of the discretized problem is supported on pairs of Diracs which are neighbors of the Diracs of the input measure. This gives a precise description of the convergence of the solution of the discretized problem toward the solution of the continuous grid-free problem, as the grid size tends to zero.

Journal ArticleDOI
TL;DR: This work forms a constrained optimization that aims to minimize power losses subject to finite inverter capacity and upper and lower voltage limits at all nodes in the circuit and explores protocols based on the dual-ascent method and on the alternating direction method of multipliers (ADMMs), finding that the ADMM protocol performs significantly better.
Abstract: We formulate the control of reactive power generation by photovoltaic inverters in a power distribution circuit as a constrained optimization that aims to minimize reactive power losses subject to finite inverter capacity and upper and lower voltage limits at all nodes in the circuit. When voltage variations along the circuit are small and losses of both real and reactive powers are small compared to the respective flows, the resulting optimization problem is convex. Moreover, the cost function is separable enabling a distributed, on-line implementation with node-local computations using only local measurements augmented with limited information from the neighboring nodes communicated over cyber channels. Such an approach lies between the fully centralized and local policy approaches previously considered. We explore protocols based on the dual ascent method and on the Alternating Direction Method of Multipliers (ADMM) and find that the ADMM protocol performs significantly better.

Posted Content
TL;DR: A variation of the receding-horizon control law is proposed to deal with the replay attackers who maliciously repeat the messages sent from the operator to the actuator in resilient networked control systems.
Abstract: This paper studies a resilient control problem for discrete-time, linear time-invariant systems subject to state and input constraints. State measurements and control commands are transmitted over a communication network and could be corrupted by adversaries. In particular, we consider the replay attackers who maliciously repeat the messages sent from the operator to the actuator. We propose a variation of the receding-horizon control law to deal with the replay attacks and analyze the resulting system performance degradation. A class of competitive (resp. cooperative) resource allocation problems for resilient networked control systems is also investigated.

Posted Content
TL;DR: A systematic method for determining the active- and reactive-power set points for PV inverters in residential systems is proposed, with the objective of optimizing the operation of the distribution feeder and ensuring voltage regulation.
Abstract: Low-voltage distribution feeders were designed to sustain unidirectional power flows to residential neighborhoods. The increased penetration of roof-top photovoltaic (PV) systems has highlighted pressing needs to address power quality and reliability concerns, especially when PV generation exceeds the household demand. A systematic method for determining the active- and reactive-power set points for PV inverters in residential systems is proposed in this paper, with the objective of optimizing the operation of the distribution feeder and ensuring voltage regulation. Binary PV-inverter selection variables and nonlinear power-flow relations render the novel optimal inverter dispatch problem nonconvex and NP-hard. Nevertheless, sparsity-promoting regularization approaches and semidefinite relaxation techniques are leveraged to obtain a computationally feasible convex reformulation. The merits of the proposed approach are demonstrated using real-world PV-generation and load-profile data for an illustrative low-voltage residential distribution system.

Posted Content
TL;DR: In this article, the problem of finding a small set of variables to affect with an input so that the resulting system is controllable is shown to be NP-hard, and it is shown that even approximating the minimum number of variables that need to be affected within a multiplicative factor of $c \log n$ is NP hard for some positive $c.
Abstract: Given a linear system, we consider the problem of finding a small set of variables to affect with an input so that the resulting system is controllable. We show that this problem is NP-hard; indeed, we show that even approximating the minimum number of variables that need to be affected within a multiplicative factor of $c \log n$ is NP-hard for some positive $c$. On the positive side, we show it is possible to find sets of variables matching this inapproximability barrier in polynomial time. This can be done by a simple greedy heuristic which sequentially picks variables to maximize the rank increase of the controllability matrix. Experiments on Erdos-Renyi random graphs demonstrate this heuristic almost always succeeds at findings the minimum number of variables.

Posted Content
TL;DR: This paper proposes a framework of Markov decision process (MDP) to investigate delay-optimal charging scheduling of the electric vehicles at a charging station with multiple charge points and gives the sets of system states for which charging no demand and charging as many demands as possible are optimal.
Abstract: In the paper, we consider delay-optimal charging scheduling of the electric vehicles (EVs) at a charging station with multiple charge points. The charging station is equipped with renewable energy generation devices and can also buy energy from power grid. The uncertainty of the EV arrival, the intermittence of the renewable energy, and the variation of the grid power price are taken into account and described as independent Markov processes. Meanwhile, the charging energy for each EV is random. The goal is to minimize the mean waiting time of EVs under the long term constraint on the cost. We propose queue mapping to convert the EV queue to the charge demand queue and prove the equivalence between the minimization of the two queues' average length. Then we focus on the minimization for the average length of the charge demand queue under long term cost constraint. We propose a framework of Markov decision process (MDP) to investigate this scheduling problem. The system state includes the charge demand queue length, the charge demand arrival, the energy level in the storage battery of the renewable energy, the renewable energy arrival, and the grid power price. Additionally the number of charging demands and the allocated energy from the storage battery compose the two-dimensional policy. We derive two necessary conditions of the optimal policy. Moreover, we discuss the reduction of the two-dimensional policy to be the number of charging demands only. We give the sets of system states for which charging no demand and charging as many demands as possible are optimal, respectively. Finally we investigate the proposed radical policy and conservative policy numerically.

Journal ArticleDOI
TL;DR: Both the resource allocation problems can be solved in polynomial time using geometric programming (GP) for arbitrary directed graphs of nonidentical nodes and a wide class of cost functions.
Abstract: We study the problem of containing spreading processes in arbitrary directed networks by distributing protection resources throughout the nodes of the network. We consider two types of protection resources are available: (i) Preventive resources able to defend nodes against the spreading (such as vaccines in a viral infection process), and (ii) corrective resources able to neutralize the spreading after it has reached a node (such as antidotes). We assume that both preventive and corrective resources have an associated cost and study the problem of finding the cost-optimal distribution of resources throughout the nodes of the network. We analyze these questions in the context of viral spreading processes in directed networks. We study the following two problems: (i) Given a fixed budget, find the optimal allocation of preventive and corrective resources in the network to achieve the highest level of containment, and (ii) when a budget is not specified, find the minimum budget required to control the spreading process. We show that both resource allocation problems can be solved in polynomial time using Geometric Programming (GP) for arbitrary directed graphs of nonidentical nodes and a wide class of cost functions. Furthermore, our approach allows to optimize simultaneously over both preventive and corrective resources, even in the case of cost functions being node-dependent. We illustrate our approach by designing optimal protection strategies to contain an epidemic outbreak that propagates through an air transportation network.

Posted Content
TL;DR: SOSTOOLS v3.00 is the latest release of the freely available MATLAB toolbox for formulating and solving sum of squares (SOS) optimization problems, with the ability to createPolynomial matrices and formulate polynomial matrix inequalities, and compatibility with MuPAD.
Abstract: SOSTOOLS v3.00 is the latest release of the freely available MATLAB toolbox for formulating and solving sum of squares (SOS) optimization problems. Such problems arise naturally in the analysis and control of nonlinear dynamical systems, but also in other areas such as combinatorial optimization. Highlights of the new release include the ability to create polynomial matrices and formulate polynomial matrix inequalities, compatibility with MuPAD, the new MATLAB symbolic engine, as well as the multipoly toolbox v2.01. SOSTOOLS v3.00 can interface with five semidefinite programming solvers, and includes ten demonstration examples.

Posted Content
TL;DR: In this article, the authors employ the sparsity-promoting optimal control to simultaneously identify the optimal control structure and optimize the closed-loop performance of wide-area power systems.
Abstract: Inter-area oscillations in bulk power systems are typically poorly controllable by means of local decentralized control. Recent research efforts have been aimed at developing wide- area control strategies that involve communication of remote signals. In conventional wide-area control, the control structure is fixed a priori typically based on modal criteria. In contrast, here we employ the recently-introduced paradigm of sparsity- promoting optimal control to simultaneously identify the optimal control structure and optimize the closed-loop performance. To induce a sparse control architecture, we regularize the standard quadratic performance index with an l1-penalty on the feedback matrix. The quadratic objective functions are inspired by the classic slow coherency theory and are aimed at imitating homogeneous networks without inter-area oscillations. We use the New England power grid model to demonstrate that the proposed combination of the sparsity-promoting control design with the slow coherency objectives performs almost as well as the optimal centralized control while only making use of a single wide-area communication link. In addition to this nominal performance, we also demonstrate that our control strategy yields favorable robustness margins and that it can be used to identify a sparse control architecture for control design via alternative means.

Posted Content
TL;DR: A new decomposition method is derived, called ldquoproximal center algorithm,rdquo which from the viewpoint of efficiency estimates improves the bounds on the number of iterations of the classical dual gradient scheme by an order of magnitude.
Abstract: Dual decomposition is a powerful technique for deriving decomposition schemes for convex optimization problems with separable structure. Although the Augmented Lagrangian is computationally more stable than the ordinary Lagrangian, the prox-term destroys the separability of the given problem. In this paper we use another approach to obtain a smooth Lagrangian, based on a smoothing technique developed by Nesterov, which preserves separability of the problem. With this approach we derive a new decomposition method, called proximal center algorithm, which from the viewpoint of efficiency estimates improves the bounds on the number of iterations of the classical dual gradient scheme by an order of magnitude. This can be achieved with the new decomposition algorithm since the resulting dual function has good smoothness properties and since we make use of the particular structure of the given problem.

Posted Content
Corentin Briat1
TL;DR: In this paper, the authors proposed a hybrid framework for stability analysis and control of linear impulsive systems, through the use of continuous-time time-varying discontinuous Lyapunov functions.
Abstract: Stability analysis and control of linear impulsive systems is addressed in a hybrid framework, through the use of continuous-time time-varying discontinuous Lyapunov functions. Necessary and sufficient conditions for stability of impulsive systems with periodic impulses are first provided in order to set up the main ideas. Extensions to stability of aperiodic systems under minimum, maximum and ranged dwell-times are then derived. By exploiting further the particular structure of the stability conditions, the results are non-conservatively extended to quadratic stability analysis of linear uncertain impulsive systems. These stability criteria are, in turn, losslessly extended to stabilization using a particular, yet broad enough, class of state-feedback controllers, providing then a convex solution to the open problem of robust dwell-time stabilization of impulsive systems using hybrid stability criteria. Relying finally on the representability of sampled-data systems as impulsive systems, the problems of robust stability analysis and robust stabilization of periodic and aperiodic uncertain sampled-data systems are straightforwardly solved using the same ideas. Several examples are discussed in order to show the effectiveness and reduced complexity of the proposed approach.

Journal ArticleDOI
TL;DR: In this article, a unified view of model-free control and intelligent PID controllers is presented, where the new advances are taken into account, and several numerical simulations are presented which demonstrate not only the power of intelligent controllers but also the great simplicity for tuning them.
Abstract: "Model-free control" and the corresponding "intelligent" PID controllers (iPIDs), which already had many successful concrete applications, are presented here for the first time in an unified manner, where the new advances are taken into account. The basics of model-free control is now employing some old functional analysis and some elementary differential algebra. The estimation techniques become quite straightforward via a recent online parameter identification approach. The importance of iPIs and especially of iPs is deduced from the presence of friction. The strange industrial ubiquity of classic PID's and the great difficulty for tuning them in complex situations is deduced, via an elementary sampling, from their connections with iPIDs. Several numerical simulations are presented which include some infinite-dimensional systems. They demonstrate not only the power of our intelligent controllers but also the great simplicity for tuning them.

Posted Content
TL;DR: An efficient stochastic dynamic programming model is introduced to optimally charge an electric vehicle while accounting for the uncertainty inherent to its use and it is shown that the randomness intrinsic to driving needs has a substantial impact on the charging strategy to be implemented.
Abstract: The combination of electric vehicles (EVs) and renewable energy is taking shape as a potential driver for a future free of fossil fuels. However, the efficient management of the EV fleet is not exempt from challenges. It calls for the involvement of all actors directly or indirectly related to the energy and transportation sectors, ranging from governments, automakers and transmission system operators, to the ultimate beneficiary of the change: the end-user. An EV is primarily to be used to satisfy driving needs, and accordingly charging policies must be designed primarily for this purpose. The charging models presented in the technical literature, however, overlook the stochastic nature of driving patterns. Here we introduce an efficient stochastic dynamic programming model to optimally charge an EV while accounting for the uncertainty inherent to its use. With this aim in mind, driving patterns are described by an inhomogeneous Markov model that is fitted using data collected from the utilization of an EV. We show that the randomness intrinsic to driving needs has a substantial impact on the charging strategy to be implemented.

Posted Content
TL;DR: In this article, the authors present necessary and sufficient conditions for the solvability of discrete time, mean field, stochastic linear-quadratic optimal control problems, where the optimal control is a linear state feedback.
Abstract: This paper first presents necessary and sufficient conditions for the solvability of discrete time, mean-field, stochastic linear-quadratic optimal control problems. Then, by introducing several sequences of bounded linear operators, the problem becomes an operator stochastic LQ problem, in which the optimal control is a linear state feedback. Furthermore, from the form of the optimal control, the problem changes to a matrix dynamic optimization problem. Solving this optimization problem, we obtain the optimal feedback gain and thus the optimal control. Finally, by completing the square, the optimality of the above control is validated.

Posted Content
TL;DR: In this paper, the authors generalized the alternating direction method of multipliers (ADMM) to Bregman ADMM (BADMM), which allows the choice of different Bregmann divergences to exploit the structure of problems.
Abstract: The mirror descent algorithm (MDA) generalizes gradient descent by using a Bregman divergence to replace squared Euclidean distance. In this paper, we similarly generalize the alternating direction method of multipliers (ADMM) to Bregman ADMM (BADMM), which allows the choice of different Bregman divergences to exploit the structure of problems. BADMM provides a unified framework for ADMM and its variants, including generalized ADMM, inexact ADMM and Bethe ADMM. We establish the global convergence and the $O(1/T)$ iteration complexity for BADMM. In some cases, BADMM can be faster than ADMM by a factor of $O(n/\log(n))$. In solving the linear program of mass transportation problem, BADMM leads to massive parallelism and can easily run on GPU. BADMM is several times faster than highly optimized commercial software Gurobi.

Proceedings ArticleDOI
TL;DR: This paper studies the minimum number of node failures needed to cause total blackout, and shows that in the case of unidirectional interdependency between the networks the problem is NP-hard, and develops heuristics to find a near-optimal solution.
Abstract: In this paper, we study the robustness of interdependent networks, in which the state of one network depends on the state of the other network and vice versa. In particular, we focus on the interdependency between the power grid and communication networks, where the grid depends on communications for its control, and the communication network depends on the grid for power. A real-world example is the Italian blackout of 2003, when a small failure in the power grid cascaded between the two networks and led to a massive blackout. In this paper, we study the minimum number of node failures needed to cause total blackout (i.e., all nodes in both networks to fail). In the case of unidirectional interdependency between the networks we show that the problem is NP-hard, and develop heuristics to find a near-optimal solution. On the other hand, we show that in the case of bidirectional interdependency this problem can be solved in polynomial time. We believe that this new interdependency model gives rise to important, yet unexplored, robust network design problems for interdependent networked infrastructures.

Posted Content
TL;DR: This paper formally establishes the theoretical optimality or nearly optimality, in the large-scale case, for the CG method and its variants to solve different classes of CP problems, including smooth, nonsmooth and certain saddle-point problems.
Abstract: This paper considers a general class of iterative optimization algorithms, referred to as linear-optimization-based convex programming (LCP) methods, for solving large-scale convex programming (CP) problems. The LCP methods, covering the classic conditional gradient (CG) method (a.k.a., Frank-Wolfe method) as a special case, can only solve a linear optimization subproblem at each iteration. In this paper, we first establish a series of lower complexity bounds for the LCP methods to solve different classes of CP problems, including smooth, nonsmooth and certain saddle-point problems. We then formally establish the theoretical optimality or nearly optimality, in the large-scale case, for the CG method and its variants to solve different classes of CP problems. We also introduce several new optimal LCP methods, obtained by properly modifying Nesterov's accelerated gradient method, and demonstrate their possible advantages over the classic CG for solving certain classes of large-scale CP problems.

Journal ArticleDOI
TL;DR: Two methods based on blind inpainting and $\ell_0$ minimization are proposed that can simultaneously find the damaged pixels and restore the image by iteratively restoring the image and updating the set of damaged pixels.
Abstract: This article studies the problem of image restoration of observed images corrupted by impulse noise and mixed Gaussian impulse noise Since the pixels damaged by impulse noise contain no information about the true image, how to find this set correctly is a very important problem We propose two methods based on blind inpainting and $\ell_0$ minimization that can simultaneously find the damaged pixels and restore the image By iteratively restoring the image and updating the set of damaged pixels, these methods have better performance than other methods, as shown in the experiments In addition, we provide convergence analysis for these methods, these algorithms will converge to coordinatewise minimum points In addition, they will converge to local minimum points (or with probability one) with some modifications in the algorithms

Posted Content
TL;DR: It is shown that under these assumptions the basic stochastic gradient method with a sufficiently-small constant step-size has an O(1/k) convergence rate, and has a linear convergence rate if $g$ is strongly-convex.
Abstract: We consider optimizing a function smooth convex function $f$ that is the average of a set of differentiable functions $f_i$, under the assumption considered by Solodov [1998] and Tseng [1998] that the norm of each gradient $f_i'$ is bounded by a linear function of the norm of the average gradient $f'$. We show that under these assumptions the basic stochastic gradient method with a sufficiently-small constant step-size has an $O(1/k)$ convergence rate, and has a linear convergence rate if $g$ is strongly-convex.

Posted Content
TL;DR: An equivalent Kalman filter for sensor selection is derived, i.e., generalized information filter, and it is proved that the multistage look-ahead policy that minimizes either the final or the average estimation error covariances of next multiple time steps is equivalent to a myopic sensor selection policy that maximizes the trace of the generalized information gain at each time step.
Abstract: In this paper, sensor selection problems for target tracking in large sensor networks with linear equality or inequality constraints are considered. First, we derive an equivalent Kalman filter for sensor selection, i.e., generalized information filter. Then, under a regularity condition, we prove that the multistage look-ahead policy that minimizes either the final or the average estimation error covariances of next multiple time steps is equivalent to a myopic sensor selection policy that maximizes the trace of the generalized information gain at each time step. Moreover, when the measurement noises are uncorrelated between sensors, the optimal solution can be obtained analytically for sensor selection when constraints are temporally separable. When constraints are temporally inseparable, sensor selections can be obtained by approximately solving a linear programming problem so that the sensor selection problem for a large sensor network can be dealt with quickly. Although there is no guarantee that the gap between the performance of the chosen subset and the performance bound is always small, numerical examples suggest that the algorithm is near-optimal in many cases. Finally, when the measurement noises are correlated between sensors, the sensor selection problem with temporally inseparable constraints can be relaxed to a Boolean quadratic programming problem which can be efficiently solved by a Gaussian randomization procedure along with solving a semi-definite programming problem. Numerical examples show that the proposed method is much better than the method that ignores dependence of noises.