An Online Convex Optimization Approach to Proactive Network Resource Allocation

doi:10.1109/TSP.2017.2750109

Citations

PDF

Open Access

More filters

Posted Content•

Introduction to Multi-Armed Bandits

[...]

Aleksandrs Slivkins¹•Institutions (1)

Microsoft¹

15 Apr 2019-arXiv: Learning

TL;DR: In this article, a more introductory, textbook-like treatment of multi-armed bandits is provided, with a self-contained, teachable technical introduction and a brief review of further developments; many of the chapters conclude with exercises.

...read moreread less

Abstract: Multi-armed bandits a simple but very powerful framework for algorithms that make decisions over time under uncertainty. An enormous body of work has accumulated over the years, covered in several books and surveys. This book provides a more introductory, textbook-like treatment of the subject. Each chapter tackles a particular line of work, providing a self-contained, teachable technical introduction and a brief review of the further developments; many of the chapters conclude with exercises. The book is structured as follows. The first four chapters are on IID rewards, from the basic model to impossibility results to Bayesian priors to Lipschitz rewards. The next three chapters cover adversarial rewards, from the full-feedback version to adversarial bandits to extensions with linear rewards and combinatorially structured actions. Chapter 8 is on contextual bandits, a middle ground between IID and adversarial bandits in which the change in reward distributions is completely explained by observable contexts. The last three chapters cover connections to economics, from learning in repeated games to bandits with supply/budget constraints to exploration in the presence of incentives. The appendix provides sufficient background on concentration and KL-divergence. The chapters on "bandits with similarity information", "bandits with knapsacks" and "bandits and agents" can also be consumed as standalone surveys on the respective topics.

...read moreread less

152 citations

Journal Article•DOI•

Online Primal-Dual Methods With Measurement Feedback for Time-Varying Convex Optimization

[...]

Andrey Bernstein¹, Emiliano Dall'Anese², Andrea Simonetto³•Institutions (3)

National Renewable Energy Laboratory¹, University of Colorado Boulder², IBM³

01 Apr 2019-IEEE Transactions on Signal Processing

TL;DR: This paper addresses the design and analysis of feedback-based online algorithms to control systems or networked systems based on performance objectives and engineering constraints that may evolve over time using the emerging time-varying convex optimization formalism.

...read moreread less

Abstract: This paper addresses the design and analysis of feedback-based online algorithms to control systems or networked systems based on performance objectives and engineering constraints that may evolve over time. The emerging time-varying convex optimization formalism is leveraged to model optimal operational trajectories of the systems, as well as explicit local and network-level operational constraints. Departing from existing batch and feed-forward optimization approaches, the design of the algorithms capitalizes on an online implementation of primal-dual projected-gradient methods; the gradient steps are, however, suitably modified to accommodate feedback from the system in the form of measurements, hence, the term “online optimization with feedback.” By virtue of this approach, the resultant algorithms can cope with model mismatches in the algebraic representation of the system states and outputs, they avoid pervasive measurements of exogenous inputs, and they naturally lend themselves to a distributed implementation. Under suitable assumptions, analytical convergence claims are established in terms of dynamic regret. Furthermore, when the synthesis of the feedback-based online algorithms is based on a regularized Lagrangian function, $\boldsymbol{Q}$ -linear convergence to solutions of the time-varying optimization problem is shown.

...read moreread less

93 citations

Journal Article•DOI•

Bandit Convex Optimization for Scalable and Dynamic IoT Management

[...]

Tianyi Chen¹, Georgios B. Giannakis¹•Institutions (1)

University of Minnesota¹

01 Feb 2019-IEEE Internet of Things Journal

TL;DR: Numerical tests in fog computation offloading tasks corroborate that the proposed BanSaP approach offers competitive performance relative to existing approaches that are based on gradient feedback.

...read moreread less

Abstract: This paper deals with online convex optimization involving both time-varying loss functions, and time-varying constraints. The loss functions are not fully accessible to the learner, and instead only the function values (also known as bandit feedback) are revealed at queried points. The constraints are revealed after making decisions, and can be instantaneously violated, yet they must be satisfied in the long term. This setting fits nicely the emerging online network tasks such as fog computing in the Internet-of-Things, where online decisions must flexibly adapt to the changing user preferences (loss functions), and the temporally unpredictable availability of resources (constraints). Tailored for such human-in-the-loop systems where the loss functions are hard to model, a family of online bandit saddle-point (BanSaP) schemes are developed, which adaptively adjust the online operations based on (possibly multiple) bandit feedback of the loss functions, and the changing environment. Performance here is assessed by: 1) dynamic regret that generalizes the widely used static regret and 2) fit that captures the accumulated amount of constraint violations. Specifically, BanSaP is proved to simultaneously yield sublinear dynamic regret and fit, provided that the best dynamic solutions vary slowly over time. Numerical tests in fog computation offloading tasks corroborate that our proposed BanSaP approach offers competitive performance relative to existing approaches that are based on gradient feedback.

...read moreread less

86 citations

Journal Article•DOI•

Learning and Management for Internet of Things: Accounting for Adaptivity and Scalability

[...]

Tianyi Chen¹, Sergio Barbarossa², Xin Wang, Georgios B. Giannakis¹, Zhi-Li Zhang¹ - Show less +1 more•Institutions (2)

University of Minnesota¹, Sapienza University of Rome²

21 Feb 2019

TL;DR: In this article, a unified framework for online learning and management policies in IoT through joint advances in communication, networking, learning, and optimization is proposed, which enables smart devices to have proximity access to cloud functionalities at the network edge along the cloud-to-things continuum.

...read moreread less

Abstract: Internet of Things (IoT) envisions an intelligent infrastructure of networked smart devices offering task-specific monitoring and control services. The unique features of IoT include extreme heterogeneity, massive number of devices, and unpredictable dynamics partially due to human interaction. These call for foundational innovations in network design and management. Ideally, it should allow efficient adaptation to changing environments, and low-cost implementation scalable to a massive number of devices, subject to stringent latency constraints. To this end, the overarching goal of this paper is to outline a unified framework for online learning and management policies in IoT through joint advances in communication, networking, learning, and optimization. From the network architecture vantage point, the unified framework leverages a promising fog architecture that enables smart devices to have proximity access to cloud functionalities at the network edge, along the cloud-to-things continuum. From the algorithmic perspective, key innovations target online approaches adaptive to different degrees of nonstationarity in IoT dynamics, and their scalable model-free implementation under limited feedback that motivates blind or bandit approaches. The proposed framework aspires to offer a stepping stone that leads to systematic designs and analysis of task-specific learning and management schemes for IoT, along with a host of new research directions to build on.

...read moreread less

76 citations

Journal Article•DOI•

Distributed Online Convex Optimization With Time-Varying Coupled Inequality Constraints

[...]

Xinlei Yi¹, Xiuxian Li², Lihua Xie², Karl Henrik Johansson¹•Institutions (2)

Royal Institute of Technology¹, Nanyang Technological University²

06 Jan 2020-IEEE Transactions on Signal Processing

TL;DR: This paper proves that the distributed online primal-dual dynamic mirror descent algorithm achieves sublinear dynamic regret and constraint violation if the accumulated dynamic variation of the optimal sequence also grows sublinearly, and achieves smaller bounds on the constraint violation.

...read moreread less

Abstract: This paper considers distributed online optimization with time-varying coupled inequality constraints. The global objective function is composed of local convex cost and regularization functions and the coupled constraint function is the sum of local convex functions. A distributed online primal-dual dynamic mirror descent algorithm is proposed to solve this problem, where the local cost, regularization, and constraint functions are held privately and revealed only after each time slot. Without assuming Slater's condition, we first derive regret and constraint violation bounds for the algorithm and show how they depend on the stepsize sequences, the accumulated dynamic variation of the comparator sequence, the number of agents, and the network connectivity. As a result, under some natural decreasing stepsize sequences, we prove that the algorithm achieves sublinear dynamic regret and constraint violation if the accumulated dynamic variation of the optimal sequence also grows sublinearly. We also prove that the algorithm achieves sublinear static regret and constraint violation under mild conditions. Assuming Slater's condition, we show that the algorithm achieves smaller bounds on the constraint violation. In addition, smaller bounds on the static regret are achieved when the objective function is strongly convex. Finally, numerical simulations are provided to illustrate the effectiveness of the theoretical results.

...read moreread less

76 citations

Collapse

An Online Convex Optimization Approach to Proactive Network Resource Allocation

Citations

References

Related Papers (5)