Adaptive Online Gradient Descent

Open AccessProceedings Article

Adaptive Online Gradient Descent

- Vol. 20, pp 65-72

TLDR

An algorithm is provided, Adaptive Online Gradient Descent, which interpolates between the results of Zinkevich for linear functions and of Hazan et al for strongly convex functions, achieving intermediate rates between √T and log T and shows strong optimality of the algorithm.

Abstract:

We study the rates of growth of the regret in online convex optimization. First, we show that a simple extension of the algorithm of Hazan et al eliminates the need for a priori knowledge of the lower bound on the second derivatives of the observed functions. We then provide an algorithm, Adaptive Online Gradient Descent, which interpolates between the results of Zinkevich for linear functions and of Hazan et al for strongly convex functions, achieving intermediate rates between √T and log T. Furthermore, we show strong optimality of the algorithm. Finally, we provide an extension of our results to general norms.

Citations

PDF

Open Access

More filters

Proceedings Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

John C. Duchi, +2 more

TL;DR: Adaptive subgradient methods as discussed by the authors dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning, which allows us to find needles in haystacks in the form of very predictive but rarely seen features.

...read moreread less

Journal Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

John C. Duchi, +2 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies setting a learning rate and results in regret guarantees that are provably as good as the best proximal functions that can be chosen in hindsight.

...read moreread less

Proceedings Article

On optimization methods for deep learning

Jiquan Ngiam, +5 more

TL;DR: It is shown that more sophisticated off-the-shelf optimization methods such as Limited memory BFGS (L-BFGS) and Conjugate gradient (CG) with line search can significantly simplify and speed up the process of pretraining deep algorithms.

...read moreread less

Proceedings Article

Dual Averaging Method for Regularized Stochastic Learning and Online Optimization

Lin Xiao

TL;DR: A new online algorithm is developed, the regularized dual averaging (RDA) method, that can explicitly exploit the regularization structure in an online setting and can be very effective for sparse online learning with l1-regularization.

...read moreread less

Posted Content

Slow Learners are Fast

John Langford, +2 more

- 03 Nov 2009 -

arXiv: Optimization and Control

TL;DR: This paper proves that online learning with delayed updates converges well, thereby facilitating parallel online learning.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Prediction, learning, and games

Nicolò Cesa-Bianchi, +1 more

TL;DR: In this paper, the authors provide a comprehensive treatment of the problem of predicting individual sequences using expert advice, a general framework within which many related problems can be cast and discussed, such as repeated game playing, adaptive data compression, sequential investment in the stock market, sequential pattern analysis, and several other problems.

...read moreread less

Proceedings Article

Online convex programming and generalized infinitesimal gradient ascent

Martin Zinkevich

TL;DR: An algorithm for convex programming is introduced, and it is shown that it is really a generalization of infinitesimal gradient ascent, and the results here imply that generalized inf initesimalgradient ascent (GIGA) is universally consistent.

...read moreread less

Journal ArticleDOI

Logarithmic regret algorithms for online convex optimization

Elad Hazan, +2 more

- 01 Dec 2007 -

Machine Learning

TL;DR: Several algorithms achieving logarithmic regret are proposed, which besides being more general are also much more efficient to implement, and give rise to an efficient algorithm based on the Newton method for optimization, a new tool in the field.

...read moreread less

Book ChapterDOI

Logarithmic regret algorithms for online convex optimization

Elad Hazan, +3 more

TL;DR: This paper proposes several algorithms achieving logarithmic regret, which besides being more general are also much more efficient to implement, and gives an efficient algorithm based on the Newton method for optimization, a new tool in the field.

...read moreread less

Proceedings Article

Convex Repeated Games and Fenchel Duality

Shai Shalev-Shwartz, +1 more

TL;DR: It is shown that various online learning and boosting algorithms can be all derived as special cases of the algorithmic framework described, which stems from a connection that is built between the notions of regret in game theory and weak duality in convex optimization.

...read moreread less

Adaptive Online Gradient Descent

Citations

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

On optimization methods for deep learning

Dual Averaging Method for Regularized Stochastic Learning and Online Optimization

Slow Learners are Fast

References

Prediction, learning, and games

Online convex programming and generalized infinitesimal gradient ascent

Logarithmic regret algorithms for online convex optimization

Logarithmic regret algorithms for online convex optimization

Convex Repeated Games and Fenchel Duality

Related Papers (5)

Online convex programming and generalized infinitesimal gradient ascent

Logarithmic regret algorithms for online convex optimization

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

Prediction, learning, and games

Online Learning and Online Convex Optimization