A trust region algorithm with a worst-case iteration complexity of $$\mathcal{O}(\epsilon ^{-3/2})$$O(∈-3/2) for nonconvex optimization

doi:10.1007/S10107-016-1026-2

Journal ArticleDOI

A trust region algorithm with a worst-case iteration complexity of $$\mathcal{O}(\epsilon ^{-3/2})$$O(∈-3/2) for nonconvex optimization

Frank E. Curtis, +2 more

- 01 Mar 2017 -

Mathematical Programming

- Vol. 162, Iss: 1, pp 1-32

TLDR

It is proved that the trust region algorithm, entitled trace, follows a trust region framework, but employs modified step acceptance criteria and a novel trust region update mechanism that allow the algorithm to achieve such a worst-case global complexity bound.

Abstract:

We propose a trust region algorithm for solving nonconvex smooth optimization problems. For any $$\overline{\epsilon }\in (0,\infty )$$∈¯?(0,?), the algorithm requires at most $$\mathcal{O}(\epsilon ^{-3/2})$$O(∈-3/2) iterations, function evaluations, and derivative evaluations to drive the norm of the gradient of the objective function below any $$\epsilon \in (0,\overline{\epsilon }]$$∈?(0,∈¯]. This improves upon the $$\mathcal{O}(\epsilon ^{-2})$$O(∈-2) bound known to hold for some other trust region algorithms and matches the $$\mathcal{O}(\epsilon ^{-3/2})$$O(∈-3/2) bound for the recently proposed Adaptive Regularisation framework using Cubics, also known as the arc algorithm. Our algorithm, entitled trace, follows a trust region framework, but employs modified step acceptance criteria and a novel trust region update mechanism that allow the algorithm to achieve such a worst-case global complexity bound. Importantly, we prove that our algorithm also attains global and fast local convergence guarantees under similar assumptions as for other trust region algorithms. We also prove a worst-case upper bound on the number of iterations, function evaluations, and derivative evaluations that the algorithm requires to obtain an approximate second-order stationary point.

A trust region algorithm with a worst-case iteration complexity of $$\mathcal{O}(\epsilon ^{-3/2})$$O(∈-3/2) for nonconvex optimization

Citations

Theoretical Insights Into the Optimization Landscape of Over-Parameterized Shallow Neural Networks

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

How to escape saddle points efficiently

Global rates of convergence for nonconvex optimization on manifolds

Newton-type methods for non-convex optimization under inexact Hessian information

References

Numerical Optimization

Nonlinear Programming

Numerical Methods for Unconstrained Optimization and Nonlinear Equations (Classics in Applied Mathematics, 16)

Nonlinear Programming: Theory and Algorithms

Numerical methods for unconstrained optimization and nonlinear equations

Related Papers (5)

Cubic regularization of Newton method and its global performance

Escaping From Saddle Points --- Online Stochastic Gradient for Tensor Decomposition

Adaptive cubic regularisation methods for unconstrained optimization. Part II: worst-case function- and derivative-evaluation complexity

Adaptive cubic regularisation methods for unconstrained optimization. Part I: motivation, convergence and numerical results

Trust Region Methods