Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization

Open AccessPosted Content

Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization

Xingguo Li, +6 more

- 29 Dec 2016 -

arXiv: Learning

Chats0

TLDR

A general theory for studying the geometry of nonconvex objective functions with underlying symmetric structures is proposed and the locations of stationary points and the null space of the associated Hessian matrices are characterized via the lens of invariant groups.

Abstract:

We propose a general theory for studying the geometry of nonconvex objective functions with underlying symmetric structures. In specific, we characterize the locations of stationary points and the null space of the associated Hessian matrices via the lens of invariant groups. As a major motivating example, we apply the proposed general theory to characterize the global geometry of the low-rank matrix factorization problem. In particular, we illustrate how the rotational symmetry group gives rise to infinitely many non-isolated strict saddle points and equivalent global minima of the objective function. By explicitly identifying all stationary points, we divide the entire parameter space into three regions: ($\cR_1$) the region containing the neighborhoods of all strict saddle points, where the objective has negative curvatures; ($\cR_2$) the region containing neighborhoods of all global minima, where the objective enjoys strong convexity along certain directions; and ($\cR_3$) the complement of the above regions, where the gradient has sufficiently large magnitudes. We further extend our result to the matrix sensing problem. This allows us to establish strong global convergence guarantees for popular iterative algorithms with arbitrary initial solutions.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Yuejie Chi, +2 more

- 15 Oct 2019 -

IEEE Transactions on Signal Processing

TL;DR: This tutorial-style overview highlights the important role of statistical models in enabling efficient nonconvex optimization with performance guarantees and reviews two contrasting approaches: two-stage algorithms, which consist of a tailored initialization step followed by successive refinement; and global landscape analysis and initialization-free algorithms.

...read moreread less

Journal ArticleDOI

Implicit Regularization in Nonconvex Statistical Estimation: Gradient Descent Converges Linearly for Phase Retrieval, Matrix Completion, and Blind Deconvolution

Cong Ma, +3 more

- 01 Jun 2020 -

Foundations of Computational Mathematics

TL;DR: In this paper, the authors show that gradient descent can achieve near-optimal statistical and computational guarantees without explicit regularization for phase retrieval, low-rank matrix completion, and blind deconvolution.

...read moreread less

Journal ArticleDOI

Harnessing Structures in Big Data via Guaranteed Low-Rank Matrix Estimation: Recent Theory and Fast Algorithms via Convex and Nonconvex Optimization

Yudong Chen, +1 more

- 28 Jun 2018 -

IEEE Signal Processing Magazine

TL;DR: A unified overview of recent advances in low-rank matrix estimation from incomplete measurements is provided, with attention paid to rigorous characterization of the performance of these algorithms and to problems where the lowrank matrix has additional structural properties that require new algorithmic designs and theoretical analysis.

...read moreread less

Journal ArticleDOI

Global optimality in low-rank matrix optimization

Zhihui Zhu, +3 more

TL;DR: In this paper, the authors consider the minimization of a general objective function over a set of rectangular matrices that have rank at most r. Despite the resulting nonconvexity, recent studies in matrix completion and sensing have shown that the factored problem has no spurious local minima and obeys the strict saddle property.

...read moreread less

Proceedings Article

Deep Hyperspherical Learning

Weiyang Liu, +6 more

TL;DR: SphereNet as mentioned in this paper adopts SphereConv as its basic convolution operator and is supervised by generalized angular softmax loss, which is a natural loss formulation under Sphere-Conv.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A Singular Value Thresholding Algorithm for Matrix Completion

Jian-Feng Cai, +2 more

- 01 Jan 2010 -

Siam Journal on Optimization

TL;DR: This paper develops a simple first-order and easy-to-implement algorithm that is extremely efficient at addressing problems in which the optimal solution has low rank, and develops a framework in which one can understand these algorithms in terms of well-known Lagrange multiplier algorithms.

...read moreread less

Journal ArticleDOI

Exact Matrix Completion via Convex Optimization

Emmanuel J. Candès, +1 more

- 01 Dec 2009 -

Foundations of Computational Mathematics

TL;DR: It is proved that one can perfectly recover most low-rank matrices from what appears to be an incomplete set of entries, and that objects other than signals and images can be perfectly reconstructed from very limited information.

...read moreread less

Journal ArticleDOI

Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization

Benjamin Recht, +2 more

- 01 Aug 2010 -

Siam Review

TL;DR: It is shown that if a certain restricted isometry property holds for the linear transformation defining the constraints, the minimum-rank solution can be recovered by solving a convex optimization problem, namely, the minimization of the nuclear norm over the given affine space.

...read moreread less

Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis

Richard A. Harshman

TL;DR: It is shown that an extension of Cattell's principle of rotation to Proportional Profiles (PP) offers a basis for determining explanatory factors for three-way or higher order multi-mode data.

...read moreread less

Journal Article

Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization

Benjamin Recht, +2 more

- 01 Aug 2010 -

Siam Journal on Control and Optimization

TL;DR: In this paper, it was shown that if a certain restricted isometry property holds for the linear transformation defining the constraints, the minimum-rank solution can be recovered by solving a convex optimization problem, namely, the minimization of the nuclear norm over the given affine space.

...read moreread less