Home
/
Authors
/
Xiaodong Li

Author

Xiaodong Li

Other affiliations: University of Pennsylvania, Stanford University

Bio: Xiaodong Li is an academic researcher from University of California, Davis. The author has contributed to research in topics: Convex optimization & Matrix completion. The author has an hindex of 26, co-authored 42 publications receiving 10704 citations. Previous affiliations of Xiaodong Li include University of Pennsylvania & Stanford University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Robust principal component analysis

[...]

Emmanuel J. Candès¹, Xiaodong Li¹, Yi Ma², John Wright³•Institutions (3)

Stanford University¹, University of Illinois at Urbana–Champaign², Microsoft³

09 Jun 2011-Journal of the ACM

TL;DR: In this paper, the authors prove that under some suitable assumptions, it is possible to recover both the low-rank and the sparse components exactly by solving a very convenient convex program called Principal Component Pursuit; among all feasible decompositions, simply minimize a weighted combination of the nuclear norm and of the e1 norm.

...read moreread less

Abstract: This article is about a curious phenomenon. Suppose we have a data matrix, which is the superposition of a low-rank component and a sparse component. Can we recover each component individuallyq We prove that under some suitable assumptions, it is possible to recover both the low-rank and the sparse components exactly by solving a very convenient convex program called Principal Component Pursuit; among all feasible decompositions, simply minimize a weighted combination of the nuclear norm and of the e1 norm. This suggests the possibility of a principled approach to robust principal component analysis since our methodology and results assert that one can recover the principal components of a data matrix even though a positive fraction of its entries are arbitrarily corrupted. This extends to the situation where a fraction of the entries are missing as well. We discuss an algorithm for solving this optimization problem, and present applications in the area of video surveillance, where our methodology allows for the detection of objects in a cluttered background, and in the area of face recognition, where it offers a principled way of removing shadows and specularities in images of faces.

...read moreread less

6,783 citations

Journal Article•DOI•

Phase Retrieval via Wirtinger Flow: Theory and Algorithms

[...]

Emmanuel J. Candès¹, Xiaodong Li², Mahdi Soltanolkotabi³•Institutions (3)

Stanford University¹, University of Pennsylvania², University of Southern California³

03 Feb 2015-IEEE Transactions on Information Theory

TL;DR: In this article, a nonconvex formulation of the phase retrieval problem was proposed and a concrete solution algorithm was presented. But the main contribution is that this algorithm is shown to rigorously allow the exact retrieval of phase information from a nearly minimal number of random measurements.

...read moreread less

Abstract: We study the problem of recovering the phase from magnitude measurements; specifically, we wish to reconstruct a complex-valued signal $ \boldsymbol {x}\in \mathbb {C}^{n}$ about which we have phaseless samples of the form $y_{r} = \left |{\langle \boldsymbol {a}_{r}, \boldsymbol {x} \rangle }\right |^{2}$ , $r = 1,\ldots , m$ (knowledge of the phase of these samples would yield a linear system). This paper develops a nonconvex formulation of the phase retrieval problem as well as a concrete solution algorithm. In a nutshell, this algorithm starts with a careful initialization obtained by means of a spectral method, and then refines this initial estimate by iteratively applying novel update rules, which have low computational complexity, much like in a gradient descent scheme. The main contribution is that this algorithm is shown to rigorously allow the exact retrieval of phase information from a nearly minimal number of random measurements. Indeed, the sequence of successive iterates provably converges to the solution at a geometric rate so that the proposed scheme is efficient both in terms of computational and data resources. In theory, a variation on this scheme leads to a near-linear time algorithm for a physically realizable model based on coded diffraction patterns. We illustrate the effectiveness of our methods with various experiments on image data. Underlying our analysis are insights for the analysis of nonconvex optimization schemes that may have implications for computational problems beyond phase retrieval.

...read moreread less

1,096 citations

Posted Content•

Stable Principal Component Pursuit

[...]

Zihan Zhou¹, Xiaodong Li², John Wright³, Emmanuel J. Candès², Yi Ma¹ - Show less +1 more•Institutions (3)

Urbana University¹, Stanford University², Microsoft³

14 Jan 2010-arXiv: Information Theory

TL;DR: This result shows that the proposed convex program recovers the low-rank matrix even though a positive fraction of its entries are arbitrarily corrupted, with an error bound proportional to the noise level, the first result that shows the classical Principal Component Analysis, optimal for small i.i.d. noise, can be made robust to gross sparse errors.

...read moreread less

Abstract: In this paper, we study the problem of recovering a low-rank matrix (the principal components) from a high-dimensional data matrix despite both small entry-wise noise and gross sparse errors. Recently, it has been shown that a convex program, named Principal Component Pursuit (PCP), can recover the low-rank matrix when the data matrix is corrupted by gross sparse errors. We further prove that the solution to a related convex program (a relaxed PCP) gives an estimate of the low-rank matrix that is simultaneously stable to small entrywise noise and robust to gross sparse errors. More precisely, our result shows that the proposed convex program recovers the low-rank matrix even though a positive fraction of its entries are arbitrarily corrupted, with an error bound proportional to the noise level. We present simulation results to support our result and demonstrate that the new convex program accurately recovers the principal components (the low-rank matrix) under quite broad conditions. To our knowledge, this is the first result that shows the classical Principal Component Analysis (PCA), optimal for small i.i.d. noise, can be made robust to gross sparse errors; or the first that shows the newly proposed PCP can be made stable to small entry-wise perturbations.

...read moreread less

470 citations

Proceedings Article•DOI•

Stable Principal Component Pursuit

[...]

Zihan Zhou¹, Xiaodong Li², John Wright³, Emmanuel J. Candès², Yi Ma¹ - Show less +1 more•Institutions (3)

Urbana University¹, Stanford University², Microsoft³

13 Jun 2010

TL;DR: In this article, a convex program, named Principal Component Pursuit (PCP), is proposed to recover the low-rank matrix from a high-dimensional data matrix despite both small entry-wise noise and gross sparse errors.

...read moreread less

Abstract: In this paper, we study the problem of recovering a low-rank matrix (the principal components) from a high-dimensional data matrix despite both small entry-wise noise and gross sparse errors. Recently, it has been shown that a convex program, named Principal Component Pursuit (PCP), can recover the low-rank matrix when the data matrix is corrupted by gross sparse errors. We further prove that the solution to a related convex program (a relaxed PCP) gives an estimate of the low-rank matrix that is simultaneously stable to small entry-wise noise and robust to gross sparse errors. More precisely, our result shows that the proposed convex program recovers the low-rank matrix even though a positive fraction of its entries are arbitrarily corrupted, with an error bound proportional to the noise level. We present simulation results to support our result and demonstrate that the new convex program accurately recovers the principal components (the low-rank matrix) under quite broad conditions. To our knowledge, this is the first result that shows the classical Principal Component Analysis (PCA), optimal for small i.i.d. noise, can be made robust to gross sparse errors; or the first that shows the newly proposed PCP can be made stable to small entry-wise perturbations.

...read moreread less

454 citations

Journal Article•DOI•

Solving Quadratic Equations via PhaseLift When There Are About as Many Equations as Unknowns

[...]

Emmanuel J. Candès¹, Xiaodong Li¹•Institutions (1)

Stanford University¹

01 Oct 2014-Foundations of Computational Mathematics

TL;DR: It is shown that any complex vector can be recovered exactly from on the order of n quadratic equations of the form |〈ai,x0〉|2=bi, i=1,…,m, by using a semidefinite program known as PhaseLift, improving upon earlier bounds.

...read moreread less

Abstract: This note shows that we can recover any complex vector $\boldsymbol {x}_{0} \in \mathbb {C}^{n}$ exactly from on the order of n quadratic equations of the form |?a i ,x 0?|2=b i , i=1,?,m, by using a semidefinite program known as PhaseLift. This improves upon earlier bounds in Candes et al. (Commun. Pure Appl. Math. 66:1241---1274, 2013), which required the number of equations to be at least on the order of nlogn. Further, we show that exact recovery holds for all input vectors simultaneously, and also demonstrate optimal recovery results from noisy quadratic measurements; these results are much sharper than previously known results.

...read moreread less

366 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Principal component analysis: a review and recent developments

[...]

Ian T. Jolliffe¹, Jorge Cadima², Jorge Cadima³•Institutions (3)

University of Exeter¹, University of Lisbon², Instituto Superior de Agronomia³

13 Apr 2016-Philosophical Transactions of the Royal Society A

TL;DR: The basic ideas of PCA are introduced, discussing what it can and cannot do, and some variants of the technique have been developed that are tailored to various different data types and structures.

...read moreread less

Abstract: Large datasets are increasingly common and are often difficult to interpret. Principal component analysis (PCA) is a technique for reducing the dimensionality of such datasets, increasing interpretability but at the same time minimizing information loss. It does so by creating new uncorrelated variables that successively maximize variance. Finding such new variables, the principal components, reduces to solving an eigenvalue/eigenvector problem, and the new variables are defined by the dataset at hand, not a priori , hence making PCA an adaptive data analysis technique. It is adaptive in another sense too, since variants of the technique have been developed that are tailored to various different data types and structures. This article will begin by introducing the basic ideas of PCA, discussing what it can and cannot do. It will then describe some variants of PCA and their application.

...read moreread less

4,289 citations

Book•

Proximal Algorithms

[...]

Neal Parikh¹, Stephen Boyd¹•Institutions (1)

Stanford University¹

27 Nov 2013

TL;DR: The many different interpretations of proximal operators and algorithms are discussed, their connections to many other topics in optimization and applied mathematics are described, some popular algorithms are surveyed, and a large number of examples of proxiesimal operators that commonly arise in practice are provided.

...read moreread less

Abstract: This monograph is about a class of optimization algorithms called proximal algorithms. Much like Newton's method is a standard tool for solving unconstrained smooth optimization problems of modest size, proximal algorithms can be viewed as an analogous tool for nonsmooth, constrained, large-scale, or distributed versions of these problems. They are very generally applicable, but are especially well-suited to problems of substantial recent interest involving large or high-dimensional datasets. Proximal methods sit at a higher level of abstraction than classical algorithms like Newton's method: the base operation is evaluating the proximal operator of a function, which itself involves solving a small convex optimization problem. These subproblems, which generalize the problem of projecting a point onto a convex set, often admit closed-form solutions or can be solved very quickly with standard or simple specialized methods. Here, we discuss the many different interpretations of proximal operators and algorithms, describe their connections to many other topics in optimization and applied mathematics, survey some popular algorithms, and provide a large number of examples of proximal operators that commonly arise in practice.

...read moreread less

3,627 citations

Journal Article•DOI•

Robust Recovery of Subspace Structures by Low-Rank Representation

[...]

Guangcan Liu¹, Zhouchen Lin², Shuicheng Yan³, Ju Sun⁴, Yong Yu¹, Yi Ma⁵ - Show less +2 more•Institutions (5)

Shanghai Jiao Tong University¹, Peking University², National University of Singapore³, Columbia University⁴, Microsoft⁵

01 Jan 2013-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: It is shown that the convex program associated with LRR solves the subspace clustering problem in the following sense: When the data is clean, LRR exactly recovers the true subspace structures; when the data are contaminated by outliers, it is proved that under certain conditions LRR can exactly recover the row space of the original data.

...read moreread less

Abstract: In this paper, we address the subspace clustering problem. Given a set of data samples (vectors) approximately drawn from a union of multiple subspaces, our goal is to cluster the samples into their respective subspaces and remove possible outliers as well. To this end, we propose a novel objective function named Low-Rank Representation (LRR), which seeks the lowest rank representation among all the candidates that can represent the data samples as linear combinations of the bases in a given dictionary. It is shown that the convex program associated with LRR solves the subspace clustering problem in the following sense: When the data is clean, we prove that LRR exactly recovers the true subspace structures; when the data are contaminated by outliers, we prove that under certain conditions LRR can exactly recover the row space of the original data and detect the outlier as well; for data corrupted by arbitrary sparse errors, LRR can also approximately recover the row space with theoretical guarantees. Since the subspace membership is provably determined by the row space, these further imply that LRR can perform robust subspace clustering and error correction in an efficient and effective way.

...read moreread less

3,085 citations

Journal Article•DOI•

A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography

[...]

Oleg Kuybeda¹, Gabriel A. Frank¹, Alberto Bartesaghi¹, Mario J. Borgnia¹, Sriram Subramaniam¹, Guillermo Sapiro² - Show less +2 more•Institutions (2)

National Institutes of Health¹, Duke University²

01 Feb 2013-Journal of Structural Biology

TL;DR: The genetic identity of each virus particle present in the mixture can be assigned based solely on the structural information derived from single envelope glycoproteins displayed on the virus surface by the nuclear norm-based, collaborative alignment method presented here.

...read moreread less

2,410 citations

Journal Article•DOI•

Exact matrix completion via convex optimization

[...]

Emmanuel J. Candès¹, Benjamin Recht²•Institutions (2)

Stanford University¹, University of Wisconsin-Madison²

01 Jun 2012-Communications of The ACM

TL;DR: In this paper, a convex programming problem is used to find the matrix with the minimum nuclear norm that is consistent with the observed entries in a low-rank matrix, which is then used to recover all the missing entries from most sufficiently large subsets.

...read moreread less

Abstract: Suppose that one observes an incomplete subset of entries selected from a low-rank matrix. When is it possible to complete the matrix and recover the entries that have not been seen? We demonstrate that in very general settings, one can perfectly recover all of the missing entries from most sufficiently large subsets by solving a convex programming problem that finds the matrix with the minimum nuclear norm agreeing with the observed entries. The techniques used in this analysis draw upon parallels in the field of compressed sensing, demonstrating that objects other than signals and images can be perfectly reconstructed from very limited information.

...read moreread less

2,327 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse