Showing papers on "Matrix (mathematics) published in 2009"

PDF

Open Access

Journal Article•DOI•

Exact Matrix Completion via Convex Optimization

[...]

Emmanuel J. Candès¹, Benjamin Recht¹•Institutions (1)

01 Dec 2009-Foundations of Computational Mathematics

TL;DR: It is proved that one can perfectly recover most low-rank matrices from what appears to be an incomplete set of entries, and that objects other than signals and images can be perfectly reconstructed from very limited information.

...read moreread less

Abstract: We consider a problem of considerable practical interest: the recovery of a data matrix from a sampling of its entries. Suppose that we observe m entries selected uniformly at random from a matrix M. Can we complete the matrix and recover the entries that we have not seen? We show that one can perfectly recover most low-rank matrices from what appears to be an incomplete set of entries. We prove that if the number m of sampled entries obeys $$m\ge C\,n^{1.2}r\log n$$ for some positive numerical constant C, then with very high probability, most n×n matrices of rank r can be perfectly recovered by solving a simple convex optimization program. This program finds the matrix with minimum nuclear norm that fits the data. The condition above assumes that the rank is not too large. However, if one replaces the 1.2 exponent with 1.25, then the result holds for all values of the rank. Similar results hold for arbitrary rectangular matrices as well. Our results are connected with the recent literature on compressed sensing, and show that objects other than signals and images can be perfectly reconstructed from very limited information.

...read moreread less

5,274 citations

Posted Content•

Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions

[...]

Nathan Halko, Per-Gunnar Martinsson, Joel A. Tropp¹•Institutions (1)

California Institute of Technology¹

22 Sep 2009-arXiv: Numerical Analysis

TL;DR: In this article, a modular framework for constructing randomized algorithms that compute partial matrix decompositions is presented, which uses random sampling to identify a subspace that captures most of the action of a matrix and then the input matrix is compressed to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization.

...read moreread less

Abstract: Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets. This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed---either explicitly or implicitly---to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, speed, and robustness. These claims are supported by extensive numerical experiments and a detailed error analysis.

...read moreread less

2,356 citations

Proceedings Article•

Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization

[...]

John Wright¹, Arvind Ganesh², Shankar R. Rao², Yigang Peng¹, Yi Ma¹ - Show less +1 more•Institutions (2)

Microsoft¹, University of Illinois at Urbana–Champaign²

07 Dec 2009

TL;DR: It is proved that most matrices A can be efficiently and exactly recovered from most error sign-and-support patterns by solving a simple convex program, for which it is given a fast and provably convergent algorithm.

...read moreread less

Abstract: Principal component analysis is a fundamental operation in computational data analysis, with myriad applications ranging from web search to bioinformatics to computer vision and image analysis. However, its performance and applicability in real scenarios are limited by a lack of robustness to outlying or corrupted observations. This paper considers the idealized "robust principal component analysis" problem of recovering a low rank matrix A from corrupted observations D = A + E. Here, the corrupted entries E are unknown and the errors can be arbitrarily large (modeling grossly corrupted observations common in visual and bioinformatic data), but are assumed to be sparse. We prove that most matrices A can be efficiently and exactly recovered from most error sign-and-support patterns by solving a simple convex program, for which we give a fast and provably convergent algorithm. Our result holds even when the rank of A grows nearly proportionally (up to a logarithmic factor) to the dimensionality of the observation space and the number of errors E grows in proportion to the total number of entries in the matrix. A by-product of our analysis is the first proportional growth results for the related problem of completing a low-rank matrix from a small fraction of its entries. Simulations and real-data examples corroborate the theoretical results, and suggest potential applications in computer vision.

...read moreread less

1,479 citations

Posted Content•

Matrix Completion With Noise

[...]

Emmanuel J. Candès¹, Yaniv Plan¹•Institutions (1)

California Institute of Technology¹

18 Mar 2009-arXiv: Information Theory

TL;DR: In this article, the authors show that matrix completion is provably accurate even when the few observed entries are corrupted with a small amount of noise, and they also present numerical results which show that, in practice, nuclear norm minimization accurately fills in the many missing entries of large low-rank matrices from just a few noisy samples.

...read moreread less

Abstract: On the heels of compressed sensing, a remarkable new field has very recently emerged. This field addresses a broad range of problems of significant practical interest, namely, the recovery of a data matrix from what appears to be incomplete, and perhaps even corrupted, information. In its simplest form, the problem is to recover a matrix from a small sample of its entries, and comes up in many areas of science and engineering including collaborative filtering, machine learning, control, remote sensing, and computer vision to name a few. This paper surveys the novel literature on matrix completion, which shows that under some suitable conditions, one can recover an unknown low-rank matrix from a nearly minimal set of entries by solving a simple convex optimization problem, namely, nuclear-norm minimization subject to data constraints. Further, this paper introduces novel results showing that matrix completion is provably accurate even when the few observed entries are corrupted with a small amount of noise. A typical result is that one can recover an unknown n x n matrix of low rank r from just about nr log^2 n noisy samples with an error which is proportional to the noise level. We present numerical results which complement our quantitative analysis and show that, in practice, nuclear norm minimization accurately fills in the many missing entries of large low-rank matrices from just a few noisy samples. Some analogies between matrix completion and compressed sensing are discussed throughout.

...read moreread less

1,292 citations

Book•

Matrix Mathematics: Theory, Facts, and Formulas

[...]

Dennis S. Bernstein

06 Jul 2009

1,088 citations

Book•

Cooperative Control of Dynamical Systems: Applications to Autonomous Vehicles

[...]

Zhihua Qu¹, Jing Wang¹, Richard A. Hull²•Institutions (2)

University of Central Florida¹, Science Applications International Corporation²

19 Feb 2009

TL;DR: In this paper, a new framework based on matrix theory is proposed to analyze and design cooperative controls for a group of individual dynamical systems whose outputs are sensed by or communicated to others in an intermittent, dynamically changing, and local manner.

...read moreread less

Abstract: In this paper, a new framework based on matrix theory is proposed to analyze and design cooperative controls for a group of individual dynamical systems whose outputs are sensed by or communicated to others in an intermittent, dynamically changing, and local manner. In the framework, sensing/communication is described mathematically by a time-varying matrix whose dimension is equal to the number of dynamical systems in the group and whose elements assume piecewise-constant and binary values. Dynamical systems are generally heterogeneous and can be transformed into a canonical form of different, arbitrary, but finite relative degrees. Utilizing a set of new results on augmentation of irreducible matrices and on lower triangulation of reducible matrices, the framework allows a designer to study how a general local-and-output-feedback cooperative control can determine group behaviors of the dynamical systems and to see how changes of sensing/communication would impact the group behaviors over time. A necessary and sufficient condition on convergence of a multiplicative sequence of reducible row-stochastic (diagonally positive) matrices is explicitly derived, and through simple choices of a gain matrix in the cooperative control law, the overall closed-loop system is shown to exhibit cooperative behaviors (such as single group behavior, multiple group behaviors, adaptive cooperative behavior for the group, and cooperative formation including individual behaviors). Examples, including formation control of nonholonomic systems in the chained form, are used to illustrate the proposed framework.

...read moreread less

937 citations

Journal Article•DOI•

Sparsest solutions of underdetermined linear systems via ℓq-minimization for 0<q⩽1

[...]

Simon Foucart¹, Ming-Jun Lai²•Institutions (2)

Vanderbilt University¹, University of Georgia²

01 May 2009-Applied and Computational Harmonic Analysis

TL;DR: In this article, a condition on the matrix of an underdetermined linear system which guarantees that the solution of the system with minimal l q -quasinorm is also the sparsest one is presented.

...read moreread less

788 citations

Posted Content•

A Simpler Approach to Matrix Completion

[...]

Benjamin Recht¹•Institutions (1)

University of Wisconsin-Madison¹

05 Oct 2009-arXiv: Information Theory

TL;DR: In this article, the authors provided the best bounds on the number of randomly sampled entries required to reconstruct an unknown low-rank matrix by minimizing the nuclear norm of the hidden matrix subject to agreement with the provided entries.

...read moreread less

Abstract: This paper provides the best bounds to date on the number of randomly sampled entries required to reconstruct an unknown low rank matrix. These results improve on prior work by Candes and Recht, Candes and Tao, and Keshavan, Montanari, and Oh. The reconstruction is accomplished by minimizing the nuclear norm, or sum of the singular values, of the hidden matrix subject to agreement with the provided entries. If the underlying matrix satisfies a certain incoherence condition, then the number of entries required is equal to a quadratic logarithmic factor times the number of parameters in the singular value decomposition. The proof of this assertion is short, self contained, and uses very elementary analysis. The novel techniques herein are based on recent work in quantum information theory.

...read moreread less

710 citations

Journal Article•DOI•

Learning to Sense Sparse Signals: Simultaneous Sensing Matrix and Sparsifying Dictionary Optimization

[...]

Julio M. Duarte-Carvajalino¹, Guillermo Sapiro¹•Institutions (1)

University of Minnesota¹

01 Jul 2009-IEEE Transactions on Image Processing

TL;DR: A framework for the joint design and optimization, from a set of training images, of the nonparametric dictionary and the sensing matrix is introduced and it is shown that this joint optimization outperforms both the use of random sensing matrices and those matrices that are optimized independently of the learning of the dictionary.

...read moreread less

Abstract: Sparse signal representation, analysis, and sensing have received a lot of attention in recent years from the signal processing, optimization, and learning communities. On one hand, learning overcomplete dictionaries that facilitate a sparse representation of the data as a liner combination of a few atoms from such dictionary leads to state-of-the-art results in image and video restoration and classification. On the other hand, the framework of compressed sensing (CS) has shown that sparse signals can be recovered from far less samples than those required by the classical Shannon-Nyquist Theorem. The samples used in CS correspond to linear projections obtained by a sensing projection matrix. It has been shown that, for example, a nonadaptive random sampling matrix satisfies the fundamental theoretical requirements of CS, enjoying the additional benefit of universality. On the other hand, a projection sensing matrix that is optimally designed for a certain class of signals can further improve the reconstruction accuracy or further reduce the necessary number of samples. In this paper, we introduce a framework for the joint design and optimization, from a set of training images, of the nonparametric dictionary and the sensing matrix. We show that this joint optimization outperforms both the use of random sensing matrices and those matrices that are optimized independently of the learning of the dictionary. Particular cases of the proposed framework include the optimization of the sensing matrix for a given dictionary as well as the optimization of the dictionary for a predefined sensing environment. The presentation of the framework and its efficient numerical optimization is complemented with numerous examples on classical image datasets.

...read moreread less

605 citations

Proceedings Article•DOI•

An accelerated gradient method for trace norm minimization

[...]

Shuiwang Ji¹, Jieping Ye¹•Institutions (1)

Arizona State University¹

14 Jun 2009

TL;DR: This paper exploits the special structure of the trace norm, based on which it is proposed an extended gradient algorithm that converges as O(1/k) and proposes an accelerated gradient algorithm, which achieves the optimal convergence rate of O( 1/k2) for smooth problems.

...read moreread less

Abstract: We consider the minimization of a smooth loss function regularized by the trace norm of the matrix variable. Such formulation finds applications in many machine learning tasks including multi-task learning, matrix classification, and matrix completion. The standard semidefinite programming formulation for this problem is computationally expensive. In addition, due to the non-smooth nature of the trace norm, the optimal first-order black-box method for solving such class of problems converges as O(1/√k), where k is the iteration counter. In this paper, we exploit the special structure of the trace norm, based on which we propose an extended gradient algorithm that converges as O(1/k). We further propose an accelerated gradient algorithm, which achieves the optimal convergence rate of O(1/k2) for smooth problems. Experiments on multi-task learning problems demonstrate the efficiency of the proposed algorithms.

...read moreread less

600 citations

Journal Article•DOI•

Complex network approach for recurrence analysis of time series

[...]

Norbert Marwan¹, Jonathan F. Donges², Jonathan F. Donges¹, Yong Zou¹, Reik V. Donner³, Reik V. Donner¹, Reik V. Donner⁴, Jürgen Kurths¹, Jürgen Kurths² - Show less +5 more•Institutions (4)

Potsdam Institute for Climate Impact Research¹, Humboldt University of Berlin², Dresden University of Technology³, Osaka Prefecture University⁴

09 Nov 2009-Physics Letters A

TL;DR: A novel approach for analysing time series using complex network theory is proposed and the potential of these complex network measures for the detection of dynamical transitions is illustrated by using the logistic map.

...read moreread less

Journal Article•DOI•

Sparsistency and Rates of Convergence in Large Covariance Matrix Estimation

[...]

Clifford Lam¹, Jianqing Fan•Institutions (1)

London School of Economics and Political Science¹

01 Dec 2009-Annals of Statistics

TL;DR: In this paper, the authors studied the sparsistency and rates of convergence for estimating sparse covariance and precision matrices based on penalized likelihood with nonconvex penalty functions.

...read moreread less

Abstract: This paper studies the sparsistency and rates of convergence for estimating sparse covariance and precision matrices based on penalized likelihood with nonconvex penalty functions. Here, sparsistency refers to the property that all parameters that are zero are actually estimated as zero with probability tending to one. Depending on the case of applications, sparsity priori may occur on the covariance matrix, its inverse or its Cholesky decomposition. We study these three sparsity exploration problems under a unified framework with a general penalty function. We show that the rates of convergence for these problems under the Frobenius norm are of order (s(n) log p(n)/n)(1/2), where s(n) is the number of nonzero elements, p(n) is the size of the covariance matrix and n is the sample size. This explicitly spells out the contribution of high-dimensionality is merely of a logarithmic factor. The conditions on the rate with which the tuning parameter λ(n) goes to 0 have been made explicit and compared under different penalties. As a result, for the L(1)-penalty, to guarantee the sparsistency and optimal rate of convergence, the number of nonzero elements should be small: sn'=O(pn) at most, among O(pn2) parameters, for estimating sparse covariance or correlation matrix, sparse precision or inverse correlation matrix or sparse Cholesky factor, where sn' is the number of the nonzero elements on the off-diagonal entries. On the other hand, using the SCAD or hard-thresholding penalty functions, there is no such a restriction.

...read moreread less

Journal Article•DOI•

Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information

[...]

Ignacy Misztal¹, Andres Legarra², Ignacio Aguilar¹•Institutions (2)

University of Georgia¹, Institut national de la recherche agronomique²

01 Sep 2009-Journal of Dairy Science

TL;DR: The proposed methodology may allow the upgrading of an existing evaluation to incorporate the genomic information when the information attributable to genomics can be expressed as modifications to the numerator relationship matrix.

...read moreread less

Posted Content•

Sparsistency and rates of convergence in large covariance matrix estimation

[...]

Clifford Lam¹, Jianqing Fan•Institutions (1)

London School of Economics and Political Science¹

01 Jan 2009-Research Papers in Economics

TL;DR: To guarantee the sparsistency and optimal rate of convergence, the number of nonzero elements should be small: sn'=O(pn) at most, among O(pn2) parameters, for estimating sparse covariance or correlation matrix, sparse precision or inverse correlation matrix or sparse Cholesky factor.

...read moreread less

Abstract: This paper studies the sparsistency and rates of convergence for estimating sparse covariance and precision matrices based on penalized likelihood with nonconvex penalty functions. Here, sparsistency refers to the property that all parameters that are zero are actually estimated as zero with probability tending to one. Depending on the case of applications, sparsity priori may occur on the covariance matrix, its inverse or its Cholesky decomposition. We study these three sparsity exploration problems under a unified framework with a general penalty function. We show that the rates of convergence for these problems under the Frobenius norm are of order (sn log pn/n)1/2, where sn is the number of nonzero elements, pn is the size of the covariance matrix and n is the sample size. This explicitly spells out the contribution of high-dimensionality is merely of a logarithmic factor. The conditions on the rate with which the tuning parameter λn goes to 0 have been made explicit and compared under different penalties. As a result, for the L1-penalty, to guarantee the sparsistency and optimal rate of convergence, the number of nonzero elements should be small: at most, among parameters, for estimating sparse covariance or correlation matrix, sparse precision or inverse correlation matrix or sparse Cholesky factor, where is the number of the nonzero elements on the off-diagonal entries. On the other hand, using the SCAD or hard-thresholding penalty functions, there is no such a restriction.

...read moreread less

Journal Article•DOI•

On the Reconstruction of Block-Sparse Signals With an Optimal Number of Measurements

[...]

Mihailo Stojnic¹, Farzad Parvaresh², Babak Hassibi²•Institutions (2)

Purdue University¹, California Institute of Technology²

01 Aug 2009-IEEE Transactions on Signal Processing

TL;DR: The relaxation given in (*) can be solved in polynomial time using semi-definite programming.

...read moreread less

Abstract: Let A be an M by N matrix (M 1 - 1/d, and d = Omega(log(1/isin)/isin3) . The relaxation given in (*) can be solved in polynomial time using semi-definite programming.

...read moreread less

Proceedings Article•DOI•

Numerical linear algebra in the streaming model

[...]

Kenneth L. Clarkson¹, David P. Woodruff¹•Institutions (1)

IBM¹

31 May 2009

TL;DR: Near-optimal space bounds are given in the streaming model for linear algebra problems that include estimation of matrix products, linear regression, low-rank approximation, and approximation of matrix rank; results for turnstile updates are proved.

...read moreread less

Abstract: We give near-optimal space bounds in the streaming model for linear algebra problems that include estimation of matrix products, linear regression, low-rank approximation, and approximation of matrix rank. In the streaming model, sketches of input matrices are maintained under updates of matrix entries; we prove results for turnstile updates, given in an arbitrary order. We give the first lower bounds known for the space needed by the sketches, for a given estimation error e. We sharpen prior upper bounds, with respect to combinations of space, failure probability, and number of passes. The sketch we use for matrix A is simply STA, where S is a sign matrix. Our results include the following upper and lower bounds on the bits of space needed for 1-pass algorithms. Here A is an n x d matrix, B is an n x d' matrix, and c := d+d'. These results are given for fixed failure probability; for failure probability δ>0, the upper bounds require a factor of log(1/δ) more space. We assume the inputs have integer entries specified by O(log(nc)) bits, or O(log(nd)) bits. (Matrix Product) Output matrix C with F(ATB-C) ≤ e F(A) F(B). We show that Θ(ce-2log(nc)) space is needed. (Linear Regression) For d'=1, so that B is a vector b, find x so that Ax-b ≤ (1+e) minx' ∈ Reald Ax'-b. We show that Θ(d2e-1 log(nd)) space is needed. (Rank-k Approximation) Find matrix tAk of rank no more than k, so that F(A-tAk) ≤ (1+e) F{A-Ak}, where Ak is the best rank-k approximation to A. Our lower bound is Ω(ke-1(n+d)log(nd)) space, and we give a one-pass algorithm matching this when A is given row-wise or column-wise. For general updates, we give a one-pass algorithm needing [O(ke-2(n + d/e2)log(nd))] space. We also give upper and lower bounds for algorithms using multiple passes, and a sketching analog of the CUR decomposition.

...read moreread less

Journal Article•DOI•

Spectral Bound and Reproduction Number for Infinite-Dimensional Population Structure and Time Heterogeneity

[...]

Horst R. Thieme

13 May 2009-Siam Journal on Applied Mathematics

TL;DR: The relation between spectral bound and reproduction number extends to models with infinite-dimensional state space and then holds between the spectral bound of a resolvent-positive closed linear operator and the spectral radius of a positive bounded linear operator.

...read moreread less

Abstract: Spectral bounds of quasi-positive matrices are crucial mathematical threshold parameters in population models that are formulated as systems of ordinary differential equations: the sign of the spectral bound of the variational matrix at 0 decides whether, at low density, the population becomes extinct or grows. Another important threshold parameter is the reproduction number $\mathcal{R}$, which is the spectral radius of a related positive matrix. As is well known, the spectral bound and $\mathcal{R}-1$ have the same sign provided that the matrices have a particular form. The relation between spectral bound and reproduction number extends to models with infinite-dimensional state space and then holds between the spectral bound of a resolvent-positive closed linear operator and the spectral radius of a positive bounded linear operator. We also extend an analogous relation between the spectral radii of two positive linear operators which is relevant for discrete-time models. We illustrate the general theory...

...read moreread less

Journal Article•DOI•

A Randomized Algorithm for Principal Component Analysis

[...]

Vladimir Rokhlin¹, Arthur Szlam¹, Mark Tygert•Institutions (1)

Yale University¹

01 Aug 2009-SIAM Journal on Matrix Analysis and Applications

TL;DR: In this article, the authors describe an efficient algorithm for low-rank approximation of matrices that produces accuracy that is very close to the best possible accuracy, for matrices of arbitrary sizes.

...read moreread less

Abstract: Principal component analysis (PCA) requires the computation of a low-rank approximation to a matrix containing the data being analyzed. In many applications of PCA, the best possible accuracy of any rank-deficient approximation is at most a few digits (measured in the spectral norm, relative to the spectral norm of the matrix being approximated). In such circumstances, efficient algorithms have not come with guarantees of good accuracy, unless one or both dimensions of the matrix being approximated are small. We describe an efficient algorithm for the low-rank approximation of matrices that produces accuracy that is very close to the best possible accuracy, for matrices of arbitrary sizes. We illustrate our theoretical results via several numerical examples.

...read moreread less

Journal Article•DOI•

Smallest singular value of a random rectangular matrix

[...]

Mark Rudelson¹, Roman Vershynin²•Institutions (2)

University of Missouri¹, University of Michigan²

01 Dec 2009-Communications on Pure and Applied Mathematics

TL;DR: In this article, the smallest singular value of a random sub-Gaussian matrix with inde- pendent and identically distributed entries was shown to be at least p Np n � 1 with high probability.

...read moreread less

Abstract: We prove an optimal estimate of the smallest singular value of a random sub- Gaussian matrix, valid for all dimensions. For an Nn matrix A with inde- pendent and identically distributed sub-Gaussian entries, the smallest singular value of A is at least of the order p Np n � 1 with high probability. A sharp

...read moreread less

Journal Article•DOI•

Matrix approach to discrete fractional calculus II: Partial fractional differential equations

[...]

Igor Podlubny¹, Aleksei V. Chechkin, Tomas Skovranek¹, YangQuan Chen², Blas Manuel Vinagre Jara³ - Show less +1 more•Institutions (3)

Technical University of Košice¹, Utah State University², University of Extremadura³

01 May 2009-Journal of Computational Physics

TL;DR: A new method that enables easy and convenient discretization of partial differential equations with derivatives of arbitrary real order (so-called fractional derivatives) and delays is presented and illustrated on numerical solution of various types of fractional diffusion equation.

...read moreread less

Journal Article•DOI•

Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

[...]

David L. Donoho¹, Jared Tanner²•Institutions (2)

Stanford University¹, University of Edinburgh²

14 Jun 2009-arXiv: Statistics Theory

TL;DR: An extensive computational experiment and formal inferential analysis is conducted to test the hypothesis that phase transitions occurring in modern high-dimensional data analysis and signal processing are universal across a range of underlying matrix ensembles, and shows that finite-sample universality can be rejected.

...read moreread less

Abstract: We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers increases beyond a threshold. In combinatorial geometry these transitions appear as abrupt changes in the properties of face counts of convex polytopes when the dimensions are varied. The thresholds in these very different problems appear in the same critical locations after appropriate calibration of variables. These thresholds are important in each subject area: for linear modelling, they place hard limits on the degree to which the now-ubiquitous high-throughput data analysis can be successful; for robustness, they place hard limits on the degree to which standard robust fitting methods can tolerate outliers before breaking down; for compressed sensing, they define the sharp boundary of the undersampling/sparsity tradeoff in undersampling theorems. Existing derivations of phase transitions in combinatorial geometry assume the underlying matrices have independent and identically distributed (iid) Gaussian elements. In applications, however, it often seems that Gaussianity is not required. We conducted an extensive computational experiment and formal inferential analysis to test the hypothesis that these phase transitions are {\it universal} across a range of underlying matrix ensembles. The experimental results are consistent with an asymptotic large-$n$ universality across matrix ensembles; finite-sample universality can be rejected.

...read moreread less

Journal Article•DOI•

Observed universality of phase transitions in high-dimensional geometry, with implications for modern data analysis and signal processing.

[...]

David L. Donoho¹, Jared Tanner²•Institutions (2)

Stanford University¹, University of Edinburgh²

13 Nov 2009-Philosophical Transactions of the Royal Society A

TL;DR: In this paper, the authors review connections between phase transitions in highdimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing, and show that the empirical phase transitions do not depend on the ensemble, and they agree extremely well with the asymptotic theory assuming Gaussianity.

...read moreread less

Abstract: We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers increases beyond a threshold. In combinatorial geometry, these transitions appear as abrupt changes in the properties of face counts of convex polytopes when the dimensions are varied. The thresholds in these very different problems appear in the same critical locations after appropriate calibration of variables. These thresholds are important in each subject area: for linear modelling, they place hard limits on the degree to which the now ubiquitous high-throughput data analysis can be successful; for robustness, they place hard limits on the degree to which standard robust fitting methods can tolerate outliers before breaking down; for compressed sensing, they define the sharp boundary of the undersampling/sparsity trade-off curve in undersampling theorems. Existing derivations of phase transitions in combinatorial geometry assume that the underlying matrices have independent and identically distributed Gaussian elements. In applications, however, it often seems that Gaussianity is not required. We conducted an extensive computational experiment and formal inferential analysis to test the hypothesis that these phase transitions are universal across a range of underlying matrix ensembles. We ran millions of linear programs using random matrices spanning several matrix ensembles and problem sizes; visually, the empirical phase transitions do not depend on the ensemble, and they agree extremely well with the asymptotic theory assuming Gaussianity. Careful statistical analysis reveals discrepancies that can be explained as transient terms, decaying with problem size. The experimental results are thus consistent with an asymptotic large-n universality across matrix ensembles; finite-sample universality can be rejected.

...read moreread less

Journal Article•DOI•

Brief paper: Mode-dependent H∞ filtering for discrete-time Markovian jump linear systems with partly unknown transition probabilities

[...]

Lixian Zhang¹, El-Kébir Boukas²•Institutions (2)

Harbin Institute of Technology¹, École Polytechnique de Montréal²

01 Jun 2009-Automatica

TL;DR: In this article, a mode-dependent full-order filter is constructed and the bounded real lemma for the resulting filtering error system is derived via LMI formulation, and an improved version of the BRL is further given by introducing additional slack matrix variables to eliminate the cross coupling between system matrices and Lyapunov matrices among different operation modes.

...read moreread less

Proceedings Article•DOI•

Spatio-temporal compressive sensing and internet traffic matrices

[...]

Yin Zhang¹, Matthew Roughan², Walter Willinger³, Lili Qiu¹•Institutions (3)

University of Texas at Austin¹, University of Adelaide², AT&T Labs³

16 Aug 2009

TL;DR: This work develops a novel spatio-temporal compressive sensing framework with two key components: a new technique called Sparsity Regularized Matrix Factorization (SRMF) that leverages the sparse or low-rank nature of real-world traffic matrices and their spatio/temporal properties, and a mechanism for combining low- rank approximations with local interpolation procedures.

...read moreread less

Abstract: Many basic network engineering tasks (e.g., traffic engineering, capacity planning, anomaly detection) rely heavily on the availability and accuracy of traffic matrices. However, in practice it is challenging to reliably measure traffic matrices. Missing values are common. This observation brings us into the realm of compressive sensing, a generic technique for dealing with missing values that exploits the presence of structure and redundancy in many real-world systems. Despite much recent progress made in compressive sensing, existing compressive-sensing solutions often perform poorly for traffic matrix interpolation, because real traffic matrices rarely satisfy the technical conditions required for these solutions.To address this problem, we develop a novel spatio-temporal compressive sensing framework with two key components: (i) a new technique called Sparsity Regularized Matrix Factorization (SRMF) that leverages the sparse or low-rank nature of real-world traffic matrices and their spatio-temporal properties, and (ii) a mechanism for combining low-rank approximations with local interpolation procedures. We illustrate our new framework and demonstrate its superior performance in problems involving interpolation with real traffic matrices where we can successfully replace up to 98% of the values. Evaluation in applications such as network tomography, traffic prediction, and anomaly detection confirms the flexibility and effectiveness of our approach.

...read moreread less

Proceedings Article•DOI•

Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks

[...]

Aydin Buluc¹, Jeremy T. Fineman², Matteo Frigo, John R. Gilbert¹, Charles E. Leiserson² - Show less +1 more•Institutions (2)

University of California, Santa Barbara¹, Massachusetts Institute of Technology²

11 Aug 2009

TL;DR: In this article, a storage format for sparse matrices, called compressed sparse blocks (CSB), is introduced, which allows both Ax and A,x to be computed efficiently in parallel, where A is an n×n sparse matrix with nnzen nonzeros and x is a dense n-vector.

...read moreread less

Abstract: This paper introduces a storage format for sparse matrices, called compressed sparse blocks (CSB), which allows both Ax and A,x to be computed efficiently in parallel, where A is an n×n sparse matrix with nnzen nonzeros and x is a dense n-vector. Our algorithms use Θ(nnz) work (serial running time) and Θ(√nlgn) span (critical-path length), yielding a parallelism of Θ(nnz/√nlgn), which is amply high for virtually any large matrix. The storage requirement for CSB is the same as that for the more-standard compressed-sparse-rows (CSR) format, for which computing Ax in parallel is easy but A,x is difficult. Benchmark results indicate that on one processor, the CSB algorithms for Ax and A,x run just as fast as the CSR algorithm for Ax, but the CSB algorithms also scale up linearly with processors until limited by off-chip memory bandwidth.

...read moreread less

Proceedings Article•DOI•

An improved approximation algorithm for the column subset selection problem

[...]

Christos Boutsidis¹, Michael W. Mahoney², Petros Drineas¹•Institutions (2)

Rensselaer Polytechnic Institute¹, Stanford University²

04 Jan 2009

TL;DR: In this paper, a two-stage algorithm that runs in O(min{mn2, m2n}) time and returns as output an m x k matrix C consisting of exactly k columns of A is presented.

...read moreread less

Abstract: We consider the problem of selecting the "best" subset of exactly k columns from an m x n matrix A. In particular, we present and analyze a novel two-stage algorithm that runs in O(min{mn2, m2n}) time and returns as output an m x k matrix C consisting of exactly k columns of A. In the first stage (the randomized stage), the algorithm randomly selects O(k log k) columns according to a judiciously-chosen probability distribution that depends on information in the top-k right singular subspace of A. In the second stage (the deterministic stage), the algorithm applies a deterministic column-selection procedure to select and return exactly k columns from the set of columns selected in the first stage. Let C be the m x k matrix containing those k columns, let PC denote the projection matrix onto the span of those columns, and let Ak denote the "best" rank-k approximation to the matrix A as computed with the singular value decomposition. Then, we prove that[EQUATION]with probability at least 0.7. This spectral norm bound improves upon the best previously-existing result (of Gu and Eisenstat [21]) for the spectral norm version of this Column Subset Selection Problem. We also prove that[EQUATION]with the same probability. This Frobenius norm bound is only a factor of √k log k worse than the best previously existing existential result and is roughly O(√k!) better than the best previous algorithmic result (both of Deshpande et al. [11]) for the Frobenius norm version of this Column Subset Selection Problem.

...read moreread less

Journal Article•DOI•

Möbius voting for surface correspondence

[...]

Yaron Lipman¹, Thomas Funkhouser¹•Institutions (1)

Princeton University¹

27 Jul 2009

TL;DR: A Mobius Voting algorithm that iteratively produces "votes" for predicted correspondences between the mutually closest points with magnitude representing their estimated deviation from isometry, which is converted to a permutation matrix with simple matrix operations and output as a discrete set of point correspondences with confidence values.

...read moreread less

Abstract: The goal of our work is to develop an efficient, automatic algorithm for discovering point correspondences between surfaces that are approximately and/or partially isometric.Our approach is based on three observations. First, isometries are a subset of the Mobius group, which has low-dimensionality -- six degrees of freedom for topological spheres, and three for topological discs. Second, computing the Mobius transformation that interpolates any three points can be computed in closed-form after a mid-edge flattening to the complex plane. Third, deviations from isometry can be modeled by a transportation-type distance between corresponding points in that plane.Motivated by these observations, we have developed a Mobius Voting algorithm that iteratively: 1) samples a triplet of three random points from each of two point sets, 2) uses the Mobius transformations defined by those triplets to map both point sets into a canonical coordinate frame on the complex plane, and 3) produces "votes" for predicted correspondences between the mutually closest points with magnitude representing their estimated deviation from isometry. The result of this process is a fuzzy correspondence matrix, which is converted to a permutation matrix with simple matrix operations and output as a discrete set of point correspondences with confidence values.The main advantage of this algorithm is that it can find intrinsic point correspondences in cases of extreme deformation. During experiments with a variety of data sets, we find that it is able to find dozens of point correspondences between different object types in different poses fully automatically.

...read moreread less

Journal Article•DOI•

Adaptive relevance matrices in learning vector quantization

[...]

Petra Schneider¹, Michael Biehl¹, Barbara Hammer²•Institutions (2)

University of Groningen¹, Clausthal University of Technology²

01 Dec 2009-Neural Computation

TL;DR: A new matrix learning scheme to extend relevance learning vector quantization (RLVQ), an efficient prototype-based classification algorithm, toward a general adaptive metric by introducing a full matrix of relevance factors in the distance measure.

...read moreread less

Abstract: We propose a new matrix learning scheme to extend relevance learning vector quantization (RLVQ), an efficient prototype-based classification algorithm, toward a general adaptive metric By introducing a full matrix of relevance factors in the distance measure, correlations between different features and their importance for the classification scheme can be taken into account and automated, and general metric adaptation takes place during training In comparison to the weighted Euclidean metric used in RLVQ and its variations, a full matrix is more powerful to represent the internal structure of the data appropriately Large margin generalization bounds can be transferred to this case, leading to bounds that are independent of the input dimensionality This also holds for local metrics attached to each prototype, which corresponds to piecewise quadratic decision boundaries The algorithm is tested in comparison to alternative learning vector quantization schemes using an artificial data set, a benchmark multiclass problem from the VCI repository, and a problem from bioinformatics, the recognition of splice sites for C elegans

...read moreread less

Posted Content•

Toda Theories, Matrix Models, Topological Strings, and N=2 Gauge Systems

[...]

Robbert Dijkgraaf, Cumrun Vafa

13 Sep 2009-arXiv: High Energy Physics - Theory

TL;DR: In this paper, the authors consider the topological string partition function, including the Nekrasov deformation, for type IIB geometries with an A n-1 singularity over a Riemann surface.

...read moreread less

Abstract: We consider the topological string partition function, including the Nekrasov deformation, for type IIB geometries with an A_{n-1} singularity over a Riemann surface. These models realize the N=2 SU(n) superconformal gauge systems recently studied by Gaiotto and collaborators. Employing large N dualities we show why the partition function of topological strings in these backgrounds is captured by the chiral blocks of A_{n-1} Toda systems and derive the dictionary recently proposed by Alday, Gaiotto and Tachikawa. For the case of genus zero Riemann surfaces, we show how these systems can also be realized by Penner-like matrix models with logarithmic potentials. The Seiberg-Witten curve can be understood as the spectral curve of these matrix models which arises holographically at large N. In this context the Nekrasov deformation maps to the beta-ensemble of generalized matrix models, that in turn maps to the Toda system with general background charge. We also point out the notion of a double holography for this system, when both n and N are large.

...read moreread less

Posted Content•

The Complex Gradient Operator and the CR-Calculus

[...]

Kenneth Kreutz-Delgado

26 Jun 2009-arXiv: Optimization and Control

TL;DR: In this article, a thorough discussion and development of the calculus of real-valued functions of complex-valued vectors is given using the framework of the Wirtinger Calculus, which is suitable for exposition in an introductory Electrical Engineering graduate level course on the use of complex gradients and complex Hessian matrices, and has been successfully used in teaching at UC San Diego.

...read moreread less

Abstract: A thorough discussion and development of the calculus of real-valued functions of complex-valued vectors is given using the framework of the Wirtinger Calculus. The presented material is suitable for exposition in an introductory Electrical Engineering graduate level course on the use of complex gradients and complex Hessian matrices, and has been successfully used in teaching at UC San Diego. Going beyond the commonly encountered treatments of the first-order complex vector calculus, second-order considerations are examined in some detail filling a gap in the pedagogic literature.

...read moreread less

Collapse