Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models

Open AccessProceedings Article

Adaptive Forward-Backward Greedy Algorithm for Sparse Learning with Linear Models

Tong Zhang

- Vol. 21, pp 1921-1928

Chats0

TLDR

This work proposes a novel combination that is based on the forward greedy algorithm but takes backward steps adaptively whenever beneficial, and proves strong theoretical results showing that this procedure is effective in learning sparse representations.

Abstract:

Consider linear prediction models where the target function is a sparse linear combination of a set of basis functions. We are interested in the problem of identifying those basis functions with non-zero coefficients and reconstructing the target function from noisy observations. Two heuristics that are widely used in practice are forward and backward greedy algorithms. First, we show that neither idea is adequate. Second, we propose a novel combination that is based on the forward greedy algorithm but takes backward steps adaptively whenever beneficial. We prove strong theoretical results showing that this procedure is effective in learning sparse representations. Experimental results support our theory.

Citations

PDF

Open Access

More filters

Book

Machine Learning : A Probabilistic Perspective

Kevin P. Murphy

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Journal ArticleDOI

Data-driven discovery of partial differential equations.

Samuel H. Rudy, +3 more

- 01 Apr 2017 -

Science Advances

TL;DR: In this paper, the authors propose a sparse regression method for discovering the governing partial differential equation(s) of a given system by time series measurements in the spatial domain, which relies on sparsitypromoting techniques to select the nonlinear and partial derivative terms of the governing equations that most accurately represent the data, bypassing a combinatorially large search through all possible candidate models.

...read moreread less

Journal Article

Analysis of Multi-stage Convex Relaxation for Sparse Regularization

Tong Zhang

- 01 Mar 2010 -

Journal of Machine Learning Research

TL;DR: A multi-stage convex relaxation scheme for solving problems with non-convex objective functions with sparse regularization is presented and it is shown that the local solution obtained by this procedure is superior to the global solution of the standard L1 conveX relaxation for learning sparse targets.

...read moreread less

Posted Content

Multi-Label Prediction via Compressed Sensing

Daniel Hsu, +3 more

- 08 Feb 2009 -

arXiv: Learning

TL;DR: In this paper, a general theory for a variant of the error correcting output code scheme, using ideas from compressed sensing for exploiting output sparsity, was developed, which can be regarded as a simple reduction from multi-label regression problems to binary regression problems.

...read moreread less

Posted Content

Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection

Abhimanyu Das, +1 more

- 19 Feb 2011 -

arXiv: Machine Learning

TL;DR: The submodularity ratio is introduced as a key quantity to help understand why greedy algorithms perform well even when the variables are highly correlated, and is a stronger predictor of the performance of greedy algorithms than other spectral parameters.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

The Elements of Statistical Learning

Trevor Hastie, +2 more

Journal ArticleDOI

Least angle regression

Bradley Efron, +19 more

- 01 Apr 2004 -

Annals of Statistics

TL;DR: A publicly available algorithm that requires only the same order of magnitude of computational effort as ordinary least squares applied to the full set of covariates is described.

...read moreread less

Journal ArticleDOI

Greed is good: algorithmic results for sparse approximation

Joel A. Tropp

- 01 Oct 2004 -

IEEE Transactions on Information Theory

TL;DR: This article presents new results on using a greedy algorithm, orthogonal matching pursuit (OMP), to solve the sparse approximation problem over redundant dictionaries and develops a sufficient condition under which OMP can identify atoms from an optimal approximation of a nonsparse signal.

...read moreread less

Journal Article

On Model Selection Consistency of Lasso

Peng Zhao, +1 more

- 01 Dec 2006 -

Journal of Machine Learning Research

TL;DR: It is proved that a single condition, which is called the Irrepresentable Condition, is almost necessary and sufficient for Lasso to select the true model both in the classical fixed p setting and in the large p setting as the sample size n gets large.

...read moreread less

Journal ArticleDOI

Simultaneous analysis of lasso and dantzig selector

Peter J. Bickel, +2 more

- 01 Aug 2009 -

Annals of Statistics

TL;DR: In this article, the Lasso estimator and the Dantzig selector exhibit similar behavior under a sparsity scenario, and they derive, in parallel, oracle inequalities for the prediction risk in the general nonparametric regression model, as well as bounds on the l p estimation loss for 1 ≤ p ≤ 2.

...read moreread less