Proximal Methods for Hierarchical Sparse Coding

Open AccessJournal Article

Proximal Methods for Hierarchical Sparse Coding

Rodolphe Jenatton, +3 more

- 01 Feb 2011 -

Journal of Machine Learning Research

- Vol. 12, Iss: 67, pp 2297-2334

TLDR

The procedure has a complexity linear, or close to linear, in the number of atoms, and allows the use of accelerated gradient techniques to solve the tree-structured sparse approximation problem at the same computational cost as traditional ones using the l1-norm.

Abstract:

Sparse coding consists in representing signals as sparse linear combinations of atoms selected from a dictionary. We consider an extension of this framework where the atoms are further assumed to be embedded in a tree. This is achieved using a recently introduced tree-structured sparse regularization norm, which has proven useful in several applications. This norm leads to regularized problems that are difficult to optimize, and in this paper, we propose efficient algorithms for solving them. More precisely, we show that the proximal operator associated with this norm is computable exactly via a dual approach that can be viewed as the composition of elementary proximal operators. Our procedure has a complexity linear, or close to linear, in the number of atoms, and allows the use of accelerated gradient techniques to solve the tree-structured sparse approximation problem at the same computational cost as traditional ones using the l1-norm. Our method is efficient and scales gracefully to millions of variables, which we illustrate in two types of applications: first, we consider fixed hierarchical dictionaries of wavelets to denoise natural images. Then, we apply our optimization tools in the context of dictionary learning, where learned dictionary elements naturally self-organize in a prespecified arborescent structure, leading to better performance in reconstruction of natural image patches. When applied to text documents, our method learns hierarchies of topics, thus providing a competitive alternative to probabilistic topic models.

Proximal Methods for Hierarchical Sparse Coding

Citations

Proximal Algorithms

A unified framework for high-dimensional analysis of M-estimators with decomposable regularizers

A Unified Framework for High-Dimensional Analysis of $M$-Estimators with Decomposable Regularizers

Optimization with Sparsity-Inducing Penalties

Optimization with Sparsity-Inducing Penalties

References

Regression Shrinkage and Selection via the Lasso

Convex Optimization

Latent dirichlet allocation

Latent Dirichlet Allocation

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Related Papers (5)

Regression Shrinkage and Selection via the Lasso

Model selection and estimation in regression with grouped variables

A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation

Atomic Decomposition by Basis Pursuit