The Fast Johnson-Lindenstrauss Transform and Approximate Nearest Neighbors

doi:10.1137/060673096

Open AccessJournal ArticleDOI

The Fast Johnson-Lindenstrauss Transform and Approximate Nearest Neighbors

Nir Ailon, +1 more

- 01 May 2009 -

SIAM Journal on Computing

- Vol. 39, Iss: 1, pp 302-322

Chats0

TLDR

A new low-distortion embedding of $\ell-2^d$ into $\ell_p^{O(\log n)}$ ($p=1,2$) called the fast Johnson-Lindenstrauss transform (FJLT) is introduced, based upon the preconditioning of a sparse projection matrix with a randomized Fourier transform.

Abstract:

We introduce a new low-distortion embedding of $\ell_2^d$ into $\ell_p^{O(\log n)}$ ($p=1,2$) called the fast Johnson-Lindenstrauss transform (FJLT). The FJLT is faster than standard random projections and just as easy to implement. It is based upon the preconditioning of a sparse projection matrix with a randomized Fourier transform. Sparse random projections are unsuitable for low-distortion embeddings. We overcome this handicap by exploiting the “Heisenberg principle” of the Fourier transform, i.e., its local-global duality. The FJLT can be used to speed up search algorithms based on low-distortion embeddings in $\ell_1$ and $\ell_2$. We consider the case of approximate nearest neighbors in $\ell_2^d$. We provide a faster algorithm using classical projections, which we then speed up further by plugging in the FJLT. We also give a faster algorithm for searching over the hypercube.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions

Alexandr Andoni, +1 more

- 01 Jan 2008 -

Communications of The ACM

TL;DR: An algorithm for the c-approximate nearest neighbor problem in a d-dimensional Euclidean space, achieving query time of O(dn 1c2/+o(1)) and space O(DN + n1+1c2 + o(1) + 1/c2), which almost matches the lower bound for hashing-based algorithm recently obtained.

...read moreread less

Journal ArticleDOI

User-Friendly Tail Bounds for Sums of Random Matrices

Joel A. Tropp

- 01 Aug 2012 -

Foundations of Computational Mathematics

TL;DR: This paper presents new probability inequalities for sums of independent, random, self-adjoint matrices and provides noncommutative generalizations of the classical bounds associated with the names Azuma, Bennett, Bernstein, Chernoff, Hoeffding, and McDiarmid.

...read moreread less

Journal ArticleDOI

Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality

Sariel Har-Peled, +2 more

- 16 Jul 2012 -

Theory of Computing

TL;DR: Two algorithms for the approximate nearest neighbor problem in high dimensional spaces for data sets of size n living in IR are presented, achieving query times that are sub-linear in n and polynomial in d.

...read moreread less

Journal ArticleDOI

Compressed sensing with coherent and redundant dictionaries

Emmanuel J. Candès, +3 more

- 01 Jul 2011 -

Applied and Computational Harmonic Analy...

TL;DR: A condition on the measurement/sensing matrix is introduced, which is a natural generalization of the now well-known restricted isometry property, and which guarantees accurate recovery of signals that are nearly sparse in (possibly) highly overcomplete and coherent dictionaries.

...read moreread less

Posted Content

Compressed Sensing with Coherent and Redundant Dictionaries

Emmanuel J. Candès, +3 more

- 14 May 2010 -

arXiv: Numerical Analysis

TL;DR: In this article, a condition on the measurement/sensing matrix is introduced, which guarantees accurate recovery of signals that are nearly sparse in (possibly) highly overcomplete and coherent dictionaries.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Monte Carlo Sampling Methods Using Markov Chains and Their Applications

W. K. Hastings

- 01 Apr 1970 -

Biometrika

TL;DR: A generalization of the sampling method introduced by Metropolis et al. as mentioned in this paper is presented along with an exposition of the relevant theory, techniques of application and methods and difficulties of assessing the error in Monte Carlo estimates.

...read moreread less

Book

The Probabilistic Method

Joel Spencer

TL;DR: A particular set of problems - all dealing with “good” colorings of an underlying set of points relative to a given family of sets - is explored.

...read moreread less

Proceedings ArticleDOI

Approximate nearest neighbors: towards removing the curse of dimensionality

Piotr Indyk, +1 more

TL;DR: In this paper, the authors present two algorithms for the approximate nearest neighbor problem in high-dimensional spaces, for data sets of size n living in R d, which require space that is only polynomial in n and d.

...read moreread less

Journal ArticleDOI

An optimal algorithm for approximate nearest neighbor searching fixed dimensions

Sunil Arya, +4 more

- 01 Nov 1998 -

Journal of the ACM

TL;DR: In this paper, it was shown that given an integer k ≥ 1, (1 + ϵ)-approximation to the k nearest neighbors of q can be computed in additional O(kd log n) time.

...read moreread less

Journal Article

Extensions of Lipschitz mappings into Hilbert space

W. B. Johnson

- 01 Jan 1984 -

Contemporary mathematics

Collapse

The Fast Johnson-Lindenstrauss Transform and Approximate Nearest Neighbors

Citations

Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions

User-Friendly Tail Bounds for Sums of Random Matrices

Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality

Compressed sensing with coherent and redundant dictionaries

Compressed Sensing with Coherent and Redundant Dictionaries

References

Monte Carlo Sampling Methods Using Markov Chains and Their Applications

The Probabilistic Method

Approximate nearest neighbors: towards removing the curse of dimensionality

An optimal algorithm for approximate nearest neighbor searching fixed dimensions

Extensions of Lipschitz mappings into Hilbert space

Related Papers (5)

Database-friendly random projections: Johnson-Lindenstrauss with binary coins

Extensions of Lipschitz mappings into Hilbert space

Improved Approximation Algorithms for Large Matrices via Random Projections

Approximate nearest neighbors: towards removing the curse of dimensionality

Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform