Data structures and algorithms for nearest neighbor search in general metric spaces

doi:10.5555/313559.313789

Open AccessProceedings ArticleDOI

Data structures and algorithms for nearest neighbor search in general metric spaces

Peter N. Yianilos

- pp 311-321

Chats0

TLDR

The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search problems in general metric spaces.

Abstract:

We consider the computational problem of finding nearest neighbors in general metric spaces. Of particular interest are spaces that may not be conveniently embedded or approximated in Euclidian space, or where the dimensionality of a Euclidian representation 1s very high. Also relevant are high-dimensional Euclidian settings in which the distribution of data is in some sense of lower dimension and embedded in the space. The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search nroblems. Tree construcI tion executes in O(nlog(n i ) time, and search is under certain circumstances and in the imit, O(log(n)) expected time. The theoretical basis for this approach is developed and the results of several experiments are reported. In Euclidian cases, kd-tree performance is compared.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Slim-Trees: High Performance Metric Trees Minimizing Overlap Between Nodes

Caetano Traina, +3 more

TL;DR: The Slim-tree is the first metric structure explicitly designed to reduce the degree of overlap, and new algorithms for inserting objects and splitting nodes are presented, generally without sacrificing search performance.

...read moreread less

Journal ArticleDOI

Efficient Algorithms for t-distributed Stochastic Neighborhood Embedding

George C. Linderman, +4 more

- 25 Dec 2017 -

arXiv: Learning

TL;DR: FIt-S NE, a sped-up version of t-SNE, enables visualization of rare cell types in large datasets by obviating the need for downsampling.

...read moreread less

Proceedings ArticleDOI

Searching in metric spaces by spatial approximation

Gonzalo Navarro

TL;DR: This work proposes a new data structure, called sa-tree (“spatial approximation tree”), which is based on approaching the searched objects spatially, that is, getting closer and closer to them, rather than the classic divide-and-conquer approach of other data structures.

...read moreread less

Journal ArticleDOI

The Extreme Value Machine

Ethan M. Rudd, +3 more

- 01 Mar 2018 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The Extreme Value Machine (EVM) is a novel, theoretically sound classifier that has a well-grounded interpretation derived from statistical Extreme Value Theory (EVT), and is the first classifier to be able to perform nonlinear kernel-free variable bandwidth incremental learning.

...read moreread less

Proceedings ArticleDOI

discovRE: Efficient Cross-Architecture Identification of Bugs in Binary Code.

Sebastian Eschweiler, +2 more

TL;DR: A new approach to efficiently search for similar functions in binary code, called discovRE, that supports four instruction set architectures (x86, x64, ARM, MIPS) and is four orders of magnitude faster than the state-of-the-art academic approach for cross-architecture bug search in binaries.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Introduction to Statistical Pattern Recognition

Keinosuke Fukunaga

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.

...read moreread less

Journal ArticleDOI

Voronoi diagrams—a survey of a fundamental geometric data structure

Franz Aurenhammer

- 01 Sep 1991 -

ACM Computing Surveys

TL;DR: The Voronoi diagram as discussed by the authors divides the plane according to the nearest-neighbor points in the plane, and then divides the vertices of the plane into vertices, where vertices correspond to vertices in a plane.

...read moreread less

Journal ArticleDOI

An Algorithm for Finding Best Matches in Logarithmic Expected Time

Jerome H. Friedman, +2 more

- 01 Sep 1977 -

ACM Transactions on Mathematical Softwar...

TL;DR: An algorithm and data structure are presented for searching a file containing N records, each described by k real valued keys, for the m closest matches or nearest neighbors to a given query record.

...read moreread less

Book

Nearest neighbor (NN) norms: NN pattern classification techniques

Belur V. Dasarathy

Journal ArticleDOI

A Branch and Bound Algorithm for Computing k-Nearest Neighbors

Keinosuke Fukunaga, +1 more

- 01 Jul 1975 -

IEEE Transactions on Computers

TL;DR: The method of branch and bound is implemented in the present algorithm to facilitate rapid calculation of the k-nearest neighbors, by eliminating the necesssity of calculating many distances.

...read moreread less