scispace - formally typeset
Open AccessProceedings ArticleDOI

Data structures and algorithms for nearest neighbor search in general metric spaces

Reads0
Chats0
TLDR
The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search problems in general metric spaces.
Abstract
We consider the computational problem of finding nearest neighbors in general metric spaces. Of particular interest are spaces that may not be conveniently embedded or approximated in Euclidian space, or where the dimensionality of a Euclidian representation 1s very high. Also relevant are high-dimensional Euclidian settings in which the distribution of data is in some sense of lower dimension and embedded in the space. The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search nroblems. Tree construcI tion executes in O(nlog(n i ) time, and search is under certain circumstances and in the imit, O(log(n)) expected time. The theoretical basis for this approach is developed and the results of several experiments are reported. In Euclidian cases, kd-tree performance is compared.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

An 86 mW 98GOPS ANN-Searching Processor for Full-HD 30 fps Video Object Recognition With Zeroless Locality-Sensitive Hashing

TL;DR: A high throughput ANN-searching processor is proposed for high-resolution (full-HD) and real-time (30 fps) video object recognition and adopts an interframe cache architecture as a hardware-oriented approach and a zeroless locality-sensitive-hashing (zeroless-LSH) algorithm as a software- oriented approach to reduce the external memory bandwidth required in nearest neighbor searching.
Journal ArticleDOI

Lower Bounds on Performance of Metric Tree Indexing Schemes for Exact Similarity Search in High Dimensions

TL;DR: In this paper, the authors analyzed the curse of dimensionality for deterministic exact similarity search in the context of popular indexing schemes, such as metric trees, and deduced the Ω(n 1/4) lower bound on the expected average case performance of hierarchical metric tree based indexing.
Proceedings ArticleDOI

Beyond Near Duplicates: Learning Hash Codes for Efficient Similar-Image Retrieval

TL;DR: This paper presents a two-tier similar-image retrieval system with the efficiency characteristics found in simpler systems designed to recognize near-duplicates and compares the efficiency of lookups based on random projections and learned hashes to 100-times-more-frequent exemplar sampling.
Journal ArticleDOI

DBM-Tree: trading height-balancing for performance in metric access methods

TL;DR: A new dynamic MAM called the DBM-tree (Density-Based Metric tree), which can minimize the overlap between high-density nodes by relaxing the height-balancing of the structure, and an optimization algorithm called Shrink is presented, which improves the performance of an already builtDBM-tree by reorganizing the elements among their nodes.
Patent

Image searching by approximate κ-NN graph

TL;DR: In this paper, the approximate k-NN graph is constructed from data points partitioned into subsets to further identify nearest-neighboring data points for each data point.
References
More filters
Book

Introduction to Statistical Pattern Recognition

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.
Journal ArticleDOI

Voronoi diagrams—a survey of a fundamental geometric data structure

TL;DR: The Voronoi diagram as discussed by the authors divides the plane according to the nearest-neighbor points in the plane, and then divides the vertices of the plane into vertices, where vertices correspond to vertices in a plane.
Journal ArticleDOI

An Algorithm for Finding Best Matches in Logarithmic Expected Time

TL;DR: An algorithm and data structure are presented for searching a file containing N records, each described by k real valued keys, for the m closest matches or nearest neighbors to a given query record.
Journal ArticleDOI

A Branch and Bound Algorithm for Computing k-Nearest Neighbors

TL;DR: The method of branch and bound is implemented in the present algorithm to facilitate rapid calculation of the k-nearest neighbors, by eliminating the necesssity of calculating many distances.
Related Papers (5)