scispace - formally typeset
Open AccessProceedings ArticleDOI

Data structures and algorithms for nearest neighbor search in general metric spaces

Reads0
Chats0
TLDR
The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search problems in general metric spaces.
Abstract
We consider the computational problem of finding nearest neighbors in general metric spaces. Of particular interest are spaces that may not be conveniently embedded or approximated in Euclidian space, or where the dimensionality of a Euclidian representation 1s very high. Also relevant are high-dimensional Euclidian settings in which the distribution of data is in some sense of lower dimension and embedded in the space. The up-tree (vantage point tree) is introduced in several forms, together‘ with &&ciated algorithms, as an improved method for these difficult search nroblems. Tree construcI tion executes in O(nlog(n i ) time, and search is under certain circumstances and in the imit, O(log(n)) expected time. The theoretical basis for this approach is developed and the results of several experiments are reported. In Euclidian cases, kd-tree performance is compared.

read more

Content maybe subject to copyright    Report

Citations
More filters

Approximate Best Bin First k-d Tree All Nearest Neighbor Search with Incremental Updates

TL;DR: An approximate algorithm to find all nearest neighbors (NN) for a set of points in moderate to high-dimensional spaces is described and it is observed that the new method is superior for exact search and high number of points as well as for approximate search in small to moderate dimensions or when a fast approximation is needed.
Proceedings ArticleDOI

Treelogy: A benchmark suite for tree traversals

TL;DR: This work presents a suite of tree traversal kernels, drawn from diverse domains, called Treelogy, to explore the connection betweenTree traversal algorithms and state-of-the-art optimizations, and characterize these algorithms by developing an ontology based on their structural properties.
Book ChapterDOI

Searching High-Dimensional Neighbours: CPU-Based Tailored Data-Structures Versus GPU-Based Brute-Force Method

TL;DR: It is shown that the proposed implementation of the brute-force algorithm using GPU (Graphics Processing Units) programming is up to 150 times faster than the classical approaches on synthetic data, and up to 75 times faster on real image processing algorithms (finding similar patches in images and texture synthesis).
Book ChapterDOI

Simple space-time trade-offs for AESA

TL;DR: This work considers indexing and range searching in metric spaces and extends AESA to use b distance bounds, requiring Θ(n2 log2(b)) bits of storage, and proposes several improvements, achieving e.g. O(n1+α) construction cost for some 0 < α < 1, and a variant using even less space.
Patent

Image compression using exemplar dictionary based on hierarchical clustering

TL;DR: In this paper, an exemplar dictionary is built from example image blocks for determining predictor blocks for encoding and decoding images, which comprises a hierarchical organization of image blocks, and performance of clustering is improved by transforming feature vectors representing the image blocks to fewer dimensions.
References
More filters
Book

Introduction to Statistical Pattern Recognition

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.
Journal ArticleDOI

Voronoi diagrams—a survey of a fundamental geometric data structure

TL;DR: The Voronoi diagram as discussed by the authors divides the plane according to the nearest-neighbor points in the plane, and then divides the vertices of the plane into vertices, where vertices correspond to vertices in a plane.
Journal ArticleDOI

An Algorithm for Finding Best Matches in Logarithmic Expected Time

TL;DR: An algorithm and data structure are presented for searching a file containing N records, each described by k real valued keys, for the m closest matches or nearest neighbors to a given query record.
Journal ArticleDOI

A Branch and Bound Algorithm for Computing k-Nearest Neighbors

TL;DR: The method of branch and bound is implemented in the present algorithm to facilitate rapid calculation of the k-nearest neighbors, by eliminating the necesssity of calculating many distances.
Related Papers (5)