scispace - formally typeset
Proceedings ArticleDOI

Data-oriented locality sensitive hashing

Reads0
Chats0
TLDR
Data-Oriented LSH is proposed to reduce memory consumption when data are non-uniformly distributed and focused on the hash table construction, and thus the query-directed methods can be applied to the index to improve further.
Abstract
Locality Sensitive Hashing (LSH) has been proposed as a scalable and high-dimensional index for approximate similarity search. Euclidean LSH is a variation of LSH and has been successfully used in many multimedia applications. However, hash functions of the basic Euclidean LSH project data points over randomly selected directions, which reduces accuracy when data are non-uniformly distributed. So more hash tables are needed to guarantee the accuracy, and thus more memory is consumed. Since heavy memory cost is a significant drawback of Euclidean LSH, we propose Data-Oriented LSH to reduce memory consumption when data are non-uniformly distributed. Most of existing methods are query-directed, such as multi-probe and query expansion methods. We focused on the hash table construction, and thus the query-directed methods can be applied to our index to improve further. The experiment shows that to achieve the same accuracy, our method uses less time and less memory compared with original Euclidean LSH.

read more

Citations
More filters
Journal ArticleDOI

Content based image retrieval using deep learning process

TL;DR: The deep belief network (DBN) method of deep learning is used to extract the features and classification and is an emerging research area, because of the generation of large volume of data.
Journal ArticleDOI

A Privacy-Preserving Framework for Large-Scale Content-Based Information Retrieval

TL;DR: The concept oftunable privacy is introduced, where the privacy protection level can be adjusted according to a policy, through hash-based piecewise inverted indexing, which shows that the privacy enhancement slightly improves the retrieval performance.
Proceedings ArticleDOI

Query by example in large-scale code repositories

TL;DR: This paper proposes a solution for the query by example problem using Abstract Syntax Tree (AST) structural similarity match and shows that the algorithm can achieve high precision and recall and scale to large code repositories without compromising quality.
Journal ArticleDOI

Compact Image Fingerprint Via Multiple Kernel Hashing

TL;DR: To enable fast fingerprints searching over a very large database, a new kernelized multiple feature hashing method is proposed to convert the real- value fingerprints into compact binary-value fingerprints.
Journal ArticleDOI

Nonnegative sparse coding induced hashing for image copy detection

TL;DR: This work combines the constrained nonnegative sparse coding and the Support Vector Machine to propose a new hashing method, called non negative sparse coding induced hashing (NSCIH), which is superior to the state-of-the-art hashing methods.
References
More filters
Proceedings ArticleDOI

Scalable Recognition with a Vocabulary Tree

TL;DR: A recognition scheme that scales efficiently to a large number of objects and allows a larger and more discriminatory vocabulary to be used efficiently is presented, which it is shown experimentally leads to a dramatic improvement in retrieval quality.
Journal ArticleDOI

A Comparison of Affine Region Detectors

TL;DR: A snapshot of the state of the art in affine covariant region detectors, and compares their performance on a set of test images under varying imaging conditions to establish a reference test set of images and performance software so that future detectors can be evaluated in the same framework.
Proceedings ArticleDOI

Locality-sensitive hashing scheme based on p-stable distributions

TL;DR: A novel Locality-Sensitive Hashing scheme for the Approximate Nearest Neighbor Problem under lp norm, based on p-stable distributions that improves the running time of the earlier algorithm and yields the first known provably efficient approximate NN algorithm for the case p<1.
Journal ArticleDOI

Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions

TL;DR: An algorithm for the c-approximate nearest neighbor problem in a d-dimensional Euclidean space, achieving query time of O(dn 1c2/+o(1)) and space O(DN + n1+1c2 + o(1) + 1/c2), which almost matches the lower bound for hashing-based algorithm recently obtained.
Book

Similarity Search: The Metric Space Approach

TL;DR: Similarity Search focuses on the state of the art in developing index structures for searching the metric space, and provides an extensive survey of specific techniques for a large range of applications.
Related Papers (5)