Topic

Locality-sensitive hashing

About: Locality-sensitive hashing is a research topic. Over the lifetime, 1894 publications have been published within this topic receiving 69362 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Fast likelihood computation for continuous-mixture densities using a tree-based nearest neighbor search.

[...]

Frank Seide

01 Jan 1995

14 citations

Journal Article•DOI•

Manifold-ranking embedded order preserving hashing for image semantic retrieval

[...]

Lei Ma¹, Hongliang Li¹, Fanman Meng¹, Qingbo Wu¹, Linfeng Xu¹ - Show less +1 more•Institutions (1)

University of Electronic Science and Technology of China¹

01 Apr 2017-Journal of Visual Communication and Image Representation

TL;DR: A novel unsupervised hashing approach, namely Manifold-Ranking Embedded Order Preserving Hashing (MREOPH), which introduces a manifold ranking loss and an order preserving loss to solve the issue of global topological structure preserving.

...read moreread less

14 citations

Journal Article•DOI•

Image retrieval with query-adaptive hashing

[...]

Dong Liu¹, Shuicheng Yan², Rongrong Ji¹, Xian-Sheng Hua³, Hong-Jiang Zhang⁴ - Show less +1 more•Institutions (4)

Harbin Institute of Technology¹, National University of Singapore², Microsoft³, Advanced Technology Center⁴

19 Feb 2013-ACM Transactions on Multimedia Computing, Communications, and Applications

TL;DR: Extensive experiments over three benchmark image datasets well demonstrate the superiority of the proposed query-adaptive hashing method over the state-of-the-art ones in terms of retrieval accuracy.

...read moreread less

Abstract: Hashing-based approximate nearest-neighbor search may well realize scalable content-based image retrieval. The existing semantic-preserving hashing methods leverage the labeled data to learn a fixed set of semantic-aware hash functions. However, a fixed hash function set is unable to well encode all semantic information simultaneously, and ignores the specific user's search intention conveyed by the query. In this article, we propose a query-adaptive hashing method which is able to generate the most appropriate binary codes for different queries. Specifically, a set of semantic-biased discriminant projection matrices are first learnt for each of the semantic concepts, through which a semantic-adaptable hash function set is learnt via a joint sparsity variable selection model. At query time, we further use the sparsity representation procedure to select the most appropriate hash function subset that is informative to the semantic information conveyed by the query. Extensive experiments over three benchmark image datasets well demonstrate the superiority of our proposed query-adaptive hashing method over the state-of-the-art ones in terms of retrieval accuracy.

...read moreread less

14 citations

Journal Article•

Learning Approximate Sequential Patterns for Classification

[...]

Zeeshan Syed, Piotr Indyk¹, John V. Guttag¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Dec 2009-Journal of Machine Learning Research

TL;DR: The pattern discovery approach identified approximately conserved sequences of morphology variations that were predictive of future death in a test population and improved the running time of the search algorithm by an order of magnitude without any noticeable effect on accuracy.

...read moreread less

Abstract: In this paper, we present an automated approach to discover patterns that can distinguish between sequences belonging to different labeled groups. Our method searches for approximately conserved motifs that occur with varying statistical properties in positive and negative training examples. We propose a two-step process to discover such patterns. Using locality sensitive hashing (LSH), we first estimate the frequency of all subsequences and their approximate matches within a given Hamming radius in labeled examples. The discriminative ability of each pattern is then assessed from the estimated frequencies by concordance and rank sum testing. The use of LSH to identify approximate matches for each candidate pattern helps reduce the runtime of our method. Space requirements are reduced by decomposing the search problem into an iterative method that uses a single LSH table in memory. We propose two further optimizations to the search for discriminative patterns. Clustering with redundancy based on a 2-approximate solution of the k-center problem decreases the number of overlapping approximate groups while providing exhaustive coverage of the search space. Sequential statistical methods allow the search process to use data from only as many training examples as are needed to assess significance. We evaluated our algorithm on data sets from different applications to discover sequential patterns for classification. On nucleotide sequences from the Drosophila genome compared with random background sequences, our method was able to discover approximate binding sites that were preserved upstream of genes. We observed a similar result in experiments on ChIP-on-chip data. For cardiovascular data from patients admitted with acute coronary syndromes, our pattern discovery approach identified approximately conserved sequences of morphology variations that were predictive of future death in a test population. Our data showed that the use of LSH, clustering, and sequential statistics improved the running time of the search algorithm by an order of magnitude without any noticeable effect on accuracy. These results suggest that our methods may allow for an unsupervised approach to efficiently learn interesting dissimilarities between positive and negative examples that may have a functional role.

...read moreread less

14 citations

Patent•

Method and system for entropy-based semantic hashing

[...]

Ruei-Sung Lin¹, David A. Ross¹, Jay Yagnik¹•Institutions (1)

Google¹

04 Jun 2010

TL;DR: In this paper, the authors describe methods, systems and articles of manufacture for identifying semantic nearest neighbors in a feature space, which includes generating an affinity matrix for objects in a given feature space and training a multi-bit hash function using a greedy algorithm.

...read moreread less

Abstract: Methods, systems and articles of manufacture for identifying semantic nearest neighbors in a feature space are described herein. A method embodiment includes generating an affinity matrix for objects in a given feature space, wherein the affinity matrix identifies the semantic similarity between each pair of objects in the feature space, training a multi-bit hash function using a greedy algorithm that increases the Hamming distance between dissimilar objects in the feature space while minimizing the Hamming distance between similar objects, and identifying semantic nearest neighbors for an object in a second feature space using the multi-bit hash function. A system embodiment includes a hash generator configured to generate the affinity matrix and train the multi-bit hash function, and a similarity determiner configured to identify semantic nearest neighbors for an object in a second feature space using the multi-bit hash function.

...read moreread less

14 citations

Collapse

Network Information

Performance

Metrics

2,048

Papers

77,891

Citations

No. of papers in the topic in previous years
Year	Papers
2023	43
2022	108
2021	88
2020	110
2019	104
2018	139

Locality-sensitive hashing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics