Home
/
Topics
/
Locality-sensitive hashing

Topic

Locality-sensitive hashing

About: Locality-sensitive hashing is a research topic. Over the lifetime, 1894 publications have been published within this topic receiving 69362 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1988
1987
1985
1983
1982
1981
1980
1979
1978
1977
1976
1975
1970

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Double hashing thresholds via local weak convergence

[...]

M. Leconte¹•Institutions (1)

French Institute for Research in Computer Science and Automation¹

01 Oct 2013

TL;DR: It is pointed out that the approach via the cavity method extends quite naturally to the analysis of double hashing and allows to compute the corresponding threshold and shows that the graph induced by the double hashing scheme has the same local weak limit as the one obtained with full randomness.

...read moreread less

Abstract: A lot of interest has recently arisen in the analysis of multiple-choice “cuckoo hashing” schemes. In this context, a main performance criterion is the load threshold under which the hashing scheme is able to build a valid hashtable with high probability in the limit of large systems; various techniques have successfully been used to answer this question (differential equations, combinatorics, cavity method) for increasing levels of generality of the model. However, the hashing scheme analysed so far is quite utopic in that it requires to generate a lot of independent, fully random choices. Schemes with reduced randomness exists, such as “double hashing”, which is expected to provide similar asymptotic results as the ideal scheme, yet they have been more resistant to analysis so far. In this paper, we point out that the approach via the cavity method extends quite naturally to the analysis of double hashing and allows to compute the corresponding threshold. The path followed is to show that the graph induced by the double hashing scheme has the same local weak limit as the one obtained with full randomness.

...read moreread less

13 citations

The analysis of hashing algorithms.

[...]

Leonidas J. Guibas

01 Jan 1976

13 citations

Proceedings Article•DOI•

Speaker recognition using mel frequency cepstral coefficient and locality sensitive hashing

[...]

Ahmed Awais, She Kun, Yue Yu, Shaukat Hayat, Aftab Ahmed, Tianyi Tu - Show less +2 more

01 May 2018

TL;DR: Experimental results show that proposed model is more accurate and robust than traditional models and good for speaker recognition.

...read moreread less

Abstract: The Mel-Frequency Cepstral Coefficients (MFCC) feature can be cast-off in speaker recognition. The process of feature extraction of the speech signal using Mel-Frequency Cepstral Coefficients (MFCC) feature vectors will generate an acoustic speech signal. Locality Sensitive Hashing (LSH) is frequently used as a classifier for Big Data related problems. In this research, we proposed a new model based on MFCC and LSH to integrate into speaker recognition model. The main returns of our newly proposed model are to get robustness, effective and accurate results in comparison with MFCC+GMM, LPCC+GMM and MFCC+PNN models. This model also contributes to the literature of Big Data. In this model, first, we extract the MFCC features from the wave file then we applied LSH classifier on extracted feature to transform into hash-table. Finally, the hash-tables of train and test wave files are compared and obtained 92.66% speaker recognition accuracy. We compared the accuracy ratio of proposed model with other traditional models namely MFCC+GMM, MFCC+PNN, and LPCC+GMM. Experimental results show that proposed model is more accurate and robust than traditional models and good for speaker recognition.

...read moreread less

13 citations

Book Chapter•DOI•

Randomly projected KD-trees with distance metric learning for image retrieval

[...]

Pengcheng Wu¹, Steven C. H. Hoi¹, DucDung Nguyen¹, Ying He¹•Institutions (1)

Nanyang Technological University¹

05 Jan 2011

TL;DR: This paper proposes a new high dimensional NN search method, called Randomly Projected kd-trees (RP-kd-Trees), which is to project data points into a lower-dimensional space so as to exploit the advantage of multiple kd -trees over low-dimensional data.

...read moreread less

Abstract: Efficient nearest neighbor (NN) search techniques for high-dimensional data are crucial to content-based image retrieval (CBIR). Traditional data structures (e.g., kd-tree) usually are only efficient for low dimensional data, but often perform no better than a simple exhaustive linear search when the number of dimensions is large enough. Recently, approximate NN search techniques have been proposed for high-dimensional search, such as Locality-Sensitive Hashing (LSH), which adopts some random projection approach. Motivated by similar idea, in this paper, we propose a new high dimensional NN search method, called Randomly Projected kd-Trees (RP-kd-Trees), which is to project data points into a lower-dimensional space so as to exploit the advantage of multiple kd-trees over low-dimensional data. Based on the proposed framework, we present an enhanced RP-kd-Trees scheme by applying distance metric learning techniques. We conducted extensive empirical studies on CBIR, which showed that our technique achieved faster search performance with better retrieval quality than regular LSH algorithms.

...read moreread less

13 citations

Proceedings Article•DOI•

Scalable forest hashing for fast similarity search

[...]

Gang Yu¹, Junsong Yuan¹•Institutions (1)

Nanyang Technological University¹

14 Jul 2014

TL;DR: A novel data-driven hashing method called forest hashing, which utilizes multiple tree structures to perform data hashing by leveraging the index structure of trees, which can significantly improve the hashing efficacy by generating balanced hash buckets.

...read moreread less

Abstract: Indexing images and videos using binary hash bits has shown promising results for fast similarity search. Existing datadriven hashing methods learn compact hash codes from the data, but usually with the cost of generating unbalanced hash buckets, thus affecting the search efficiency. We propose a novel data-driven hashing method called forest hashing, which utilizes multiple tree structures to perform data hashing. By leveraging the index structure of trees, we can significantly improve the hashing efficacy by generating balanced hash buckets. Moreover, forest hashing naturally supports scalable coding where more trees can improve the coding quality with a longer code. Last but not the least, our forest hashing can be easily extended for semantic search by integrating semi-supervised label information. Experiments on two benchmark datasets show favorable results compared with the state-of-the-art hashing methods.

...read moreread less

13 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
…
139
140
141
142
143
144
145
…
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,048

Papers

77,891

Citations

No. of papers in the topic in previous years
Year	Papers
2023	43
2022	108
2021	88
2020	110
2019	104
2018	139

Locality-sensitive hashing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics