Home
/
Topics
/
Locality-sensitive hashing

Topic

Locality-sensitive hashing

About: Locality-sensitive hashing is a research topic. Over the lifetime, 1894 publications have been published within this topic receiving 69362 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1988
1987
1985
1983
1982
1981
1980
1979
1978
1977
1976
1975
1970

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Shape Descriptor Based Document Image Indexing and Symbol Recognition

[...]

Ehtesham Hassan, Santanu Chaudhury, M. Gopal

26 Jul 2009

TL;DR: A novel shape descriptor based on shape context, which in combination with hierarchical distance based hashing is used for word and graphical pattern based document image indexing and retrieval and the applicability is demonstrated for classification of characters and symbols.

...read moreread less

Abstract: In this paper we present a novel shape descriptor based on shape context, which in combination with hierarchical distance based hashing is used for word and graphical pattern based document image indexing and retrieval. The shape descriptor represents the relative arrangement of points sampled on the boundary of the shape of object. We also demonstrate the applicability of the novel shape descriptor for classification of characters and symbols. For indexing, we provide anew formulation for distance based hierarchical locality sensitive hashing. Experiments have yielded promising results.

...read moreread less

16 citations

Proceedings Article•

Efficient Clustering of Metagenomic Sequences using Locality Sensitive Hashing.

[...]

Zeehasham Rasheed¹, Huzefa Rangwala¹, Daniel Barbará¹•Institutions (1)

George Mason University¹

01 Jan 2012

TL;DR: An efficient and accurate metagenome clustering approach that uses the locality sensitive hashing (LSH) technique to approximate the computational complexity associated with comparing sequences, and introduces the use of fixed-length, gapless subsequences for improving the sensitivity of the LSH-based similarity function.

...read moreread less

Abstract: The new generation of genomic technologies have allowed researchers to determine the collective DNA of organisms (e.g., microbes) co-existing as communities across the ecosystem (e.g., within the human host). There is a need for the computational approaches to analyze and annotate the large volumes of available sequence data from such microbial communities (metagenomes). In this paper, we developed an efficient and accurate metagenome clustering approach that uses the locality sensitive hashing (LSH) technique to approximate the computational complexity associated with comparing sequences. We introduce the use of fixed-length, gapless subsequences for improving the sensitivity of the LSH-based similarity function. We evaluate the performance of our algorithm on two metagenome datasets associated with microbes existing across different human skin locations. Our empirical results show the strength of the developed approach in comparison to three state-of-the-art sequence clustering algorithms with regards to computational efficiency and clustering quality. We also demonstrate practical significance for the developed clustering algorithm, to compare bacterial diversity and structure across different skin locations.

...read moreread less

16 citations

Proceedings Article•DOI•

Searching with expectations

[...]

Harsimrat Sandhawalia¹, Hervé Jégou¹•Institutions (1)

French Institute for Research in Computer Science and Automation¹

14 Mar 2010

TL;DR: The problem of generating compact signatures as a rate-distortion problem is formulated and the aim is at minimizing the reconstruction error on the squared distances with a constraint on the memory usage.

...read moreread less

Abstract: Handling large amounts of data, such as large image databases, requires the use of approximate nearest neighbor search techniques. Recently, Hamming embedding methods such as spectral hashing have addressed the problem of obtaining compact binary codes optimizing the trade-off between the memory usage and the probability of retrieving the true nearest neighbors. In this paper, we formulate the problem of generating compact signatures as a rate-distortion problem. In the spirit of source coding algorithms, we aim at minimizing the reconstruction error on the squared distances with a constraint on the memory usage. The vectors are ranked based on the distance estimates to the query vector. Experiments on image descriptors show a significant improvement over spectral hashing.

...read moreread less

16 citations

Journal Article•DOI•

Forecasting model for wind power integrating least squares support vector machine, singular spectrum analysis, deep belief network, and locality‐sensitive hashing

[...]

Arslan Habib¹, Rabeh Abbassi², Rabeh Abbassi³, Andrés Julián Aristizábal⁴, Abdelkader Abbassi² - Show less +1 more•Institutions (4)

University of Strathclyde¹, University of Kairouan², Tunis University³, Universidad de Bogotá Jorge Tadeo Lozano⁴

01 Feb 2020-Wind Energy

16 citations

Proceedings Article•DOI•

The ANN-tree: an index for efficient approximate nearest neighbor search

[...]

King-Ip Lin¹, Congjun Yang¹•Institutions (1)

University of Memphis¹

18 Apr 2001

TL;DR: This work proposes an index structure, the ANN-tree (approximate nearest neighbor tree), which is demonstrably more efficient than existing structures like the R*-tree and is a preferable index structure for both exact and approximate nearest neighbor searches.

...read moreread less

Abstract: We explore the problem of approximate nearest neighbor searches. We propose an index structure, the ANN-tree (approximate nearest neighbor tree) to solve this problem. The ANN-tree supports high accuracy nearest neighbor search. The actual nearest neighbor of a query point can usually be found in the first leaf page accessed. The accuracy increases to near 100% if a second page is accessed. This is not achievable via traditional indexes. Even if an exact nearest neighbor query is desired, the ANN-tree is demonstrably more efficient than existing structures like the R*-tree. This makes the ANN-tree a preferable index structure for both exact and approximate nearest neighbor searches. We present the index in detail and provide experimental results on both real and synthetic data sets.

...read moreread less

16 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
…
120
121
122
123
124
125
126
…
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,048

Papers

77,891

Citations

No. of papers in the topic in previous years
Year	Papers
2023	43
2022	108
2021	88
2020	110
2019	104
2018	139

Locality-sensitive hashing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics