Home
/
Topics
/
Approximate string matching

Topic

Approximate string matching

About: Approximate string matching is a research topic. Over the lifetime, 1903 publications have been published within this topic receiving 62352 citations. The topic is also known as: fuzzy string-searching algorithm & fuzzy string-matching algorithm.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973

Papers

PDF

Open Access

More filters

Patent•

Pattern string matching apparatus and pattern string matching method

[...]

Akagi Takuma¹•Institutions (1)

Toshiba¹

31 Jul 2000

TL;DR: In this article, the authors compare each character of a first character string with each characters of a second character string, vote for a matrix having two sides corresponding to the characters of the first character strings and the characters from the second character strings, and calculate values of the voting result for respective components arranged in an oblique direction of the matrix.

...read moreread less

Abstract: This invention is to compare each character of a first character string with each character of a second character string, vote for a matrix having two sides corresponding to the characters of the first character string and the characters of the second character string and calculate values of the voting result for respective components arranged in an oblique direction of the matrix The matching result is determined based on the calculated values of the voting result As a result, a high-speed and highly precise matching process which is noise-resistant and takes the character arrangement into consideration can be attained

...read moreread less

14 citations

Journal Article•DOI•

Towards optimal packed string matching

[...]

Oren Ben-Kiki¹, Philip Bille², Dany Breslauer³, Leszek Gasieniec⁴, Roberto Grossi⁵, Oren Weimann³ - Show less +2 more•Institutions (5)

Intel¹, Technical University of Denmark², University of Haifa³, University of Liverpool⁴, University of Pisa⁵

01 Mar 2014-Theoretical Computer Science

TL;DR: The Crochemore-Perrin constant-space O(n)-time string-matching algorithm is extended to run in optimal O( n/@a) time and even in real-time, achieving a factor @a speedup over traditional algorithms that examine each character individually.

...read moreread less

14 citations

Proceedings Article•DOI•

Extended approximate string matching algorithms to detect name aliases

[...]

Muniba Shaikh¹, Nasrullah Memon¹, Uffe Kock Wiil¹•Institutions (1)

University of Southern Denmark¹

10 Jul 2011

TL;DR: An extension to widely used ASM algorithms is proposed to detect the name aliases that generate as a result of transliteration and the experimental evaluation shows that proposed extension increases the accuracy of the basic algorithms to a considerable level.

...read moreread less

Abstract: This paper focuses on the problem of alias detection based on orthographic variations of Arabic names. Alias detection is the process to identify different variants of the same name. To detect aliases based on orthographic variations, the approximate string matching (ASM) algorithms are widely used that measure the similarities between two strings (i.e., the name and alias). ASM algorithms work well to detect various type of orthographic variations but still there is a need to develop techniques to detect correct aliases of Arabic names that occur due to the translation of Arabic names into English. An extension to widely used ASM algorithms is proposed to detect the name aliases that generate as a result of transliteration. This paper aims to improve the accuracy of the basic ASM algorithms in order to detect correct aliases. The experimental evaluation shows that proposed extension increases the accuracy of the basic algorithms to a considerable level.

...read moreread less

14 citations

Book•

Computer algorithms : string pattern matching strategies

[...]

順一青江

01 Jan 1994

14 citations

Journal Article•DOI•

Harry: a tool for measuring string similarity

[...]

Konrad Rieck¹, Christian Wressnegger¹•Institutions (1)

University of Göttingen¹

01 Jan 2016-Journal of Machine Learning Research

TL;DR: Harry is a small tool specifically designed for measuring the similarity of strings and implements over 20 similarity measures, including common string distances and string kernels, such as the Levenshtein distance and the Subsequence kernel.

...read moreread less

Abstract: Comparing strings and assessing their similarity is a basic operation in many application domains of machine learning, such as in information retrieval, natural language processing and bioinformatics. The practitioner can choose from a large variety of available similarity measures for this task, each emphasizing different aspects of the string data. In this article, we present Harry, a small tool specifically designed for measuring the similarity of strings. Harry implements over 20 similarity measures, including common string distances and string kernels, such as the Levenshtein distance and the Subsequence kernel. The tool has been designed with efficiency in mind and allows for multi-threaded as well as distributed computing, enabling the analysis of large data sets of strings. Harry supports common data formats and thus can interface with analysis environments, such as Matlab, Pylab and Weka.

...read moreread less

14 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
…
119
120
121
122
123
124
125
…
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

1,942

Papers

64,998

Citations

No. of papers in the topic in previous years
Year	Papers
2023	8
2022	30
2021	32
2020	30
2019	48
2018	39

Approximate string matching

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics