Home
/
Topics
/
Edit distance

Topic

Edit distance

About: Edit distance is a research topic. Over the lifetime, 2887 publications have been published within this topic receiving 71491 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1981
1980
1976
1975
1974

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Identifying Changed Source Code Lines from Version Repositories

[...]

Gerardo Canfora¹, Luigi Cerulo¹, M. Di Penta¹•Institutions (1)

University of Sannio¹

20 May 2007

TL;DR: This paper shows how the evolution of changes at source code line level can be inferred from CVS repositories, by combining information retrieval techniques and the Levenshtein edit distance.

...read moreread less

Abstract: Observing the evolution of software systems at different levels of granularity has been a key issue for a number of studies, aiming at predicting defects or at studying certain phenomena, such as the presence of clones or of crosscutting concerns. Versioning systems such as CVS and SVN, however, only provide information about lines added or deleted by a contributor: any change is shown as a sequence of additions and deletions. This provides an erroneous estimate of the amount of code changed. This paper shows how the evolution of changes at source code line level can be inferred from CVS repositories, by combining information retrieval techniques and the Levenshtein edit distance. The application of the proposed approach to the ArgoUML case study indicates a high precision and recall.

...read moreread less

102 citations

Patent•

Spelling correction system and method for misspelled input

[...]

Hee-Jun Song¹, Young-Hee Park¹, Hyun Sik Shim¹, Ham Jong Gyu¹, Harksoo Kim¹, Jooho Lee¹, Se Hee Lee¹ - Show less +3 more•Institutions (1)

Samsung¹

07 Apr 2009

TL;DR: In this article, a spelling correction system and method automatically recognizes and corrects misspelled inputs in an electronic device with relatively lower computing power in a learning process, a misspelling correction dictionary is constructed on the basis of a corpus of accepted words, and context-sensitive strings are selected from among all the strings registered in the dictionary Context information about the context sensitive strings is acquired.

...read moreread less

Abstract: A spelling correction system and method automatically recognizes and corrects misspelled inputs in an electronic device with relatively lower computing power In a learning process, a misspelling correction dictionary is constructed on the basis of a corpus of accepted words, and context-sensitive strings are selected from among all the strings registered in the dictionary Context information about the context-sensitive strings is acquired In an applying process, at least one target string is selected from among all the strings in a user's input sentence through the dictionary If the target string is one of the context-sensitive strings, the target string is corrected by use of the context information

...read moreread less

102 citations

Evaluating machine translation output with automatic sentence segmentation.

[...]

Evgeny Matusov¹, Gregor Leusch¹, Oliver Bender¹, Hermann Ney¹•Institutions (1)

RWTH Aachen University¹

01 Jan 2005

TL;DR: A novel automatic sentence segmentation method for evaluating machine translation output with possibly erroneous sentence boundaries that efficiently produces an optimal automatic segmentation of the hypotheses and thus allows application of existing well-established evaluation measures.

...read moreread less

Abstract: This paper presents a novel automatic sentence segmentation method for evaluating machine translation output with possibly erroneous sentence boundaries. The algorithm can process translation hypotheses with segment boundaries which do not correspond to the reference segment boundaries, or a completely unsegmented text stream. Thus, the method is especially useful for evaluating translations of spoken language. The evaluation procedure takes advantage of the edit distance algorithm and is able to handle multiple reference translations. It efficiently produces an optimal automatic segmentation of the hypotheses and thus allows application of existing well-established evaluation measures. Experiments show that the evaluation measures based on the automatically produced segmentation correlate with the human judgement at least as well as the evaluation measures which are based on manual sentence boundaries.

...read moreread less

98 citations

Book Chapter•DOI•

Discovering context: classifying tweets through a semantic transform based on wikipedia

[...]

Yegin Genc¹, Yasuaki Sakamoto¹, Jeffrey V. Nickerson¹•Institutions (1)

Stevens Institute of Technology¹

09 Jul 2011

TL;DR: By mapping messages into a large context, the authors can compute the distances between them, and then classify them, which yields more accurate classification of a set of Twitter messages than alternative techniques using string edit distance and latent semantic analysis.

...read moreread less

Abstract: By mapping messages into a large context, we can compute the distances between them, and then classify them. We test this conjecture on Twitter messages: Messages are mapped onto their most similar Wikipedia pages, and the distances between pages are used as a proxy for the distances between messages. This technique yields more accurate classification of a set of Twitter messages than alternative techniques using string edit distance and latent semantic analysis.

...read moreread less

97 citations

Book Chapter•DOI•

Exact and Approximation Algorithms for the Inversion Distance Between Two Chromosomes

[...]

John Kececioglu¹, David Sankoff²•Institutions (2)

University of California, Davis¹, Université de Montréal²

02 Jun 1993

TL;DR: This work considers the problem of computing the shortest series of reversals that transform one permutation to another, and takes an arbitrary substring of elements and reverses their order.

...read moreread less

Abstract: Motivated by the problem in computational biology of reconstructing the series of chromosome inversions by which one organism evolved from another, we consider the problem of computing the shortest series of reversals that transform one permutation to another. The permutations describe the order of genes on corresponding chromosomes, and a reversal takes an arbitrary substring of elements and reverses their order.

...read moreread less

96 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
…
25
26
27
28
29
30
31
…
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

3,030

Papers

78,281

Citations

No. of papers in the topic in previous years
Year	Papers
2023	39
2022	96
2021	111
2020	149
2019	145
2018	139

Edit distance

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics