Search or ask a question

Showing papers on "Edit distance published in 1990"

PDF

Open Access

Journal Article•DOI•

An O(NP) sequence comparison algorithm

[...]

Sun Wu¹, Udi Manber¹, Gene Myers¹, Webb Miller²•Institutions (2)

University of Arizona¹, Pennsylvania State University²

01 Sep 1990-Information Processing Letters

TL;DR: This work presents an algorithm for finding a shortest edit distance of A and B whose worst-case running time is O( NP ) and whose expected running time was O( N + PD ).

...read moreread less

124 citations

Journal Article•DOI•

Minimum message length encoding and the comparison of macromolecules

[...]

Lloyd Allison¹, C. N. Yee¹•Institutions (1)

Monash University¹

01 Jan 1990-Bulletin of Mathematical Biology

TL;DR: A comparison of inductive inference known as minimum message length encoding is applied to string comparison in molecular biology and the posterior odds-ratio of two string alignments or of two models of string mutation to be computed.

...read moreread less

33 citations

Journal Article•DOI•

Efficient systolic string matching

[...]

Graham M. Megson¹•Institutions (1)

Universities UK¹

22 Nov 1990-Electronics Letters

TL;DR: Two new string matching heuristics are presented which reduce the hardware requirement and improve the computation speed of the systolic string matcher due to Lipton and Lopresti.

...read moreread less

Abstract: Two new string matching heuristics are presented which reduce the hardware requirement and improve the computation speed of the systolic string matcher due to Lipton and Lopresti (see 1st International Workshop on Systolic Arrays, Oxford, p.181-91, Adam-Hilger, 1987). The new array requires A=m/2+n/2-1 basic cells, T=m/2+n/2-1+max (m,n) steps to match strings of size n and m, respectively, and has efficiency e=1 (100%). A measure of the heuristic effectiveness compared with the minimum edit distance is also given.

...read moreread less

9 citations

Journal Article•DOI•

Response markup with an edit distance algorithm: a technique for providing learners with feedback on misspellings

[...]

John C. Nesbit¹, Kazuhiko Nakayama¹•Institutions (1)

University of Tsukuba¹

01 May 1990-Computer Education

TL;DR: The instructional value of text markup is considered and how to extract markup information from the matrix normally generated in the calculation of edit distance is shown.

...read moreread less

Abstract: Instructional systems that accept text responses entered by the learner must be capable of dealing with the inevitable occurrence of misspellings. A widely known procedure which finds the edit distance between two strings, and has been found effective in recognizing misspellings, can be easily extended to also annotate the response to give the learner feedback on the precise nature of the error. This paper considers the instructional value of text markup and shows how to extract markup information from the matrix normally generated in the calculation of edit distance. Included is a listing of a short Pascal program that illustrates the main concepts discussed.

...read moreread less

5 citations

Journal Article•DOI•

Sequence matching with binary codes

[...]

J. H. Bradford¹•Institutions (1)

Brock University¹

24 Apr 1990-Information Processing Letters

TL;DR: An algorithm is introduced that encodes pairs of strings as binary numbers such that the Hamming distance between the binary codewords is equal to the Levenshtein Distance between the original strings.

...read moreread less

4 citations

Journal Article•DOI•

Sequence Comparison Applied to Correction and Markup of Multi-Word Responses

[...]

John C. Nesbit, Kazuhiko Nakayama

01 Jan 1990-the CALICO Journal

TL;DR: An extension is proposed which attends to word boundaries and thereby recommends corrections that appear more reasonable to users, and a much faster but nonadmissible version of other applications is presented, thereby bringing the technique within range of current microcomputers.

...read moreread less

Abstract: In some instructional situations, such as foreign language dictation, the degree of correctness of a student's text response can be determined without reference to grammar and semantics by comparison with a target string provided by a course author. The standard sequence comparison procedure, which assesses the distance between two strings in terms of edit costs, makes demands on machine time proportional to the product of the string lengths. This characteristic renders it impractical for real-time correction of multi-word responses on current instructional computer systems. We present a much faster but nonadmissible version of other applications, thereby bringing the technique within range of current microcomputers. The usual method for generating markup for single word responses does not generalize well to multi-word responses because it fails to recognize word boundaries, and will sometimes suggest edits that seem unnatural to users. We propose an extension which attends to word boundaries and thereby recommends corrections that appear more reasonable.

...read moreread less

4 citations