scispace - formally typeset
Search or ask a question
Topic

Approximate string matching

About: Approximate string matching is a research topic. Over the lifetime, 1903 publications have been published within this topic receiving 62352 citations. The topic is also known as: fuzzy string-searching algorithm & fuzzy string-matching algorithm.


Papers
More filters
Book ChapterDOI
21 Oct 2012
TL;DR: It is shown there is a linear number of maximal-exponent repeats in an overlap-free string and the algorithm can locate all of them in linear time.
Abstract: The exponent of a string is the quotient of the string's length over the string's smallest period. The exponent and the period of a string can be computed in time proportional to the string's length. We design an algorithm to compute the maximal exponent of factors of an overlap-free string. Our algorithm runs in linear-time on a fixed-size alphabet, while a naive solution of the question would run in cubic time. The solution for non overlap-free strings derives from algorithms to compute all maximal repetitions, also called runs, occurring in the string. We show there is a linear number of maximal-exponent repeats in an overlap-free string. The algorithm can locate all of them in linear time.

24 citations

Journal ArticleDOI
01 Nov 1991
TL;DR: The novel approach formalizes an entropy weights activation of prototypes for fuzzy partial matching and brings another dimension to the fuzzy matching criterion that provides a measure of uncertainty in the partial matching to each prototype.
Abstract: A characteristic approach of approximate reasoning is the partial matching of observations to prototypes. This analysis is cast in the framework of fuzzy set theory and brings another dimension to the fuzzy matching criterion; this dimension is the measure of uncertainty through the concept of subjective entropy. While a similarity measure, in the matching process, activates relevant prototypes, the entropy formalism derived provides a measure of uncertainty in the partial matching to each prototype. The novel approach formalizes an entropy weights activation of prototypes for fuzzy partial matching. A methodology is developed for matching of observation to a set of prototypes making use of a suitable aggregation done with a framework of fuzzy integrals. A method of dealing with compound hypothesis is also developed. >

24 citations

Proceedings ArticleDOI
20 Oct 1993
TL;DR: A new algorithm for string edit distance computation that needs time that is only linear in the length of one of the two strings to be matched, provided that the other string has undergone some preprocessing in an off-line phase is proposed.
Abstract: A new algorithm for string edit distance computation is proposed. It needs time that is only linear in the length of one of the two strings to be matched, provided that the other string has undergone some preprocessing in an off-line phase. The algorithm can be extended to matching a word against a dictionary of any size. In this case the time complexity is independent of the length of the dictionary words, and the number of entries in the dictionary. >

24 citations

01 Jan 1992
TL;DR: This work considers several problems from a theoretical perspective and provides efficient algorithms and lower bounds for these problems in sequential and parallel models of computation for the string matching problem.
Abstract: Problems involving strings arise in many areas of computer science and have numerous practical applications. We consider several problems from a theoretical perspective and provide efficient algorithms and lower bounds for these problems in sequential and parallel models of computation. In the sequential setting, we present new algorithms for the string matching problem improving the previous bounds on the number of comparisons performed by such algorithms. In parallel computation, we present tight algorithms and lower bounds for the string matching problem, for finding the periods of a string, for detecting squares and for finding initial palindromes.

24 citations

Journal ArticleDOI
TL;DR: This paper proposed a bit-parallel multiple approximate string match algorithm, and developed a GPU implementation which achieved speedups about 28 relative to a single-thread CPU code.

24 citations


Network Information
Related Topics (5)
Server
79.5K papers, 1.4M citations
81% related
Cluster analysis
146.5K papers, 2.9M citations
80% related
Scheduling (computing)
78.6K papers, 1.3M citations
79% related
Network packet
159.7K papers, 2.2M citations
78% related
Optimization problem
96.4K papers, 2.1M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20238
202230
202132
202030
201948
201839