A Survey of Text Similarity Approaches
Citations
11 citations
Cites methods from "A Survey of Text Similarity Approac..."
...For this reason, Needleman-Wunsch [11] [12]–[14] was applied in order to avoid delays in the computational time....
[...]
11 citations
10 citations
10 citations
Cites background from "A Survey of Text Similarity Approac..."
...OpenRefine [3] does not learn syntactic profiles, but it allows clustering of strings using character-based similarity measures [17]....
[...]
...We observed that character-based measures [17] show poor AUC, and are not indicative of syntactic similarity....
[...]
...However, existing similarity measures [17] do not capture the desired syntactic dissimilarity over strings....
[...]
10 citations
Cites methods from "A Survey of Text Similarity Approac..."
...Among them, corpus-based similarity calculations can be divided into three methods based on different models: bag-of-words (BOW) models, neural networks and search engines (Gomaa & Fahmy, 2013; Pradhan et al., 2015)....
[...]
References
13,049 citations
11,844 citations
10,500 citations
"A Survey of Text Similarity Approac..." refers background in this paper
...Dice’s coefficient is defined as twice the number of common terms in the compared strings divided by the total number of terms in both strings [11]....
[...]
10,262 citations
"A Survey of Text Similarity Approac..." refers background in this paper
...It is useful for dissimilar sequences that are suspected to contain regions of similarity or similar sequence motifs within their larger sequence context [8]....
[...]
6,014 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...The GLSA approach can combine any kind of similarity measure on the space of terms with any suitable method of dimensionality reduction....
[...]
...LSA assumes that words that are close in meaning will occur in similar pieces of text....
[...]
...Latent Semantic Analysis (LSA) [15] is the most popular technique of Corpus-Based similarity....
[...]
...Generalized Latent Semantic Analysis (GLSA) [16] is a framework for computing semantically motivated term and document vectors....
[...]
...Mining the web for synonyms: PMIIR versus LSA on TOEFL....
[...]