A Survey of Text Similarity Approaches
Citations
144 citations
137 citations
Cites background or methods from "A Survey of Text Similarity Approac..."
...Cosine Similarity is a measure of similarity between two vectors obtained from the cosine angle multiplication value of two vectors being compared [3]....
[...]
...Some approach to determine similarity level applied such as cosine similarity [3]....
[...]
116 citations
Cites methods from "A Survey of Text Similarity Approac..."
..., path, lch, wup, jcn (Gomaa and Fahmy, 2013)) were used to calculate the similarity between two words....
[...]
...method (Bos and Markert, 2005) where automatic reasoning tools are used to check the logical representations derived from sentences and (2) machine learning method (Zhao et al., 2013; Gomaa and Fahmy, 2013) where a supervised model is built...
[...]
...Existing work on STS can be divided into 4 categories according to the similarity measures used (Gomaa and Fahmy, 2013): (1) string-based method (Bär et al....
[...]
111 citations
References
226 citations
131 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...Nine algorithms were explained; HAL, LSA, GLSA, ESA, CL-ESA, PMI-IR, SCO-PMI, NGD and DISCO....
[...]
...Second-order co-occurrence pointwise mutual information (SCO-PMI) [20,21] is a semantic similarity measure using pointwise mutual information to sort lists of important neighbor words of the two target words from a large corpus....
[...]
86 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...computed by dividing the number of similar n-grams by maximal number of n-grams [9]....
[...]
77 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...DISCO is a method that computes distributional similarity between words by using a simple context window of size ±3 words for counting co-occurrences....
[...]
...Extracting DIStributionally similar words using COoccurrences (DISCO) [23, 24] Distributional similarity between words assumes that words with similar meaning occur in similar context....
[...]
...DISCO has two main similarity measures DISCO1 and DISCO2; DISCO1 computes the first order similarity between two input words based on their collocation sets....
[...]
...If the most distributionally similar word is required; DISCO returns the second order word vector for the given word....
[...]
...DISCO2 computes the second order similarity between two input words based on their sets of distributionally similar words....
[...]