A Survey of Text Similarity Approaches
Citations
144 citations
137 citations
Cites background or methods from "A Survey of Text Similarity Approac..."
...Cosine Similarity is a measure of similarity between two vectors obtained from the cosine angle multiplication value of two vectors being compared [3]....
[...]
...Some approach to determine similarity level applied such as cosine similarity [3]....
[...]
116 citations
Cites methods from "A Survey of Text Similarity Approac..."
..., path, lch, wup, jcn (Gomaa and Fahmy, 2013)) were used to calculate the similarity between two words....
[...]
...method (Bos and Markert, 2005) where automatic reasoning tools are used to check the logical representations derived from sentences and (2) machine learning method (Zhao et al., 2013; Gomaa and Fahmy, 2013) where a supervised model is built...
[...]
...Existing work on STS can be divided into 4 categories according to the similarity measures used (Gomaa and Fahmy, 2013): (1) string-based method (Bär et al....
[...]
111 citations
References
2,285 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...CL-ESA exploits a document-aligned multilingual reference collection such as Wikipedia to represent a document as a languageindependent concept vector....
[...]
...Nine algorithms were explained; HAL, LSA, GLSA, ESA, CL-ESA, PMI-IR, SCO-PMI, NGD and DISCO....
[...]
...Explicit Semantic Analysis (ESA) [17] is a measure used to compute the semantic relatedness between two arbitrary texts....
[...]
...The cross-language explicit semantic analysis (CLESA) [18] is a multilingual generalization of ESA....
[...]
2,253 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...There are six measures of semantic similarity; three of them are based on information content: Resnik (res) [29], Lin (lin) [25] and Jiang & Conrath (jcn) [30]....
[...]
2,148 citations
1,784 citations
"A Survey of Text Similarity Approac..." refers methods in this paper
...If both terms always occur together, their NGD is zero, or equivalent to the coefficient between x squared and y squared....
[...]
...Nine algorithms were explained; HAL, LSA, GLSA, ESA, CL-ESA, PMI-IR, SCO-PMI, NGD and DISCO....
[...]
...Normalized Google Distance (NGD) [22] is a semantic similarity measure derived from the number of hits returned by the Google search engine for a given set of keywords....
[...]
1,717 citations