# An information-theoretic perspective of tf—idf measures

##### Citations

1,366 citations

### Cites background from "An information-theoretic perspectiv..."

...All these are closely related (see e.g. Aizawa, 2003)....

[...]

...Aizawa In this paper (Aizawa, 2003), the author juggles with three different event spaces: the space of documents, the space of query terms and the space of terms in document texts....

[...]

...The interpretation of IDF as a function of a probability suggests associating it with (one of) the probabilistic approaches to information retrieval....

[...]

1,171 citations

752 citations

### Cites background or result from "An information-theoretic perspectiv..."

...IDF is still a subject of current research [Joachims 1997; Amati and van Rijsbergen 1998; Hiemstra 1998; Papineni 2001; Aizawa 2003; Roelleke 2003; Lee 2007] where Robertson [2004] and Sp¨ arck Jones [2004] responded to recent developments on interpreting IDF....

[...]

...a subject of current research [Joachims 1997; Amati and van Rijsbergen 1998; Hiemstra 1998; Papineni 2001; Aizawa 2003; Roelleke 2003; Lee 2007] where Robertson [2004] and Spärck Jones [2004] responded to recent developments on interpreting IDF....

[...]

472 citations

422 citations

##### References

45,034 citations

### "An information-theoretic perspectiv..." refers background or methods in this paper

...On the other hand, idf can be interpreted as ‘the amount of information’ in conventional information theory (Brookes, 1972; Wong & Yao, 1992), given as the log of the inverse probability (Cover & Thomas, 1991)....

[...]

...We first introduce some of the basic formulae of information theory (Cover & Thomas, 1991) that we use in our theoretical development....

[...]

12,059 citations

### "An information-theoretic perspectiv..." refers background in this paper

...Measures of representation, on the other hand, are generally associated with the vector-space retrieval model in information retrieval (Salton & McGill, 1983)....

[...]

...PII: S0306-4573(02)00021-3 (Dennis, 1964; Salton & McGill, 1983), idf (Sparck-Jones, 1972), relevance weighting methods (Robertson & Sparck-Jones, 1976) and tf–idf and its variations (Salton & Buckley, 1988)....

[...]

...Examples of such measures include pairwise mutual information (Church & Hanks, 1990), signalto-noise ratio (Dennis, 1964; Salton & McGill, 1983) and idf (Sparck-Jones, 1972)....

[...]

9,923 citations

9,460 citations

9,295 citations