Latent dirichlet allocation
Citations
161 citations
161 citations
161 citations
161 citations
Cites background or methods from "Latent dirichlet allocation"
...We cannot possibly study all topic modeling approaches, so we select a few that are representative: the well-known Mixture of Unigrams (MU) model [1]; Latent Dirichlet Allocation (LDA) [2], a more complicated and computationally expensive topic model; and Pachinko Allocation Model (PAM) [3], a recently proposed new topic model which not only models the relations between words and identifies topics but also models the organization and co-occurrences of the topics themselves....
[...]
...Latent Dirichlet Allocation (LDA) [2] is a widely-used topic model which also assumes that there are multiple topics in the corpus but that a document can have multiple topics....
[...]
...We can automatically infer a set of topics either by simple clustering[1] or methods popularized by the machine learning community [2,3,4]....
[...]
...A natural question is whether these topics are useful to help retrieve documents on the same topic as a query – intuitively relevant documents have topic distributions that are likely to have generated the set of words associated with the query[2,5]....
[...]
161 citations
Cites methods from "Latent dirichlet allocation"
...Therefore, recently, a more widely used model, latent dirichlet allocation (LDA)([80]), was proposed to overcome this issue by allowing multiple latent topics with a priori Dirichlet distribution, a conjugate prior of multinomial distribution, assigned to each single document....
[...]
...Comparatively, LDA is more widely used for tag recommendation....
[...]
...Xi et al.[84] employed LDA for eliciting topics from the words in documents, as well as the co-occurrence tags, where words and tags form independent vocabulary spaces, and then recommended tags for target documents....
[...]
...Furthermore, Li et al.[90] combined LDA and GN community detection algorithm[91-92] to observe the topic distributions of communities, as well as community evolving over time in social tagging systems....
[...]
...Bundschus et al.[87] integrated both user information and tag information into LDA algorithm....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations