Latent dirichlet allocation
Citations
159 citations
159 citations
159 citations
Cites methods from "Latent dirichlet allocation"
...This weighted frequency is inspired by tf-idf - a numerical statistic widely used in information retrieval and text mining [3] to measure the importance of a term/word t to a document in a corpus....
[...]
158 citations
Cites background or methods or result from "Latent dirichlet allocation"
...Therefore, a number of algorithms are available to get approximate estimates of model parameters ranging from variational EM (Blei et al., 2003) to expectation propagation (Minka & Lafferty, 2002) and Gibbs sampling....
[...]
...We also tried two metrics suggested by (Cao et al., 2009) and (Arun et al., 2010) and compared them with perplexity or held-out likelihood (Blei et al., 2003)....
[...]
..., 2010) and compared them with perplexity or held-out likelihood (Blei et al., 2003)....
[...]
...The results showed that topic models outperform typical representations in a supervised setting when the proportion of training data is very small (Lu et al., 2011) (Blei et al., 2003)....
[...]
...As PLSA is based on the maximum likelihood estimation for given documents and is, therefore, susceptible to overfitting, Latent Dirichlet Allocation (LDA) was proposed as an improved topic model, which introduces Dirichlet prior and provides a200 fully generative model (Blei et al., 2003)....
[...]
158 citations
Cites methods from "Latent dirichlet allocation"
...We attempt to alleviate this problem by using Latent Dirichlet Allocation (LDA), a generative probabilistic model mostly used for topic modelling [Blei et al. 2003], built upon Latent Semantic Indexing (LSI) and probabilistic LSI....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations