Latent dirichlet allocation
Citations
178 citations
178 citations
Cites methods from "Latent dirichlet allocation"
...Since it allows to control the number of clusters produced and since it has been successfully applied to several tasks in the past, we decided to employ Latent Dirichlet Allocation [4] for the clustering....
[...]
...Since it allows to control the number of clusters produced and since it has been successfully applied to several tasks in the past, we decided to employ Latent Dirichlet Allocation [4] for the clustering....
[...]
178 citations
Cites background or methods from "Latent dirichlet allocation"
...(2) A number of approxiate inference algorithms for mixed membership models have appeared in recent years, including mean-field variational methods (Blei et al., 2003; Teh et al., 2007), expectation propagation (Minka and Lafferty, 2002), and Monte Carlo Markov chain sampling (MCMC) (Erosheva and…...
[...]
...Unfortunately, a closed form solution for the approximate maximum likelihood estimate of ~α does not exist Blei et al. (2003)....
[...]
...Mixed membership models, such as latent Dirichlet allocation [1], have emerged in recent years as a flexible modeling tool for data where the single group assumption is violated by the heterogeneity within a unit of analysis—e....
[...]
...Mixed membership models, such as latent Dirichlet allocation (Blei et al., 2003), have emerged in recent years as a flexible modeling tool for data where the single cluster assumption is violated by the heterogeneity within of a data point....
[...]
...They have been successfully applied in many domains, such as document analysis (Minka and Lafferty, 2002; Blei et al., 2003; Buntine and Jakulin, 2006), surveys (Berkman et al., 1989; Erosheva, 2002), image processing (Li and Perona, 2005), transcriptional regulation (Airoldi et al., 2006b), and…...
[...]
178 citations
Cites background or methods from "Latent dirichlet allocation"
...In this paper, we extend Latent Dirichlet Allocation (LDA) [4] to model the relation among a bug report and its corresponding buggy source files....
[...]
...In LDA, a document is considered to be generated by a “machine” which is driven via parameters by the hidden factors called topics, and its words are taken from some vocabulary [4]....
[...]
...Let us describe the B-component in our BugScout model, which is extended from LDA [4]....
[...]
...S-component in BugScout is adopted from LDA [4]....
[...]
...5: Illustration of LDA [4]...
[...]
178 citations
Cites background or methods from "Latent dirichlet allocation"
...Finally, for all LDA algorithms we used α = 0.1, π = 1/K. 2We actually set these values using a fixed but somewhat elaborate scheme which is the reason they ended up different for each dataset....
[...]
...We found that the CV-HDP performs significantly better than the CV-LDA on both test-set likelihood and the variational bound....
[...]
...Our approach is an extension of the collapsed VB approximation for LDA (CV-LDA) presented in [7], and represents the first VB approximation to the HDP1....
[...]
...The number of topics used in CV-HDP was truncated at 40, 80, and 120 topics, corresponding to the number of topics used in the LDA algorithms....
[...]
...For LDA and its cousins, there are alternatives based on variational Bayesian (VB) approximations [3] and on expectation propagation (EP) [5]....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations