Latent dirichlet allocation
Citations
1,471 citations
Cites methods from "Latent dirichlet allocation"
...Thus far, variational methods have mainly been explored in the parametric setting, in particular within the formalism of the exponential family (Attias, 2000; Ghahramani and Beal, 2001; Blei et al., 2003 )....
[...]
1,440 citations
Cites background or methods from "Latent dirichlet allocation"
...A variational inference approach has been proposed in (Blei et al. 2003)....
[...]
...This representation 1Alternatively, some researchers refer to this representation as “bag of keypoints”, see for example (Dance et al. 2004). is a heritage from the text analysis domain, for which the latent topic models were first developed (Hofmann 1999; Blei et al. 2003)....
[...]
...This figure is reproduced from (Blei et al. 2003) Suppose we have a set of M(j = 1, . . . ,M) video sequences containing spatial-temporal words from a vocabulary of size V (i = 1, . . . , V )....
[...]
...LDA (Blei et al. 2003) addresses these weaknesses....
[...]
...6 (a) Latent Dirichlet Allocation (LDA) graphical model (Blei et al. 2003)....
[...]
1,435 citations
Cites background or methods from "Latent dirichlet allocation"
...The LDA generative model assumes that documents (i.e. Facebook messages) contain a combination of topics, and that topics are a distribution of words; since the words in a document are known, the latent variable of topics can be estimated through Gibbs sampling [74]....
[...]
...To use topics as features, we find the probability of a subject’s use of each topic: p(topic j subject)~ X word[topic p(topic j word) p(word j subject) where p(word j subject) is the normalized word use by that subject and p(topic j word) is the probability of the topic given the word (a value provided from the LDA procedure)....
[...]
...We use an implementation of the LDA algorithm provided by the Mallet package [75], adjusting one parameter (alpha~0:30) to favor fewer topics per document, since individual Facebook status updates tend to contain fewer topics than the typical documents (newspaper or encyclopedia articles) to which LDA is applied....
[...]
...The second type of linguistic feature, topics, consists of word clusters created using Latent Dirichlet Allocation (LDA) [72,73]....
[...]
...Language use features include: (a) words and phrases: a sequence of 1 to 3 words found using an emoticon-aware tokenizer and a collocation filter (24,530 features) (b) topics: automatically derived groups of words for a single topic found using the Latent Dirichlet Allocation technique [72,75] (500 features)....
[...]
1,429 citations
1,405 citations
Cites background or methods from "Latent dirichlet allocation"
...Allocation (LDA) models (Blei et al., 2003; Griffiths et al., 2007), where parameters are set to optimize the joint probability distribution of words and documents....
[...]
...…better inputs in a phrase similarity task, whereas the two representations are comparable in a paraphrase classification experiment.3 Allocation (LDA) models (Blei et al., 2003; Griffiths et al., 2007), where parameters are set to optimize the joint probability distribution of words and documents....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations