Latent dirichlet allocation
Citations
257 citations
257 citations
Additional excerpts
...VI discusses future work....
[...]
256 citations
Cites methods from "Latent dirichlet allocation"
...models from this tweet set via latent dirichlet allocation (LDA) (Blei et al., 2003), a well-known generative topic modeling algorithm....
[...]
...With a set of training tweets as obtained in Algorithm 1, we adopt the latent Dirichlet allocation (LDA) (Blei et al., 2003), a renowned generative probabilistic model for topic discovery, to build the composite topical features....
[...]
256 citations
Cites background from "Latent dirichlet allocation"
...Related work on document content characterization [1, 7, 11, 21] introduces a set of probabilistic models to simulate the generation of a document....
[...]
...3(a) models documents as generated by a mixture of topics [1]....
[...]
...topic [7, 1]), are modeled as variables in the generative Bayesian network and have been shown to work well for document content characterization....
[...]
255 citations
Cites background or methods from "Latent dirichlet allocation"
...Topic modelling is a popular statistical method for (soft) clustering documents (Blei et al., 2003; Deerwester et al., 1990; Hofmann, 1999)....
[...]
...Latent Dirichlet Allocation (LDA) (Blei et al., 2003), one type of topic model, has been widely used in NLP and applied to a range of tasks including word sense disambiguation (Boyd-Graber et al....
[...]
...A total of 300 topics are 1http://dumps.wikimedia.org/enwiki/20120104/ 2We also experimented with different lengths of context windows 3The data set can be downloaded from http://staffwww.dcs.shef.ac.uk/people/N.Aletras/ resources/TopicCoherence300.tar.gz generated by running LDA over three different document collections: • NYT: 47,229 New York Times news articles published between May and December 2010 from the GigaWord corpus....
[...]
...Latent Dirichlet Allocation (LDA) (Blei et al., 2003), one type of topic model, has been widely used in NLP and applied to a range of tasks including word sense disambiguation (Boyd-Graber et al., 2007), multi-document summarisation (Haghighi and Vanderwende, 2009) and generation of comparable…...
[...]
...Latent Dirichlet Allocation (LDA) (Blei et al., 2003), one type of topic model, has been widely used in NLP and applied to a range of tasks including word sense disambiguation (Boyd-Graber et al., 2007), multi-document summarisation (Haghighi and Vanderwende, 2009) and generation of comparable corpora (Preiss, 2012)....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations