Latent dirichlet allocation
Citations
329 citations
329 citations
Cites background or methods from "Latent dirichlet allocation"
...LDA is shown using a white dot....
[...]
...Assuming there are K topics Φ = φ1:K , each of which is a categorical distribution over a fixed set of vocabulary, LDA treats each document as a mixture of these topics where the topic proportion xi is inferred from the data....
[...]
...We compare to collaborative topic regression (CTR), a state-of-the-art method for recommending scientific papers [30] combining both LDA and WMF.9 We did not compare with the more recent and scalable collaborative topic Poisson factorization (CTPF) [7] since the resulting performance differences may have been the result of CTPF Poisson likelihood (versus Gaussian likelihood for both ExpoMF and WMF)....
[...]
...One can understand LDA as representing documents in a low-dimensional “topic” space with the topic proportion xi being their coordinates....
[...]
..., word embeddings [17], latent Dirichlet allocation [2]), or the position of venue i obtained by first clustering all the venues in the data set then finding the expected assignment to L clusters for each venue....
[...]
329 citations
Cites background from "Latent dirichlet allocation"
...For text representation, we first obtain the feature vector based on 500 tokens (with stop words removed) and then the LDA model is used to compute the probability of each document under 100 topics....
[...]
...Specifically, many text feature extraction techniques, such as tf-idf and latent Dirichlet allocation (LDA) [2], can be employed to extract the input text features for TextNet....
[...]
...cation (LDA) [2], can be employed to extract the input text features for TextNet....
[...]
...For text features, we first extract the feature vector based on the 300 most frequent tokens (with stop words removed) and then utilize the LDA to compute the probability of each document under 100 topics....
[...]
...For text representation, we first obtain the feature vector based on 25 000 most frequent tokens (with stop words removed) and then use the LDA to compute the probability of each document under 1000 topics....
[...]
329 citations
329 citations
Cites methods from "Latent dirichlet allocation"
...Analytical Approach To empirically examine these data in relationship to the research questions above regarding the influence of corporate funding on textual content, the study employs a combination of social network analysis and a form of large-scale computational text analysis called latent dirichlet allocation (LDA) (30) topic modeling....
[...]
...To empirically examine these data in relationship to the research questions above regarding the influence of corporate funding on textual content, the study employs a combination of social network analysis and a form of large-scale computational text analysis called latent dirichlet allocation (LDA) (30) topic modeling....
[...]
...Building on the standard LDA model (30), Roberts et al....
[...]
...Building on the standard LDA model (30), Roberts et al. (31) explain, importantly, that: As in LDA, each document arises as a mixture over K topics....
[...]
...Thus, there are three critical differences in the STM as compared to the LDA model. . .(1) topics can be correlated; (2) each document has its own prior distribution over topics, defined by covariate X rather than sharing a global mean; and (3) word use within a topic can vary by covariate U....
[...]
References
17,608 citations
16,079 citations
"Latent dirichlet allocation" refers background in this paper
...Finally, Griffiths and Steyvers (2002) have presented a Markov chain Monte Carlo algorithm for LDA....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to ashierarchical models(Gelman et al., 1995), or more precisely asconditionally independent hierarchical models(Kass and Steffey, 1989)....
[...]
...Structures similar to that shown in Figure 1 are often studied in Bayesian statistical modeling, where they are referred to as hierarchical models (Gelman et al., 1995), or more precisely as conditionally independent hierarchical models (Kass and Steffey, 1989)....
[...]
12,443 citations
"Latent dirichlet allocation" refers methods in this paper
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notably latent semantic indexing (LSI) (Deerwester et al., 1990)....
[...]
...To address these shortcomings, IR researchers have proposed several other dimensionality reduction techniques, most notablylatent semantic indexing (LSI)(Deerwester et al., 1990)....
[...]
12,059 citations
"Latent dirichlet allocation" refers background or methods in this paper
...In the populartf-idf scheme (Salton and McGill, 1983), a basic vocabulary of “words” or “terms” is chosen, and, for each document in the corpus, a count is formed of the number of occurrences of each word....
[...]
...We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model....
[...]
7,086 citations