Topic

Latent Dirichlet allocation

About: Latent Dirichlet allocation is a research topic. Over the lifetime, 5351 publications have been published within this topic receiving 212555 citations. The topic is also known as: LDA.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Event Recommendation in Social Networks with Linked Data Enablement

[...]

Yinuo Zhang¹, Hao Wu¹, Vikrambhai S. Sorathia¹, Viktor K. Prasanna¹•Institutions (1)

University of Southern California¹

01 Jan 2013

TL;DR: This paper uses Latent Dirichlet Allocation (LDA) to generate a topic distribution over each event and user, and enables linked data as data sources to collect contextual information related to events and users, and build an enhanced profile for them.

...read moreread less

Abstract: In recent years, social networking services have gained phenomenal popularity. They allow us to explore the world and share our findings in a convenient way. Event is a critical component in social networks. A user can create, share or join different events in their social circle. In this paper, we investigate the problem of event recommendation. We propose recommendation methods based on the similarity of an event’s content and a user’s interests in terms of topics. Specifically, we use Latent Dirichlet Allocation (LDA) to generate a topic distribution over each event and user. We also consider friend relationship and attendance history to increase recommendation accuracy. Moreover, we enable linked data as our data sources to collect contextual information related to events and users, and build an enhanced profile for them. As reliable resource, linked data is used to find structured knowledge and linkages among different knowledge. Finally, we conduct comprehensive experiments on various datasets in both academic community and popular social networking

...read moreread less

30 citations

Journal Article•DOI•

Classification with Incomplete Data Using Dirichlet Process Priors

[...]

Chunping Wang¹, Xuejun Liao¹, Lawrence Carin¹, David B. Dunson¹•Institutions (1)

Duke University¹

01 Mar 2010-Journal of Machine Learning Research

TL;DR: A non-parametric hierarchical Bayesian framework is developed for designing a classifier, based on a mixture of simple (linear) classifiers, which is extended to allow simultaneous design of classifiers on multiple data sets, termed multi-task learning.

...read moreread less

Abstract: A non-parametric hierarchical Bayesian framework is developed for designing a classifier, based on a mixture of simple (linear) classifiers. Each simple classifier is termed a local "expert", and the number of experts and their construction are manifested via a Dirichlet process formulation. The simple form of the "experts" allows analytical handling of incomplete data. The model is extended to allow simultaneous design of classifiers on multiple data sets, termed multi-task learning, with this also performed non-parametrically via the Dirichlet process. Fast inference is performed using variational Bayesian (VB) analysis, and example results are presented for several data sets. We also perform inference via Gibbs sampling, to which we compare the VB results.

...read moreread less

30 citations

Journal Article•DOI•

Sequential Activity Profiling: Latent Dirichlet Allocation of Markov Chains

[...]

Mark Girolami¹, Ata Kaban²•Institutions (2)

University of Glasgow¹, University of Birmingham²

01 May 2005-Data Mining and Knowledge Discovery

TL;DR: A linear-time algorithm is proposed that defines a distributed predictive model for finite state symbolic sequences which represent the traces of the activity of a number of individuals within a group.

...read moreread less

Abstract: To provide a parsimonious generative representation of the sequential activity of a number of individuals within a population there is a necessary tradeoff between the definition of individual specific and global representations. A linear-time algorithm is proposed that defines a distributed predictive model for finite state symbolic sequences which represent the traces of the activity of a number of individuals within a group. The algorithm is based on a straightforward generalization of latent Dirichlet allocation to time-invariant Markov chains of arbitrary order. The modelling assumption made is that the possibly heterogeneous behavior of individuals may be represented by a relatively small number of simple and common behavioral traits which may interleave randomly according to an individual-specific distribution. The results of an empirical study on three different application domains indicate that this modelling approach provides an efficient low-complexity and intuitively interpretable representation scheme which is reflected by improved prediction performance over comparable models.

...read moreread less

30 citations

Journal Article•DOI•

Document-level multi-topic sentiment classification of Email data with BiLSTM and data augmentation

[...]

Sisi Liu¹, Kyungmi Lee¹, Ickjai Lee¹•Institutions (1)

James Cook University¹

07 Jun 2020-Knowledge Based Systems

TL;DR: A framework for document-level multi-topic sentiment classification of Email data is developed, and both latent Dirichlet allocation topic modeling and semantic text segmentation are applied to post-process Email documents.

...read moreread less

Abstract: Email data has unique characteristics, involving multiple topics, lengthy replies, formal language, high variance in length, high duplication, anomalies, and indirect relationships that distinguish it from other social media data. In order to better model Email documents and to capture complex sentiment structures in the content, we develop a framework for document-level multi-topic sentiment classification of Email data. Note that, a large volume of labeled Email data is rarely publicly available. We introduce an optional data augmentation process to increase the size of datasets with synthetically labeled data to reduce the probability of overfitting and underfitting during the training process. To generate segments with topic embeddings and topic weighting vectors as inputs for our proposed model, we apply both latent Dirichlet allocation topic modeling and semantic text segmentation to post-process Email documents. Empirical results obtained with multiple sets of experiments, including performance comparison against various state-of-the-art algorithms with and without data augmentation and diverse parameter settings, are analyzed to demonstrate the effectiveness of our proposed framework.

...read moreread less

30 citations

Proceedings Article•DOI•

Latent dirichlet language model for speech recognition

[...]

Jen-Tzung Chien¹, Chuang-Hua Chueh¹•Institutions (1)

National Cheng Kung University¹

01 Dec 2008

TL;DR: A new latent Dirichlet language model (LDLM) is presented for modeling of word sequence by merging theDirichlet priors to characterize the uncertainty of latent topics of n-gram events and a new Bayesian framework is introduced.

...read moreread less

Abstract: Latent Dirichlet allocation (LDA) has been successfully presented for document modeling and classification. LDA calculates the document probability based on bag-of-words scheme without considering the sequence of words. This model discovers the topic structure at document level, which is different from the concern of word prediction in speech recognition. In this paper, we present a new latent Dirichlet language model (LDLM) for modeling of word sequence. A new Bayesian framework is introduced by merging the Dirichlet priors to characterize the uncertainty of latent topics of n-gram events. The robust topic-based language model is established accordingly. In the experiments, we implement LDLM for continuous speech recognition and obtain better performance than probabilistic latent semantic analysis (PLSA) based language method.

...read moreread less

30 citations

Collapse

Network Information

Performance

Metrics

6,513

Papers

245,225

Citations

No. of papers in the topic in previous years
Year	Papers
2023	323
2022	842
2021	418
2020	429
2019	473
2018	446

Latent Dirichlet allocation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics