Topic

Latent Dirichlet allocation

About: Latent Dirichlet allocation is a research topic. Over the lifetime, 5351 publications have been published within this topic receiving 212555 citations. The topic is also known as: LDA.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Patent•

Information Relation Generation

[...]

Dingcheng Li¹, Swapna Somasundaran¹, Amit Chakraborty¹•Institutions (1)

Siemens¹

28 Sep 2011

TL;DR: In this paper, Latent Dirichlet Allocation (LDA) is used to determine the relationship between named entities, which is performed on text associated with the name entities rather than on an entire document.

...read moreread less

Abstract: For generating a word space, manual thresholding of word scores is used. Rather than requiring the user to select the threshold arbitrarily or review each word, the user is iteratively requested to indicate the relevance of a given word. Words with greater or lesser scores are labeled in the same way depending upon the response. For determining the relationship between named entities, Latent Dirichlet Allocation (LDA) is performed on text associated with the name entities rather than on an entire document. LDA for relationship mining may include context information and/or supervised learning.

...read moreread less

39 citations

Posted Content•

Catching Fire via "Likes": Inferring Topic Preferences of Trump Followers on Twitter

[...]

Yu Wang¹, Jiebo Luo¹, Richard G. Niemi¹, Yuncheng Li¹, Tianran Hu¹ - Show less +1 more•Institutions (1)

University of Rochester¹

09 Mar 2016-arXiv: Social and Information Networks

TL;DR: This paper proposed a framework to infer the topic preferences of Donald Trump's followers on Twitter by using latent Dirichlet allocation (LDA) to derive the weighted mixture of topics for each Trump tweet and then used negative binomial regression to model the "likes" with the weights of each topic serving as explanatory variables.

...read moreread less

Abstract: In this paper, we propose a framework to infer the topic preferences of Donald Trump's followers on Twitter. We first use latent Dirichlet allocation (LDA) to derive the weighted mixture of topics for each Trump tweet. Then we use negative binomial regression to model the "likes," with the weights of each topic serving as explanatory variables. Our study shows that attacking Democrats such as President Obama and former Secretary of State Hillary Clinton earns Trump the most "likes." Our framework of inference is generalizable to the study of other politicians.

...read moreread less

38 citations

Journal Article•DOI•

Inferring Concept Prerequisite Relations from Online Educational Resources

[...]

Sudeshna Roy, Meghana Madhyastha¹, Sheril Lawrence¹, Vaibhav Rajan²•Institutions (2)

International Institute of Information Technology, Bangalore¹, National University of Singapore²

17 Jul 2019

TL;DR: PREREQ is a new supervised learning method for inferring concept prerequisite relations using latent representations of concepts obtained from the Pairwise Latent Dirichlet Allocation model and a neural network based on the Siamese network architecture that can learn unknown concept prerequisites from course prerequisites and labeled concept prerequisite data.

...read moreread less

Abstract: The Internet has rich and rapidly increasing sources of high quality educational content. Inferring prerequisite relations between educational concepts is required for modern large-scale online educational technology applications such as personalized recommendations and automatic curriculum creation. We present PREREQ, a new supervised learning method for inferring concept prerequisite relations. PREREQ is designed using latent representations of concepts obtained from the Pairwise Latent Dirichlet Allocation model, and a neural network based on the Siamese network architecture. PREREQ can learn unknown concept prerequisites from course prerequisites and labeled concept prerequisite data. It outperforms state-of-the-art approaches on benchmark datasets and can effectively learn from very less training data. PREREQ can also use unlabeled video playlists, a steadily growing source of training data, to learn concept prerequisites, thus obviating the need for manual annotation of course prerequisites.

...read moreread less

38 citations

Journal Article•DOI•

Understanding relationship quality in hospitality services: A study based on text analytics and partial least squares

[...]

Manuel J. Sánchez-Franco, Gabriel Cepeda-Carrión, José L. Roldán

03 Jun 2019-Internet Research

TL;DR: LDA and PLS produce relevant informative summaries of corpora, and confirm and address more specifically the results of the previous literature concerning relationship quality.

...read moreread less

Abstract: The purpose of this paper is to analyze the occurrence of terms to identify the relevant topics and then to investigate the area (based on topics) of hospitality services that is highly associated with relationship quality. This research represents an opportunity to fill the gap in the current literature, and clarify the understanding of guests’ affective states by evaluating all aspects of their relationship with a hotel.,This research focuses on natural opinions upon which machine-learning algorithms can be executed: text summarization, sentiment analysis and latent Dirichlet allocation (LDA). Our data set contains 47,172 reviews of 33 hotels located in Las Vegas, and registered with Yelp. A component-based structural equation modeling (partial least squares (PLS)) is applied, with a dual – exploratory and predictive – purpose.,To maintain a truly loyal relationship and to achieve competitive success, hospitality managers must take into account both tangible and intangible features when allocating their marketing efforts to satisfaction-, trust- and commitment-based cues. On the other hand, the application of the PLS predict algorithm demonstrates the predictive performance (out-of-sample prediction) of our model that supports its ability to predict new and accurate values for individual cases when further samples are added.,LDA and PLS produce relevant informative summaries of corpora, and confirm and address more specifically the results of the previous literature concerning relationship quality. Our results are more reliable and accurate (providing insights not indicated in guests’ ratings into how hotels can improve their services) than prior statistical results based on limited sample data and on numerical satisfaction ratings alone.

...read moreread less

38 citations

Proceedings Article•DOI•

Discovering topics from dark websites

[...]

Li Yang¹, Feiqiong Liu¹, Joseph Migga Kizza¹, Raimund K. Ege²•Institutions (2)

University of Tennessee at Chattanooga¹, Northern Illinois University²

15 May 2009

TL;DR: A framework to discover latent topics from web sites of terrorists or extremists via analyzing contents of dark websites is proposed and LDA-based analysis assigns a probability to a document and captures exchangeability of both words and documents.

...read moreread less

Abstract: Analysis of dark websites is important for developing effective combating strategies against terrorism or extremists when more and more scattered terrorist cells use the ubiquity of the Internet to form communities in virtual space with fairly low costs. Terrorists or extremists anonymously set up various web sites embedded in the public Internet, exchanging ideology, spreading propaganda, and recruiting new members. In this paper, we propose a framework to discover latent topics via analyzing contents of dark websites. The content and data from dark websites are gathered and extracted by crawlers and exported to documents. Latent Dirichlet Allocation (LDA) algorithm is used to analyze the extracted documents so as to discover latent topics from web sites of terrorists or extremists. In contrast to the traditional Information Retrieval (IR) schemes, LDA-based analysis assigns a probability to a document and captures exchangeability of both words and documents. Our work helps to gain insights into the structure and communities of terrorists and extremists.

...read moreread less

38 citations

Collapse

Network Information

Performance

Metrics

6,513

Papers

245,225

Citations

No. of papers in the topic in previous years
Year	Papers
2023	323
2022	842
2021	418
2020	429
2019	473
2018	446

Latent Dirichlet allocation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics