Home
/
Topics
/
Latent Dirichlet allocation

Topic

Latent Dirichlet allocation

About: Latent Dirichlet allocation is a research topic. Over the lifetime, 5351 publications have been published within this topic receiving 212555 citations. The topic is also known as: LDA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1992
1990
1989
1988
1985
1979
1976
1969
1965

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning and network analysis: Classifying and visualizing accident narratives in construction

[...]

Botao Zhong¹, Xing Pan¹, Peter E.D. Love², Lieyun Ding¹, Weili Fang¹ - Show less +1 more•Institutions (2)

Huazhong University of Science and Technology¹, Curtin University²

01 May 2020-Automation in Construction

TL;DR: The proposed automated classification model and LDA-based network analysis method provide a useful approach to enable machine-assisted interpretation of texts-based accident narratives and can provide managers with much-needed information and knowledge to improve safety on-site.

...read moreread less

80 citations

Journal Article•DOI•

The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors

[...]

Dinesh Puranam¹, Vishal Narayan², Vrinda Kadiyali¹•Institutions (2)

Cornell University¹, National University of Singapore²

21 Aug 2017-Marketing Science

TL;DR: A scalable Bayesian topic model is proposed to measure and understand changes in consumer opinion about health (and other topics) and calibrate the model on 761,962 online reviews of restaurants posted over eight years.

...read moreread less

Abstract: In 2008, New York City mandated that all chain restaurants post calorie information on their menus. For managers of chain and standalone restaurants, as well as for policy makers, a pertinent goal might be to monitor the impact of this regulation on consumer conversations. We propose a scalable Bayesian topic model to measure and understand changes in consumer opinion about health (and other topics). We calibrate the model on 761,962 online reviews of restaurants posted over eight years. Our model allows managers to specify prior topics of interest such as “health” for a calorie posting regulation. It also allows the distribution of topic proportions within a review to be affected by its length, valence, and the experience level of its author. Using a difference-in-differences estimation approach, we isolate the potentially causal effect of the regulation on consumer opinion. Following the regulation, there was a statistically small but significant increase in the proportion of discussion of the health to...

...read moreread less

79 citations

Journal Article•DOI•

Extracting Features of Entertainment Products: A Guided Latent Dirichlet Allocation Approach Informed by the Psychology of Media Consumption:

[...]

Olivier Toubia, Garud Iyengar, Renée Bunnell, Alain Lemaire

01 Feb 2019-Journal of Marketing Research

TL;DR: In this article, a quantitative approach for describing entertainment products, in a way that allows for improving the predictive performance of consumer choice models for these products, has been proposed to improve the prediction performance of these models.

...read moreread less

Abstract: The authors propose a quantitative approach for describing entertainment products, in a way that allows for improving the predictive performance of consumer choice models for these products. Their ...

...read moreread less

79 citations

Journal Article•DOI•

Fuzzy Approach Topic Discovery in Health and Medical Corpora

[...]

Amir Karami¹, Aryya Gangopadhyay², Bin Zhou², Hadi Kharrazi³•Institutions (3)

University of South Carolina¹, University of Maryland, Baltimore County², Johns Hopkins University³

01 Apr 2018-International Journal of Fuzzy Systems

TL;DR: F fuzzy latent semantic analysis (FLSA) is described, a novel approach in topic modeling using fuzzy perspective that can handle health and medical corpora redundancy issue and provides a new method to estimate the number of topics.

...read moreread less

Abstract: The majority of medical documents and electronic health records are in text format that poses a challenge for data processing and finding relevant documents. Looking for ways to automatically retrieve the enormous amount of health and medical knowledge has always been an intriguing topic. Powerful methods have been developed in recent years to make the text processing automatic. One of the popular approaches to retrieve information based on discovering the themes in health and medical corpora is topic modeling; however, this approach still needs new perspectives. In this research, we describe fuzzy latent semantic analysis (FLSA), a novel approach in topic modeling using fuzzy perspective. FLSA can handle health and medical corpora redundancy issue and provides a new method to estimate the number of topics. The quantitative evaluations show that FLSA produces superior performance and features to latent Dirichlet allocation, the most popular topic model.

...read moreread less

79 citations

Journal Article•

Comparison of Dimension Reduction Methods for Automated Essay Grading

[...]

Tuomo Kakkonen, Niko Myller, Erkki Sutinen, Jari Timonen

01 Jul 2008-Educational Technology & Society

TL;DR: The results show that the use of learning materials as training data for the grading model outperforms the k-NN-based grading methods and the division of the learning materials in the training data is crucial.

...read moreread less

Abstract: Automatic Essay Assessor (AEA) is a system that utilizes information retrieval techniques such as Latent Semantic Analysis (LSA), Probabilistic Latent Semantic Analysis (PLSA), and Latent Dirichlet Allocation (LDA) for automatic essay grading. The system uses learning materials and relatively few teacher-graded essays for calibrating the scoring mechanism before grading. We performed a series of experiments using LSA, PLSA and LDA for document comparisons in AEA. In addition to comparing the methods on a theoretical level, we compared the applicability of LSA, PLSA, and LDA to essay grading with empirical data. The results show that the use of learning materials as training data for the grading model outperforms the k-NN-based grading methods. In addition to this, we found that using LSA yielded slightly more accurate grading than PLSA and LDA. We also found that the division of the learning materials in the training data is crucial. It is better to divide learning materials into sentences than paragraphs.

...read moreread less

79 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
…
75
76
77
78
79
80
81
…
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,513

Papers

245,225

Citations

No. of papers in the topic in previous years
Year	Papers
2023	323
2022	842
2021	418
2020	429
2019	473
2018	446

Latent Dirichlet allocation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics