Home
/
Topics
/
Probabilistic latent semantic analysis

Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1985
1984
1983
1982
1981
1980
1979
1978
1974
1973
1971
1970
1969
1965
1963
1960
1958
1956
1954

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

ILDA: interdependent LDA model for learning latent aspects and their ratings from online product reviews

[...]

Samaneh Moghaddam¹, Martin Ester¹•Institutions (1)

Simon Fraser University¹

24 Jul 2011

TL;DR: This paper introduces Interdependent Latent Dirichlet Allocation (ILDA) model, a probabilistic graphical models which aim to extract aspects and corresponding ratings of products from online reviews and conducts experiments on a real life dataset, Epinions.com.

...read moreread less

Abstract: Today, more and more product reviews become available on the Internet, e.g., product review forums, discussion groups, and Blogs. However, it is almost impossible for a customer to read all of the different and possibly even contradictory opinions and make an informed decision. Therefore, mining online reviews (opinion mining) has emerged as an interesting new research direction. Extracting aspects and the corresponding ratings is an important challenge in opinion mining. An aspect is an attribute or component of a product, e.g. 'screen' for a digital camera. It is common that reviewers use different words to describe an aspect (e.g. 'LCD', 'display', 'screen'). A rating is an intended interpretation of the user satisfaction in terms of numerical values. Reviewers usually express the rating of an aspect by a set of sentiments, e.g. 'blurry screen'. In this paper we present three probabilistic graphical models which aim to extract aspects and corresponding ratings of products from online reviews. The first two models extend standard PLSI and LDA to generate a rated aspect summary of product reviews. As our main contribution, we introduce Interdependent Latent Dirichlet Allocation (ILDA) model. This model is more natural for our task since the underlying probabilistic assumptions (interdependency between aspects and ratings) are appropriate for our problem domain. We conduct experiments on a real life dataset, Epinions.com, demonstrating the improved effectiveness of the ILDA model in terms of the likelihood of a held-out test set, and the accuracy of aspects and aspect ratings.

...read moreread less

198 citations

Proceedings Article•DOI•

Improving text retrieval for the routing problem using latent semantic indexing

[...]

David A. Hull¹•Institutions (1)

Stanford University¹

01 Aug 1994

TL;DR: This paper applies LSI to the routing task, which operates under the assumption that a sample of relevant and non-relevant documents is available to use in constructing the query, and finds that when LSI is used is conjuction with statistical classification, there is a dramatic improvement in performance.

...read moreread less

Abstract: Latent Semantic Indexing (LSI) is a novel approach to information retrieval that attempts to model the underlying structure of term associations by transforming the traditional representation of documents as vectors of weighted term frequencies to a new coordinate space where both documents and terms are represented as linear combinations of underlying semantic factors. In previous research, LSI has produced a small improvement in retrieval performance. In this paper, we apply LSI to the routing task, which operates under the assumption that a sample of relevant and non-relevant documents is available to use in constructing the query. Once again, LSI slightly improves performance. However, when LSI is used is conjuction with statistical classification, there is a dramatic improvement in performance.

...read moreread less

197 citations

Proceedings Article•

Document space models using latent semantic analysis.

[...]

Yoshihiko Gotoh, Steve Renals

01 Jan 1997

TL;DR: It is shown that, using semantic information, mixture LMs performs better than a conventional single LM with slight increase of computational cost and compared to manual clustering, this work builds on previous work in the eld of information retrieval.

...read moreread less

Abstract: In this paper, an approach for constructing mixture language models (LMs) based on some notion of semantics is discussed. To this end, a technique known as latent semantic analysis (LSA) is used. The approach encapsulates corpus-derived semantic information and is able to model the varying style of the text. Using such information , the corpus texts are clustered in an unsuper-vised manner and mixture LMs are automatically created. This work builds on previous work in the eld of information retrieval which was recently applied by Bel-legarda et. al. to the problem of clustering words by semantic categories. The principal contribution of this work is to characterize the document space resulting from the LSA modeling and to demonstrate the approach for mixture LM application. Comparison is made between manual and automatic clustering in order to elucidate how the semantic information is expressed in the space. It is shown that, using semantic information, mixture LMs performs better than a conventional single LM with slight increase of computational cost.

...read moreread less

192 citations

Semantic Data Models.

[...]

Roger King, Dennis McLeod

01 Jan 1985

TL;DR: A semantic data model describes the concepts that are important to an organization along Description: with their meanings and relationships to other important concepts and how the data relate to the real world.

...read moreread less

191 citations

Book•

Latent Trait and Latent Class Models

[...]

Rolf Langeheine, Jürgen Rost

27 Apr 2013

TL;DR: This chapter discusses Latent Trait Theory, a model for Latent Class Theory, and its applications to Criterion-Referenced Testing.

...read moreread less

Abstract: and Overview.- I Latent Trait Theory.- 1 Measurement Models for Ordered Response Categories.- 2 Testing a Latent Trait Model.- 3 Latent Trait Models with Indicators of Mixed Measurement Level.- II Latent Class Theory.- 4 New Developments in Latent Class Theory.- 5 Log-Linear Modeling, Latent Class Analysis, or Correspondence Analysis: Which Method Should Be Used for the Analysis of Categorical Data?.- 6 A Latent Class Covariate Model with Applications to Criterion-Referenced Testing.- III Comparative Views of Latent Traits and Latent Classes.- 7 Test Theory with Qualitative and Quantitative Latent Variables.- 8 Latent Class Models for Measuring.- Chaffer 9 Comparison of Latent Structure Models.- IV Application Studies.- 10 Latent Variable Techniques for Measuring Development.- 11 Item Bias and Test Multidimensionality.- 12 On a Rasch-Model-Based Test for Noncomputerized Adaptive Testing.- 13 Systematizing the Item Content in Test Design.

...read moreread less

190 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
…
23
24
25
26
27
28
29
…
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics