Home
/
Topics
/
Probabilistic latent semantic analysis

Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1985
1984
1983
1982
1981
1980
1979
1978
1974
1973
1971
1970
1969
1965
1963
1960
1958
1956
1954

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Chinese Spoken Document Summarization Using Probabilistic Latent Topical Information

[...]

Berlin Chen¹, Yao-Ming Yeh¹, Yao-Min Huang¹, Yi-Ting Chen¹•Institutions (1)

National Taiwan Normal University¹

14 May 2006

TL;DR: The use of probabilistic latent topical information for extractive summarization of spoken documents is proposed and the summarization capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as the HMM model.

...read moreread less

Abstract: The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as the HMM model. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained.

...read moreread less

27 citations

Patent•

Convolutional latent semantic models and their applications

[...]

Xiaodong He¹, Jianfeng Gao¹, Li Deng¹, Qiang Lou¹, Yunhong Zhou¹, Guowei Liu¹, Gregory Buehrer¹, Jianchang Mao¹, Yelong Shen¹, Ruofei Zhang¹ - Show less +6 more•Institutions (1)

Microsoft¹

01 Apr 2014

TL;DR: In this paper, a deep learning model, such as a convolutional latent semantic model, is designed to capture both the local and global linguistic contexts of the linguistic items, and the similarity measure expresses the closeness between the first and second linguistic items in a high-level semantic space.

...read moreread less

Abstract: Functionality is described herein for transforming first and second symbolic linguistic items into respective first and second continuous-valued concept vectors, using a deep learning model, such as a convolutional latent semantic model. The model is designed to capture both the local and global linguistic contexts of the linguistic items. The functionality then compares the first concept vector with the second concept vector to produce a similarity measure. More specifically, the similarity measure expresses the closeness between the first and second linguistic items in a high-level semantic space. In one case, the first linguistic item corresponds to a query, and the second linguistic item may correspond to a phrase, or a document, or a keyword, or an ad, etc. In one implementation, the convolutional latent semantic model is produced in a training phase based on click-through data.

...read moreread less

27 citations

Proceedings Article•DOI•

Uniqueness of Non-Negative Matrix Factorization

[...]

Hans Laurberg¹•Institutions (1)

Aalborg University¹

26 Aug 2007

TL;DR: A strong uniqueness theorem on non-negative matrix factorizations (NMF) is introduced and it is described how the theorem can be applied to two of the common application areas of NMF, namely music analysis and probabilistic latent semantic analysis.

...read moreread less

Abstract: In this paper, two new properties of stochastic vectors are introduced and a strong uniqueness theorem on non-negative matrix factorizations (NMF) is introduced. It is described how the theorem can be applied to two of the common application areas of NMF, namely music analysis and probabilistic latent semantic analysis. Additionally, the theorem can be used for selecting the model order and the sparsity parameter in sparse NMFs.

...read moreread less

27 citations

Proceedings Article•DOI•

Automated essay scoring using Generalized Latent Semantic Analysis

[...]

Md. Monjurul Islam¹, A. S. M. Latiful Hoque¹•Institutions (1)

Bangladesh University of Engineering and Technology¹

01 Dec 2010

TL;DR: This work has developed an AEG system using Generalized Latent Semantic Analysis (GLSA) which makes n-gram by document matrix instead of word by documents matrix and outperforms the existing system.

...read moreread less

Abstract: Automated Essay Grading (AEG) is a very important research area in educational technology. Latent Semantic Analysis (LSA) is an information retrieval technique used for automated essay grading. LSA forms a word by document matrix and then the matrix is decomposed using Singular Value Decomposition (SVD) technique. Existing AEG systems based on LSA cannot achieve higher level of performance to be a replica of human grader. We have developed an AEG system using Generalized Latent Semantic Analysis (GLSA) which makes n-gram by document matrix instead of word by document matrix. We have evaluated this system using details representation and showed the performance of the system. Experimental results show that our system outperforms the existing system.

...read moreread less

27 citations

Book Chapter•DOI•

Query expansion using a collection dependent probabilistic latent semantic thesaurus

[...]

Laurence A. F. Park¹, Kotagiri Ramamohanarao¹•Institutions (1)

University of Melbourne¹

22 May 2007

TL;DR: This investigation examines the novel automatic query expansion method using the latent semantic thesaurus, which is based on probabilistic latent semantic analysis, and shows how to construct it by mining text documents for probabilism term relationships.

...read moreread less

Abstract: Many queries on collections of text documents are too short to produce informative results. Automatic query expansion is a method of adding terms to the query without interaction from the user in order to obtain more refined results. In this investigation, we examine our novel automatic query expansion method using the probabilistic latent semantic thesaurus, which is based on probabilistic latent semantic analysis. We show how to construct the thesaurus by mining text documents for probabilistic term relationships, and we show that by using the latent semantic thesaurus, we can overcome many of the problems associated to latent semantic analysis on large document sets which were previously identified. Experiments using TREC document sets show that our term expansion method out performs the popular probabilistic pseudorelevance feedback method by 7.3%.

...read moreread less

27 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
…
132
133
134
135
136
137
138
…
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics