Home
/
Topics
/
Probabilistic latent semantic analysis

Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1985
1984
1983
1982
1981
1980
1979
1978
1974
1973
1971
1970
1969
1965
1963
1960
1958
1956
1954

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Clickthrough-based latent semantic model

[...]

Jianfeng Gao¹, Kristina Toutanova¹, Wen-tau Yih¹•Institutions (1)

Microsoft¹

19 Dec 2011

TL;DR: In this paper, a computer-implemented method and system for ranking documents is presented, which includes identifying a number of query-document pairs based on clickthrough data for a many of documents.

...read moreread less

Abstract: There is provided a computer-implemented method and system for ranking documents. The method includes identifying a number of query-document pairs based on clickthrough data for a number of documents. The method also includes building a latent semantic model based on the query-document pairs and ranking the documents for a search based on the latent semantic model.

...read moreread less

21 citations

Journal Article•DOI•

An HMM-Based Algorithm for Content Ranking and Coherence-Feature Extraction

[...]

Chien-Liang Liu¹, Wen-Hoar Hsaio¹, Chia-Hoang Lee¹, Hsiao-Cheng Chi•Institutions (1)

National Chiao Tung University¹

09 Jan 2013

TL;DR: An algorithm called coherence hidden Markov model (HMM) to extract coherence features and rank content and an intelligent assisted blog writing system based on the coherence-HMM ranking model is proposed.

...read moreread less

Abstract: In this paper, we propose an algorithm called coherence hidden Markov model (HMM) to extract coherence features and rank content. Coherence HMM is a variant of HMM and is used to model the stochastic process of essay writing and identify topics as hidden states, given sequenced clauses as observations. This study uses probabilistic latent semantic analysis for parameter estimation of coherence HMM. In coherence-feature extraction, support vector regression (SVR) with surface features and coherence features is used for essay grading. The experimental results indicate that SVR can benefit from coherence features. The adjacent agreement rate and the exact agreement rate are 95.24% and 59.80%, respectively. Moreover, this study submits high-scoring essays to the same experiment and finds that the adjacent agreement rate and exact agreement rate are 98.33% and 64.50%, respectively. In content ranking, we design and implement an intelligent assisted blog writing system based on the coherence-HMM ranking model. Several corpora are employed to help users efficiently compose blog articles. When users finish composing a clause or sentence, the system provides candidate texts for their reference based on current clause or sentence content. The experimental results demonstrate that all participants can benefit from the system and save considerable time on writing articles.

...read moreread less

21 citations

Proceedings Article•DOI•

An effective semantic search technique using ontology

[...]

Jihyun Lee¹, Jun-Ki Min², Chin-Wan Chung¹•Institutions (2)

KAIST¹, Korea University of Technology and Education²

20 Apr 2009

TL;DR: This paper proposes a novel ranking model that provides more accurate semantic search results compared to existing ranking models and considers the number of meaningful semantic relationships between a resource and keywords, the coverage of keywords, and the distinguishability of keywords.

...read moreread less

Abstract: In this paper, we present a semantic search technique considering the type of desired Web resources and the semantic relationships between the resources and the query keywords in the ontology. In order to effectively retrieve the most relevant top-k resources, we propose a novel ranking model. To do this, we devise a measure to determine the weight of the semantic relationship. In addition, we consider the number of meaningful semantic relationships between a resource and keywords, the coverage of keywords, and the distinguishability of keywords. Through experiments using real datasets, we observe that our ranking model provides more accurate semantic search results compared to existing ranking models.

...read moreread less

20 citations

Proceedings Article•DOI•

Unsupervised modeling and recognition of object categories with combination of visual contents and geometric similarity links

[...]

Gunhee Kim¹, Christos Faloutsos¹, Martial Hebert¹•Institutions (1)

Carnegie Mellon University¹

30 Oct 2008

TL;DR: A probabilistic approach for unsupervised modeling and recognition of object categories which combines two types of complementary visual evidence, visual contents and inter-connected links between the images is proposed.

...read moreread less

Abstract: This paper proposes a probabilistic approach for unsupervised modeling and recognition of object categories which combines two types of complementary visual evidence, visual contents and inter-connected links between the images. By doing so, our approach not only increases modeling and recognition performance but also provides possible solutions to several problems including modeling of geometric information, computational complexity, and the inherent ambiguity of visual words. Our approach can be incorporated in any generative models, but here we consider two popular models, pLSA and LDA. Experimental results show that the topic models updated by adding link analysis terms significantly improve the standard pLSA and LDA models. Furthermore, we presented competitive performances on unsupervised modeling, ranking of training images, classification of unseen images, and localization tasks with MSRC and PASCAL2005 datasets.

...read moreread less

20 citations

Book Chapter•DOI•

PLSI: The True Fisher Kernel and beyond

[...]

Jean-Cédric Chappelier¹, Emmanuel Eckard¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

30 Aug 2009

TL;DR: This paper proposes a novel and theoretically sound document similarity, which avoids the problem of "folding in" unknown documents in PLSI, and experimental results are provided on several information retrieval evaluation sets.

...read moreread less

Abstract: The Probabilistic Latent Semantic Indexing model, introduced by T. Hofmann (1999), has engendered applications in numerous fields, notably document classification and information retrieval. In this context, the Fisher kernel was found to be an appropriate document similarity measure. However, the kernels published so far contain unjustified features, some of which hinder their performances. Furthermore, PLSI is not generative for unknown documents, a shortcoming usually remedied by "folding them in" the PLSI parameter space. This paper contributes on both points by (1) introducing a new, rigorous development of the Fisher kernel for PLSI, addressing the role of the Fisher Information Matrix, and uncovering its relation to the kernels proposed so far; and (2) proposing a novel and theoretically sound document similarity, which avoids the problem of "folding in" unknown documents. For both aspects, experimental results are provided on several information retrieval evaluation sets.

...read moreread less

20 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
…
162
163
164
165
166
167
168
…
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics