Home
/
Topics
/
Probabilistic latent semantic analysis

Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1985
1984
1983
1982
1981
1980
1979
1978
1974
1973
1971
1970
1969
1965
1963
1960
1958
1956
1954

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An exploration of improving prediction accuracy by constructing a multi-type clustering based recommendation framework

[...]

Xiao Ma¹, Hongwei Lu¹, Zaobin Gan¹, Qian Zhao¹•Institutions (1)

Huazhong University of Science and Technology¹

26 May 2016-Neurocomputing

TL;DR: A multi-type clustering based recommendation framework which systematically considers the trust-based user clustering, similarity-based users clustering and similarity- based item clustering to further improve the recommendation accuracy is proposed.

...read moreread less

27 citations

Journal Article•DOI•

Modeling continuous visual features for semantic image annotation and retrieval

[...]

Zhixin Li¹, Zhiping Shi¹, Xi Liu¹, Zhongzhi Shi¹•Institutions (1)

Chinese Academy of Sciences¹

01 Feb 2011-Pattern Recognition Letters

TL;DR: A semantic annotation model is presented which employs continuous PLSA and standard PLSA to model visual features and textual words respectively and can predict semantic annotation precisely for unseen images.

...read moreread less

27 citations

Cross-Language Information Retrieval Using Latent Semantic Indexing

[...]

Paul G. Young

01 Oct 1994

TL;DR: Using the proposed merge strategies, LSI is shown to be able to retrieve relevant documents from either language (Greek or English) without requiring any translation of a user's query.

...read moreread less

Abstract: In this thesis, a method for indexing cross-language databases for conceptual querymatching is presented. Two languages (Greek and English) are combined by appending a small portion of documents from one language to the identical documents in the other language. The proposed merging strategy duplicates less than 7% of the entire database (made up of di erent translations of the Gospels). Previous strategies duplicated up to 34% of the initial database in order to perform the merger. The proposed method retrieves a larger number of relevant documents for both languages with higher cosine rankings when Latent Semantic Indexing (LSI) is employed. Using the proposed merge strategies, LSI is shown to be e ective in retrieving documents from either language (Greek or English) without requiring any translation of a user's query. An e ective Bible search product needs to allow the use of natural language for searching (queries). LSI enables the user to form queries with using natural expressions in the user's own native language. The merging strategy proposed in this study enables LSI to retrieve relevant documents e ectively while duplicating a minimum of the entire database. iv

...read moreread less

27 citations

Proceedings Article•DOI•

Medical image retrieval using bag of meaningful visual words: unsupervised visual vocabulary pruning with PLSA

[...]

Antonio Foncubierta-Rodríguez¹, Alba García Seco de Herrera¹, Henning Müller¹•Institutions (1)

University of Applied Sciences Western Switzerland¹

22 Oct 2013

TL;DR: A visual vocabulary pruning technique is presented that enormously reduces the amount of required words to describe a medical image dataset with no significant effect on the accuracy.

...read moreread less

Abstract: Content--based medical image retrieval has been proposed as a technique that allows not only for easy access to images from the relevant literature and electronic health records but also for training physicians, for research and clinical decision support The bag-of-visual-words approach is a widely used technique that tries to shorten the semantic gap by learning meaningful features from the dataset and describing documents and images in terms of the histogram of these features Visual vocabularies are often redundant, over--complete and noisy Larger than required vocabularies lead to high--dimensional feature spaces, which present important disadvantages with the curse of dimensionality and computational cost being the most obvious ones In this work a visual vocabulary pruning technique is presented It enormously reduces the amount of required words to describe a medical image dataset with no significant effect on the accuracy Results show that a reduction of up to 90% can be achieved without impact on the system performance Obtaining a more compact representation of a document enables multimodal description as well as using classifiers requiring low--dimensional representations

...read moreread less

27 citations

Proceedings Article•DOI•

[...]

Guoli Song¹, Shuhui Wang, Qingming Huang¹, Qi Tian²•Institutions (2)

Chinese Academy of Sciences¹, University of Texas at San Antonio²

01 Dec 2015

TL;DR: This work builds the work based on Gaussian process latent variable model (GPLVM) to learn the non-linear non-parametric mapping functions and transform heterogeneous data into a shared latent space and proposes multi-modal Similarity Gaussian Process latent Variable model (m-SimGP), which learns the nonlinear mapping functions between the intra- modal similarities and latent representation.

...read moreread less

Abstract: Data from real applications involve multiple modalities representing content with the same semantics and deliver rich information from complementary aspects. However, relations among heterogeneous modalities are simply treated as observation-to-fit by existing work, and the parameterized cross-modal mapping functions lack flexibility in directly adapting to the content divergence and semantic complicacy of multi-modal data. In this paper, we build our work based on Gaussian process latent variable model (GPLVM) to learn the non-linear non-parametric mapping functions and transform heterogeneous data into a shared latent space. We propose multi-modal Similarity Gaussian Process latent variable model (m-SimGP), which learns the nonlinear mapping functions between the intra-modal similarities and latent representation. We further propose multi-modal regularized similarity GPLVM (m-RSimGP) by encouraging similar/dissimilar points to be similar/dissimilar in the output space. The overall objective functions are solved by simple and scalable gradient decent techniques. The proposed models are robust to content divergence and high-dimensionality in multi-modal representation. They can be applied to various tasks to discover the non-linear correlations and obtain the comparable low-dimensional representation for heterogeneous modalities. On two widely used real-world datasets, we outperform previous approaches for cross-modal content retrieval and cross-modal classification.

...read moreread less

27 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
…
135
136
137
138
139
140
141
…
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics