Home
/
Topics
/
Probabilistic latent semantic analysis

Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1985
1984
1983
1982
1981
1980
1979
1978
1974
1973
1971
1970
1969
1965
1963
1960
1958
1956
1954

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Probabilistic Latent Semantic Analysis-Based Gear Fault Diagnosis Under Variable Working Conditions

[...]

Chao Chen¹, Fei Shen¹, Jiawen Xu¹, Ruqiang Yan¹•Institutions (1)

Southeast University¹

01 Jun 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: Experimental results prove that the proposed latent feature-based transfer learning (TL) strategy has a significant advantage over gear fault diagnosis, especially under varying working conditions.

...read moreread less

Abstract: Gears are often operated under various working conditions, which may cause the training and testing data have different but related distributions when conducting gear fault diagnosis. To address this issue, a latent feature-based transfer learning (TL) strategy is proposed in this paper. First, the bag-of-fault-words (BOFW) model combined with the continuous wavelet transform (CWT) method is developed to extract and represent every fault feature parameter as a histogram. Before identifying the gear fault, the latent feature-based TL strategy is carried out, which adopts the joint dual-probabilistic latent semantic analysis (JD-PLSA) to model the shared and domain-specific latent features. After that, a mapping matrix between two domains can be constructed by using Pearson’s correlation coefficients (PCCs) to effectively transfer shared and mapped domain specific latent knowledge and to reduce the gap between two domains. Then, a Fisher kernel-based support vector machine (FSVM) is used to identify the gear fault types. To verify the effectiveness of the proposed approach, gear data sets gathered from Spectra Quest’s drivetrain dynamics simulator (DDS) are analyzed. Experimental results prove that the proposed approach has a significant advantage over gear fault diagnosis, especially under varying working conditions.

...read moreread less

17 citations

Journal Article•

Asymptotic accuracy of distribution-based estimation of latent variables

[...]

Keisuke Yamazaki¹•Institutions (1)

Tokyo Institute of Technology¹

01 Jan 2014-Journal of Machine Learning Research

TL;DR: In this paper, distribution-based functions for the errors in the estimation of the latent variables were derived for both the maximum likelihood and the Bayes methods, and the asymptotic behavior of both the methods was analyzed.

...read moreread less

Abstract: Hierarchical statistical models are widely employed in information science and data engineering. The models consist of two types of variables: observable variables that represent the given data and latent variables for the unobservable labels. An asymptotic analysis of the models plays an important role in evaluating the learning process; the result of the analysis is applied not only to theoretical but also to practical situations, such as optimal model selection and active learning. There are many studies of generalization errors, which measure the prediction accuracy of the observable variables. However, the accuracy of estimating the latent variables has not yet been elucidated. For a quantitative evaluation of this, the present paper formulates distribution-based functions for the errors in the estimation of the latent variables. The asymptotic behavior is analyzed for both the maximum likelihood and the Bayes methods.

...read moreread less

17 citations

Journal Article•DOI•

Higher-Order Smoothing: A Novel Semantic Smoothing Method for Text Classification

[...]

Mitat Poyraz¹, Zeynep Hilal Kilimci¹, Murat Can Ganiz¹•Institutions (1)

Doğuş University¹

17 May 2014-Journal of Computer Science and Technology

TL;DR: This paper presents a novel semantic smoothing method named Higher-Order Smoothing (HOS) for the Naive Bayes algorithm, built on a similar graph based data representation of the HONB which allows semantics in higher-order paths to be exploited.

...read moreread less

Abstract: It is known that latent semantic indexing (LSI) takes advantage of implicit higher-order (or latent) structure in the association of terms and documents. Higher-order relations in LSI capture “latent semantics”. These findings have inspired a novel Bayesian framework for classification named Higher-Order Naive Bayes (HONB), which was introduced previously, that can explicitly make use of these higher-order relations. In this paper, we present a novel semantic smoothing method named Higher-Order Smoothing (HOS) for the Naive Bayes algorithm. HOS is built on a similar graph based data representation of the HONB which allows semantics in higher-order paths to be exploited. We take the concept one step further in HOS and exploit the relationships between instances of different classes. As a result, we move beyond not only instance boundaries, but also class boundaries to exploit the latent information in higher-order paths. This approach improves the parameter estimation when dealing with insufficient labeled data. Results of our extensive experiments demonstrate the value of HOS on several benchmark datasets.

...read moreread less

17 citations

Analysis of text coherence using Latent Semantic Analysis

[...]

Peter W. Foltz, Walter Kintsch, Thomas K. Landauer

01 Jan 1998

17 citations

Proceedings Article•

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining

[...]

Jagannadan Varadarajan¹, Rémi Emonet¹, Jean-Marc Odobez¹•Institutions (1)

Idiap Research Institute¹

01 Jan 2010

TL;DR: This paper proposes a method that encourages sparsity, by adding regularization constraints on the searched distributions, which can be used with most topic models and lead to a simple modified version of the EM standard optimization procedure.

...read moreread less

Abstract: We address the mining of sequential activity patterns from document logs given as word-time occurrences. We achieve this using topics that models both the cooccurrence and the temporal order in which words occur within a temporal window. Discovering such topics, which is particularly hard when multiple activities can occur simultaneously, is conducted through the joint inference of the temporal topics and of their starting times, allowing the implicit alignment of the same activity occurences in the document. A current issue is that while we would like topic starting times to be represented by sparse distributions, this is not achieved in practice. Thus, in this paper, we propose a method that encourages sparsity, by adding regularization constraints on the searched distributions. The constraints can be used with most topic models (e.g. PLSA, LDA) and lead to a simple modified version of the EM standard optimization procedure. The effect of the sparsity constraint on our activity model and the robustness improvment in the presence of difference noises have been validated on synthetic data. Its effectiveness is also illustrated in video activity analysis, where the discovered topics capture frequent patterns that implicitly represent typical trajectories of scene objects.

...read moreread less

17 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
…
181
182
183
184
185
186
187
…
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics