Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Latent semantic sparse hashing for cross-modal similarity search

[...]

Jile Zhou¹, Guiguang Ding¹, Yuchen Guo¹•Institutions (1)

Tsinghua University¹

03 Jul 2014

TL;DR: A novel Latent Semantic Sparse Hashing (LSSH) is proposed to perform cross-modal similarity search by employing Sparse Coding and Matrix Factorization to capture the salient structures of images and learn the latent concepts from text.

...read moreread less

Abstract: Similarity search methods based on hashing for effective and efficient cross-modal retrieval on large-scale multimedia databases with massive text and images have attracted considerable attention. The core problem of cross-modal hashing is how to effectively construct correlation between multi-modal representations which are heterogeneous intrinsically in the process of hash function learning. Analogous to Canonical Correlation Analysis (CCA), most existing cross-modal hash methods embed the heterogeneous data into a joint abstraction space by linear projections. However, these methods fail to bridge the semantic gap more effectively, and capture high-level latent semantic information which has been proved that it can lead to better performance for image retrieval. To address these challenges, in this paper, we propose a novel Latent Semantic Sparse Hashing (LSSH) to perform cross-modal similarity search by employing Sparse Coding and Matrix Factorization. In particular, LSSH uses Sparse Coding to capture the salient structures of images, and Matrix Factorization to learn the latent concepts from text. Then the learned latent semantic features are mapped to a joint abstraction space. Moreover, an iterative strategy is applied to derive optimal solutions efficiently, and it helps LSSH to explore the correlation between multi-modal representations efficiently and automatically. Finally, the unified hashcodes are generated through the high level abstraction space by quantization. Extensive experiments on three different datasets highlight the advantage of our method under cross-modal scenarios and show that LSSH significantly outperforms several state-of-the-art methods.

...read moreread less

384 citations

Proceedings Article•

Dynamic Social Network Analysis using Latent Space Models

[...]

Purnamrita Sarkar¹, Andrew W. Moore¹•Institutions (1)

Carnegie Mellon University¹

05 Dec 2005

TL;DR: In this paper, the authors generalize a successful static model of relationships into a dynamic model that accounts for friendships drifting over time, and show how to make it tractable to learn such models from data, even as the number of entities n gets large.

...read moreread less

Abstract: This paper explores two aspects of social network modeling. First, we generalize a successful static model of relationships into a dynamic model that accounts for friendships drifting over time. Second, we show how to make it tractable to learn such models from data, even as the number of entities n gets large. The generalized model associates each entity with a point in p-dimensional Euclidian latent space. The points can move as time progresses but large moves in latent space are improbable. Observed links between entities are more likely if the entities are close in latent space. We show how to make such a model tractable (sub-quadratic in the number of entities) by the use of appropriate kernel functions for similarity in latent space; the use of low dimensional kd-trees; a new efficient dynamic adaptation of multidimensional scaling for a first pass of approximate projection of entities into latent space; and an efficient conjugate gradient update rule for non-linear local optimization in which amortized time per entity during an update is O(log n). We use both synthetic and real-world data on upto 11,000 entities which indicate linear scaling in computation time and improved performance over four alternative approaches. We also illustrate the system operating on twelve years of NIPS co-publication data. We present a detailed version of this work in [1].

...read moreread less

364 citations

Proceedings Article•DOI•

Latent Multi-view Subspace Clustering

[...]

Changqing Zhang¹, Qinghua Hu¹, Huazhu Fu², Pengfei Zhu¹, Xiaochun Cao³ - Show less +1 more•Institutions (3)

Tianjin University¹, Agency for Science, Technology and Research², Chinese Academy of Sciences³

21 Jul 2017

TL;DR: A novel Latent Multi-view Subspace Clustering method, which clusters data points with latent representation and simultaneously explores underlying complementary information from multiple views, which makes subspace representation more accurate and robust as well.

...read moreread less

Abstract: In this paper, we propose a novel Latent Multi-view Subspace Clustering (LMSC) method, which clusters data points with latent representation and simultaneously explores underlying complementary information from multiple views. Unlike most existing single view subspace clustering methods that reconstruct data points using original features, our method seeks the underlying latent representation and simultaneously performs data reconstruction based on the learned latent representation. With the complementarity of multiple views, the latent representation could depict data themselves more comprehensively than each single view individually, accordingly makes subspace representation more accurate and robust as well. The proposed method is intuitive and can be optimized efficiently by using the Augmented Lagrangian Multiplier with Alternating Direction Minimization (ALM-ADM) algorithm. Extensive experiments on benchmark datasets have validated the effectiveness of our proposed method.

...read moreread less

357 citations

Proceedings Article•

Bayesian Gaussian Process Latent Variable Model

[...]

Michalis K. Titsias¹, Neil D. Lawrence²•Institutions (2)

National and Kapodistrian University of Athens¹, University of Sheffield²

31 Mar 2010

TL;DR: In this article, a variational inference framework for training the Gaussian process latent variable model and thus performing Bayesian nonlinear dimensionality reduction is introduced, which can automatically select the dimensionality of the nonlinear latent space.

...read moreread less

Abstract: We introduce a variational inference framework for training the Gaussian process latent variable model and thus performing Bayesian nonlinear dimensionality reduction. This method allows us to variationally integrate out the input variables of the Gaussian process and compute a lower bound on the exact marginal likelihood of the nonlinear latent variable model. The maximization of the variational lower bound provides a Bayesian training procedure that is robust to overfitting and can automatically select the dimensionality of the nonlinear latent space. We demonstrate our method on real world datasets. The focus in this paper is on dimensionality reduction problems, but the methodology is more general. For example, our algorithm is immediately applicable for training Gaussian process models in the presence of missing or uncertain inputs.

...read moreread less

338 citations

Patent•

Construction of trainable semantic vectors and clustering, classification, and searching using trainable semantic vectors

[...]

Randall J. Calistri-Yeh¹, Bo Yuan¹, George B. Osborne¹, David L. Snyder¹•Institutions (1)

University of Rochester¹

14 Apr 2004

TL;DR: In this paper, a trainable semantic vector (TSV) is constructed to represent the significance of the information relative to each of the predetermined categories, and various types of manipulation and analysis such as searching, classification, and clustering can subsequently be performed on a semantic level.

...read moreread less

Abstract: An apparatus and method are disclosed for producing a semantic representation of information in a semantic space. The information is first represented in a table that stores values which indicate a relationship with predetermined categories. The categories correspond to dimensions in the semantic space. The significance of the information with respect to the predetermined categories is then determined. A trainable semantic vector (TSV) is constructed to provide a semantic representation of the information. The TSV has dimensions equal to the number of predetermined categories and represents the significance of the information relative to each of the predetermined categories. Various types of manipulation and analysis, such as searching, classification, and clustering, can subsequently be performed on a semantic level.

...read moreread less

326 citations

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics