Topic

Probabilistic latent semantic analysis

About: Probabilistic latent semantic analysis is a research topic. Over the lifetime, 2884 publications have been published within this topic receiving 198341 citations. The topic is also known as: PLSA.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Latent Embeddings for Zero-Shot Classification

[...]

Yongqin Xian¹, Zeynep Akata¹, Gaurav Sharma¹, Quynh C. Nguyen², Matthias Hein², Bernt Schiele¹ - Show less +2 more•Institutions (2)

Max Planck Society¹, Saarland University²

27 Jun 2016

TL;DR: This paper proposed a latent embedding model for learning a compatibility function between image and class embeddings, in the context of zero-shot classification, which augments the state-of-the-art bilinear compatibility model by incorporating latent variables.

...read moreread less

Abstract: We present a novel latent embedding model for learning a compatibility function between image and class embeddings, in the context of zero-shot classification. The proposed method augments the state-of-the-art bilinear compatibility model by incorporating latent variables. Instead of learning a single bilinear map, it learns a collection of maps with the selection, of which map to use, being a latent variable for the current image-class pair. We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image. We empirically demonstrate that our model improves the state-of-the-art for various class embeddings consistently on three challenging publicly available datasets for the zero-shot setting. Moreover, our method leads to visually highly interpretable results with clear clusters of different fine-grained object properties that correspond to different latent variable maps.

...read moreread less

571 citations

Journal Article•

Posterior Regularization for Structured Latent Variable Models

[...]

Kuzman Ganchev¹, João Graça², Jennifer Gillenwater², Ben Taskar¹•Institutions (2)

University of Pennsylvania¹, INESC-ID²

01 Mar 2010-Journal of Machine Learning Research

TL;DR: This work presents an efficient algorithm for learning with posterior regularization and illustrates its versatility on a diverse set of structural constraints such as bijectivity, symmetry and group sparsity in several large scale experiments, including multi-view learning, cross-lingual dependency grammar induction, unsupervised part-of-speech induction, and bitext word alignment.

...read moreread less

Abstract: We present posterior regularization, a probabilistic framework for structured, weakly supervised learning. Our framework efficiently incorporates indirect supervision via constraints on posterior distributions of probabilistic models with latent variables. Posterior regularization separates model complexity from the complexity of structural constraints it is desired to satisfy. By directly imposing decomposable regularization on the posterior moments of latent variables during learning, we retain the computational efficiency of the unconstrained model while ensuring desired constraints hold in expectation. We present an efficient algorithm for learning with posterior regularization and illustrate its versatility on a diverse set of structural constraints such as bijectivity, symmetry and group sparsity in several large scale experiments, including multi-view learning, cross-lingual dependency grammar induction, unsupervised part-of-speech induction, and bitext word alignment.

...read moreread less

570 citations

Journal Article•DOI•

Exploiting latent semantic information in statistical language modeling

[...]

J.R. Bellegarda¹•Institutions (1)

Apple Inc.¹

01 Aug 2000

TL;DR: This paper focuses on the use of latent semantic analysis, a paradigm that automatically uncovers the salient semantic relationships between words and documents in a given corpus, and proposes an integrative formulation for harnessing this synergy.

...read moreread less

Abstract: Statistical language models used in large-vocabulary speech recognition must properly encapsulate the various constraints, both local and global, present in the language. While local constraints are readily captured through n-gram modeling, global constraints, such as long-term semantic dependencies, have been more difficult to handle within a data-driven formalism. This paper focuses on the use of latent semantic analysis, a paradigm that automatically uncovers the salient semantic relationships between words and documents in a given corpus. In this approach, (discrete) words and documents are mapped onto a (continuous) semantic vector space, in which familiar clustering techniques can be applied. This leads to the specification of a powerful framework for automatic semantic classification, as well as the derivation of several language model families with various smoothing properties. Because of their large-span nature, these language models are well suited to complement conventional n-grams. An integrative formulation is proposed for harnessing this synergy, in which the latent semantic information is used to adjust the standard n-gram probability. Such hybrid language modeling compares favorably with the corresponding n-gram baseline: experiments conducted on the Wall Street Journal domain show a reduction in average word error rate of over 20%. This paper concludes with a discussion of intrinsic tradeoffs, such as the influence of training data selection on the resulting performance.

...read moreread less

565 citations

Proceedings Article•

A recurrent latent variable model for sequential data

[...]

Junyoung Chung¹, Kyle Kastner¹, Laurent Dinh¹, Kratarth Goel¹, Aaron Courville¹, Yoshua Bengio¹ - Show less +2 more•Institutions (1)

Université de Montréal¹

07 Dec 2015

TL;DR: It is argued that through the use of high-level latent random variables, the variational RNN (VRNN)1 can model the kind of variability observed in highly structured sequential data such as natural speech.

...read moreread less

Abstract: In this paper, we explore the inclusion of latent random variables into the hidden state of a recurrent neural network (RNN) by combining the elements of the variational autoencoder. We argue that through the use of high-level latent random variables, the variational RNN (VRNN)1 can model the kind of variability observed in highly structured sequential data such as natural speech. We empirically evaluate the proposed model against other related sequential models on four speech datasets and one handwriting dataset. Our results show the important roles that latent random variables can play in the RNN dynamics.

...read moreread less

539 citations

Patent•

Computer information retrieval using latent semantic structure

[...]

Scott Craig Deerwester¹, Susan T. Dumais¹, George W. Furnas¹, Richard Allan Harshman¹, Thomas K. Landauer¹, Karen E. Lochbaum¹, Lynn A. Streeter¹ - Show less +3 more•Institutions (1)

Telcordia Technologies¹

15 Sep 1988

TL;DR: In this article, a methodology for retrieving textual data objects is disclosed, where the information is treated in the statistical domain by presuming that there is an underlying, latent semantic structure in the usage of words in the data objects.

...read moreread less

Abstract: A methodology for retrieving textual data objects is disclosed. The information is treated in the statistical domain by presuming that there is an underlying, latent semantic structure in the usage of words in the data objects. Estimates to this latent structure are utilized to represent and retrieve objects. A user query is recouched in the new statistical domain and then processed in the computer system to extract the underlying meaning to respond to the query.

...read moreread less

536 citations

Collapse

Network Information

Performance

Metrics

2,984

Papers

212,744

Citations

No. of papers in the topic in previous years
Year	Papers
2023	19
2022	77
2021	14
2020	36
2019	27
2018	58

Probabilistic latent semantic analysis

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics