A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining

Open AccessProceedings Article

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining

Chats0

TLDR

This paper proposes a method that encourages sparsity, by adding regularization constraints on the searched distributions, which can be used with most topic models and lead to a simple modified version of the EM standard optimization procedure.

Abstract:

We address the mining of sequential activity patterns from document logs given as word-time occurrences. We achieve this using topics that models both the cooccurrence and the temporal order in which words occur within a temporal window. Discovering such topics, which is particularly hard when multiple activities can occur simultaneously, is conducted through the joint inference of the temporal topics and of their starting times, allowing the implicit alignment of the same activity occurences in the document. A current issue is that while we would like topic starting times to be represented by sparse distributions, this is not achieved in practice. Thus, in this paper, we propose a method that encourages sparsity, by adding regularization constraints on the searched distributions. The constraints can be used with most topic models (e.g. PLSA, LDA) and lead to a simple modified version of the EM standard optimization procedure. The effect of the sparsity constraint on our activity model and the robustness improvment in the presence of difference noises have been validated on synthetic data. Its effectiveness is also illustrated in video activity analysis, where the discovered topics capture frequent patterns that implicitly represent typical trajectories of scene objects.

Citations

PDF

Open Access

More filters

Proceedings Article

Optimization for Machine Learning

Suvrit Sra, +2 more

TL;DR: This book captures the state of the art of the interaction between optimization and machine learning in a way that is accessible to researchers in both fields and will enrich the ongoing cross-fertilization between the machine learning community and these other fields, and within the broader optimization community.

...read moreread less

Анализ конструктивно-технологических ограничений при проектировании лавинных фотодиодов, работающих в режиме счета фотонов

Юрий Федорович Адамов, +3 more

TL;DR: In this article, the authors published a journal article entitled "Journal of Modern Foreign Psychology 2018, vol. 7, no. 2, pp. 90 and 99, with the following abstracts:

...read moreread less

Journal ArticleDOI

Additive regularization of topic models

Konstantin Vorontsov, +1 more

- 01 Oct 2015 -

Machine Learning

TL;DR: This paper introduces an alternative semi-probabilistic approach, which it is called additive regularization of topic models (ARTM), which regularizes an ill-posed problem of stochastic matrix factorization by maximizing a weighted sum of the log-likelihood and additional criteria.

...read moreread less

Journal ArticleDOI

A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs

Jagannadan Varadarajan, +5 more

- 01 May 2013 -

International Journal of Computer Vision

TL;DR: This paper introduces a novel probabilistic activity modeling approach that mines recurrent sequential patterns called motifs from documents given as word-time count matrices, and proposes a general method that favors the recovery of sparse distributions by adding simple regularization constraints on the searched distributions to the data likelihood optimization criteria.

...read moreread less

Book ChapterDOI

Tutorial on Probabilistic Topic Modeling: Additive Regularization for Stochastic Matrix Factorization

Konstantin Vorontsov, +1 more

TL;DR: Additive Regularization of Topic Models (ARTM) as mentioned in this paper is a non-Bayesian approach that is free of redundant probabilistic assumptions and provides a simple inference for many combined and multi-objective topic models.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

Unsupervised Learning by Probabilistic Latent Semantic Analysis

Thomas Hofmann

- 01 Jan 2001 -

Machine Learning

TL;DR: This paper proposes to make use of a temperature controlled version of the Expectation Maximization algorithm for model fitting, which has shown excellent performance in practice, and results in a more principled approach with a solid foundation in statistical inference.

...read moreread less

Proceedings ArticleDOI

Dynamic topic models

David M. Blei, +1 more

TL;DR: A family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections, and dynamic topic models provide a qualitative window into the contents of a large document collection.

...read moreread less

Proceedings ArticleDOI

Topics over time: a non-Markov continuous-time model of topical trends

Xuerui Wang, +1 more

TL;DR: An LDA-style topic model is presented that captures not only the low-dimensional structure of data, but also how the structure changes over time, showing improved topics, better timestamp prediction, and interpretable trends.

...read moreread less

Related Papers (5)

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining

Citations

Optimization for Machine Learning

Анализ конструктивно-технологических ограничений при проектировании лавинных фотодиодов, работающих в режиме счета фотонов

Additive regularization of topic models

A Sequential Topic Model for Mining Recurrent Activities from Long Term Video Logs

Tutorial on Probabilistic Topic Modeling: Additive Regularization for Stochastic Matrix Factorization

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Unsupervised Learning by Probabilistic Latent Semantic Analysis

Dynamic topic models

Topics over time: a non-Markov continuous-time model of topical trends

Related Papers (5)

Latent dirichlet allocation

Decoupling Sparsity and Smoothness in the Discrete Hierarchical Dirichlet Process

A Markov Clustering Topic Model for mining behaviour in video

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

Spatial-Temporal correlatons for unsupervised action classification