Topics over time: a non-Markov continuous-time model of topical trends

doi:10.1145/1150402.1150450

Proceedings ArticleDOI

Topics over time: a non-Markov continuous-time model of topical trends

Xuerui Wang, +1 more

- pp 424-433

Chats0

TLDR

An LDA-style topic model is presented that captures not only the low-dimensional structure of data, but also how the structure changes over time, showing improved topics, better timestamp prediction, and interpretable trends.

Abstract:

This paper presents an LDA-style topic model that captures not only the low-dimensional structure of data, but also how the structure changes over time. Unlike other recent work that relies on Markov assumptions or discretization of time, here each topic is associated with a continuous distribution over timestamps, and for each generated document, the mixture distribution over topics is influenced by both word co-occurrences and the document's timestamp. Thus, the meaning of a particular topic can be relied upon as constant, but the topics' occurrence and correlations change significantly over time. We present results on nine months of personal email, 17 years of NIPS research papers and over 200 years of presidential state-of-the-union addresses, showing improved topics, better timestamp prediction, and interpretable trends.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Group formation in large social networks: membership, growth, and evolution

Lars Backstrom, +3 more

TL;DR: It is found that the propensity of individuals to join communities, and of communities to grow rapidly, depends in subtle ways on the underlying network structure, and decision-tree techniques are used to identify the most significant structural determinants of these properties.

...read moreread less

Proceedings ArticleDOI

Meme-tracking and the dynamics of the news cycle

Jure Leskovec, +2 more

TL;DR: This work develops a framework for tracking short, distinctive phrases that travel relatively intact through on-line text; developing scalable algorithms for clustering textual variants of such phrases, and identifies a broad class of memes that exhibit wide spread and rich variation on a daily basis.

...read moreread less

Journal ArticleDOI

Probabilistic Topic Models

David M. Blei, +2 more

- 18 Oct 2010 -

IEEE Signal Processing Magazine

TL;DR: In this paper, a review of probabilistic topic models can be found, which can be used to summarize a large collection of documents with a smaller number of distributions over words.

...read moreread less

Proceedings ArticleDOI

A biterm topic model for short texts

Xiaohui Yan, +3 more

TL;DR: The approach can discover more prominent and coherent topics, and significantly outperform baseline methods on several evaluation metrics, and is found that BTM can outperform LDA even on normal texts, showing the potential generality and wider usage of the new topic model.

...read moreread less

Proceedings ArticleDOI

Recurrent Recommender Networks

Chao-Yuan Wu, +4 more

TL;DR: Recurrent Recommender Networks (RRN) are proposed that are able to predict future behavioral trajectories by endowing both users and movies with a Long Short-Term Memory (LSTM) autoregressive model that captures dynamics, in addition to a more traditional low-rank factorization.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

Finding scientific topics

Thomas L. Griffiths, +1 more

- 06 Apr 2004 -

Proceedings of the National Academy of S...

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.

...read moreread less

Journal ArticleDOI

Hierarchical Dirichlet Processes

Yee Whye Teh, +3 more

- 01 Dec 2006 -

Journal of the American Statistical Asso...

TL;DR: This work considers problems involving groups of data where each observation within a group is a draw from a mixture model and where it is desirable to share mixture components between groups, and considers a hierarchical model, specifically one in which the base measure for the childDirichlet processes is itself distributed according to a Dirichlet process.

...read moreread less

Journal ArticleDOI

An introduction to MCMC for machine learning

Christophe Andrieu, +3 more

- 01 Jan 2003 -

Machine Learning

TL;DR: This purpose of this introductory paper is to introduce the Monte Carlo method with emphasis on probabilistic machine learning and review the main building blocks of modern Markov chain Monte Carlo simulation.

...read moreread less

Topics over time: a non-Markov continuous-time model of topical trends

Citations

Group formation in large social networks: membership, growth, and evolution

Meme-tracking and the dynamics of the news cycle

Probabilistic Topic Models

A biterm topic model for short texts

Recurrent Recommender Networks

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Finding scientific topics

Hierarchical Dirichlet Processes

An introduction to MCMC for machine learning

Related Papers (5)

Latent dirichlet allocation

Dynamic topic models

Finding scientific topics

Probabilistic latent semantic indexing

The author-topic model for authors and documents