Sparse Additive Generative Models of Text

Open AccessProceedings Article

Sparse Additive Generative Models of Text

- pp 1041-1048

TLDR

This approach has two key advantages: it can enforce sparsity to prevent overfitting, and it can combine generative facets through simple addition in log space, avoiding the need for latent switching variables.

Abstract:

Generative models of text typically associate a multinomial with every class label or topic. Even in simple models this requires the estimation of thousands of parameters; in multi-faceted latent variable models, standard approaches require additional latent "switching" variables for every token, complicating inference. In this paper, we propose an alternative generative model for text. The central idea is that each class label or latent topic is endowed with a model of the deviation in log-frequency from a constant background distribution. This approach has two key advantages: we can enforce sparsity to prevent overfitting, and we can combine generative facets through simple addition in log space, avoiding the need for latent switching variables. We demonstrate the applicability of this idea to a range of scenarios: classification, topic modeling, and more complex multifaceted generative models.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Structural topic models for open ended survey responses

Margaret E. Roberts, +7 more

- 01 Oct 2014 -

American Journal of Political Science

TL;DR: The structural topic model makes analyzing open-ended responses easier, more revealing, and capable of being used to estimate treatment effects, and is illustrated with analysis of text from surveys and experiments.

...read moreread less

Journal ArticleDOI

stm: An R Package for Structural Topic Models

Margaret E. Roberts, +3 more

- 31 Oct 2019 -

Journal of Statistical Software

TL;DR: This paper demonstrates how to use the R package stm for structural topic modeling, which allows researchers to flexibly estimate a topic model that includes document-level metadata.

...read moreread less

Journal ArticleDOI

A model of text for experimentation in the social sciences

Margaret E. Roberts, +2 more

- 18 Oct 2016 -

Journal of the American Statistical Asso...

TL;DR: A hierarchical mixed membership model for analyzing topical content of documents, in which mixing weights are parameterized by observed covariates is posit, enabling researchers to introduce elements of the experimental design that informed document collection into the model, within a generally applicable framework.

...read moreread less

Proceedings ArticleDOI

Discovering geographical topics in the twitter stream

Liangjie Hong, +4 more

TL;DR: An algorithm is presented by modeling diversity in tweets based on topical diversity, geographical diversity, and an interest distribution of the user by exploiting sparse factorial coding of the attributes, thus allowing it to deal with a large and diverse set of covariates efficiently.

...read moreread less

Proceedings ArticleDOI

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

Samuel Gehman, +4 more

TL;DR: It is found that pretrained LMs can degenerate into toxic text even from seemingly innocuous prompts, and empirically assess several controllable generation methods find that while data- or compute-intensive methods are more effective at steering away from toxicity than simpler solutions, no current method is failsafe against neural toxic degeneration.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Regression Shrinkage and Selection via the Lasso

Robert Tibshirani

- 01 Jan 1996 -

Journal of the royal statistical society...

TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

...read moreread less

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

Sparse bayesian learning and the relevance vector machine

Michael E. Tipping

- 01 Sep 2001 -

Journal of Machine Learning Research

TL;DR: It is demonstrated that by exploiting a probabilistic Bayesian learning framework, the 'relevance vector machine' (RVM) can derive accurate prediction models which typically utilise dramatically fewer basis functions than a comparable SVM while offering a number of additional advantages.

...read moreread less

Posted Content

Supervised Topic Models

David M. Blei, +1 more

- 03 Mar 2010 -

arXiv: Machine Learning

TL;DR: This article proposed supervised latent Dirichlet allocation (sLDA), a statistical model of labeled documents, which accommodates a variety of response types and derived an approximate maximum-likelihood procedure for parameter estimation, which relies on variational methods to handle intractable posterior expectations.

...read moreread less

Sparse Additive Generative Models of Text

Citations

Structural topic models for open ended survey responses

stm: An R Package for Structural Topic Models

A model of text for experimentation in the social sciences

Discovering geographical topics in the twitter stream

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models

References

Regression Shrinkage and Selection via the Lasso

Latent dirichlet allocation

Latent Dirichlet Allocation

Sparse bayesian learning and the relevance vector machine

Supervised Topic Models

Related Papers (5)

Latent dirichlet allocation

Reading Tea Leaves: How Humans Interpret Topic Models

Finding scientific topics

Supervised Topic Models

Probabilistic topic models