scispace - formally typeset
D

David M. Blei

Researcher at Columbia University

Publications -  399
Citations -  122384

David M. Blei is an academic researcher from Columbia University. The author has contributed to research in topics: Inference & Topic model. The author has an hindex of 98, co-authored 378 publications receiving 111547 citations. Previous affiliations of David M. Blei include Columbia University Medical Center & Hewlett-Packard.

Papers
More filters
Proceedings Article

Nonparametric Density Estimation for Stochastic Optimization with an Observable State Variable

TL;DR: This paper uses nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state s, and proposes two solution methods that depend on the problem characteristics: function-based and gradient-based optimization.
Posted Content

Dynamic Bernoulli Embeddings for Language Evolution

TL;DR: This work develops dynamic embeddings, building on exponential family embedDings to capture how the meanings of words change over time, and uses them to analyze three large collections of historical texts.
Proceedings Article

Learning with scope, with application to information extraction and classification

TL;DR: The authors proposed a hierarchical probabilistic model that uses both local/scope-limited features, such as word formatting, and global features, like word content, to capture and exploit the new regularities encountered in previously unseen data.
Proceedings Article

Efficient Online Inference for Bayesian Nonparametric Relational Models

TL;DR: A new model for large social networks, the hierarchical Dirichlet process relational model, which allows nodes to have mixed membership in an unbounded set of communities is introduced, and an online stochastic variational inference algorithm is derived to allow scalable learning.
Proceedings ArticleDOI

Exponential family embeddings

TL;DR: The authors proposed exponential family embeddings, a class of methods that extends the idea of word embedding to other types of high-dimensional data, such as neural activity of zebrafish, users' shopping behavior, and movie ratings.