Mixed Membership Stochastic Blockmodels

doi:10.5555/1390681.1442798

Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

Edoardo M. Airoldi, +3 more

- 01 Jun 2008 -

Journal of Machine Learning Research

- Vol. 9, Iss: 65, pp 1981-2014

TLDR

In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.

Abstract:

Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

Citations

PDF

Open Access

More filters

Latent Dirichlet Allocation in R

Martin Ponweiser

TL;DR: This thesis proves the suitability of the R environment for text mining with LDA, and replication of the data analyses from the 2004 LDA paper ``Finding scientific topics'' by Thomas Griffiths and Mark Steyvers within the framework ofThe R statistical programming language and the R~package topicmodels.

...read moreread less

Proceedings Article

Nonparametric estimation and testing of exchangeable graph models

Justin Yang, +2 more

TL;DR: A specific estimator is built using the proposed 3-step procedure, which combines probability matrix estimation by Universal Singular Value Thresholding (USVT) and empirical degree sorting of the observed adjacency matrix, and it is proved that this estimation is consistent.

...read moreread less

Proceedings ArticleDOI

CoBaFi: collaborative bayesian filtering

Alex Beutel, +3 more

TL;DR: A unified Bayesian approach to Collaborative Filtering that models the discrete structure of ratings and is flexible to the often non-Gaussian shape of the distribution, and finds a co-clustering of users and items, which improves the model's accuracy and makes the model robust to fraud.

...read moreread less

Proceedings ArticleDOI

Community Level Diffusion Extraction

Zhiting Hu, +3 more

TL;DR: A new approach, i.e., COmmunity Level Diffusion (COLD), to uncover and explore temporal diffusion, model topics and communities in a unified latent framework, and extract inter-community influence dynamics.

...read moreread less

Posted Content

Network Data

Bryan S. Graham

- 13 Dec 2019 -

arXiv: Econometrics

TL;DR: This chapter describes econometric methods for analyzing networks, emphasizing dyadic regression analysis incorporating unobserved agent-specific heterogeneity and supporting causal inference, and empirical models of strategic network formation admitting interdependencies in preferences.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Maximum likelihood from incomplete data via the EM algorithm

Arthur P. Dempster, +2 more

- 01 Sep 1977 -

Journal of the royal statistical society...

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Journal ArticleDOI

Finding scientific topics

Thomas L. Griffiths, +1 more

- 06 Apr 2004 -

Proceedings of the National Academy of S...

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.

...read moreread less

Journal ArticleDOI

Functional organization of the yeast proteome by systematic analysis of protein complexes

Anne-Claude Gavin, +37 more

- 10 Jan 2002 -

Nature

TL;DR: The analysis provides an outline of the eukaryotic proteome as a network of protein complexes at a level of organization beyond binary interactions, which contains fundamental biological information and offers the context for a more reasoned and informed approach to drug discovery.

...read moreread less