scispace - formally typeset
Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

TLDR
In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Abstract
Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Document hierarchies from text and links

TL;DR: The underlying mechanism is a Bayesian generative model in which a latent hierarchical structure explains the observed data, finding hierarchical groups of documents with similar word distributions and dense network connections, enabling application to networks with more than ten thousand nodes.
Proceedings ArticleDOI

Asymptotic mutual information for the binary stochastic block model

TL;DR: An information-theoretic view of the stochastic block model, a popular statistical model for the large-scale structure of complex networks, is developed, which establishes an explicit `single-letter' characterization of the per-vertex mutual information between the vertex labels and the graph, when the graph average degree diverges.
Journal ArticleDOI

Overlapping stochastic block models with application to the French political blogosphere

TL;DR: The Overlapping Stochastic Block Model (OSMBM) as mentioned in this paper allows the vertices to belong to multiple clusters, and, to some extent, generalizes the well-known stochastic block model [Nowicki and Snijders (2001].
Journal ArticleDOI

Community detection in multi-relational data with restricted multi-layer stochastic blockmodel

TL;DR: In this paper, the authors consider two random graph models, the multi-layer stochastic block model (MLSBM) and a model with a restricted parameter space, and derive consistency results for community assignments of the maximum likelihood estimators (MLEs) in both models.
Journal ArticleDOI

Detecting overlapping protein complexes based on a generative model with functional and topological properties

TL;DR: A Generative Model with Functional and Topological Properties (GMFTP) is developed to describe the generative processes of the PPI network and the functional profile and shows that GMFTP has a competitive performance over the state-of-the-art approaches.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Related Papers (5)