scispace - formally typeset
Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

TLDR
In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Abstract
Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Dynamic Community Detection with Temporal Dirichlet Process

TL;DR: The Dynamic Stochastic Block model with Temporal Dirichlet Process is proposed, which is able to detect communities and track their evolution simultaneously from a network stream and exhibits a rich-gets-richer effect and other appealing properties.
Proceedings ArticleDOI

Random Spatial Network Models for Core-Periphery Structure

TL;DR: In this paper, a generative, random network model with core-periphery structure is proposed, which jointly accounts for topological and spatial information by "core scores" of vertices.
Journal ArticleDOI

The dynamic stochastic topic block model for dynamic networks with textual edges

TL;DR: A probabilistic model to cluster the nodes of a dynamic graph, accounting for the content of textual edges as well as their frequency is developed, and an application to the Enron dataset is illustrated.
Journal ArticleDOI

Confidence sets for network structure

TL;DR: This article proposes conservative confidence sets that hold with respect to these underlying Bernoulli parameters as a function of any given partition of network nodes, enabling us to assess estimates of residual network structure, that is, structure that cannot be explained by known covariates and thus cannot be easily verified by manual inspection.
Proceedings ArticleDOI

Hierarchical Community-Level Information Diffusion Modeling in Social Networks

TL;DR: The notion of users' topic popularity is introduced as to enable theHCID model to depict the information diffusion process which is both topic-aware and source-aware, and uni-directional from the higher levels to the lower ones.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Related Papers (5)