scispace - formally typeset
Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

TLDR
In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Abstract
Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Posted Content

Community Detection for Hypergraph Networks via Regularized Tensor Power Iteration

TL;DR: This work proposes a new method for community detection that operates directly on the hypergraph, and introduces a degree-corrected block model for hypergraphs (hDCBM), and shows that Tensor-SCORE yields consistent community detection for a wide range of network sparsity and degree heterogeneity.
Journal ArticleDOI

Online tensor methods for learning latent variable models

TL;DR: An online tensor decomposition based approach for two latent variable modeling problems namely, community detection and topic modeling, in which the latent communities that the social actors in social networks belong to are learned.
Posted Content

Network cross-validation by edge sampling

TL;DR: This paper proposes a new network resampling strategy, based on splitting node pairs rather than nodes, that is applicable to cross-validation for a wide range of network model selection tasks.
Journal ArticleDOI

The analysis of social network data: an exciting frontier for statisticians

TL;DR: Some of the key statistical methods used in social network analysis are introduced and where those used by Christakis and Fowler (CF) fit into the general framework are indicated to help understand the challenges of research involving social networks.
Proceedings Article

Latent Multi-group Membership Graph Model

TL;DR: The Latent Multi-group Membership Graph model is developed, a model of networks with rich node feature structure that can be used to summarize the network structure, to predict links between the nodes, and to predict missing features of a node.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Related Papers (5)