scispace - formally typeset
Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

TLDR
In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Abstract
Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings Article

Topological feature based classification

TL;DR: This work proposes a method, based on blockmodelling, for leveraging communities and other topological features for use in a predictive classification task and is shown to outperform graph-based semi-supervised methods on directed and approximately bipartite networks.

Learning with large-scale social media networks

TL;DR: This research provides novel concepts and efficient algorithms to harness the power of social media networks, enables the integration of data in heterogeneous format and information from networks of multiple modes or dimensions, and offers a learning-based solution to social computing.
Journal ArticleDOI

Overlapping community detection based on conductance optimization in large-scale networks

TL;DR: An overlapping community detection algorithm for large-scale networks based on local expansion is proposed, and a novel seeding method is presented, in which the approach is superior to the others in the state of the art.
Journal ArticleDOI

Detecting local network motifs

TL;DR: In this article, the authors propose to define motifs through a local over-representation in the network and develop a statistic to detect them without relying on simulations, and illustrate the performance of their procedure on simulated and real data, recovering already known biologically relevant motifs.
Journal ArticleDOI

Multi-way blockmodels for analyzing coordinated high-dimensional responses.

TL;DR: A family of multi-way stochastic blockmodels suited for temporal coordination between multiple high-dimensional responses is introduced, which avoids pre-processing steps such as binning and thresholding commonly adopted for this type of problems, in biology.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Related Papers (5)