scispace - formally typeset
Open AccessJournal ArticleDOI

Mixed Membership Stochastic Blockmodels

TLDR
In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Abstract
Consider data consisting of pairwise measurements, such as presence or absence of links between pairs of objects. These data arise, for instance, in the analysis of protein interactions and gene regulatory networks, collections of author-recipient email, and social networks. Analyzing pairwise measurements with probabilistic models requires special assumptions, since the usual independence or exchangeability assumptions no longer hold. Here we introduce a class of variance allocation models for pairwise measurements: mixed membership stochastic blockmodels. These models combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters that instantiate node-specific variability in the connections (mixed membership). We develop a general variational inference algorithm for fast approximate posterior inference. We demonstrate the advantages of mixed membership stochastic blockmodels with applications to social networks and protein interaction networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Clusterwise p * models for social network analysis

TL;DR: Clusterwise p* models are developed to detect differentially functioning network models as a function of the subset of observations being considered, allowing for local estimation of network regions and avoiding some of the common degeneracy problems that are rampant in p* (e.g., exponential random graph) models.
Posted Content

Graphlet decomposition of a weighted network

TL;DR: The graphlet decomposition of a weighted network, which encodes a notion of social information based on social structure, is introduced and a scalable inference algorithm, which combines EM with Bron-Kerbosch in a novel fashion, is developed.
Journal ArticleDOI

Who delays climate action? Interest groups and coalitions in state legislative struggles in the United States

TL;DR: In this article, the authors systematically characterize the legislative interests of nine main coalitions engaged in Massachusetts energy politics and find that climate and clean energy advocates have few reliable allies and face four coalitions of opponents from the utilities, real estate, power generators, and fossil fuel and chemical industries.
Proceedings Article

Metadata Dependent Mondrian Processes

TL;DR: This paper proposes a metadata dependent Mondrian process (MDMP) to incorporate meta information into the stochastic partition process in the product space and the entity allocation process on the resulting block structure and demonstrates that MDMP is more effective than the baseline models in prediction accuracy and easier to converge in posterior inference.
Journal ArticleDOI

Heterogeneous Networks and Their Applications: Scientometrics, Name Disambiguation, and Topic Modeling

TL;DR: A unified ACL Anthology network is built, tying together the citation, author collaboration, and term-cooccurence networks with affiliation and venue relations, and proves to be convenient and allows problems such as name disambiguation, topic modeling, and the measurement of scientific impact to be easily solved.
References
More filters
Journal ArticleDOI

Gene Ontology: tool for the unification of biology

TL;DR: The goal of the Gene Ontology Consortium is to produce a dynamic, controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing.
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Journal ArticleDOI

Finding scientific topics

TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.
Related Papers (5)