scispace - formally typeset
Open AccessJournal ArticleDOI

Likelihood-based model selection for stochastic block models

Y. X. Rachel Wang, +1 more
- 01 Apr 2017 - 
- Vol. 45, Iss: 2, pp 500-528
Reads0
Chats0
TLDR
In this paper, an approach based on the log likelihood ratio statistic and analysis of its asymptotic properties under model misspecification is presented. But the authors focus on estimating the latent node labels and the model parameters rather than the issue of choosing the number of blocks.
Abstract
The stochastic block model (SBM) provides a popular framework for modeling community structures in networks. However, more attention has been devoted to problems concerning estimating the latent node labels and the model parameters than the issue of choosing the number of blocks. We consider an approach based on the log likelihood ratio statistic and analyze its asymptotic properties under model misspecification. We show the limiting distribution of the statistic in the case of underfitting is normal and obtain its convergence rate in the case of overfitting. These conclusions remain valid when the average degree grows at a polylog rate. The results enable us to derive the correct order of the penalty term for model complexity and arrive at a likelihood-based model selection criterion that is asymptotically consistent. Our analysis can also be extended to a degree-corrected block model (DCSBM). In practice, the likelihood function can be estimated using more computationally efficient variational methods or consistent label estimation algorithms, allowing the criterion to be applied to large networks.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Network Cross-Validation for Determining the Number of Communities in Network Data

TL;DR: In this paper, the authors developed an efficient network cross-validation (NCV) approach to determine the number of communities, as well as to choose between the regular stochastic block model and the degree corrected block model (DCBM).
Journal ArticleDOI

Achieving Optimal Misclassification Proportion in Stochastic Block Models

TL;DR: A computationally feasible two-stage method that achieves optimal statistical performance in misclassification proportion for stochastic block model under weak regularity conditions and is demonstrated by competitive numerical results.
Journal ArticleDOI

Global spectral clustering in dynamic networks.

TL;DR: Applying PisCES to medial prefrontal cortex in monkey rhesus brains from near conception to adulthood reveals dense communities that persist, merge, and diverge over time and others that are loosely organized and short lived, illustrating how dynamic community detection can yield interesting insights into processes such as brain development.
Journal ArticleDOI

Network cross-validation by edge sampling

TL;DR: In this article, the authors propose a new network resampling strategy based on splitting node pairs rather than nodes, which is applicable to cross-validation for a wide range of network model selection tasks.
Journal ArticleDOI

A review of stochastic block models and extensions for graph clustering

TL;DR: Different approaches and extensions proposed for different aspects in model-based clustering of graphs, such as the type of the graph, the clustering approach, the inference approach, and whether the number of groups is selected or estimated are reviewed.
References
More filters
Journal ArticleDOI

Modularity and community structure in networks

TL;DR: In this article, the modularity of a network is expressed in terms of the eigenvectors of a characteristic matrix for the network, which is then used for community detection.
Journal ArticleDOI

Finding community structure in networks using the eigenvectors of matrices

TL;DR: A modularity matrix plays a role in community detection similar to that played by the graph Laplacian in graph partitioning calculations, and a spectral measure of bipartite structure in networks and a centrality measure that identifies vertices that occupy central positions within the communities to which they belong are proposed.
Journal ArticleDOI

Stochastic blockmodels and community structure in networks

TL;DR: This work demonstrates how the generalization of blockmodels to incorporate this missing element leads to an improved objective function for community detection in complex networks and proposes a heuristic algorithm forcommunity detection using this objective function or its non-degree-corrected counterpart.
Journal ArticleDOI

Mixed Membership Stochastic Blockmodels

TL;DR: In this article, the authors introduce a class of variance allocation models for pairwise measurements, called mixed membership stochastic blockmodels, which combine global parameters that instantiate dense patches of connectivity (blockmodel) with local parameters (mixed membership), and develop a general variational inference algorithm for fast approximate posterior inference.
Proceedings Article

Learning to Discover Social Circles in Ego Networks

TL;DR: A novel machine learning task of identifying users' social circles is defined as a node clustering problem on a user's ego-network, a network of connections between her friends, and a model for detecting circles is developed that combines network structure as well as user profile information.
Related Papers (5)