scispace - formally typeset
Open AccessJournal ArticleDOI

Spectral redemption in clustering sparse networks

Reads0
Chats0
TLDR
A way of encoding sparse data using a “nonbacktracking” matrix, and it is shown that the corresponding spectral algorithm performs optimally for some popular generative models, including the stochastic block model.
Abstract
Spectral algorithms are classic approaches to clustering and community detection in networks. However, for sparse networks the standard versions of these algorithms are suboptimal, in some cases completely failing to detect communities even when other algorithms such as belief propagation can do so. Here, we present a class of spectral algorithms based on a nonbacktracking walk on the directed edges of the graph. The spectrum of this operator is much better-behaved than that of the adjacency matrix or other commonly used matrices, maintaining a strong separation between the bulk eigenvalues and the eigenvalues relevant to community structure even in the sparse case. We show that our algorithm is optimal for graphs generated by the stochastic block model, detecting communities all of the way down to the theoretical limit. We also show the spectrum of the nonbacktracking operator for some real-world networks, illustrating its advantages over traditional spectral clustering.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Machine learning and the physical sciences

TL;DR: This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences, including conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields.
Journal ArticleDOI

Community detection in networks: A user guide

TL;DR: In this paper, the authors present a guided tour of the main aspects of community detection in networks and point out strengths and weaknesses of popular methods, and give directions to their use.
Journal ArticleDOI

Higher-order organization of complex networks

TL;DR: A generalized framework for clustering networks on the basis of higher-order connectivity patterns provides mathematical guarantees on the optimality of obtained clusters and scales to networks with billions of edges.
Journal ArticleDOI

Influence maximization in complex networks through optimal percolation

TL;DR: This work maps the problem onto optimal percolation in random networks to identify the minimal set of influencers, which arises by minimizing the energy of a many-body system, where the form of the interactions is fixed by the non-backtracking matrix of the network.
Journal ArticleDOI

Vital nodes identification in complex networks

TL;DR: In this paper, the state-of-the-art algorithms for vital node identification in real networks are reviewed and compared, and extensive empirical analyses are provided to compare well-known methods on disparate real networks.
References
More filters
Journal ArticleDOI

Community structure in social and biological networks

TL;DR: This article proposes a method for detecting communities, built around the idea of using centrality indices to find community boundaries, and tests it on computer-generated and real-world graphs whose community structure is already known and finds that the method detects this known structure with high sensitivity and reliability.
Journal ArticleDOI

A tutorial on spectral clustering

TL;DR: In this article, the authors present the most common spectral clustering algorithms, and derive those algorithms from scratch by several different approaches, and discuss the advantages and disadvantages of these algorithms.
Journal ArticleDOI

Finding community structure in networks using the eigenvectors of matrices

TL;DR: A modularity matrix plays a role in community detection similar to that played by the graph Laplacian in graph partitioning calculations, and a spectral measure of bipartite structure in networks and a centrality measure that identifies vertices that occupy central positions within the communities to which they belong are proposed.
Journal ArticleDOI

An Information Flow Model for Conflict and Fission in Small Groups

TL;DR: In this paper, the authors used data from a voluntary association to construct a new formal model for a traditional anthropological problem, fission in small groups, where the process leading to fission is viewed as an unequal flow of sentiments and information across the ties in a social network.
Proceedings ArticleDOI

The political blogosphere and the 2004 U.S. election: divided they blog

TL;DR: Differences in the behavior of liberal and conservative blogs are found, with conservative blogs linking to each other more frequently and in a denser pattern.