Spectral Redemption in Clustering Sparse Networks

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
Physics
Statistical, Nonlinear, and Soft Matter Physics
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Krzakala, Florent
Moore, Cristopher
Mossel, Elchanan
Neeman, Joe
Sly, Allan
Zdeborová, Lenka
Zhang, Pan
Contributor
Abstract

Spectral algorithms are classic approaches to clustering and community detection in networks. However, for sparse networks the standard versions of these algorithms are suboptimal, in some cases completely failing to detect communities even when other algorithms such as belief propagation can do so. Here, we present a class of spectral algorithms based on a nonbacktracking walk on the directed edges of the graph. The spectrum of this operator is much better-behaved than that of the adjacency matrix or other commonly used matrices, maintaining a strong separation between the bulk eigenvalues and the eigenvalues relevant to community structure even in the sparse case. We show that our algorithm is optimal for graphs generated by the stochastic block model, detecting communities all of the way down to the theoretical limit. We also show the spectrum of the nonbacktracking operator for some real-world networks, illustrating its advantages over traditional spectral clustering.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2013-12-24
Journal title
PNAS
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
At the time of publication, author Elchanan Mossel was affiliated with University of California, Berkeley. Currently, he is a faculty member at the Statistics Department at the University of Pennsylvania.
Recommended citation
Collection