corner
corner
Access provided through the subscription of Staats- und Universitaetsbibliothek Bremen Go Mobile!

Phys. Rev. E 80, 016109 (2009) [18 pages]

Multiresolution community detection for megascale networks by information-based replica correlations

Download: PDF (1,425 kB) Export: BibTeX or EndNote (RIS)

Peter Ronhovde and Zohar Nussinov
Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, Missouri 63130, USA

Received 22 December 2008; revised 28 April 2009; published 14 July 2009

We use a Potts model community detection algorithm to accurately and quantitatively evaluate the hierarchical or multiresolution structure of a graph. Our multiresolution algorithm calculates correlations among multiple copies (“replicas”) of the same graph over a range of resolutions. Significant multiresolution structures are identified by strongly correlated replicas. The average normalized mutual information, the variation in information, and other measures, in principle, give a quantitative estimate of the “best” resolutions and indicate the relative strength of the structures in the graph. Because the method is based on information comparisons, it can, in principle, be used with any community detection model that can examine multiple resolutions. Our approach may be extended to other optimization problems. As a local measure, our Potts model avoids the “resolution limit” that affects other popular models. With this model, our community detection algorithm has an accuracy that ranks among the best of currently available methods. Using it, we can examine graphs over 40×106 nodes and more than 1×109 edges. We further report that the multiresolution variant of our algorithm can solve systems of at least 200 000 nodes and 10×106 edges on a single processor with exceptionally high accuracy. For typical cases, we find a superlinear scaling O(L1.3) for community detection and O(L1.3 log N) for the multiresolution algorithm, where L is the number of edges and N is the number of nodes in the system.

© 2009 The American Physical Society

URL:
http://link.aps.org/doi/10.1103/PhysRevE.80.016109
DOI:
10.1103/PhysRevE.80.016109
PACS:
89.75.Fb, 64.60.Cn, 89.65.−s