We gratefully acknowledge support from
the Simons Foundation
and Allianz der deutschen Wissenschaftsorganisationen, koordiniert durch TIB, MPG und HGF
Full-text links:

Download:

Current browse context:

cs.SI

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo Facebook logo LinkedIn logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Social and Information Networks

Title: Pseudo-likelihood methods for community detection in large sparse networks

Abstract: Many algorithms have been proposed for fitting network models with communities, but most of them do not scale well to large networks, and often fail on sparse networks. Here we propose a new fast pseudo-likelihood method for fitting the stochastic block model for networks, as well as a variant that allows for an arbitrary degree distribution by conditioning on degrees. We show that the algorithms perform well under a range of settings, including on very sparse networks, and illustrate on the example of a network of political blogs. We also propose spectral clustering with perturbations, a method of independent interest, which works well on sparse networks where regular spectral clustering fails, and use it to provide an initial value for pseudo-likelihood. We prove that pseudo-likelihood provides consistent estimates of the communities under a mild condition on the starting value, for the case of a block model with two communities.
Comments: Published in at this http URL the Annals of Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects: Social and Information Networks (cs.SI); Learning (cs.LG); Statistics Theory (math.ST); Physics and Society (physics.soc-ph); Machine Learning (stat.ML)
Journal reference: Annals of Statistics 2013, Vol. 41, No. 4, 2097-2122
DOI: 10.1214/13-AOS1138
Report number: IMS-AOS-AOS1138
Cite as: arXiv:1207.2340 [cs.SI]
  (or arXiv:1207.2340v3 [cs.SI] for this version)

Submission history

From: Arash A. Amini [view email]
[v1] Tue, 10 Jul 2012 13:28:32 GMT (822kb)
[v2] Thu, 21 Feb 2013 18:52:23 GMT (655kb)
[v3] Tue, 5 Nov 2013 15:49:54 GMT (769kb)