We gratefully acknowledge support from
the Simons Foundation
and Allianz der deutschen Wissenschaftsorganisationen, koordiniert durch TIB, MPG und HGF
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo Facebook logo LinkedIn logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Role of Normalization in Spectral Clustering for Stochastic Blockmodels

Abstract: Spectral Clustering clusters elements using the top few eigenvectors of their (possibly normalized) similarity matrix. The quality of Spectral Clustering is closely tied to the convergence properties of these principal eigenvectors. This rate of convergence has been shown to be identical for both the normalized and unnormalized variants ([17]). However normalization for Spectral Clustering is the common practice ([16], [19]). Indeed, our experiments also show that normalization improves prediction accuracy. In this paper, for the popular Stochastic Blockmodel, we theoretically show that under spectral embedding, normalization shrinks the variance of points in a class by a constant fraction. As a byproduct of our work, we also obtain sharp deviation bounds of empirical principal eigenvalues of graphs generated from a Stochastic Blockmodel.
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1310.1495 [stat.ML]
  (or arXiv:1310.1495v1 [stat.ML] for this version)

Submission history

From: Purnamrita Sarkar [view email]
[v1] Sat, 5 Oct 2013 17:07:28 GMT (163kb)