Skip Navigation

Community detection: effective evaluation on large social networks

  1. Pádraig Cunningham

+ Author Affiliations

  1. Clique Research Cluster, University College Dublin, 8 Belfield Office Park, Clonskeagh, Dublin 4, Ireland
  1. Corresponding author. Email: conradlee@gmail.com
  1. Edited by: Aaron Clauset

  • Received February 4, 2013.
  • Accepted August 6, 2013.

Abstract

While many recently proposed methods aim to detect network communities in large datasets, such as those generated by social media and telecommunications services, most evaluation (i.e. benchmarking) of this research is based on small, hand-curated datasets. We argue that these two types of networks differ so significantly that, by evaluating algorithms solely on the smaller networks, we know little about how well they perform on the larger datasets. Recent work addresses this problem by introducing social network datasets annotated with meta-data that is believed to approximately indicate a ‘ground truth’ set of network communities. While such efforts are a step in the right direction, we find this meta-data problematic for two reasons. First, in practice, the groups contained in such meta-data may only be a subset of a network's communities. Second, while it is often reasonable to assume that meta-data is related to network communities in some way, we must be cautious about assuming that these groups correspond closely to network communities. Here, we consider these difficulties and propose an evaluation scheme based on a classification task that is tailored to deal with them.

Key words

| Table of Contents

This Article

  1. jcomplexnetw 2 (1): 19-37. doi: 10.1093/comnet/cnt012
  1. All Versions of this Article:
    1. cnt012v1
    2. 2/1/19 most recent

- Share

Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.