Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs > arXiv:2302.11640

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 22 Feb 2023 (v1), last revised 2 Mar 2024 (this version, v2)]

Title:A critical look at the evaluation of GNNs under heterophily: Are we really making progress?

Authors:Oleg Platonov, Denis Kuznedelev, Michael Diskin, Artem Babenko, Liudmila Prokhorenkova
Download a PDF of the paper titled A critical look at the evaluation of GNNs under heterophily: Are we really making progress?, by Oleg Platonov and 4 other authors
Download PDF HTML (experimental)
Abstract:Node classification is a classical graph machine learning task on which Graph Neural Networks (GNNs) have recently achieved strong results. However, it is often believed that standard GNNs only work well for homophilous graphs, i.e., graphs where edges tend to connect nodes of the same class. Graphs without this property are called heterophilous, and it is typically assumed that specialized methods are required to achieve strong performance on such graphs. In this work, we challenge this assumption. First, we show that the standard datasets used for evaluating heterophily-specific models have serious drawbacks, making results obtained by using them unreliable. The most significant of these drawbacks is the presence of a large number of duplicate nodes in the datasets Squirrel and Chameleon, which leads to train-test data leakage. We show that removing duplicate nodes strongly affects GNN performance on these datasets. Then, we propose a set of heterophilous graphs of varying properties that we believe can serve as a better benchmark for evaluating the performance of GNNs under heterophily. We show that standard GNNs achieve strong results on these heterophilous graphs, almost always outperforming specialized models. Our datasets and the code for reproducing our experiments are available at this https URL
Subjects: Machine Learning (cs.LG)
Cite as: arXiv:2302.11640 [cs.LG]
  (or arXiv:2302.11640v2 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2302.11640
arXiv-issued DOI via DataCite

Submission history

From: Liudmila Ostroumova Prokhorenkova [view email]
[v1] Wed, 22 Feb 2023 20:32:59 UTC (51 KB)
[v2] Sat, 2 Mar 2024 21:17:13 UTC (51 KB)
Full-text links:

Access Paper:

    Download a PDF of the paper titled A critical look at the evaluation of GNNs under heterophily: Are we really making progress?, by Oleg Platonov and 4 other authors
  • Download PDF
  • HTML (experimental)
  • TeX Source
  • Other Formats
view license
Current browse context:
cs.LG
< prev   |   next >
new | recent | 2302
Change to browse by:
cs

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack