Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs > arXiv:2310.04987

Help | Advanced Search

Computer Science > Machine Learning

(cs)
[Submitted on 8 Oct 2023 (v1), last revised 30 Jan 2024 (this version, v2)]

Title:Data-centric Graph Learning: A Survey

Authors:Yuxin Guo, Deyu Bo, Cheng Yang, Zhiyuan Lu, Zhongjian Zhang, Jixi Liu, Yufei Peng, Chuan Shi
Download a PDF of the paper titled Data-centric Graph Learning: A Survey, by Yuxin Guo and 7 other authors
Download PDF
Abstract:The history of artificial intelligence (AI) has witnessed the significant impact of high-quality data on various deep learning models, such as ImageNet for AlexNet and ResNet. Recently, instead of designing more complex neural architectures as model-centric approaches, the attention of AI community has shifted to data-centric ones, which focuses on better processing data to strengthen the ability of neural models. Graph learning, which operates on ubiquitous topological data, also plays an important role in the era of deep learning. In this survey, we comprehensively review graph learning approaches from the data-centric perspective, and aim to answer three crucial questions: (1) when to modify graph data, (2) what part of the graph data needs modification to unlock the potential of various graph models, and (3) how to safeguard graph models from problematic data influence. Accordingly, we propose a novel taxonomy based on the stages in the graph learning pipeline, and highlight the processing methods for different data structures in the graph data, i.e., topology, feature and label. Furthermore, we analyze some potential problems embedded in graph data and discuss how to solve them in a data-centric manner. Finally, we provide some promising future directions for data-centric graph learning.
Comments: 20 pages
Subjects: Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as: arXiv:2310.04987 [cs.LG]
  (or arXiv:2310.04987v2 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2310.04987
arXiv-issued DOI via DataCite

Submission history

From: Cheng Yang [view email]
[v1] Sun, 8 Oct 2023 03:17:22 UTC (259 KB)
[v2] Tue, 30 Jan 2024 10:05:11 UTC (152 KB)
Full-text links:

Access Paper:

    Download a PDF of the paper titled Data-centric Graph Learning: A Survey, by Yuxin Guo and 7 other authors
  • Download PDF
  • TeX Source
  • Other Formats
view license
Current browse context:
cs.LG
< prev   |   next >
new | recent | 2310
Change to browse by:
cs
cs.SI

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack