Computer Science > Social and Information Networks
[Submitted on 6 Jun 2023 (v1), last revised 5 Mar 2024 (this version, v2)]
Title:Optimal Inference in Contextual Stochastic Block Models
Download PDF HTML (experimental)Abstract:The contextual stochastic block model (cSBM) was proposed for unsupervised community detection on attributed graphs where both the graph and the high-dimensional node information correlate with node labels. In the context of machine learning on graphs, the cSBM has been widely used as a synthetic dataset for evaluating the performance of graph-neural networks (GNNs) for semi-supervised node classification. We consider a probabilistic Bayes-optimal formulation of the inference problem and we derive a belief-propagation-based algorithm for the semi-supervised cSBM; we conjecture it is optimal in the considered setting and we provide its implementation. We show that there can be a considerable gap between the accuracy reached by this algorithm and the performance of the GNN architectures proposed in the literature. This suggests that the cSBM, along with the comparison to the performance of the optimal algorithm, readily accessible via our implementation, can be instrumental in the development of more performant GNN architectures.
Submission history
From: O Duranthon [view email][v1] Tue, 6 Jun 2023 10:02:57 UTC (108 KB)
[v2] Tue, 5 Mar 2024 16:09:44 UTC (154 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)