• Editors' Suggestion
  • Go Mobile »
  • Access by Staats- und Universitaetsbibliothek Bremen

Structural Reducibility of Hypergraphs

Alec Kirkley1,2,3,*, Helcio Felippe4, and Federico Battiston4,†

  • *Contact author: alec.w.kirkley@gmail.com
  • Contact author: battistonf@ceu.edu

Phys. Rev. Lett. 135, 247401 – Published 12 December, 2025

DOI: https://doi.org/10.1103/xrn7-cz8v

Abstract

Higher-order interactions provide a nuanced understanding of the relational structure of complex systems beyond traditional pairwise interactions. However, higher-order network analyses also incur more cumbersome interpretations and greater computational demands than their pairwise counterparts. Here, we present an information-theoretic framework for determining the extent to which a hypergraph representation of a networked system is structurally redundant and for identifying its most critical higher orders of interaction that allow us to remove these redundancies while preserving essential higher-order structure.

Physics Subject Headings (PhySH)

Article Text

A wide variety of complex systems and relational data are characterized by higher-order, nondyadic interactions . Such systems can be conveniently represented as hypergraphs, collections of nodes representing fundamental units of a system that are connected by hyperedges encoding interactions among an arbitrary number of nodes . To investigate the higher-order architecture of networked systems, new mathematical and computational frameworks have been proposed , revealing previously unknown organizational principles and new emergent behaviors in collective phenomena ranging from contagions and diffusion to synchronization and evolutionary dynamics . Nevertheless, due to the high dimensionality of many real-world hypergraphs, higher-order network analyses are typically more computationally demanding and complex than pairwise network analyses. Hence, it is important to identify and exploit redundancies—which have been observed in real-world systems —to construct more compressed representations that retain the key structural heterogeneity present in a system’s original higher-order structure.

Inspired by related work in the context of multilayer networks , here we provide a simple and principled information-theoretic solution to identify the structural reducibility of a hypergraph—the extent to which a hypergraph provides redundant information about a system’s relational structure—and remove these redundancies to create a reduced representation that retains its critical higher-order structure. Our method is interpretable and computationally efficient and can be generalized to capture the reducibility of hypergraphs when viewed at different scales. We test our framework on a variety of synthetic network models, showcasing its wide applicability and robustness to different sources of statistical noise. Finally, we apply the framework to a corpus of real-world higher-order systems from various application domains, finding that many of these systems can be substantially structurally reduced.

Hypergraph reducibility—

Let be a hypergraph with unique (but not necessarily consecutive) layers indexed by , each layer containing all hyperedges of size from . Let denote the set of unique layer indices. For example, a hypergraph with only layers and would have and . We consider as a set of undirected, sorted tuples of size with no repeated entries, and let be the number of hyperedges in (in other words, is a simple, undirected hypergraph). There are possible undirected, sorted tuples of size so . We also let be the projection of layer onto the order of , which extracts all unique -tuples nested within the -tuples in . For example, if , we would have . We define , similarly to the unprojected layers, and use the convention .

The structural reducibility of a hypergraph can be defined based on the overlap among its pairs of layers , where overlap is defined based on the projection of each layer to the lower order of the two. This convention is required because higher-order hyperedges have unique projections onto lower-order interactions (as defined above), but one cannot conversely determine higher-order structure uniquely from lower-order structure alone . Higher overlap among the layers indicates higher structural redundancy among different orders of hyperedges, suggesting a higher structural reducibility for the hypergraph . Formally, we can define the overlap of the layers indexed by as , which is the number of hyperedges the two layers share when they are projected to the lower order of the two layers.

Our proposed reducibility measure reflects the extent to which a hypergraph can be compressed in an information-theoretic sense when we exploit the structural redundancy among its hyperedge layers of different orders (i.e., their layer overlaps). To formalize this concept mathematically, we consider transmitting the hypergraph to a receiver using two different schemes. In the first (naïve) scheme, we transmit each of ’s layers individually. In the second scheme, we first transmit a set of “representative” layers which capture most of the heterogeneity in the hyperedge structure in , and then we transmit each remaining layer as a noisy copy of a representative layer . A similar concept has been employed to compress multilayer network structures and sets of network partitions using intermediate representative structures.

We assume that the receiver knows the orders of the layers and the number of hyperedges for each —specifying these counts incurs a comparatively negligible information cost anyway. Since each layer has possible configurations of its hyperedges when is known, we need to send a bit string of length approximately equal to bits to fully specify which configuration corresponds to . The naïve transmission of as individual layers, therefore, requires an information content of

(1)

bits, using the convention for brevity.

A better way to transmit is to exploit the overlaps among layers of hyperedges of different sizes to save information. To do this, we first transmit a representative subset of layers indexed by , which incurs a cost of bits. Then we transmit each remaining layer by (a) transmitting a representative layer of a higher order, costing us as there are layers to choose from; (b) transmitting the overlap among the layer and its representative , costing us as ; and (c) transmitting layer given the constraints imposed by the overlap value . If of the possible edges in are present in , and of the possible edges absent from are present in , then we require bits to specify given knowledge of and . Steps (a) and (b) incur negligible information costs compared to step (c) and can be ignored. The total information content of this scheme is then

(2)

bits. For a minimal information cost, the representative layer for each layer can be assigned as

(3)

The information cost of this transmission scheme depends on which layers are selected as representatives—the better the layers capture the heterogeneity in the hyperedge structure in , the lower the information cost . Thus, to maximize compression from layer overlap, we must find the optimal set of representative layers according to

(4)

We will describe shortly how to solve this optimization problem.

The optimal information cost of this transmission scheme is bounded in the interval , where

(5)

The lower bound follows from always minimally needing to transmit the top layer of as a representative layer, at a cost , and the upper bound follows from being in the solution space over which we minimize to find . Therefore, exploiting layer overlap always provides compression relative to the naïve transmission of layers independently. The reducibility of the hypergraph can then be computed based on the extent to which can be compressed relative to the baseline cost of .

Using these bounds, we can construct a properly normalized structural reducibility measure for a hypergraph as

(6)

which satisfies . If is maximally compressible—i.e., is a nested hypergraph where all layers with are given by —then we have and and, thus, . On the other hand, we have an information cost when is highly incompressible (i.e., has little to no structural overlap among its layers), as there is little shared information that can be exploited to improve on the naïve information cost of ; thus, .

In Supplemental Material , we discuss extending our reducibility concept to understand the structural redundancy of multiscale coarse-grainings of hypergraphs (Sec. I), as well as individual hypergraph layers (Sec. V) and individual hyperedges (Sec. VI).

By solving Eq.  to maximize compression of , we can also obtain a compressed hypergraph representation for given by , which captures the critical higher-order structure of while removing structurally redundant layers. In Fig. , we show an example hypergraph with layers and its corresponding reduced representation of layers , giving a reducibility value of .

FIG. 1.

Structural reducibility. (a) Hypergraph containing layers of size , which is reducible to an optimal representative layer set with reducibility [Eq. ]. (b) Reducibility of a noisy nested hypergraph, with noise parameter determining the fraction of randomized hyperedges, for various hypergraph dimensions .

Optimizing reducibility—

To identify the optimal representative layers , when —a value satisfied by most real hypergraph datasets —we can use a simple brute-force search. Letting be the largest hyperedge size in , we must have , since there is no higher layer from which to transmit . Therefore, we have possible subsets of representatives that we must search through. In our method we must first compute , , and for all pairs of layers with . For layer pairs with small values , we can directly compute and using the projection and the lower layer . For larger values, however, we cannot compute the projection directly. In this case, we can compute by iterating through the edges in a fixed order, for each edge checking its overlaps with all previously checked edges. Then, we can compute the number of new projected tuples that contributes to as , where is the number of unique subtuples of size within the set of overlapping tuples , which can be computed recursively using the same approach. When computing the projection is infeasible, we also must compute in a more efficient way. We do this by iterating over hyperedges and incrementing for each edge that fully overlaps with , removing from afterward.

We can then compute a matrix such that

(7)

for layer pairs . The computation of certain conditional entropies is often the computational bottleneck in practice and takes roughly run-time for large where projection is intractable and run-time when using direct projection. We also compute a vector with entries

(8)

storing the individual layer information costs for all layers . Then, for each valid subset , we can compute

(9)

where . We then select as the representative subset that achieves the minimum value of . There are subsets to check, and each takes in the worst case operations to compute for each layer . The iteration over thus has a time complexity of roughly , which in practice is tractable for most real-world datasets with (see Table ).

For systems with many layers , optimizing over subsets through direct enumeration is unfeasible. In such cases, we use an approximate greedy method for identifying to compute the structural reducibility. Starting with , we can iteratively add the best layer to following the rule until . Then, we choose among the explored solution candidates to find the representative layer set giving the lowest information cost . In practice, this approximation always obtains accurate results (see Sec. IV in Supplemental Material ), significantly speeding up the computation and making our method available to otherwise intractable datasets.

Reducibility of synthetic hypergraphs—

To validate our approach, we investigate our method on synthetic hypergraphs with tunable structure. First, we consider nested hypergraphs, where all lower-order interactions are fully encapsulated into those of higher order. Noisy nested hypergraphs are nested hypergraph where a noise parameter determines the fraction of its hyperedges to be rewired, replacing each selected hyperedge with a hyperedge of the same order drawn uniformly at random. In Fig. , we plot the results of this test, averaged over ten realizations of the randomness for each value of . Standard errors (vanishingly small) are shown as error bars. We can observe that indicates complete reducibility when for each fully nested hypergraph and that decreases smoothly as the hypergraph becomes noisier, eventually bottoming out at for purely random hypergraphs ( ). As the system gets larger ( increases), we observe greater reducibility values, since a larger fraction of the layers are structurally redundant due to being nested within the top layer .

In a followup experiment, we examine the reducibility of more general synthetic hypergraphs with nested structure. We start by generating three planted representative layers , , and on nodes with , , , 1050, and 50 hyperedges, respectively, drawn uniformly at random without replacement. We then generate the layers , , and as noisy versions of the projected layers , , and , respectively, by selecting a fraction of the hyperedges in each projected layer randomly and replacing these hyperedges with those of the same size drawn uniformly at random. In Fig. , we plot layer-layer similarity matrices showing the network normalized mutual information (over arbitrary tuple sizes) for pairs of layers in the generated hypergraphs, illustrating the effect of on the nested structure. As increases, we see a smooth decrease in the structural reducibility , with our method able to infer the planted set of representative layers . We compare our results with dynamical reducibility , which reduces structure based on the collective behavior supported by the hypergraph rather than topological overlap and which remains close to zero without detecting any change in this simple but nuanced hypergraph structure. All results are averaged over 20 realizations of the randomness at each . Section II in Supplemental Material further investigates the reducibility of hypergraphs with tunable nested interactions.

FIG. 2.

Comparison of structural and dynamical reducibility measures. (a) Pairwise layer similarity matrices of block-nested hypergraphs at increasing levels of noise . Hypergraph layers , 4, and 6 are fully nested within , 5, and 7, respectively, and the similarity is lost as increases. (b) Structural and dynamical reducibility measures against all values. The dynamical reducibility does not detect any compressibility between layers. The structural reducibility uncovers both the structural redundancies and the planted, optimal representative layers .

We also examine our proposed multiscale reducibility measure in a similar experimental setting. For an node system, we synthesize each random hypergraph using a planted community partition by generating each of the hyperedges in layer as follows: (i) choose a random node to start a hyperedge ; (ii) add nodes to , drawing each node (without replacement) from the same community as with probability and from a different community with probability . The result is a hypergraph that is more tightly clustered under the planted node partition as and uncorrelated with as . All results are averaged over ten realizations of the randomness at each , and separate experiments are run for equally sized communities in . We observe that the standard reducibility of Eq.  cannot detect any changes in the mesoscale nested structure, while the multiscale reducibility of Eq. (S3) in Supplemental Material exhibits a smooth descent as increases. For larger , we see that the communities in become smaller and the hypergraph becomes less reducible under the coarse-graining , approaching the standard structural reducibility value.

Reducibility of real networks—

Finally, we apply our reducibility method to a range of real higher-order networks , as shown in Table . We find a great variety in the reducibility of these systems, with many systems most parsimoniously represented by only a small subset of their layers. In Supplemental Material , we further investigate the nested organization of real-world systems in Sec. III, while in Sec. IV we compare run times of the exact and greedy optimization methods for finding , showing that the greedy scheme is considerably faster especially for larger hypergraphs. Finally, we explore the structural and dynamical properties of all reduced empirical hypergraphs in Sec. VII. We find that the reduced hypergraph representations consistently preserve the global, mesoscale, and local connectivity of the complete empirical hypergraphs—as quantified by the effective number of connected components, community structure, and degree ordering, respectively. We also find that these reduced systems preserve the consensus times in higher-order voter model dynamics . In general, we observe that such properties are better preserved as the reducibility increases, due to improved compressibility of the original hypergraph structure.

TABLE I.

Structural reducibility of empirical datasets. Daggers denote the usage of greedy minimization for obtaining and . The greedy scheme produced identical results to the exact scheme for all networks with .

Dataset
Coauth-mag-geology_1980 1674 903 18 {3, 5, 9, 18} 0.03
Coauth-mag-geology_1981 1075 547 29 {4, 5, 8, 29} 0.01
Coauth-mag-geology_1982 1878 987 26 {4, 6, 26} 0.02
Coauth-mag-geology_1983 1734 883 36 {4, 36}
Kaggle-whats-cooking 6714 39224 65 {6,8,9,11,65}
Contact-high-school 327 7818 5 {3, 5} 0.13
Contact-primary-school 242 12704 5 {4, 5} 0.09
Hospital-lyon 75 1824 5 {4, 5} 0.11
Hypertext-conference 113 2434 6 {3, 5, 6} 0.06
Invs13 92 787 4 {3, 4} 0.05
Invs15 217 4909 4 {3, 4} 0.10
Science-gallery 410 3350 5 {3, 5} 0.16
Sfhh-conference 403 10541 9 {4,9} 0.10
Malawi-village 84 431 4 {3, 4} 0.19
Dawn 2290 138742 16 {6,7,13,16} 0.15
Ndc-classes 628 796 39 {4, 9, 10, 13, 14,23,27,39}
Ndc-substances 3414 6471 187 {5, 40, 187}
Email-enron 143 1459 37 {4,6,11,12,37}
Email-eu 986 24520 40 {5,7,8,9,10,11, 12,13,27,39,40}
Tags-ask-ubuntu 3021 145053 5 {5} 0.17
Tags-math-sx 1627 169259 5 {5} 0.26

Conclusion—

Reducing the dimensionality of higher-order systems allows for more efficient analyses, with simpler interpretations and visualization. Here, we have developed a principled, efficient, and interpretable information-theoretic framework for assessing the structural reducibility of hypergraphs and removing structural redundancies to construct compressed hypergraph representations retaining the critical higher-order structure of complex networked systems. There are a number of ways in which this framework can be extended in future work to directed, weighted, temporal, or multilayer hypergraphs. This would allow the method to be applied to representations that capture additional nuances of the relational structure in a wider variety of systems. Our Letter sheds new light on the organizational principles of higher-order networks, distinguishing the extent to which lower-order information is redundant in the presence of higher-order information.

A. K. acknowledges support from the National Science Foundation of China (NSFC) through Young Scientist Fund Project No. 12405044. F. B. acknowledges support from the Austrian Science Fund (FWF) through Projects No. 10.55776/PAT1052824 and No. 10.55776/PAT1652425.

Data availability—

The data that support the findings of this Letter are openly available . Code is available as part of the library Hypergraphx .

Supplemental Material

Section S1: Detailed derivation of structural reducibility at different scales.
Section S2: Extensive testing of structural reducibility on synthetic hypergraphs with prescribed nestedness.
Section S3: Detailed analysis of nestedness for 21 empirical datasets.
Section S4: Extensive testing of the accuracy and efficiency of an approximate greedy algorithm for structural reducibility.
Section S5: Individual layer reducibility.
Section S6: Reducibility through representative hyperedges.
Section S7: Structural and dynamical properties of reduced hypergraphs.

References (42)

  1. F. Battiston, G. Cencetti, I. Iacopini, V. Latora, M. Lucas, A. Patania, J.-G. Young, and G. Petri, Phys. Rep. 874, 1 (2020).
  2. F. Battiston, E. Amico, A. Barrat, G. Bianconi, G. Ferraz de Arruda, B. Franceschiello, I. Iacopini, S. Kéfi, V. Latora, Y. Moreno et al., Nat. Phys. 17, 1093 (2021).
  3. G. Bianconi, Higher-Order Networks (Cambridge University Press, Cambridge, England, 2021).
  4. C. Bick, E. Gross, H. A. Harrington, and M. T. Schaub, SIAM Rev. 65, 686 (2023).
  5. S. Majhi, M. Perc, and D. Ghosh, J. R. Soc. Interface 19, 20220043 (2022).
  6. C. Berge, Hypergraphs: Combinatorics of Finite Sets (Elsevier, New York, 1984), Vol. 45.
  7. A. R. Benson, R. Abebe, M. T. Schaub, A. Jadbabaie, and J. Kleinberg, Proc. Natl. Acad. Sci. U.S.A. 115, E11221 (2018).
  8. G. Petri and A. Barrat, Phys. Rev. Lett. 121, 228301 (2018).
  9. M. Contisciani, F. Battiston, and C. De Bacco, Nat. Commun. 13, 7229 (2022).
  10. L. Di Gaetano, F. Battiston, and M. Starnini, Phys. Rev. Lett. 132, 037401 (2024).
  11. I. Iacopini, G. Petri, A. Barrat, and V. Latora, Nat. Commun. 10, 2485 (2019).
  12. G. Burgio, S. Gómez, and A. Arenas, Phys. Rev. Lett. 132, 077401 (2024).
  13. G. Ferraz de Arruda, A. Aleta, and Y. Moreno, Nat. Rev. Phys. 6, 468 (2024).
  14. L. Di Gaetano, G. Carugno, F. Battiston, and F. Coghi, Phys. Rev. Lett. 133, 107401 (2024).
  15. P. S. Skardal and A. Arenas, Phys. Rev. Lett. 122, 248301 (2019).
  16. A. P. Millán, J. J. Torres, and G. Bianconi, Phys. Rev. Lett. 124, 218301 (2020).
  17. Y. Zhang, M. Lucas, and F. Battiston, Nat. Commun. 14, 1605 (2023).
  18. M. S. Anwar, G. K. Sar, M. Perc, and D. Ghosh, Communications in Physics 7, 59 (2024).
  19. S. Majhi, S. Ghosh, P. K. Pal, S. Pal, T. K. Pal, D. Ghosh, J. Završnik, and M. Perc, Phys. Life Rev. 52, 144 (2025).
  20. U. Alvarez-Rodriguez, F. Battiston, G. F. de Arruda, Y. Moreno, M. Perc, and V. Latora, Nat. Hum. Behav. 5, 586 (2021).
  21. A. Civilini, O. Sadekar, F. Battiston, J. Gómez-Gardeñes, and V. Latora, Phys. Rev. Lett. 132, 167401 (2024).
  22. A. Kumar, S. Chowdhary, V. Capraro, and M. Perc, Phys. Rev. E 104, 054308 (2021).
  23. Q. F. Lotito, F. Musciotto, A. Montresor, and F. Battiston, Commun. Phys. 5, 79 (2022).
  24. T. LaRock and R. Lambiotte, J. Phys. Complex. 4, 045007 (2023).
  25. N. W. Landry, J.-G. Young, and N. Eikmeier, EPJ Data Sci. 13, 17 (2024).
  26. L. Gallo, L. Lacasa, V. Latora, and F. Battiston, Nat. Commun. 15, 4754 (2024).
  27. M. De Domenico, V. Nicosia, A. Arenas, and V. Latora, Nat. Commun. 6, 6864 (2015).
  28. M. De Domenico and J. Biamonte, Phys. Rev. X 6, 041062 (2016).
  29. A. Santoro and V. Nicosia, Phys. Rev. X 10, 021069 (2020).
  30. A. Kirkley, A. Rojas, M. Rosvall, and J.-G. Young, Commun. Phys. 6, 148 (2023).
  31. A. Kirkley and M. Newman, Commun. Phys. 5, 40 (2022).
  32. M. Lucas, L. Gallo, A. Ghavasieh, F. Battiston, and M. De Domenico, arXiv:2404.08547.
  33. H. Felippe, F. Battiston, and A. Kirkley, Commun. Phys. 7, 335 (2024).
  34. N. W. Landry, M. Lucas, I. Iacopini, G. Petri, A. Schwarze, A. Patania, and L. Torres, J. Open Source Software 8, 5162 (2023).
  35. See Supplemental Material at http://link.aps.org/supplemental/10.1103/xrn7-cz8v including Refs. [36–41].
  36. J. Kim, D.-S. Lee, B. Min, M. A. Porter, M. San Miguel, and K.-I. Goh, Phys. Rev. E 111, L052301 (2025).
  37. M. A. Riolo, G. T. Cantwell, G. Reinert, and M. E. J. Newman, Phys. Rev. E 96, 032310 (2017).
  38. S. Morel-Balbi and A. Kirkley, Phys. Rev. Res. 6, 033307 (2024).
  39. N. X. Vinh, J. Epps, and J. Bailey, Proceedings of the 26th Annual International Conference on Machine Learning (2009), pp. 1073–1080, 10.1145/1553374.1553511.
  40. A. Eriksson, D. Edler, A. Rojas, M. de Domenico, and M. Rosvall, Commun. Phys. 4, 133 (2021).
  41. V. Sood, T. Antal, and S. Redner, Phys. Rev. E 77, 041121 (2008).
  42. Q. F. Lotito, M. Contisciani, C. De Bacco, L. Di Gaetano, L. Gallo, A. Montresor, F. Musciotto, N. Ruggeri, and F. Battiston, J. Complex Networks 11, cnad019 (2023).