• Mobile Active!
  • Access by Staats- und Universitaetsbibliothek Bremen

Inverse Ising Inference Using All the Data

Erik Aurell and Magnus Ekeberg
Phys. Rev. Lett. 108, 090201 – Published 1 March 2012

Abstract

We show that a method based on logistic regression, using all the data, solves the inverse Ising problem far better than mean-field calculations relying only on sample pairwise correlation functions, while still computationally feasible for hundreds of nodes. The largest improvement in reconstruction occurs for strong interactions. Using two examples, a diluted Sherrington-Kirkpatrick model and a two-dimensional lattice, we also show that interaction topologies can be recovered from few samples with good accuracy and that the use of l1 regularization is beneficial in this process, pushing inference abilities further into low-temperature regimes.

  • Figure
  • Figure
  • Received 29 September 2011

DOI:https://doi.org/10.1103/PhysRevLett.108.090201

© 2012 American Physical Society

Authors & Affiliations

Erik Aurell*

  • ACCESS Linnaeus Centre, KTH, Stockholm, Sweden and Department Computational Biology, AlbaNova University Centre, 106 91 Stockholm, Sweden

Magnus Ekeberg

  • Engineering Physics Program, KTH Royal Institute of Technology, 100 77 Stockholm, Sweden

  • *Also at Aalto University School of Science, Helsinki, Finland. eaurell@kth.se
  • ekeb@kth.se

Article Text

Click to Expand

References

Click to Expand
Issue

Vol. 108, Iss. 9 — 2 March 2012

Reuse & Permissions
Author publication services for translation and copyediting assistance advertisement
  • (a) and (b) show reconstruction errors of PLM, nMF, and SCE versus temperature for (a) fully () and (b) sparsely () connected SK systems of size . The number of MC samples used are (dotted lines), (dashed lines) and (continuous lines). (c) Reconstruction errors of PLM as functions of external field strength for a SK system of size using samples for two different temperatures. The dashed curve is obtained for by excluding parameter sets where one or more empirical . (d) Comparison of parameter estimates between Boltzmann learning () and PLM () using data generated from a distribution with Hamiltonian . The system parameters used are , , , and all interaction parameters are drawn from a distribution.
  • (a) Edge agreement versus sample size in a binary SK model of size and sparsity for and . for all data points. (b) Probability of 100% edge agreement versus inverse temperature for and using on nearest-neighbor grids () with 30% dilution.

Sign up to receive regular email alerts from Physical Review Letters