Efficient training of energy-based models via spin-glass control

Alejandro Pozas-Kerstjens,Gorka Muñoz-Gil,Eloy Piñol,Miguel Ángel García-March,Antonio Acín,Maciej Lewenstein,Przemysław R. Grzybowski
DOI: https://doi.org/10.1088/2632-2153/abe807
2021-04-15
Abstract:We introduce a new family of energy-based probabilistic graphical models for efficient unsupervised learning. Its definition is motivated by the control of the spin-glass properties of the Ising model described by the weights of Boltzmann machines. We use it to learn the Bars and Stripes dataset of various sizes and the MNIST dataset, and show how they quickly achieve the performance offered by standard methods for unsupervised learning. Our results indicate that the standard initialization of Boltzmann machines with random weights equivalent to spin-glass models is an unnecessary bottleneck in the process of training. Furthermore, this new family allows for very easy access to low-energy configurations, which points to new, efficient training algorithms. The simplest variant of such algorithms approximates the negative phase of the log-likelihood gradient with no Markov chain Monte Carlo sampling costs at all, and with an accuracy sufficient to achieve good learning and generalization.
Statistical Mechanics,Disordered Systems and Neural Networks,Machine Learning
What problem does this paper attempt to address?