Machine Learning and Neural Networks for Field Theory
Sam Foreman,Xiao-Yong Jin,J. Osborn
2020-01-01
Abstract:Perturbative acceleration [1–3]. The ability to efficiently sample from high-dimensional distributions remains a widely-pursued goal across scientific disciplines, with some noteable examples including particle physics, molecular dynamics, protein folding, population genetics, neuroscience, epidemiology, economics, ecology, climate science, and astrophysics & cosmology [4]. Recently, there has been a growing interest in developing probabilistic models that are parameterized by neural networks, and while much progress has been made in this direction [5–10], mitigating the critical slowing down (CSD) effect for lattice QCD remains a long-term goal of the community. Since all lattice QCD simulations are performed at finite lattice spacing a, an extrapolation to the continuum limit is required in order to accurately compute physical quantities of interest. More reliable extrapolations can be done by simulating the theory at increasingly smaller lattice spacings while keeping the physics constant. Unfortunately, this causes the correlation times of these quantities to diverge, indicating that the continuum limit is a critical point of the theory. Markov Chain Monte Carlo (MCMC) algorithms are known to encounter difficulties when simulating theories near a critical point, an issue known as the critical slowing down of the algorithm [11]. This effect can easily be seen in the topological charge Q ∈ Z, whose auto-correlation time increases dramatically with smaller lattice spacing as configurations tend to get ‘stuck’ in distinct topological sectors, preventing an efficient exploration of the phase space. As a result, developing new sampling techniques that are able to offer improvements in efficiency through a reduction of statistical autocorrelations are highly desired. In this LOI, we describe some recent work in this area, and provide suggestions for possible future directions. Generally speaking, MCMC methods are a class of algorithms that use Markov Chains to sample from a target distribution p(x) that is often too complicated to sample from directly. Currently, the Hamiltonian Monte Carlo (HMC) algorithm is the most widely used technique for generating gauge configurations in lattice gauge theory and lattice QCD. We include below a brief overview of the approach, but refer the interested reader to [12, 13] for more details and limitations. We begin by introducing an auxiliary momentum variable v (normally distributed, independent of the position x) in order to lift the target distribution onto a joint probability distribution p(x, v) in phase space. The Markov Chain is then obtained by simulating a physical system governed by a Hamiltonian comprised of kinetic and potential energy functions, i.e. H(x, v) = U(x) + T (v). In particular, HMC operates by sampling from the canonical distribution p(x, v) = exp(H(x, v)) = p(x)p(v) by solving the equations of motion (ẋi = ∂H ∂vi , v̇i = − ∂H ∂xi ) for a fixed period of time using a volume-preserving integrator. In practice the integration is done in discrete steps introducing some numerical error. This then requires a Metropolis accept/reject step to correct for the error. As the lattice spacing decreases, sectors of different topology become separated by large potential barriers, and simply moving along trajectories from the standard EOM become inefficient at moving between