The quality and complexity of pairwise maximum entropy models for large cortical populations
Valdemar Kargård Olsen,Jonathan R. Whitlock,Yasser Roudi
DOI: https://doi.org/10.1371/journal.pcbi.1012074
2024-05-03
PLoS Computational Biology
Abstract:We investigate the ability of the pairwise maximum entropy (PME) model to describe the spiking activity of large populations of neurons recorded from the visual, auditory, motor, and somatosensory cortices. To quantify this performance, we use (1) Kullback-Leibler (KL) divergences, (2) the extent to which the pairwise model predicts third-order correlations, and (3) its ability to predict the probability that multiple neurons are simultaneously active. We compare these with the performance of a model with independent neurons and study the relationship between the different performance measures, while varying the population size, mean firing rate of the chosen population, and the bin size used for binarizing the data. We confirm the previously reported excellent performance of the PME model for small population sizes N < 20. But we also find that larger mean firing rates and bin sizes generally decreases performance. The performance for larger populations were generally not as good. For large populations, pairwise models may be good in terms of predicting third-order correlations and the probability of multiple neurons being active, but still significantly worse than small populations in terms of their improvement over the independent model in KL-divergence. We show that these results are independent of the cortical area and of whether approximate methods or Boltzmann learning are used for inferring the pairwise couplings. We compared the scaling of the inferred couplings with N and find it to be well explained by the Sherrington-Kirkpatrick (SK) model, whose strong coupling regime shows a complex phase with many metastable states. We find that, up to the maximum population size studied here, the fitted PME model remains outside its complex phase. However, the standard deviation of the couplings compared to their mean increases, and the model gets closer to the boundary of the complex phase as the population size grows. With recent major advances in recording technology, much of computational neuroscience has effectively turned into describing patterns in large amounts of data as succinctly as possible. One way to do this is to construct simple parametric models of the probability distribution over patterns of neuronal activity, such as the pairwise maximum entropy model. Intuitively, the pairwise model makes the distribution over all patterns as flat or uniform as possible, while keeping all firing rates and pairwise correlations the same as in the data. This model has been shown to effectively capture the observed distribution of activity patterns well for small populations (∼ 10), but it has not been systematically studied for large populations. Here, we study the performance of the pairwise model using a Neuropixel dataset recorded from the visual, auditory, somatosensory, and motor cortices of freely moving rats exposed to different stimuli. Consistent with previous findings, we find good performance for small populations, before it falls sharply as the population size increases (>∼ 25). However, we also find that this decrease in performance reveals interesting differences between the correlation structure of the data recorded under different sensory conditions.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?