Optimal errors and phase transitions in high-dimensional generalized linear models

Jean Barbier,Florent Krzakala,Nicolas Macris,Léo Miolane,Lenka Zdeborová
DOI: https://doi.org/10.1073/pnas.1802705116
IF: 11.1
2019-03-01
Proceedings of the National Academy of Sciences
Abstract:Significance High-dimensional generalized linear models are basic building blocks of current data analysis tools including multilayers neural networks. They arise in signal processing, statistical inference, machine learning, communication theory, and other fields. We establish rigorously the intrinsic information-theoretic limitations of inference and learning for a class of randomly generated instances of generalized linear models, thus closing several decades-old conjectures. Moreover, we delimit regions of parameters for which the optimal error rates are efficiently achievable with currently known algorithms. Our proof technique is able to deal with the output nonlinearity and is hence of independent interest, opening ways to establish similar results for models of neural networks where nonlinearities are essential but in general difficult to account for.
What problem does this paper attempt to address?