Deep Learning Is Singular, and That’s Good
Susan Wei,Daniel Murfet,Mingming Gong,Hui Li,Jesse Gell-Redman,Thomas Quella
DOI: https://doi.org/10.1109/tnnls.2022.3167409
IF: 14.255
2022-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:In singular models, the optimal set of parameters forms an analytic set with singularities, and a classical statistical inference cannot be applied to such models. This is significant for deep learning as neural networks are singular, and thus", dividing" by the determinant of the Hessian or employing the Laplace approximation is not appropriate. Despite its potential for addressing fundamental issues in deep learning, a singular learning theory appears to have made little inroads into the developing canon of a deep learning theory. Via a mix of theory and experiment, we present an invitation to the singular learning theory as a vehicle for understanding deep learning and suggest an important future work to make the singular learning theory directly applicable to how deep learning is performed in practice.
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, hardware & architecture