A mathematical theory of semantic development in deep neural networks

Andrew M. Saxe,James L. McClelland,Surya Ganguli
DOI: https://doi.org/10.1073/pnas.1820226116
IF: 11.1
2019-05-17
Proceedings of the National Academy of Sciences
Abstract:Significance Over the course of development, humans learn myriad facts about items in the world, and naturally group these items into useful categories and structures. This semantic knowledge is essential for diverse behaviors and inferences in adulthood. How is this richly structured semantic knowledge acquired, organized, deployed, and represented by neuronal networks in the brain? We address this question by studying how the nonlinear learning dynamics of deep linear networks acquires information about complex environmental structures. Our results show that this deep learning dynamics can self-organize emergent hidden representations in a manner that recapitulates many empirical phenomena in human semantic development. Such deep networks thus provide a mathematically tractable window into the development of internal neural representations through experience.
What problem does this paper attempt to address?