Emergent Structures and Lifetime Structure Evolution in Artificial Neural Networks
Siavash Golkar
DOI: https://doi.org/10.48550/arXiv.1911.11691
2019-11-27
Abstract:Motivated by the flexibility of biological neural networks whose connectivity structure changes significantly during their lifetime, we introduce the Unstructured Recursive Network (URN) and demonstrate that it can exhibit similar flexibility during training via gradient descent. We show empirically that many of the different neural network structures commonly used in practice today (including fully connected, locally connected and residual networks of different depths and widths) can emerge dynamically from the same URN. These different structures can be derived using gradient descent on a single general loss function where the structure of the data and the relative strengths of various regulator terms determine the structure of the emergent network. We show that this loss function and the regulators arise naturally when considering the symmetries of the network as well as the geometric properties of the input data.
Machine Learning,Neural and Evolutionary Computing,Neurons and Cognition
What problem does this paper attempt to address?