Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity

Amit Daniely,Roy Frostig,Yoram Singer
DOI: https://doi.org/10.48550/arXiv.1602.05897
2017-05-20
Abstract:We develop a general duality between neural networks and compositional kernels, striving towards a better understanding of deep learning. We show that initial representations generated by common random initializations are sufficiently rich to express all functions in the dual kernel space. Hence, though the training objective is hard to optimize in the worst case, the initial weights form a good starting point for optimization. Our dual view also reveals a pragmatic and aesthetic perspective of neural networks and underscores their expressive power.
Machine Learning,Artificial Intelligence,Computational Complexity,Data Structures and Algorithms
What problem does this paper attempt to address?