Learning Compact Neural Networks Using Ordinary Differential Equations as Activation Functions

MohamadAli Torkamani,Phillip Wallis,Shiv Shankar,Amirmohammad Rooshenas
DOI: https://doi.org/10.48550/arXiv.1905.07685
2019-05-19
Abstract:Most deep neural networks use simple, fixed activation functions, such as sigmoids or rectified linear units, regardless of domain or network structure. We introduce differential equation units (DEUs), an improvement to modern neural networks, which enables each neuron to learn a particular nonlinear activation function from a family of solutions to an ordinary differential equation. Specifically, each neuron may change its functional form during training based on the behavior of the other parts of the network. We show that using neurons with DEU activation functions results in a more compact network capable of achieving comparable, if not superior, performance when is compared to much larger networks.
Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?