A simple and efficient architecture for trainable activation functions

Andrea Apicella,Francesco Isgrò,Roberto Prevete
DOI: https://doi.org/10.1016/j.neucom.2019.08.065
IF: 6
2019-12-01
Neurocomputing
Abstract:Automatically learning the best activation function for the task is an active topic in neural network research. At the moment, despite promising results, it is still challenging to determine a method for learning an activation function that is, at the same time, theoretically simple and easy to implement. Moreover, most of the methods proposed so far introduce new parameters or adopt different learning techniques. In this work, we propose a simple method to obtain a trained activation function which adds to the neural network local sub-networks with a small number of neurons. Experiments show that this approach could lead to better results than using a pre-defined activation function, without introducing the need to learn a large number of additional parameters.
computer science, artificial intelligence
What problem does this paper attempt to address?