S-Shape Activation Function with Adaptive Saturation Rate for Neural Networks

Nannan Ji,Jiangshe Zhang,Chunxia Zhang,Jianghong Ma,Lijuan Yang
DOI: https://doi.org/10.2139/ssrn.4331062
2023-01-01
Abstract:Activation function plays a vital role in the ability of artificial neural networks to learn complex functional mapping from data. Currently, the most successful and widely-used activation functions are designed by mimicking the models of biological neurons or proposed from the perspective of machine learning. In this work, motivated by two testimonies in neuroscience that the sigmoidal function is a common stimulus-response function and the response of a neuron varies from trial to trial even when the same sensory stimulus is repeatedly delivered, we propose an s-shape activation function with adaptive saturation rate, referred to as AFAS, through adding a trainable parameter α into the traditional sigmoid function. In this approach, α controls the saturation rate of activation function to characterize the variation in the neuron’s response. Meanwhile, we limit the minimum value of α to alleviate the vanishing gradients problem that may occur in deep neural networks. The effectiveness of the proposed AFAS is comprehensively substantiated by five different tasks with various types of network architectures.
What problem does this paper attempt to address?