Abstract:Effective activation functions introduce non-linear transformations, providing neural networks with stronger fitting capa-bilities, which help them better adapt to real data distributions. Huawei Noah's Lab believes that dynamic activation functions are more suitable than static activation functions for enhancing the non-linear capabilities of neural networks. Tsinghua University's related research also suggests using dynamically adjusted activation functions. Building on the ideas of using fine-tuned activation functions from Tsinghua University and Huawei Noah's Lab, we propose a series-based learnable ac-tivation function called LSLU (Learnable Series Linear Units). This method simplifies deep learning networks while im-proving accuracy. This method introduces learnable parameters {\theta} and {\omega} to control the activation function, adapting it to the current layer's training stage and improving the model's generalization. The principle is to increase non-linearity in each activation layer, boosting the network's overall non-linearity. We evaluate LSLU's performance on CIFAR10, CIFAR100, and specific task datasets (e.g., Silkworm), validating its effectiveness. The convergence behavior of the learnable parameters {\theta} and {\omega}, as well as their effects on generalization, are analyzed. Our empirical results show that LSLU enhances the general-ization ability of the original model in various tasks while speeding up training. In VanillaNet training, parameter {\theta} initially decreases, then increases before stabilizing, while {\omega} shows an opposite trend. Ultimately, LSLU achieves a 3.17% accuracy improvement on CIFAR100 for VanillaNet (Table 3). Codes are available at <a class="link-external link-https" href="https://github.com/vontran2021/Learnable-series-linear-units-LSLU" rel="external noopener nofollow">this https URL</a>.

Adaptive Parametric Activation

PATS: A New Neural Network Activation Function with Parameter.

Normalized Activation Function: Toward Better Convergence

APALU: A Trainable, Adaptive Activation Function for Deep Learning Networks

Activate or Not: Learning Customized Activation

Activation Adaptation in Neural Networks

Adaptive Blending Units: Trainable Activation Functions for Deep Neural Networks

A novel activation function for multilayer feed-forward neural networks

ANAct: Adaptive Normalization for Activation Functions

S-Shape Activation Function with Adaptive Saturation Rate for Neural Networks

Exploring the Relationship: Transformative Adaptive Activation Functions in Comparison to Other Activation Functions

Trainable Highly-expressive Activation Functions

Activated Gradients for Deep Neural Networks

An Efficient Asymmetric Nonlinear Activation Function for Deep Neural Networks

Learn-Able Parameter Guided Activation Functions

A generic shift-norm-activation approach for deep learning

Activation Functions: Dive into an optimal activation function

Simple yet effective adaptive activation functions for physics-informed neural networks

Activation function optimization method: Learnable series linear units (LSLUs)

A Method on Searching Better Activation Functions

Learning Specialized Activation Functions for Physics-informed Neural Networks