fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions

Alireza Afzal Aghaei
2024-06-12
Abstract:Recent advancements in neural network design have given rise to the development of Kolmogorov-Arnold Networks (KANs), which enhance speed, interpretability, and precision. This paper presents the Fractional Kolmogorov-Arnold Network (fKAN), a novel neural network architecture that incorporates the distinctive attributes of KANs with a trainable adaptive fractional-orthogonal Jacobi function as its basis function. By leveraging the unique mathematical properties of fractional Jacobi functions, including simple derivative formulas, non-polynomial behavior, and activity for both positive and negative input values, this approach ensures efficient learning and enhanced accuracy. The proposed architecture is evaluated across a range of tasks in deep learning and physics-informed deep learning. Precision is tested on synthetic regression data, image classification, image denoising, and sentiment analysis. Additionally, the performance is measured on various differential equations, including ordinary, partial, and fractional delay differential equations. The results demonstrate that integrating fractional Jacobi functions into KANs significantly improves training speed and performance across diverse fields and applications.
Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
The paper aims to address the issue of activation functions in neural networks and proposes a new neural network architecture—Fractional Kolmogorov-Arnold Networks (fKAN). Specifically, the paper attempts to improve existing neural network models through the following points: 1. **Introduction of Fractional Jacobi Functions**: The paper proposes using fractional Jacobi functions as the basis for activation functions. By leveraging their unique mathematical properties (such as simple derivative formulas, non-polynomial behavior, and effective handling of both positive and negative input values), it aims to enhance learning efficiency and accuracy. 2. **Enhanced Training Speed and Performance**: By integrating fractional Jacobi functions into KANs, the training speed and application performance across various domains are significantly improved, including tasks such as synthetic regression data, image classification, image denoising, and sentiment analysis. 3. **Application in Physics-Informed Deep Learning**: fKAN also excels in solving various differential equations, including ordinary differential equations, partial differential equations, and fractional delay differential equations. 4. **Addressing Limitations of Existing Activation Functions**: The paper discusses the issues with commonly used activation functions (such as Sigmoid, Tanh, ReLU, and their variants), such as the vanishing gradient problem, and overcomes these limitations by introducing fractional Jacobi functions. In summary, the paper primarily addresses how to design more efficient and accurate neural network models through fractional Jacobi functions, providing new solutions particularly in physics-informed deep learning and solving complex differential equations.