Abstract:Recent advancements in neural network design have given rise to the development of Kolmogorov-Arnold Networks (KANs), which enhance speed, interpretability, and precision. This paper presents the Fractional Kolmogorov-Arnold Network (fKAN), a novel neural network architecture that incorporates the distinctive attributes of KANs with a trainable adaptive fractional-orthogonal Jacobi function as its basis function. By leveraging the unique mathematical properties of fractional Jacobi functions, including simple derivative formulas, non-polynomial behavior, and activity for both positive and negative input values, this approach ensures efficient learning and enhanced accuracy. The proposed architecture is evaluated across a range of tasks in deep learning and physics-informed deep learning. Precision is tested on synthetic regression data, image classification, image denoising, and sentiment analysis. Additionally, the performance is measured on various differential equations, including ordinary, partial, and fractional delay differential equations. The results demonstrate that integrating fractional Jacobi functions into KANs significantly improves training speed and performance across diverse fields and applications.

What problem does this paper attempt to address?

The paper aims to address the issue of activation functions in neural networks and proposes a new neural network architecture—Fractional Kolmogorov-Arnold Networks (fKAN). Specifically, the paper attempts to improve existing neural network models through the following points: 1. **Introduction of Fractional Jacobi Functions**: The paper proposes using fractional Jacobi functions as the basis for activation functions. By leveraging their unique mathematical properties (such as simple derivative formulas, non-polynomial behavior, and effective handling of both positive and negative input values), it aims to enhance learning efficiency and accuracy. 2. **Enhanced Training Speed and Performance**: By integrating fractional Jacobi functions into KANs, the training speed and application performance across various domains are significantly improved, including tasks such as synthetic regression data, image classification, image denoising, and sentiment analysis. 3. **Application in Physics-Informed Deep Learning**: fKAN also excels in solving various differential equations, including ordinary differential equations, partial differential equations, and fractional delay differential equations. 4. **Addressing Limitations of Existing Activation Functions**: The paper discusses the issues with commonly used activation functions (such as Sigmoid, Tanh, ReLU, and their variants), such as the vanishing gradient problem, and overcomes these limitations by introducing fractional Jacobi functions. In summary, the paper primarily addresses how to design more efficient and accurate neural network models through fractional Jacobi functions, providing new solutions particularly in physics-informed deep learning and solving complex differential equations.

fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions

rKAN: Rational Kolmogorov-Arnold Networks

FC-KAN: Function Combinations in Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

F-KANs: Federated Kolmogorov-Arnold Networks

Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems

BSRBF-KAN: A combination of B-splines and Radial Basis Functions in Kolmogorov-Arnold Networks

Evaluating Federated Kolmogorov-Arnold Networks on Non-IID Data

GKAN: Graph Kolmogorov-Arnold Networks

A Survey on Kolmogorov-Arnold Network

QKAN: Quantum Kolmogorov-Arnold Networks

Convolutional Kolmogorov-Arnold Networks

Kolmogorov-Arnold Networks (KAN) for Time Series Classification and Robust Analysis

On the expressiveness and spectral bias of KANs

Smooth Kolmogorov Arnold networks enabling structural knowledge representation

EKAN: Equivariant Kolmogorov-Arnold Networks

Kolmogorov-Arnold Networks (KANs) for Time Series Analysis

Bayesian Kolmogorov Arnold Networks (Bayesian_KANs): A Probabilistic Approach to Enhance Accuracy and Interpretability

Kolmogorov Arnold Informed neural network: A physics-informed deep learning framework for solving forward and inverse problems based on Kolmogorov Arnold Networks

Kolmogorov-Arnold Network Autoencoders

On Training of Kolmogorov-Arnold Networks