Abstract:Despite the strong performance in many computer vision tasks, Convolutional Neural Networks (CNNs) can sometimes struggle to efficiently capture long-range, complex non-linear dependencies in deeper layers of the network. We address this limitation by introducing Residual KAN, which incorporates the Kolmogorov-Arnold Network (KAN) within the CNN framework as a residual component. Our approach uses Chebyshev polynomials as the basis for KAN convolutions that enables more expressive and adaptive feature representations while maintaining computational efficiency. The proposed RKAN blocks, when integrated into established architectures such as ResNet and DenseNet, offer consistent improvements over the baseline models on various well-known benchmarks. Our results demonstrate the potential of RKAN to enhance the capabilities of deep CNNs in visual data.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the limitation of convolutional neural networks (CNNs) in efficiently capturing long - distance and complex non - linear dependencies in deep networks. Specifically, although CNNs perform well in many computer vision tasks, in the deep layers of the network, they sometimes have difficulty effectively capturing these complex dependencies. To this end, the author introduced the Residual Kolmogorov - Arnold network (Residual KAN, RKAN). By adding the Kolmogorov - Arnold network (KAN) based on Chebyshev polynomials as a residual component in the CNN framework, the expressive power and adaptability of CNN are enhanced while maintaining computational efficiency. ### Main Contributions 1. **Introduction of KAN**: Using the Kolmogorov - Arnold representation theorem, KAN can represent any multivariate continuous function, thus providing a more flexible and powerful neural network architecture. 2. **Chebyshev Polynomials**: Using Chebyshev polynomials as the basis for KAN convolution enhances the expressive power and adaptability of feature representation. 3. **Residual Component**: Integrating KAN as a residual component into existing CNN architectures, such as ResNet and DenseNet, to improve model performance. 4. **Experimental Verification**: Experiments were carried out on multiple well - known benchmark datasets (such as CIFAR - 100, Food - 101, Tiny ImageNet and ILSVRC - 2012) to verify the effectiveness of RKAN. ### Key Technologies - **KAN Convolution**: By extracting 3×3 patches and using Chebyshev polynomials for transformation, non - linearity is introduced. - **Residual Connection**: Combining the outputs of the main path and the residual path through element - wise addition enhances the gradient flow and facilitates the training of deeper networks. - **Normalization**: Using different normalization methods (such as tanh normalization, standardization and min - max scaling) to process input data to ensure that it conforms to the input range of Chebyshev polynomials. ### Experimental Results - **Performance Improvement**: On multiple datasets, the RKAN model shows a significant performance improvement compared to the baseline model, especially in deeper and wider models. - **Computational Efficiency**: Although adding the RKAN module will increase a certain amount of computational overhead, it still maintains a relatively high computational efficiency overall. - **Robustness**: On different datasets and network architectures, the RKAN module shows good robustness, especially in preventing over - fitting. ### Conclusion By integrating KAN as a residual component into CNN, RKAN effectively solves the problem of capturing complex non - linear dependencies in deep networks, improves the expressive power and performance of the model, and at the same time maintains computational efficiency. This method shows significant advantages in multiple computer vision tasks.

Residual Kolmogorov-Arnold Network for Enhanced Deep Learning

Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision

Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks

Convolutional Kolmogorov-Arnold Networks

A Revolutionary Neural Network Architecture with Interpretability and Flexibility Based on Kolmogorov–Arnold for Solar Radiation and Temperature Forecasting

Kolmogorov-Arnold Network Autoencoders

Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies

ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU

Reimagining Linear Probing: Kolmogorov-Arnold Networks in Transfer Learning

A Survey on Kolmogorov-Arnold Network

KKANs: Kurkova-Kolmogorov-Arnold Networks and Their Learning Dynamics

A preliminary study on continual learning in computer vision using Kolmogorov-Arnold Networks

Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing

A Comprehensive Survey on Kolmogorov Arnold Networks (KAN)

GKAN: Graph Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements

Wav-KAN: Wavelet Kolmogorov-Arnold Networks

Kolmogorov-Arnold Networks for Genomic Tasks

Generalization Bounds and Model Complexity for Kolmogorov-Arnold Networks