Abstract:We introduce KANICE (Kolmogorov-Arnold Networks with Interactive Convolutional Elements), a novel neural architecture that combines Convolutional Neural Networks (CNNs) with Kolmogorov-Arnold Network (KAN) principles. KANICE integrates Interactive Convolutional Blocks (ICBs) and KAN linear layers into a CNN framework. This leverages KANs' universal approximation capabilities and ICBs' adaptive feature learning. KANICE captures complex, non-linear data relationships while enabling dynamic, context-dependent feature extraction based on the Kolmogorov-Arnold representation theorem. We evaluated KANICE on four datasets: MNIST, Fashion-MNIST, EMNIST, and SVHN, comparing it against standard CNNs, CNN-KAN hybrids, and ICB variants. KANICE consistently outperformed baseline models, achieving 99.35% accuracy on MNIST and 90.05% on the SVHN dataset. Furthermore, we introduce KANICE-mini, a compact variant designed for efficiency. A comprehensive ablation study demonstrates that KANICE-mini achieves comparable performance to KANICE with significantly fewer parameters. KANICE-mini reached 90.00% accuracy on SVHN with 2,337,828 parameters, compared to KANICE's 25,432,000. This study highlights the potential of KAN-based architectures in balancing performance and computational efficiency in image classification tasks. Our work contributes to research in adaptive neural networks, integrates mathematical theorems into deep learning architectures, and explores the trade-offs between model complexity and performance, advancing computer vision and pattern recognition. The source code for this paper is publicly accessible through our GitHub repository (<a class="link-external link-https" href="https://github.com/m-ferdaus/kanice" rel="external noopener nofollow">this https URL</a>).

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? The paper "KANICE: Kolmogorov - Arnold Networks with Interactive Convolutional Elements" aims to solve the following problems: 1. **Limitations of standard convolutional neural networks (CNNs)**: - **Long - range dependencies**: Traditional CNNs are insufficient in handling long - range dependencies, especially in complex image classification tasks. - **Adaptability**: Traditional CNNs have limited adaptability to different input distributions and it is difficult to dynamically adjust the feature extraction method. - **Expressive power**: The expressive power of standard linear layers is limited and it is difficult to capture complex non - linear relationships. 2. **Combining the advantages of Kolmogorov - Arnold networks (KAN) and interactive convolutional blocks (ICB)**: - **Universal approximation ability of KAN**: Based on the Kolmogorov - Arnold representation theorem, KAN can approximate any continuous multivariable function through the combination of univariate functions, thereby enhancing the expressive power and generalization ability of the model. - **Adaptive feature learning of ICB**: ICB realizes dynamic, context - related feature extraction through multi - scale convolution and element - level multiplication, improving the model's adaptability to complex image data. 3. **Balancing performance and computational efficiency**: - **KANICE - mini**: A compact variant KANICE - mini is proposed, aiming to maintain high performance while reducing the number of parameters and improving the computational efficiency of the model. ### Specific objectives - **Improve the accuracy of image classification**: By combining KAN and ICB, KANICE shows higher accuracy on multiple image classification datasets such as MNIST, Fashion - MNIST, EMNIST and SVHN. - **Enhance the generalization ability of the model**: KANICE better adapts to different data distributions through improved feature extraction and expressive power, and improves the performance of the model on unseen data. - **Explore the trade - off between model complexity and performance**: By introducing KANICE - mini, the impact of reduced model complexity on performance is studied, providing more efficient solutions for practical applications. ### Experimental results - **Performance evaluation**: KANICE performs well on multiple datasets, especially achieving an accuracy of 99.35% on the MNIST dataset and 90.05% on the SVHN dataset. - **Ablation study**: KANICE - mini can still achieve performance comparable to that of KANICE with a significant reduction in the number of parameters, demonstrating its efficiency. ### Conclusion This paper proposes a new neural network architecture KANICE by combining KAN and ICB, which effectively solves the limitations of traditional CNNs in complex image classification tasks. At the same time, by introducing KANICE - mini, the performance and computational efficiency of the model are further balanced, providing more choices for practical applications.

KANICE: Kolmogorov-Arnold Networks with Interactive Convolutional Elements

Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies

Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks

Convolutional Kolmogorov-Arnold Networks

Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision

KANs for Computer Vision: An Experimental Study

Kolmogorov-Arnold Network Autoencoders

Residual Kolmogorov-Arnold Network for Enhanced Deep Learning

How to Learn More? Exploring Kolmogorov-Arnold Networks for Hyperspectral Image Classification

A preliminary study on continual learning in computer vision using Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

A Survey on Kolmogorov-Arnold Network

Activation Space Selectable Kolmogorov-Arnold Networks

KKANs: Kurkova-Kolmogorov-Arnold Networks and Their Learning Dynamics

Suitability of KANs for Computer Vision: A preliminary investigation

Exploring Kolmogorov-Arnold Networks for Interpretable Time Series Classification

GKAN: Graph Kolmogorov-Arnold Networks

KAN 2.0: Kolmogorov-Arnold Networks Meet Science

How to Learn More? Exploring Kolmogorov–Arnold Networks for Hyperspectral Image Classification

Kolmogorov-Arnold Networks for Genomic Tasks

A Revolutionary Neural Network Architecture with Interpretability and Flexibility Based on Kolmogorov–Arnold for Solar Radiation and Temperature Forecasting