Wav-KAN: Wavelet Kolmogorov-Arnold Networks

Zavareh Bozorgasl,Hao Chen
2024-05-27
Abstract:In this paper, we introduce Wav-KAN, an innovative neural network architecture that leverages the Wavelet Kolmogorov-Arnold Networks (Wav-KAN) framework to enhance interpretability and performance. Traditional multilayer perceptrons (MLPs) and even recent advancements like Spl-KAN face challenges related to interpretability, training speed, robustness, computational efficiency, and performance. Wav-KAN addresses these limitations by incorporating wavelet functions into the Kolmogorov-Arnold network structure, enabling the network to capture both high-frequency and low-frequency components of the input data efficiently. Wavelet-based approximations employ orthogonal or semi-orthogonal basis and maintain a balance between accurately representing the underlying data structure and avoiding overfitting to the noise. While continuous wavelet transform (CWT) has a lot of potentials, we also employed discrete wavelet transform (DWT) for multiresolution analysis, which obviated the need for recalculation of the previous steps in finding the details. Analogous to how water conforms to the shape of its container, Wav-KAN adapts to the data structure, resulting in enhanced accuracy, faster training speeds, and increased robustness compared to Spl-KAN and MLPs. Our results highlight the potential of Wav-KAN as a powerful tool for developing interpretable and high-performance neural networks, with applications spanning various fields. This work sets the stage for further exploration and implementation of Wav-KAN in frameworks such as PyTorch and TensorFlow, aiming to make wavelets in KAN as widespread as activation functions like ReLU and sigmoid in universal approximation theory (UAT). The codes to replicate the simulations are available at this https URL.
Machine Learning,Artificial Intelligence,Signal Processing
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily addresses the following issues: 1. **Interpretability of Neural Networks**: - Traditional Multilayer Perceptrons (MLPs) and some recent advancements (such as Spl-KAN) lack interpretability. Wav-KAN enhances the interpretability and performance of the network by introducing wavelet functions. 2. **Training Speed, Robustness, and Computational Efficiency**: - Traditional neural networks face challenges in training speed, robustness, and computational efficiency. Wav-KAN addresses these issues by combining wavelet transforms with the Kolmogorov-Arnold network structure. 3. **Capturing High-Frequency and Low-Frequency Components**: - Wav-KAN efficiently captures high-frequency and low-frequency components in the input data, thereby improving the overall performance of the network. 4. **Avoiding Overfitting**: - Wavelet transforms avoid overfitting to noise while accurately representing the data structure, thus enhancing the model's generalization ability. Through these improvements, Wav-KAN not only enhances the performance of neural networks but also increases their interpretability, making them suitable for various fields such as drug discovery and genomics. Additionally, the paper demonstrates the advantages of Wav-KAN over Spl-KAN and traditional MLPs in terms of training speed and accuracy, providing experimental results to support these conclusions.