NTK-Guided Few-Shot Class Incremental Learning

Jingren Liu,Zhong Ji,Yanwei Pang,YunLong Yu
2024-09-24
Abstract:The proliferation of Few-Shot Class Incremental Learning (FSCIL) methodologies has highlighted the critical challenge of maintaining robust anti-amnesia capabilities in FSCIL learners. In this paper, we present a novel conceptualization of anti-amnesia in terms of mathematical generalization, leveraging the Neural Tangent Kernel (NTK) perspective. Our method focuses on two key aspects: ensuring optimal NTK convergence and minimizing NTK-related generalization loss, which serve as the theoretical foundation for cross-task generalization. To achieve global NTK convergence, we introduce a principled meta-learning mechanism that guides optimization within an expanded network architecture. Concurrently, to reduce the NTK-related generalization loss, we systematically optimize its constituent factors. Specifically, we initiate self-supervised pre-training on the base session to enhance NTK-related generalization potential. These self-supervised weights are then carefully refined through curricular alignment, followed by the application of dual NTK regularization tailored specifically for both convolutional and linear layers. Through the combined effects of these measures, our network acquires robust NTK properties, ensuring optimal convergence and stability of the NTK matrix and minimizing the NTK-related generalization loss, significantly enhancing its theoretical generalization. On popular FSCIL benchmark datasets, our NTK-FSCIL surpasses contemporary state-of-the-art approaches, elevating end-session accuracy by 2.9\% to 9.3\%.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the key challenge in Few-Shot Class-Incremental Learning (FSCIL), which is how to maintain strong anti-forgetting ability while continuously learning incremental classes with limited data samples. Specifically, the paper proposes applying Neural Tangent Kernel (NTK) theory to FSCIL to enhance the model's generalization ability. ### Main Problems Addressed by the Paper: 1. **Anti-Forgetting and Model Generalization**: Existing FSCIL methods mainly focus on preserving existing knowledge in each session and addressing the problem of catastrophic forgetting. However, these methods are insufficient in terms of model generalization. This paper introduces NTK theory to understand the generalization characteristics of neural networks in FSCIL. 2. **NTK Convergence and Generalization Loss**: The paper proposes a new conceptual approach that transforms anti-forgetting into a mathematical generalization problem, using the NTK perspective to ensure NTK convergence and minimize NTK-related generalization loss. 3. **Optimization Strategies**: To achieve global NTK convergence, the authors introduce a meta-learning mechanism based on an extended network architecture. Additionally, to reduce NTK-related generalization loss, the components are systematically optimized. Specific measures include self-supervised pre-training, curriculum alignment, and dual NTK regularization. ### Summary: The core objective of the paper is to improve the model's generalization ability and anti-forgetting capability within the FSCIL framework through NTK theory. Through theoretical analysis and experimental validation, the authors demonstrate that this approach significantly improves the accuracy of the final session on popular FSCIL benchmark datasets, thereby surpassing current state-of-the-art methods.