NTK-Guided Few-Shot Class Incremental Learning

Jingren Liu,Zhong Ji,Yanwei Pang,YunLong Yu

2024-09-24

Abstract:The proliferation of Few-Shot Class Incremental Learning (FSCIL) methodologies has highlighted the critical challenge of maintaining robust anti-amnesia capabilities in FSCIL learners. In this paper, we present a novel conceptualization of anti-amnesia in terms of mathematical generalization, leveraging the Neural Tangent Kernel (NTK) perspective. Our method focuses on two key aspects: ensuring optimal NTK convergence and minimizing NTK-related generalization loss, which serve as the theoretical foundation for cross-task generalization. To achieve global NTK convergence, we introduce a principled meta-learning mechanism that guides optimization within an expanded network architecture. Concurrently, to reduce the NTK-related generalization loss, we systematically optimize its constituent factors. Specifically, we initiate self-supervised pre-training on the base session to enhance NTK-related generalization potential. These self-supervised weights are then carefully refined through curricular alignment, followed by the application of dual NTK regularization tailored specifically for both convolutional and linear layers. Through the combined effects of these measures, our network acquires robust NTK properties, ensuring optimal convergence and stability of the NTK matrix and minimizing the NTK-related generalization loss, significantly enhancing its theoretical generalization. On popular FSCIL benchmark datasets, our NTK-FSCIL surpasses contemporary state-of-the-art approaches, elevating end-session accuracy by 2.9\% to 9.3\%.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

The paper aims to address the key challenge in Few-Shot Class-Incremental Learning (FSCIL), which is how to maintain strong anti-forgetting ability while continuously learning incremental classes with limited data samples. Specifically, the paper proposes applying Neural Tangent Kernel (NTK) theory to FSCIL to enhance the model's generalization ability. ### Main Problems Addressed by the Paper: 1. **Anti-Forgetting and Model Generalization**: Existing FSCIL methods mainly focus on preserving existing knowledge in each session and addressing the problem of catastrophic forgetting. However, these methods are insufficient in terms of model generalization. This paper introduces NTK theory to understand the generalization characteristics of neural networks in FSCIL. 2. **NTK Convergence and Generalization Loss**: The paper proposes a new conceptual approach that transforms anti-forgetting into a mathematical generalization problem, using the NTK perspective to ensure NTK convergence and minimize NTK-related generalization loss. 3. **Optimization Strategies**: To achieve global NTK convergence, the authors introduce a meta-learning mechanism based on an extended network architecture. Additionally, to reduce NTK-related generalization loss, the components are systematically optimized. Specific measures include self-supervised pre-training, curriculum alignment, and dual NTK regularization. ### Summary: The core objective of the paper is to improve the model's generalization ability and anti-forgetting capability within the FSCIL framework through NTK theory. Through theoretical analysis and experimental validation, the authors demonstrate that this approach significantly improves the accuracy of the final session on popular FSCIL benchmark datasets, thereby surpassing current state-of-the-art methods.

NTK-Guided Few-Shot Class Incremental Learning

NTK-Guided Few-Shot Class Incremental Learning

A Cognition-Driven Framework for Few-Shot Class-Incremental Learning

Analogical Learning-Based Few-Shot Class-Incremental Learning

Few-Shot Incremental Learning with Continually Evolved Classifiers

Few-shot class-incremental learning based on representation enhancement

Pseudo Initialization Based Few-Shot Class Incremental Learning

Few Shot Class Incremental Learning via Efficient Prototype Replay and Calibration

Bias Mitigating Few-Shot Class-Incremental Learning

GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Self-supervised Contrastive Feature Refinement for Few-Shot Class-Incremental Learning.

FSCIL-EACA: Few-Shot Class-Incremental Learning Network Based on Embedding Augmentation and Classifier Adaptation for Image Classification

MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incremental Learning

Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class Incremental Learning

Multimodal Parameter-Efficient Few-Shot Class Incremental Learning

Few-Shot Class-Incremental Learning Via Class-Aware Bilateral Distillation

Enhanced Few-Shot Class-Incremental Learning via Ensemble Models

Model Attention Expansion for Few-Shot Class-Incremental Learning

Dynamic Support Network for Few-Shot Class Incremental Learning

Knowledge Transfer-Driven Few-Shot Class-Incremental Learning