Sharpness-aware gradient guidance for few-shot class-incremental learning

Runhang Chen,Xiao-Yuan Jing,Fei Wu,Haowen Chen
DOI: https://doi.org/10.1016/j.knosys.2024.112030
IF: 8.139
2024-06-02
Knowledge-Based Systems
Abstract:Few-shot class-incremental learning (FSCIL) is a challenge that requires a model to learn new classes from limited examples without forgetting the learned knowledge. A common solution freezes the parameters trained on the base task and only fine-tunes the classifier for new incremental tasks. However, this solution may not guarantee the model's generalization ability to unseen classes, resulting in model limitations in adapting to new classes. To address this issue, we propose a novel approach called sharpness-aware gradient guidance (SAGG). Specifically, the SAGG objective improves the model's generalization ability by finding parameters within flat regions of the loss landscape. We first measure the sharpness of the loss function around the current parameters by adding perturbations. Then, we jointly optimize the training loss and the perturbation loss by explicitly guiding the gradient directions. Moreover, a prototype calibration strategy is proposed to help the model adapt to new classes with limited training instances by adjusting the classifier weights. We evaluate our approach on three benchmark datasets: CIFAR100, miniImageNet, and CUB200. The empirical results show that our method significantly surpasses other competing methods in terms of average accuracy.
computer science, artificial intelligence
What problem does this paper attempt to address?