NeCa: Network Calibration for Class Incremental Learning.

Zhenyao Zhang,Lijun Zhang
DOI: https://doi.org/10.1007/978-3-031-47634-1_29
2023-01-01
Abstract:Class incremental learning (CIL) aims to continually learn unseen classes in new tasks without forgetting the previous ones. However, deep neural networks are prone to make a biased prediction towards classes in the most recently learned task, dubbed task-recency bias. Most recent studies make a post-training adjustment on the last fully connected layer to alleviate this problem but ignore the feature extractor. This work proposes a novel training framework termed network calibration (NeCa) that simultaneously adjusts the last fully connected layer and the feature extractor. Specifically, we combine the post-training adjustment process with the training process into a balanced learning module, whose loss function is corrected based on the prior probabilities of classes. In this module, the parameters of the whole network are well-calibrated via backpropagation. Additional knowledge transmission and decaying regularization modules further mitigate catastrophic forgetting in CIL. Experiment results manifest that NeCa outperforms the state-of-the-art methods on three mainstream datasets, including MNIST, CIFAR-100, and ImageNet-100, which validates the effectiveness of our framework. Furthermore, we conduct experiments with the prevalent vision transformer backbone, and the consistently excellent performance demonstrates that NeCa is also competently suited for attention-based models.
What problem does this paper attempt to address?