Contrastive Supervised Distillation for Continual Representation Learning

Tommaso Barletti,Niccolo' Biondi,Federico Pernici,Matteo Bruni,Alberto Del Bimbo
DOI: https://doi.org/10.48550/arXiv.2205.05476
2022-05-11
Computer Vision and Pattern Recognition
Abstract:In this paper, we propose a novel training procedure for the continual representation learning problem in which a neural network model is sequentially learned to alleviate catastrophic forgetting in visual search tasks. Our method, called Contrastive Supervised Distillation (CSD), reduces feature forgetting while learning discriminative features. This is achieved by leveraging labels information in a distillation setting in which the student model is contrastively learned from the teacher model. Extensive experiments show that CSD performs favorably in mitigating catastrophic forgetting by outperforming current state-of-the-art methods. Our results also provide further evidence that feature forgetting evaluated in visual retrieval tasks is not as catastrophic as in classification tasks. Code at: https://github.com/NiccoBiondi/ContrastiveSupervisedDistillation.
What problem does this paper attempt to address?