Abstract:For artificial learning systems, continual learning over time from a stream of data is essential. The burgeoning studies on supervised continual learning have achieved great progress, while the study of catastrophic forgetting in unsupervised learning is still blank. Among unsupervised learning methods, self-supervise learning method shows tremendous potential on visual representation without any labeled data at scale. To improve the visual representation of self-supervised learning, larger and more varied data is needed. In the real world, unlabeled data is generated at all times. This circumstance provides a huge advantage for the learning of the self-supervised method. However, in the current paradigm, packing previous data and current data together and training it again is a waste of time and resources. Thus, a continual self-supervised learning method is badly needed. In this paper, we make the first attempt to implement the continual contrastive self-supervised learning by proposing a rehearsal method, which keeps a few exemplars from the previous data. Instead of directly combining saved exemplars with the current data set for training, we leverage self-supervised knowledge distillation to transfer contrastive information among previous data to the current network by mimicking similarity score distribution inferred by the old network over a set of saved exemplars. Moreover, we build an extra sample queue to assist the network to distinguish between previous and current data and prevent mutual interference while learning their own feature representation. Experimental results show that our method performs well on CIFAR100 and ImageNet-Sub. Compared with self-supervised baselines, which learning tasks one by one without taking any technique, we improve the image classification top-1 accuracy by 1.60% on CIFAR100 and 2.86% on ImageNet-Sub under 10 incremental steps setting.

Contrastive Supervised Distillation for Continual Representation Learning

Continual Contrastive Self-supervised Learning for Image Classification.

Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation

Uncertainty-Aware Contrastive Distillation for Incremental Semantic Segmentation

An Attention-based Representation Distillation Baseline for Multi-Label Continual Learning

Continual Contrastive Learning for Image Classification

DCD: Discriminative and Consistent Representation Distillation

Representational Continuity for Unsupervised Continual Learning

Class Incremental Learning with Deep Contrastive Learning and Attention Distillation

Look-Ahead Selective Plasticity for Continual Learning of Visual Tasks

Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples

Rethinking the Representational Continuity: Towards Unsupervised Continual Learning

R epresentational c ontinuity for u nsupervised c ontinual l earning

DCCD: Reducing Neural Network Redundancy Via Distillation

Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning

Online Distillation with Continual Learning for Cyclic Domain Shifts

Centroid Distance Distillation for Effective Rehearsal in Continual Learning

Brain-Inspired Continual Learning-Robust Feature Distillation and Re-Consolidation for Class Incremental Learning

Complementary Calibration: Boosting General Continual Learning With Collaborative Distillation and Self-Supervision

Provable Contrastive Continual Learning

Adaptive Similarity Bootstrapping for Self-Distillation based Representation Learning