Abstract:The performance of a lifelong learning (L3) model degrades when it is trained on a series of tasks, as the geometrical formation of the embedding space changes while learning novel concepts sequentially. The majority of existing L3 approaches operate on a fixed-curvature (e.g., zero-curvature Euclidean) space that is not necessarily suitable for modeling the complex geometric structure of data. Furthermore, the distillation strategies apply constraints directly on low-dimensional embeddings, discouraging the L3 model from learning new concepts by making the model highly stable. To address the problem, we propose a distillation strategy named L3DMC that operates on mixed-curvature spaces to preserve the already-learned knowledge by modeling and maintaining complex geometrical structures. We propose to embed the projected low dimensional embedding of fixed-curvature spaces (Euclidean and hyperbolic) to higher-dimensional Reproducing Kernel Hilbert Space (RKHS) using a positive-definite kernel function to attain rich representation. Afterward, we optimize the L3 model by minimizing the discrepancies between the new sample representation and the subspace constructed using the old representation in RKHS. L3DMC is capable of adapting new knowledge better without forgetting old knowledge as it combines the representation power of multiple fixed-curvature spaces and is performed on higher-dimensional RKHS. Thorough experiments on three benchmarks demonstrate the effectiveness of our proposed distillation strategy for medical image classification in L3 settings. Our code implementation is publicly available at <a class="link-external link-https" href="https://github.com/csiro-robotics/L3DMC" rel="external noopener nofollow">this https URL</a>.

M2Distill: Multi-Modal Distillation for Lifelong Imitation Learning

Lifelong Learning Via Progressive Distillation And Retrospection

Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration

L3DMC: Lifelong Learning using Distillation via Mixed-Curvature Space

Experience Consistency Distillation Continual Reinforcement Learning for Robotic Manipulation Tasks

I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation

Multi-Modal Imitation Learning in Partially Observable Environments

Online Multi-modal Imitation Learning Via Lifelong Intention Encoding.

PolyTask: Learning Unified Policies through Behavior Distillation

M2KD: Multi-model and Multi-level Knowledge Distillation for Incremental Learning

Learning to Discern: Imitating Heterogeneous Human Demonstrations with Preference and Representation Learning

Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process

Distilled Meta-learning for Multi-Class Incremental Learning

MIND: Multi-Task Incremental Network Distillation

Reinforcement Learning via Auxiliary Task Distillation

Online Distillation with Continual Learning for Cyclic Domain Shifts

Dual Policy Distillation

DDIL: Improved Diffusion Distillation With Imitation Learning

Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation

Module-wise Adaptive Distillation for Multimodality Foundation Models

One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations