Abstract:Lifelong person re-identification (LReID) is an important but challenging task that suffers from catastrophic forgetting due to significant domain gaps between training steps. Existing LReID approaches typically rely on data replay and knowledge distillation to mitigate this issue. However, data replay methods compromise data privacy by storing historical exemplars, while knowledge distillation methods suffer from limited performance due to the cumulative forgetting of undistilled knowledge. To overcome these challenges, we propose a novel paradigm that models and rehearses the distribution of the old domains to enhance knowledge consolidation during the new data learning, possessing a strong anti-forgetting capacity without storing any exemplars. Specifically, we introduce an exemplar-free LReID method called Distribution Rehearsing via Adaptive Style Kernel Learning (DASK). DASK includes a Distribution Rehearser Learning mechanism that learns to transform arbitrary distribution data into the current data style at each learning step. To enhance the style transfer capacity of DRL, an Adaptive Kernel Prediction network is explored to achieve an instance-specific distribution adjustment. Additionally, we design a Distribution Rehearsing-driven LReID Training module, which rehearses old distribution based on the new data via the old AKPNet model, achieving effective new-old knowledge accumulation under a joint knowledge consolidation scheme. Experimental results show our DASK outperforms the existing methods by 3.6%-6.8% and 4.5%-6.5% on anti-forgetting and generalization capacity, respectively. Our code is available at <a class="link-external link-https" href="https://github.com/zhoujiahuan1991/AAAI2025-DASK" rel="external noopener nofollow">this https URL</a>

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively alleviate the catastrophic forgetting phenomenon while protecting data privacy in the Lifelong Person Re - identification (LReID) task. Specifically, the paper proposes a new method named "Distribution - Aware Style Kernel Learning for Rehearsal" (DASK), which aims to enhance knowledge consolidation by simulating the data distribution of the old domain without storing any historical samples, thereby improving the model's anti - forgetting ability during the learning process of new data. ### Background and Challenges of the Paper 1. **Lifelong Person Re - identification (LReID)**: - **Objective**: Train the model gradually on continuously changing datasets so that it can adapt to data from different domains. - **Main Challenge**: Catastrophic Forgetting, that is, after the model learns new data, its performance on previously learned data drops sharply. 2. **Limitations of Existing Methods**: - **Data Replay Methods**: Replay by storing historical samples to maintain old knowledge. However, this method has data privacy issues and storage burdens. - **Knowledge Distillation Methods**: Retain historical knowledge by maintaining feature consistency between new and old models. However, due to the significant domain gap between new and old data, these methods have limited performance in anti - forgetting ability. ### Solutions Proposed in the Paper 1. **DASK Method**: - **Core Idea**: Generate data in the old - domain style to simulate the distribution of the old domain, thereby achieving knowledge consolidation without storing historical samples. - **Main Components**: - **Distribution Rehearsal Module (DRM)**: Use the Adaptive Kernel Prediction Network (AKPNet) to generate distribution - shift kernels and transform new data into the old - domain style. - **Joint Knowledge Consolidation Module (JKCM)**: Combine real new data and generated old - domain - style data, and achieve the accumulation of new and old knowledge through a joint - learning loss function. 2. **Technical Details**: - **Distribution - Shift Kernel Generation**: - For each input image \(x_i\), AKPNet generates a distribution - shift kernel \(k_i\) and transforms \(x_i\) into an old - domain - style image \(x_i^*\) through a convolution operation: \[ x_i^*=k_i\oplus x_i \] - **Joint Knowledge Consolidation**: - Calculate the cross - instance similarity matrices \(S_{t - 1}\) and \(S_t\) between new and old models, and adopt the knowledge distillation loss \(L_{SKD}\) to maintain the prediction similarity between new and old models: \[ L_{SKD}=\frac{1}{B}\sum_{i = 1}^{B}KL\left(\frac{(S_{t - 1})_i}{\|(S_{t - 1})_i\|}\Bigg\|\frac{(S_t)_i}{\|(S_t)_i\|}\right) \] - Use the classic ReID loss \(L_{ReID}\) to learn new knowledge, which consists of the triplet loss \(L_{Tri}\) and the cross - entropy loss \(L_{CE}\): \[ L_{ReID}=L_{Tri}+L_{CE} \] ### Experimental Results 1. **Datasets**: - **Training Subsets**: Market - 1501, DukeMTMC - reID, CUHK - SYSU, MSMT17, CUHK03. - **Testing Subsets**: CUHK01, CUHK02, VIPeR, PRID, i - LIDS, GRID, SenseReID. 2. **Evaluation Metrics**: - **Mean Average Precision (mAP)** and **Rank@1 Accuracy (R@1)**. 3. **Experimental Results**:

DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification

Distribution-Aware Knowledge Prototyping for Non-Exemplar Lifelong Person Re-Identification

Exemplar-Free Lifelong Person Re-identification via Prompt-Guided Adaptive Knowledge Consolidation

LSTKC: Long Short-Term Knowledge Consolidation for Lifelong Person Re-identification

Lifelong Person Re-Identification via Adaptive Knowledge Accumulation

Lifelong Person Re-identification by Pseudo Task Knowledge Preservation

Distribution Aligned Semantics Adaption for Lifelong Person Re-Identification

Auto-selected Knowledge Adapters for Lifelong Person Re-identification

Lifelong Person Re-Identification via Knowledge Refreshing and Consolidation

Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification

Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation

Domain Consistency Representation Learning for Lifelong Person Re-Identification

Patch-based Knowledge Distillation for Lifelong Person Re-Identification

Semi-Supervised Domain Generalizable Person Re-Identification

Diverse Representation Embedding for Lifelong Person Re-Identification

Dual Knowledge Distillation on Multiview Pseudo Labels for Unsupervised Person Re-Identification

Learning consistent region features for lifelong person re-identification

A Memorizing and Generalizing Framework for Lifelong Person Re-Identification

Unsupervised Lifelong Person Re-identification via Contrastive Rehearsal

Dynamic Textual Prompt For Rehearsal-free Lifelong Person Re-identification

Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification.