DASK: Distribution Rehearsing via Adaptive Style Kernel Learning for Exemplar-Free Lifelong Person Re-Identification

Kunlun Xu,Chenghao Jiang,Peixi Xiong,Yuxin Peng,Jiahuan Zhou
2024-12-12
Abstract:Lifelong person re-identification (LReID) is an important but challenging task that suffers from catastrophic forgetting due to significant domain gaps between training steps. Existing LReID approaches typically rely on data replay and knowledge distillation to mitigate this issue. However, data replay methods compromise data privacy by storing historical exemplars, while knowledge distillation methods suffer from limited performance due to the cumulative forgetting of undistilled knowledge. To overcome these challenges, we propose a novel paradigm that models and rehearses the distribution of the old domains to enhance knowledge consolidation during the new data learning, possessing a strong anti-forgetting capacity without storing any exemplars. Specifically, we introduce an exemplar-free LReID method called Distribution Rehearsing via Adaptive Style Kernel Learning (DASK). DASK includes a Distribution Rehearser Learning mechanism that learns to transform arbitrary distribution data into the current data style at each learning step. To enhance the style transfer capacity of DRL, an Adaptive Kernel Prediction network is explored to achieve an instance-specific distribution adjustment. Additionally, we design a Distribution Rehearsing-driven LReID Training module, which rehearses old distribution based on the new data via the old AKPNet model, achieving effective new-old knowledge accumulation under a joint knowledge consolidation scheme. Experimental results show our DASK outperforms the existing methods by 3.6%-6.8% and 4.5%-6.5% on anti-forgetting and generalization capacity, respectively. Our code is available at <a class="link-external link-https" href="https://github.com/zhoujiahuan1991/AAAI2025-DASK" rel="external noopener nofollow">this https URL</a>
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively alleviate the catastrophic forgetting phenomenon while protecting data privacy in the Lifelong Person Re - identification (LReID) task. Specifically, the paper proposes a new method named "Distribution - Aware Style Kernel Learning for Rehearsal" (DASK), which aims to enhance knowledge consolidation by simulating the data distribution of the old domain without storing any historical samples, thereby improving the model's anti - forgetting ability during the learning process of new data. ### Background and Challenges of the Paper 1. **Lifelong Person Re - identification (LReID)**: - **Objective**: Train the model gradually on continuously changing datasets so that it can adapt to data from different domains. - **Main Challenge**: Catastrophic Forgetting, that is, after the model learns new data, its performance on previously learned data drops sharply. 2. **Limitations of Existing Methods**: - **Data Replay Methods**: Replay by storing historical samples to maintain old knowledge. However, this method has data privacy issues and storage burdens. - **Knowledge Distillation Methods**: Retain historical knowledge by maintaining feature consistency between new and old models. However, due to the significant domain gap between new and old data, these methods have limited performance in anti - forgetting ability. ### Solutions Proposed in the Paper 1. **DASK Method**: - **Core Idea**: Generate data in the old - domain style to simulate the distribution of the old domain, thereby achieving knowledge consolidation without storing historical samples. - **Main Components**: - **Distribution Rehearsal Module (DRM)**: Use the Adaptive Kernel Prediction Network (AKPNet) to generate distribution - shift kernels and transform new data into the old - domain style. - **Joint Knowledge Consolidation Module (JKCM)**: Combine real new data and generated old - domain - style data, and achieve the accumulation of new and old knowledge through a joint - learning loss function. 2. **Technical Details**: - **Distribution - Shift Kernel Generation**: - For each input image \(x_i\), AKPNet generates a distribution - shift kernel \(k_i\) and transforms \(x_i\) into an old - domain - style image \(x_i^*\) through a convolution operation: \[ x_i^*=k_i\oplus x_i \] - **Joint Knowledge Consolidation**: - Calculate the cross - instance similarity matrices \(S_{t - 1}\) and \(S_t\) between new and old models, and adopt the knowledge distillation loss \(L_{SKD}\) to maintain the prediction similarity between new and old models: \[ L_{SKD}=\frac{1}{B}\sum_{i = 1}^{B}KL\left(\frac{(S_{t - 1})_i}{\|(S_{t - 1})_i\|}\Bigg\|\frac{(S_t)_i}{\|(S_t)_i\|}\right) \] - Use the classic ReID loss \(L_{ReID}\) to learn new knowledge, which consists of the triplet loss \(L_{Tri}\) and the cross - entropy loss \(L_{CE}\): \[ L_{ReID}=L_{Tri}+L_{CE} \] ### Experimental Results 1. **Datasets**: - **Training Subsets**: Market - 1501, DukeMTMC - reID, CUHK - SYSU, MSMT17, CUHK03. - **Testing Subsets**: CUHK01, CUHK02, VIPeR, PRID, i - LIDS, GRID, SenseReID. 2. **Evaluation Metrics**: - **Mean Average Precision (mAP)** and **Rank@1 Accuracy (R@1)**. 3. **Experimental Results**: