Learning to Balance: Diverse Normalization for Cloth-Changing Person Re-Identification

Hongjun Wang,Jiyuan Chen,Zhengwei Yin,Xuan Song,Yinqiang Zheng
2024-10-14
Abstract:Cloth-Changing Person Re-Identification (CC-ReID) involves recognizing individuals in images regardless of clothing status. In this paper, we empirically and experimentally demonstrate that completely eliminating or fully retaining clothing features is detrimental to the task. Existing work, either relying on clothing labels, silhouettes, or other auxiliary data, fundamentally aim to balance the learning of clothing and identity features. However, we practically find that achieving this balance is challenging and nuanced. In this study, we introduce a novel module called Diverse Norm, which expands personal features into orthogonal spaces and employs channel attention to separate clothing and identity features. A sample re-weighting optimization strategy is also introduced to guarantee the opposite optimization direction. Diverse Norm presents a simple yet effective approach that does not require additional data. Furthermore, Diverse Norm can be seamlessly integrated ResNet50 and significantly outperforms the state-of-the-art methods.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### The Problem Addressed by the Paper The paper primarily focuses on a core issue in the task of **Cloth-Changing Person Re-Identification (CC-ReID)**: how to identify the same person wearing different clothes in images. Specifically, the authors point out that existing methods either completely eliminate clothing features or fully retain them, both of which adversely affect the task. Completely eliminating clothing features leads to performance degradation when clothing remains unchanged, while retaining too many clothing features results in poor performance when clothing changes. ### Background and Motivation 1. **Early Work**: Early research on person re-identification (ReID) assumed that people would not change clothes in a short period, so most methods relied on clothing information to identify individuals. These methods performed well on short-term datasets but showed significant performance drops on long-term datasets where people frequently change clothes. 2. **Existing Solutions**: To overcome this limitation, researchers have gradually shifted their attention to considering clothing changes during model training and testing. Mainstream solutions include: - Combining data from other modalities (such as silhouette sketches, body shape, hairstyle, etc.) to encourage the model to learn from these data. - Using manually annotated clothing labels to force the model to reduce its focus on clothing appearance. 3. **Balance Issue**: Regardless of the method, all these works essentially attempt to find an optimal balance to encode identity features and clothing features into the model's high-level representations. However, this balance is fragile, and the optimization problem itself is ill-posed, easily falling into local minima. ### Proposed Method To address the above issues, the authors introduce a new module called **Diverse Norm**, with the following main features: 1. **Feature Expansion and Orthogonalization**: Expanding personal features into an orthogonal space through orthogonalization operations to separate clothing features and identity features. 2. **Channel Attention Mechanism**: Using a channel attention mechanism to select clothing features and identity features. 3. **Sample Reweighting Strategy**: Introducing a sample reweighting optimization strategy to ensure different branches focus on specific input samples, thereby achieving concept selection. ### Experiments and Results 1. **Datasets**: The experiments used two key CC-ReID datasets—PRCC and LTCC, as well as other large-scale datasets LaST and DeepChange. 2. **Performance Evaluation**: The model performance was evaluated using mean Average Precision (mAP) and Cumulative Matching Characteristics (CMC) at Rank1 and Rank5 matching accuracy. 3. **Comparison Experiments**: The method was compared with various existing methods, including standard ReID models and ReID models specifically designed for clothing change scenarios. The results show that Diverse Norm significantly outperforms existing methods in all scenarios. ### Conclusion The authors first pointed out the challenges of balancing clothing features and identity features in CC-ReID baseline methods and proposed a new method—Diverse Norm, which does not require additional data, multi-modal input, or clothing labels. This method effectively separates clothing features and identity features through simple orthogonalization and channel attention mechanisms, providing a simple and efficient solution. Experimental results show that Diverse Norm surpasses the current state-of-the-art methods on multiple datasets.