Abstract:Unsupervised visible-infrared person re-identification (USL-VI-ReID) is of great research and practical significance yet remains challenging due to the absence of annotations. Existing approaches aim to learn modality-invariant representations in an unsupervised setting. However, these methods often encounter label noise within and across modalities due to suboptimal clustering results and considerable modality discrepancies, which impedes effective training. To address these challenges, we propose a straightforward yet effective solution for USL-VI-ReID by mitigating universal label noise using neighbor information. Specifically, we introduce the Neighbor-guided Universal Label Calibration (N-ULC) module, which replaces explicit hard pseudo labels in both homogeneous and heterogeneous spaces with soft labels derived from neighboring samples to reduce label noise. Additionally, we present the Neighbor-guided Dynamic Weighting (N-DW) module to enhance training stability by minimizing the influence of unreliable samples. Extensive experiments on the RegDB and SYSU-MM01 datasets demonstrate that our method outperforms existing USL-VI-ReID approaches, despite its simplicity. The source code is available at: <a class="link-external link-https" href="https://github.com/tengxiao14/Neighbor-guided-USL-VI-ReID" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the common label noise problem in **Unsupervised Visible - Infrared Cross - Modal Person Re - identification (USL - VI - ReID)**. Specifically, existing unsupervised methods will encounter label noise during the learning process due to sub - optimal clustering results and significant modal differences, which hinders effective training. ### Problem Background Unsupervised Visible - Infrared Cross - Modal Person Re - identification (USL - VI - ReID) aims to use unlabeled datasets to learn shared feature representations, thereby achieving cross - modal identity recognition. However, existing methods usually generate pseudo - labels through clustering. Due to the unsatisfactory clustering results and the significant differences between modalities, these pseudo - labels often contain noise, which affects the training effect of the model. ### Method Proposed in the Paper To solve the above problems, the paper proposes a framework for alleviating common label noise based on neighbor information. Specifically, it includes two key modules: 1. **Neighborhood - Guided Universal Label Calibration Module (N - ULC)**: - This module reduces label noise by introducing soft labels to replace hard pseudo - labels. - For each sample, calculate its correlation with neighbor samples and generate more accurate soft labels based on these correlations. - The formulas are as follows: \[ \tilde{P}_{q_v}^{\text{intra}}=\left[\frac{|N(q_v, U_v, k)\cap C_v^l|}{|N(q_v, U_v, k)\cup C_v^l|}\right] \] \[ \tilde{I}_{q_v}^{\text{intra}}=\mu I_{q_v}^{\text{intra}}+(1 - \mu)P_{q_v}^{\text{intra}} \] 2. **Neighborhood - Guided Dynamic Weighting Module (N - DW)**: - This module reduces the influence of unreliable samples by using the consistency of neighbors and enhances the stability of training. - According to the correlation between the sample and its neighbors, assign weights to each sample, thereby reducing the influence of unreliable samples on training. - The formula is as follows: \[ \omega_{q_v}^{\text{intra}}=\exp\left( - w\cdot(1 - [P_{q_v}^{\text{intra}}]_{l'})^2\right) \] ### Experimental Results The paper conducted extensive experiments on two public datasets (RegDB and SYSU - MM01). The results show that the proposed method outperforms existing methods in multiple settings. Especially in dealing with label noise, this method performs excellently and effectively improves the performance of the model. ### Summary The main contributions of the paper are: - Proposing a simple and effective framework that uses neighbor information to alleviate the common label noise in unsupervised Visible - Infrared Cross - Modal Person Re - identification. - Introducing the Neighborhood - Guided Universal Label Calibration Module (N - ULC) and the Neighborhood - Guided Dynamic Weighting Module (N - DW), which are respectively used to improve the accuracy of identity representation and enhance the stability of training. - The experimental results verify the effectiveness and superiority of this method. Through these improvements, the paper solves the common label noise problem in existing methods and provides a new solution for unsupervised Visible - Infrared Cross - Modal Person Re - identification.

Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from Neighbors

Modality-transfer Generative Adversarial Network and Dual-Level Unified Latent Representation for Visible Thermal Person Re-Identification

Unsupervised Visible-Infrared Person ReID by Collaborative Learning with Neighbor-Guided Label Refinement

Robust Pseudo-label Learning with Neighbor Relation for Unsupervised Visible-Infrared Person Re-Identification

Inter-Intra Modality Knowledge Learning and Clustering Noise Alleviation for Unsupervised Visible-Infrared Person Re-Identification

Shallow-Deep Collaborative Learning for Unsupervised Visible-Infrared Person Re-Identification

Dual Pseudo-Labels Interactive Self-Training for Semi-Supervised Visible-Infrared Person Re-Identification

Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification

Modality Blur and Batch Alignment Learning for Twin Noisy Labels-based Visible–infrared Person Re-identification

Exploring Homogeneous and Heterogeneous Consistent Label Associations for Unsupervised Visible-Infrared Person ReID

Unsupervised Visible-Infrared ReID via Pseudo-label Correction and Modality-level Alignment

Refining Noisy Labels with Label Reliability Perception for Person Re-identification

Progressive Contrastive Learning with Multi-Prototype for Unsupervised Visible-Infrared Person Re-identification

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID

Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

Adaptive Middle Modality Alignment Learning for Visible-Infrared Person Re-identification

Dynamic Modality-Camera Invariant Clustering for Unsupervised Visible-Infrared Person Re-identification

Unbiased Feature Learning with Causal Intervention for Visible-Infrared Person Re-identification

High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification

Joint Color-irrelevant Consistency Learning and Identity-aware Modality Adaptation for Visible-infrared Cross Modality Person Re-identification.

Semi-Supervised Learning With Heterogeneous Distribution Consistency for Visible Infrared Person Re-Identification