Multi-Scale Transformer-Based Matching Network for Generalizable Person Re-Identification

Jinhua Jiang,Wenfeng Zhang,Ruisheng Ran,Wei Hu,Jiangyan Dai
DOI: https://doi.org/10.1109/lsp.2023.3313088
2023-01-01
IEEE Signal Processing Letters
Abstract:Recently some researches have focused on the Domain-Generalization (DG) Re-ID problem that training and testing are not in the same domain distribution. To fit the unseen complex scenes, recently deep feature matching-based methods for DG Re-ID have been developed and achieved the state-of-the-arts. However, they ignored some cases in which the accuracy of key region matching is unstable at a single scale, and the bad impact of style variations for feature representations. To address the issues, we propose a novel deep image matching model named Multi-scale Transformer-based Matching Network (MTMN) for DG Re-ID problem. MTMN matches two images with multi-scale local respondence instead of fixed representations. Specifically, the Transformer is carefully modified to formulate efficient local interactions between query and gallery images in multiple scales. Moreover, the style normalization is introduced to filter out identity-irrelated features to promote the matching results. Comprehensive experiments on several DG Re-ID tasks demonstrate the superiority of the proposed method compared with the state-of-the-arts, e.g., 5.4 $\%$ and 2.6 $\%$ gains in Rank-1 and mAP on Market-1501 $\rightarrow$ MSMT17(V1) task.
What problem does this paper attempt to address?