Abstract:Cross-Resolution Person Re-Identification (re-ID) aims to match images with disparate resolutions arising from variations in camera hardware and shooting distances. Most conventional works utilize Super-Resolution (SR) models to recover Low Resolution (LR) images to High Resolution (HR) images. However, because the SR models cannot completely compensate for the missing information in the LR images, there is still a large gap between the HR image recovered from the LR images and the real HR images. To tackle this challenge, we propose a novel Multi-Scale Image- and Feature-Level Alignment (MSIFLA) framework to align the images on multiple resolution scales at both the image and feature level. Specifically, (i) we design a Cascaded Multi-Scale Resolution Reconstruction (CMSR2) module, which is composed of three cascaded Image Reconstruction (IR) networks, and can continuously reconstruct multiple variables of different resolution scales from low to high for each image, regardless of image resolution. The reconstructed images with specific resolution scales are of similar distribution; therefore, the images are aligned on multiple resolution scales at the image level. (ii) We propose a Multi-Resolution Representation Learning (MR2L) module which consists of three-person re-ID networks to encourage the IR models to preserve the ID-discriminative information during training separately. Each re-ID network focuses on mining discriminative information from a specific scale without the disturbance from various resolutions. By matching the extracted features on three resolution scales, the images with different resolutions are also aligned at the feature-level. We conduct extensive experiments on multiple public cross-resolution person re-ID datasets to demonstrate the superiority of the proposed method. In addition, the generalization of MSIFLA in handling cross-resolution retrieval tasks is verified on the UAV vehicle dataset.

Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification

Joining Features by Global Guidance with Bi-Relevance Trihard Loss for Person Re-Identification

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification

Dual Attention Matching Network for Context-Aware Feature Sequence based Person Re-Identification

Adaptive multi-task learning for cross domain and modal person re-identification

Cross-Modality Person Re-identification with Memory-Based Contrastive Embedding

Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification

A Local-Global Self-attention Interaction Network for RGB-D Cross-Modal Person Re-identification.

Multi-Scale Image- and Feature-Level Alignment for Cross-Resolution Person Re-Identification

Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID

Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification

RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment

Deep High-Resolution Representation Learning for Cross-Resolution Person Re-Identification

Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments

EANet: Enhancing Alignment for Cross-Domain Person Re-identification

Joint Cross-Consistency Learning and Multi-Feature Fusion for Person Re-Identification

Deep-Person: Learning discriminative deep features for person Re-Identification

Cross-modality person re-identification via modality-synergy alignment learning

Person Re-Identification with Joint Verification and Identification of Identity-Attribute Labels

Cross-modality paired-images generation and augmentation for RGB-infrared person re-identification

Dual adaptive alignment and partitioning network for visible and infrared cross-modality person re-identification