Abstract:In view of the strong generalizability and self-learning capabilities of deep learning models, many scholars have studies how to apply deep learning theory in the pedestrian re-identification field. However, a number of problems persist in practically applying deep learning in this field, including determining how to make full use of the features of the sequence information in the salient region of an image and addressing the data gap between data-driven deep learning models and pedestrian re-identification tasks. In view of these problems, in this paper, a re-identification method is proposed based on a visual common attention mechanism. Initially, the method focuses on the local area of the image at the location specified by the given coordinates. Next, under the constraint of pedestrian image pairing tags, it focuses on the sequence of salient regions of image pairs based on deep learning techniques. Then, the global features and the local attention features are cascaded into joint features for use in pedestrian re-identification. To address the data gap between deep learning models and pedestrian re-identification, a new strategy for generating difficult positive samples is proposed primarily through a positive sample that mainly involves a positive sample generation network, a difficult positive sample conversion network, and a dual-stream twin network. We using this network, a large number of positive samples can be obtained to train the data-driven neural network and solve the re-identification task. The above ideas are combined to propose a pedestrian re-identification algorithm based on a visual attention-positive sample generation network deep learning model. The experimental results show that the method proposed in this paper not only achieves better recognition results than other deep learning methods, but also adapts well to a variety of databases. In addition, the method proposed in this paper is more robust to occluded pedestrian images than other deep learning methods. In addition, the method proposed in this paper is more robust than other deep learning methods for occluding pedestrian images.

A Lightweight Approach to Optimizing Computational Efficiency in Multi-source Domain Adaptation for Pedestrian Re-identification

A Novel Two-Stream Saliency Image Fusion CNN Architecture for Person Re-Identification

DIMGNet: A Transformer-based Network for Pedestrian Reidentification with Multi-granularity Information Mutual Gain

Transformer fusion and histogram layer multispectral pedestrian detection network

Cross-domain unsupervised pedestrian re-identification based on multi-view decomposition

Weight Determination In Multi-Feature Fusion For Pedestrian Re-Identification

Unsupervised Domain Adaptation for Multispectral Pedestrian Detection

MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification

Pedestrian Re-identification Algorithm Based on Visual Attention-positive Sample Generation Network Deep Learning Model

A Novel Attention-Driven Framework for Unsupervised Pedestrian Re-identification with Clustering Optimization

MHDNet: A Multi-Scale Hybrid Deep Learning Model for Person Re-Identification

Lightweight Cross-Modal Multispectral Pedestrian Detection Based on Spatial Reweighted Attention Mechanism

Occluded pedestrian re-identification via Res-ViT double-branch hybrid network

Self-Training With Progressive Representation Enhancement for Unsupervised Cross-Domain Person Re-Identification

Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond

1St Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification

Pedestrian Re-Identification Based on Fine-Grained Feature Learning and Fusion

Multidirection and Multiscale Pyramid in Transformer for Video-Based Pedestrian Retrieval

Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View Learning

Reparameterized dilated architecture: A wider field of view for pedestrian detection