A Lightweight Approach to Optimizing Computational Efficiency in Multi-source Domain Adaptation for Pedestrian Re-identification

Xiaofeng Zhang,Jia He,Tong Xu,Mingchao Zhu,Kejun Wang,Bo Jiang,Xia Liu
DOI: https://doi.org/10.1145/3690407.3690550
2024-01-01
Abstract:Pedestrian re-identification is a technique to locate the same individual in data from different cameras. Currently, the method of multi-source domain adaptation for pedestrian re-identification is becoming popular. In this approach, the core challenge lies in the issue of distribution differences between domains. In addition, many scholars are also dedicated to solving the noise problem existing in pseudo labels. This paper takes a computational cost perspective and aims to solve the domain adaptation challenges of large model sizes and slow image processing speeds. We have performed a lightening process on the base model ViT. Firstly, a hybrid architecture, MixNet, is proposed, introducing the CNN basic module into ViT, extracting local features via shallow CNN, and capturing long-distance dependency patterns using deep Transformers, thereby reducing the parameters and computations. Additionally, an optimization is further conducted on the MixNet model based on the pruning algorithm. By using hierarchical pruning to compress multi-head attention and joint pruning for the mapping matrix in self-attention, the computational costs of the model have been effectively optimized. Relevant experiments carried out on several different tasks strongly verify that the proposed method has achieved significant optimization in computational costs.
What problem does this paper attempt to address?