Refining Video-Based Person Re-Identification: an Integrated Framework with Facial and Body Cues

Yichen Li,Yufei Yin,Wengang Zhou,Houqiang Li
DOI: https://doi.org/10.1145/3643490.3661806
2024-01-01
Abstract:In Person Re-Identification (Re-ID), the use of facial cues has often been overlooked due to the focus on low-quality image datasets in past research. However, these cues are essential biometric markers, particularly valuable in video person re-identification scenarios where abundant facial information is available. This paper introduces the Dual-Cue Graph Network (DCGN), a graph convolutional network-based method for re-ranking that integrates facial and body cues. Our approach begins with a facial feature fusion module that prioritizes face quality to improve the extraction of facial features from videos. Unlike traditional Re-ID networks, our method focuses on facial cues for person retrieval, producing preliminary candidate results. We then implement a confidence-weighted fusion module to combine body and facial cues and re-rank these initial results, thereby enhancing the overall person retrieval process. Our experiments on real-world video datasets confirm the effectiveness of this method, demonstrating that facial cues are a critical source of information in video-based scenarios and significantly boost the performance of video person re-identification.
What problem does this paper attempt to address?