Temporal-Contextual Attention Network for Video-Based Person Re-identification

Di Chen,Zheng-Jun Zha,Jiawei Liu,Hongtao Xie,Yongdong Zhang
DOI: https://doi.org/10.1007/978-3-030-00776-8_14
2018-01-01
Abstract:Video-based person re-identification aims to identify a specific person in surveillance videos from different cameras. This paper presents a new Temporal-Contextual Attention Network (TCA-Net) for person re-identification in videos. The TCA-Net exploits temporally local context among consecutive frames to concentrate selectively on crucial frames within a video sequence. Specifically, the network consists of a Convolutional Neural Network (CNN) module and a temporal-contextual attention block. The CNN module embeds each video frame into a convolutional representation, and the temporal-contextual attention block learns the importance of a video frame for re-identification by exploiting the local context among the frame and its neighboring frames. The feature of a video sequence is then obtained by aggregating frame-level features weighted by frame importance. We evaluate the proposed TCA-Net on a challenging dataset MARS. The experimental results have demonstrated the effectiveness of the proposed approach.
What problem does this paper attempt to address?