Convolutional LSTM networks for video-based person re-identification

Lin Wu,Chunhua Shen,Anton van den Hengel
2016-01-01
Abstract:In this paper, we study the problem of video-based person re-identification. This is more challenging, and of greater practical interest, than conventional image-based person re-identification. To address this problem, we propose the use of convolutional Long Short Term Memory (LSTM) based networks to learn a video-based representation for person re-identification. To this end, we propose to jointly leverage deep Convolutional Neural Networks (CNNs) and LSTM networks. Given sequential video frames of a person, the spatial information encoded in the frames is first extracted by a set of CNNs. An encoderdecoder framework derived from LSTMs is employed to encode the resulting temporal of CNN outputs. This approach leads to a refined feature representation that is able to explicitly model the video as an ordered sequence, while preserving the spatial information. Comparative experiments demonstrate that our approach achieves the state-of-the-art performance for video-based person re-identification on iLIDS-VID and PRID 2011, the two primary public datasets for this purpose.
What problem does this paper attempt to address?