An Enhanced Deep Feature Representation for Person Re-identification

Shangxuan Wu,Ying-Cong Chen,Xiang Li,An-Cong Wu,Jin-Jie You,Wei-Shi Zheng
DOI: https://doi.org/10.1109/WACV.2016.7477681
2016-04-28
Abstract:Feature representation and metric learning are two critical components in person re-identification models. In this paper, we focus on the feature representation and claim that hand-crafted histogram features can be complementary to Convolutional Neural Network (CNN) features. We propose a novel feature extraction model called Feature Fusion Net (FFN) for pedestrian image representation. In FFN, back propagation makes CNN features constrained by the handcrafted features. Utilizing color histogram features (RGB, HSV, YCbCr, Lab and YIQ) and texture features (multi-scale and multi-orientation Gabor features), we get a new deep feature representation that is more discriminative and compact. Experiments on three challenging datasets (VIPeR, CUHK01, PRID450s) validates the effectiveness of our proposal.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the feature representation problem in person re - identification from different perspectives. Specifically, the author focuses on how to improve the performance of person re - identification by fusing hand - crafted features and features extracted by convolutional neural networks (CNN). The paper points out that traditional person re - identification methods mainly rely on cross - view invariant features or robust metrics, and in recent years, deep - learning methods (such as CNN) have also made significant progress in this field. However, in practical applications, due to the influence of factors such as viewing angles, illumination, cluttered backgrounds and occlusions, the appearance of pedestrians will change significantly, which makes it difficult for a single feature representation method to meet these challenges. For this reason, the paper proposes a new feature extraction model - Feature Fusion Net (FFN). This model aims to use hand - crafted features (such as color histogram features and texture features) to constrain the feature extraction process of CNN, thereby generating more discriminative and compact deep - feature representations. Experimental results show that this model is effective on three challenging person re - identification datasets (VIPeR, CUHK01, PRID450s), especially in terms of the Rank - 1 matching rate, which has been significantly improved compared with existing methods.