Large-scale Multi-modal Person Identification in Real Unconstrained Environments

Jiajie Ye,Yisheng Guan,Junfa Liu,Xinghong Huang,Hong Zhang
DOI: https://doi.org/10.48550/arXiv.1912.12134
2019-12-17
Abstract:Person identification (P-ID) under real unconstrained noisy environments is a huge challenge. In multiple-feature learning with Deep Convolutional Neural Networks (DCNNs) or Machine Learning method for large-scale person identification in the wild, the key is to design an appropriate strategy for decision layer fusion or feature layer fusion which can enhance discriminative power. It is necessary to extract different types of valid features and establish a reasonable framework to fuse different types of information. In traditional methods, different persons are identified based on single modal features to identify, such as face feature, audio feature, and head feature. These traditional methods cannot realize a highly accurate level of person identification in real unconstrained environments. The study aims to propose a fusion module to fuse multi-modal features for person identification in real unconstrained environments.
Computer Vision and Pattern Recognition,Signal Processing
What problem does this paper attempt to address?