Large-Scale Video-Based Gesture Recognition Using 3D CNN Model

Qiguang MIAO,Yunan LI,Xin XU
DOI: https://doi.org/10.3969/j.issn.1009-6868.2017.04.002
2017-01-01
Abstract:In this paper, an effective 3D convolutional neural network(CNN)-based method for large-scale gesture recognition is proposed. To obtain compact and uniform data for training and feature extracting, the inputs are unified into 32-frame videos. To describe features of gesture in different aspects, the optical flow data from red, green, blue (RGB) videos are generated. After that, the spatiotemporal features of RGB and optical flow data are extracted with the C3D model (a 3D CNN model) respectively and blended together in the next stage to boost the performance. Finally, the classes are predicted with a linear support vector machine (SVM) classifier. Our proposed method achieves 46.70% accuracy on the validation set of ChalearnLAP Isolated Gesture Dataset (IsoGD).
What problem does this paper attempt to address?