Multi-Cue Information Fusion For Two-Layer Activity Recognition

Yanli Ji,Jiaming Li,Hong Cheng,Xing Xu,Jingkuan Song
DOI: https://doi.org/10.1007/978-3-319-54526-4_21
2016-01-01
Abstract:Human activities involve complex multi-cue information. We propose a multi-cue information fusion based two-layer recognition approach for visual activity recognition. On the bottom layer, we learn features of body motion, interactive objects and scenes related with activities using deep networks. On the top layer of recognition, we fuse multi-cue information for activity recognition. In our experiments, we evaluate the performance of each single-cue information and various combinations of multi-cue information in activity recognition. We evaluate the effectiveness of two fusion methods, a linear support vector machine (SVM) classifier and a fully connected network. Experimental results illustrate that scene and body motion provide larger contributions for activity recognition, and recognition by fusing multi-cue information achieves 3%-12% higher MAcc than using single-cue information in the CCV database. Compared with state-of-the-art works, our approach achieves high level results both in CCV and UCF-101 databases.
What problem does this paper attempt to address?