Locating and Recognizing Multiple Human Actions by Searching for Maximum Score Subsequences

Hong-Bo Zhang,Shao-Zi Li,Shu-Yuan Chen,Song-Zhi Su,Xian-Ming Lin,Dong-Lin Cao
DOI: https://doi.org/10.1007/s11760-013-0501-y
IF: 1.583
2013-01-01
Signal Image and Video Processing
Abstract:Despite the numerous methods to recognize human actions in a video, few are designed for videos containing more than one action over a certain time period. Moreover, existing multiple action recognition methods adopt windowed sequence search strategy. Windowed sequence searching requires an exhaustive trial of window length yielding intensive computation. This work presents a frame-based strategy, capable of searching for maximum score subsequences that correspond to actions. Therefore, start and ending times of all actions are located, and action categories are identified as well. Moreover, contrast mutual information is proposed as a new score function to increase recognition accuracy. Experimental results indicate that the proposed method locates and recognizes multiple actions in a video accurately, even for the conventional single action classification problem.
What problem does this paper attempt to address?