Research on Local Spatio-Temporal Features for Action Recognition

LEI Qing,LI Shao-zi
DOI: https://doi.org/10.3778/j.issn.1002-8331.2010.34.003
2010-01-01
Abstract:Local spatio-temporal features have become a popular video representation for action recognition in recent years. Several methods for feature detection and description have been proposed in the literature and promising recognition results are demonstrated for a number of action classes.This paper employs the motion representation based on space-time interest points and implements action recognition method based on spatio-temporal codebook and words.Firstly,accurate interests points detectes from videos taking advantage of Gabor and Gaussian mixture filtering,then three kinds of local features:histo- gram of gradient,histogram of flow and histogram of space-time gradient are extracted as 3DSIFT to describe interest points. K-means cluster algorithm performs on features and learns the spatial-tempoal codebook.Finally a standard bag-of-features SVM approach is used for action recognition.The performance is investigated on a total of 16 action classes distributed over two datasets with varying difficulty.Experiment results demonstrate that features combined spatial with temporal information can well adapted to complex environment such as camera movement,illumination changes and different clothing in realistic settings and achieve better recognition performance.
What problem does this paper attempt to address?