Selecting Effective and Discriminative Spatio-Temporal Interest Points for Recognizing Human Action.

Hongbo Zhang,Shaozi Li,Songzhi Su,Shu-Yuan Chen
DOI: https://doi.org/10.1587/transinf.e96.d.1783
2013-01-01
IEICE Transactions on Information and Systems
Abstract:Many successful methods for recognizing human action are spatio-temporal interest point (STIP) based methods. Given a test video sequence, for a matching-based method using a voting mechanism, each test STIP casts a vote for each action class based on its mutual information with respect to the respective class, which is measured in terms of class likelihood probability. Therefore, two issues should be addressed to improve the accuracy of action recognition. First, effective STIPs in the training set must be selected as references for accurately estimating probability. Second, discriminative STIPs in the test set must be selected for voting. This work uses epsilon-nearest neighbors as effective STIPs for estimating the class probability and uses a variance filter for selecting discriminative STIPs. Experimental results verify that the proposed method is more accurate than existing action recognition methods.
What problem does this paper attempt to address?