Abstract:To exploit the trajectories from different areas of a video in an effective way to represent action, this paper proposes to extract the trajectories of action-related areas, the trajectories of action-related motion boundaries and the dense trajectories independently, and then concatenate the representations of them to obtain the final representation of the video. The key to extract the former two sets of trajectories is to detect the action-related areas in each frame at first. We fulfill this task by applying sparse representation to the motion of the subvideo centered at current frame on patch level. To this end, we spatially divide the subvideo into patches. For each patch, we learn a weighted sparse representation of its motion vector using the dictionary constructed by the motion vectors of all the rest patches, and then use the reconstruction error to measure patch saliency. Based on the saliency of all patches in a frame, a saliency map is obtained to indicate the action-related areas, which on one hand is incorporated into dense tracking to extract the trajectories of action-related areas, and on the other hand is used as a mask to filter out the background motion boundaries so that the action-related motion boundary trajectories are derived. The experiments on four benchmark datasets, namely, Hollywood2, YouTube, HMDB51 and UCF101, demonstrate the effectiveness of our method.

Action Recognition Using Edge Trajectories and Motion Acceleration Descriptor

Action Recognition Using Trajectories of Joints

A Study of Relative Motion Point Trajectories for Action Recognition

Action Recognition Based on Object Tracking and Dense Trajectories

Learning Deep Trajectory Descriptor for Action Recognition in Videos Using Deep Neural Networks.

Action Recognition Based on Dense Trajectories and Human Detection

Action recognition via restricted dense trajectories and spatio-temporal co-occurrence feature

Exploring the Influence of Motion Boundary Sampling to Improved Dense Trajectories for Action Recognition

Action Recognition Based on Depth Image Sequence

Action Recognition Using Multi-Scale Histograms of Oriented Gradients Based Depth Motion Trail Images

Compressed Video Action Recognition Using Motion Vector Representation.

Activity Recognition Using Dense Long-Duration Trajectories

An Improved Method Using Kinematic Features for Action Recognition

Combined Trajectories for Action Recognition Based on Saliency Detection and Motion Boundary.

Weighted Feature Trajectories and Concatenated Bag-of-features for Action Recognition

Human Action Recognition Based on Point Context Tensor Shape Descriptor

Action Recognition by Jointly Using Video Proposal and Trajectory

Action recognition and detection by combining motion and appearance features

Human Action Recognition by Fast Dense Trajectories.

Action Recognition Using Form and Motion Modalities

3D Action Recognition Using Multi-Temporal Depth Motion Maps and Fisher Vector