Abstract:To exploit the trajectories from different areas of a video in an effective way to represent action, this paper proposes to extract the trajectories of action-related areas, the trajectories of action-related motion boundaries and the dense trajectories independently, and then concatenate the representations of them to obtain the final representation of the video. The key to extract the former two sets of trajectories is to detect the action-related areas in each frame at first. We fulfill this task by applying sparse representation to the motion of the subvideo centered at current frame on patch level. To this end, we spatially divide the subvideo into patches. For each patch, we learn a weighted sparse representation of its motion vector using the dictionary constructed by the motion vectors of all the rest patches, and then use the reconstruction error to measure patch saliency. Based on the saliency of all patches in a frame, a saliency map is obtained to indicate the action-related areas, which on one hand is incorporated into dense tracking to extract the trajectories of action-related areas, and on the other hand is used as a mask to filter out the background motion boundaries so that the action-related motion boundary trajectories are derived. The experiments on four benchmark datasets, namely, Hollywood2, YouTube, HMDB51 and UCF101, demonstrate the effectiveness of our method.

Action Recognition by Jointly Using Video Proposal and Trajectory

A Method of Simultaneously Action Recognition and Video Segmentation of Video Streams.

Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks

Action Recognition Using Trajectories of Joints

Human Action Recognition Based on Action Relevance Weighted Encoding

Action Recognition Based on Dense Trajectories and Human Detection

Trajectory-Based Modeling Of Human Actions With Motion Reference Points

Action recognition via restricted dense trajectories and spatio-temporal co-occurrence feature

Action Recognition Based on Object Tracking and Dense Trajectories

A Study of Relative Motion Point Trajectories for Action Recognition

Combined Trajectories for Action Recognition Based on Saliency Detection and Motion Boundary.

Human action recognition in videos based on dense trajectory selection

Action Proposals Using Hierarchical Clustering Of Super-Trajectories

Action Recognition Based on Joint Trajectory Maps with Convolutional Neural Networks

Compressed Video Action Recognition Using Motion Vector Representation.

Action Recognition Using Edge Trajectories and Motion Acceleration Descriptor

Action Recognition by Joint Learning

Human action recognition with salient trajectories and multiple kernel learning

Joint Action Recognition And Pose Estimation From Video

Trajectories-based Motion Neighborhood Feature for Human Action Recognition

Features extraction approach based on dense salient trajectories in videos