Abstract:Action recognition is an enabling technology for many real world applications, such as human-computer interaction, surveillance, video retrieval, retirement home monitoring, and robotics. In the past decade, it has attracted a great amount of interest in the research community. Recently, the commoditization of depth sensors has generated much excitement in action recognition from depth sensors. New depth sensor technology has enabled many applications that were not feasible before. On one hand, action recognition becomes far easier with depth sensors. On the other hand, the drive to recognize more complex actions presents new challenges. One crucial aspect of action recognition is to extract discriminative features. The depth maps have completely different characteristics from the RGB images. Directly applying features designed for RGB images does not work. Complex actions usually involve complicated temporal structures, human-object interactions, and person-person contacts. New machine learning algorithms need to be developed to learn these complex structures. This work enables the reader to quickly familiarize themselves with the latest research in depth-sensor based action recognition, and to gain a deeper understanding of recently developed techniques. It will be of great use for both researchers and practitioners who are interested in human action recognition with depth sensors. The text focuses on feature representation and machine learning algorithms for action recognition from depth sensors. After presenting a comprehensive overview of the state of the art in action recognition from depth data, the authors then provide in-depth descriptions of their recently developed feature representations and machine learning techniques, including lower-level depth and skeleton features, higher-level representations to model the temporal structure and human-object interactions, and feature selection techniques for occlusion handling.

Human behaviour recognition with mid-level representations for crowd understanding and analysis

Human Action Recognition Using Deep Learning Methods.

Based on cluster tree human action recognition algorithm for monocular video

Hierarchical Complex Activity Representation and Recognition Using Topic Model and Classifier Level Fusion.

Toward Accurate Person-level Action Recognition in Videos of Crowed Scenes

Online Robust Action Recognition Based on a Hierarchical Model

Human Action Recognition From Digital Videos Based on Deep Learning.

Human Action Recognition with Contextual Constraints Using a RGB-D Sensor

Human Action Recognition Based on Three-Stream Network with Frame Sequence Features

Behaviour recognition based on the integration of multigranular motion features in the Internet of Things

Zero-Shot Crowd Behavior Recognition

Human Action Recognition Based on Hierarchical Multi-Scale Adaptive Conv-Long Short-Term Memory Network

Human Behavior Recognition from Multiview Videos

Deep Learning-Based Human Action Recognition in Videos

Human crowd behaviour analysis based on video segmentation and classification using expectation–maximization with deep learning architectures

Combining Sparse And Dense Descriptors With Temporal Semantic Structures For Robust Human Action Recognition

Human Action Recognition with Depth Cameras

A Systematic Survey on Human Behavior Recognition Methods

Action Recognition by Exploring Data Distribution and Feature Correlation

A Novel Hierarchical Framework for Human Action Recognition

Human Behavior Recognition Based on Multiscale Convolutional Neural Network.