Abstract:Human activity recognition (HAR) using body-worn sensors is an active research area in human-computer interaction and human activity analysis. The traditional methods use hand-crafted features to classify multiple activities, which is both heavily dependent on human domain knowledge and results in shallow feature extraction. Rapid developments in deep learning have caused most researchers to switch to deep learning methods, which extract features from raw data automatically. Most of the existing works on human activity recognition tasks involve multimodal sensor data, and these networks mainly focus on the top representation extracted from bottom-up feedforward process without reusing other features from bottom layers. In this paper, we present a novel hybrid deep learning network for human activity recognition that also employs multimodal sensor data; however, our proposed model is a ConvLSTM pipeline that makes full use of the information in each layer extracted along the temporal domain. Thus, we propose a dense connection module (DCM) to ensure maximum information flow between the network layers. Furthermore, we employ a multilayer feature aggregation module (MFAM) to extract features along the spatial domain, and we aggregate the features obtained from every convolutional layer according to the importance of features in different spatial locations. The output of the MFAM is input into two LSTM layers to further model the temporal dependencies. Finally, a fully connected layer and a softmax function are used to compute the probability of each class. We demonstrate the effectiveness of our proposed model on two benchmark datasets: Opportunity and UniMiB-SHAR. The results illustrate that our designed network outperforms the state-of-the-art models. We also conduct experiments on efficiency, multimodal fusion and different hyperparameters to analyze our proposed network. Finally, we carry out ablation and visualization experiments to reveal the effectiveness of the two proposed modules.

Deep Dilation on Multimodality Time Series for Human Activity Recognition.

A Deep Dilated Convolutional Self-attention Model for Multimodal Human Activity Recognition

A Hybrid Network Based on Dense Connection and Weighted Feature Aggregation for Human Activity Recognition

A Multi-Task Deep Learning Approach for Sensor-based Human Activity Recognition and Segmentation

A Multidimensional Parallel Convolutional Connected Network Based on Multisource and Multimodal Sensor Data for Human Activity Recognition

Learning Dynamic Spatio-Temporal Relations for Human Activity Recognition.

A Multi-dimensional Parallel Convolutional Connected Network Based on Multi-source and Multi-modal Sensor Data for Human Activity Recognition

Deep Residual Bidir-LSTM for Human Activity Recognition Using Wearable Sensors

Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition

Sequential Human Activity Recognition Based on Deep Convolutional Network and Extreme Learning Machine Using Wearable Sensors

A Semisupervised Recurrent Convolutional Attention Model for Human Activity Recognition

Human Activity Recognition based on Dynamic Spatio-Temporal Relations

A Multitask Deep Learning Approach for Sensor-Based Human Activity Recognition and Segmentation

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks

An Improved Deep Convolutional LSTM for Human Activity Recognition Using Wearable Sensors

CIR-DFENet: Incorporating Cross-Modal Image Representation and Dual-Stream Feature Enhanced Network for Activity Recognition

1-DCNN with Stacked LSTM Architecture for Human Activity Recognition Using Wearable Sensing Data

Dual-Branch Interactive Networks on Multichannel Time Series for Human Activity Recognition

A Deep Structured Model with Radius–Margin Bound for 3D Human Activity Recognition

Human Activity Recognition Based on Deep-Temporal Learning Using Convolution Neural Networks Features and Bidirectional Gated Recurrent Unit With Features Selection

Multi-channel Time Series Decomposition Network For Generalizable Sensor-Based Activity Recognition