A Multidimensional Parallel Convolutional Connected Network Based on Multisource and Multimodal Sensor Data for Human Activity Recognition

Yuhao Wang,Hongji Xu,Lina Zheng,Guozhen Zhao,Zhi Liu,Shuang Zhou,Mengmeng Wang,Jie Xu
DOI: https://doi.org/10.1109/jiot.2023.3265937
IF: 10.6
2023-01-01
IEEE Internet of Things Journal
Abstract:Human activity recognition (HAR) technology based on wearables has received increasing attention in recent years. The traditional methods have used hand-crafted features to recognize human activities, resulting in shallow feature extraction. With the development of deep learning, an increasing number of researchers have focused on studying deep learning methods. To achieve higher recognition accuracy, the majority of the current HAR research involves multisource and multimodal sensors (MMSs) data. However, due to the limitations in the receptive fields of single-dimensional convolutional kernels, these networks are still infeasible for extracting spatiotemporal features. In this study, a multidimensional parallel convolutional connected (MPCC) deep learning network based on MMS data for HAR is proposed that fully utilizes the advantages of multidimensional convolutional kernels. Moreover, multiscale residual convolutional squeeze-and-excitation (MRCSE) modules are proposed to enrich the diversity of feature information by combining squeeze-and-excitation (SE) blocks. A daily home activity (DHA) data set is constructed based on the requirements for HAR in certain scenarios, such as smart home, and we conduct experiments on the optimal combination of sensor locations on the DHA data set according to a weighted $\text{F}1~({\mathrm{ F}}_{\mathrm{ W}})$ -score. Both tenfold and leave-one-subject-out (LOSO) cross-validations (CVs) are used to evaluate the performance of the proposed network. The MPCC-MRCSE network achieves ${\mathrm{ F}}_{\mathrm{ W}}$ -scores of 98.33% and 95.42% on the physical activity monitoring for aging people (PAMAP2) and OPPORTUNITY data sets using tenfold CVs, respectively, and achieves ${\mathrm{ F}}_{\mathrm{ W}}$ -scores of 81.47% on the PAMAP2 when applying an LOSO CV.
What problem does this paper attempt to address?