Decoding Human Activities: Analyzing Wearable Accelerometer and Gyroscope Data for Activity Recognition

Utsab Saha,Sawradip Saha,Tahmid Kabir,Shaikh Anowarul Fattah,Mohammad Saquib
2024-07-09
Abstract:A person's movement or relative positioning can be effectively captured by different types of sensors and corresponding sensor output can be utilized in various manipulative techniques for the classification of different human activities. This letter proposes an effective scheme for human activity recognition, which introduces two unique approaches within a multi-structural architecture, named FusionActNet. The first approach aims to capture the static and dynamic behavior of a particular action by using two dedicated residual networks and the second approach facilitates the final decision-making process by introducing a guidance module. A two-stage training process is designed where at the first stage, residual networks are pre-trained separately by using static (where the human body is immobile) and dynamic (involving movement of the human body) data. In the next stage, the guidance module along with the pre-trained static or dynamic models are used to train the given sensor data. Here the guidance module learns to emphasize the most relevant prediction vector obtained from the static or dynamic models, which helps to effectively classify different human activities. The proposed scheme is evaluated using two benchmark datasets and compared with state-of-the-art methods. The results clearly demonstrate that our method outperforms existing approaches in terms of accuracy, precision, recall, and F1 score, achieving 97.35% and 95.35% accuracy on the UCI HAR and Motion-Sense datasets, respectively which highlights both the effectiveness and stability of the proposed scheme.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
This paper aims to address the problem of human activity recognition using data from wearable sensors such as accelerometers and gyroscopes. Specifically, the paper proposes an effective approach named FusionActNet, which introduces two unique network structures to capture static and dynamic behaviors, combined with a guidance module to optimize the final decision process. ### Main Issues 1. **High-Precision Classification Challenge**: Existing shallow networks struggle to extract meaningful patterns from raw 1D time-domain data from accelerometers and gyroscopes, especially when there is significant data overlap between similar activities such as walking, lying down, and sitting. 2. **Multi-Stage Training**: How to design an effective multi-stage training process to improve the accuracy and stability of the model. ### Solutions 1. **Dual-Model Structure**: - **Static Model**: Specifically designed to capture features of static activities such as sitting, lying, and standing. - **Dynamic Model**: Specifically designed to capture features of dynamic activities such as walking, going upstairs, and going downstairs. 2. **Guidance Module**: - In the first stage, the static and dynamic models are pre-trained using static and dynamic data, respectively. - In the second stage, the guidance module combines the pre-trained static and dynamic models to perform weighted fusion on the input data to generate the final activity label prediction. ### Advantages - **Complementary Information**: By capturing different types of activity features through two dedicated models, the robustness of the model is improved. - **Weighted Fusion**: The guidance module optimizes the final prediction results by performing weighted fusion of the outputs from the static and dynamic models. ### Experimental Results - **UCI HAR Dataset**: Achieved an accuracy of 97.35%, outperforming existing methods. - **Motion-Sense Dataset**: Achieved an accuracy of 95.35%, also outperforming existing methods. ### Conclusion The proposed FusionActNet approach performs excellently in handling static and dynamic activity recognition, achieving high accuracy and stability on two benchmark datasets, demonstrating significant progress in the field of activity recognition technology. Future research can further explore applying this method to other types of data to enhance its generalizability.