Pedestrian Navigation Activity Recognition Based on Segmentation Transformer

Qu Wang,Zhi Tao,Jiahui Ning,Zhuqing Jiang,Liangliang Guo,Haiyong Luo,Haiying Wang,Aidong Men,Xiaofei Cheng,Zhang Zhang
DOI: https://doi.org/10.1109/jiot.2024.3394050
IF: 10.6
2024-07-27
IEEE Internet of Things Journal
Abstract:In the context of the Internet of Things, utilizing the inherent inertial sensors in smartphones for human activity recognition (HAR) has garnered considerable attention owing to its wide-ranging applications. However, prevailing HAR approaches primarily treat activity identification as a single-label classification task, focusing solely on discerning pedestrian motion modes or device usage modes, while disregarding their interrelatedness. Additionally, HAR methods employing sliding windows encounter challenges associated with the multiclass window problem, wherein certain sample labels differ from the label assigned to the window. This article aims to address these issues. This article presents a novel approach for simultaneously recognizing pedestrian motion and device usage modes by utilizing the segmentation Transformer. The proposed joint recognition framework effectively annotates sensor data at each timestamp and achieves dense prediction of time-series data through the encoding and decoding of the annotated data. To optimize the utilization of information extracted from each Transformer layer, a global up-sampling decoder based on the pyramid attention module is introduced, enabling dense decoding of features obtained from each Transformer layer. We performed experiments on two publicly available data sets to comprehensively assess the effectiveness of the proposed methodology. The results demonstrate that our approach achieves an accuracy of 99.79% and a weighted F-score of 99.77%, surpassing the performance of existing state-of-the-art methods. Furthermore, we constructed heterogeneous data sets to validate the robustness of our method. The extensive experimental findings indicate that the joint recognition framework effectively uncovers the inherent correlations between pedestrian motion and device usage modes, leading to enhanced accuracy in recognition and addressing the challenges posed by the multiclass window problem.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?