Enhanced federated recognition mechanism based on spatial-temporal model with split learning for multi-view human activity classification in edge intelligent network

Nguyen Anh Tuan,Atif Rizwan,Sa Jim Soe Moe,DoHyeun Kim
DOI: https://doi.org/10.1007/s11042-024-20354-0
IF: 2.577
2024-10-25
Multimedia Tools and Applications
Abstract:Multi-view human activity recognition (HAR) is a significant research domain focused on comprehending human actions from diverse sensor perspectives. With the explosion of IoT, the demand for real-time processing and data privacy in this domain has led to the deployment of deep learning-based HAR models on IoT devices at the edge. However, the constrained computational capabilities of edge devices present challenges, including performance degradation and slower inference speeds in real-time HAR systems. To address these challenges, we propose a novel recognition mechanism called Spatial-Temporal Split Learning (STSL). By leveraging both spatial and temporal features from different views, STSL enhances the accuracy of the activity classification model. Besides, STSL reduces the processing demands on edge devices by dividing the model into two parts: the first part operates on edge devices, extracting spatial features from frames captured by multiple cameras, while the second part runs on the edge server, where it aggregates spatial features, extracts temporal features, and uses them to make predictions. Moreover, our approach supports continuous learning, enabling the model to adapt and improve over time with streaming data. We deploy STSL with EdgeX microservices to optimize data management and improve the connection between the edge server and devices. We evaluated the performance of the proposed method using the IXMAS public dataset, focusing on key metrics, including accuracy and processing speed. Our approach achieved 95.25% accuracy on the IXMAS dataset and got 7 FPS when deployed with Raspberry Pi 4. Compared to existing state-of-the-art methods, the proposed method has competitive accuracy and efficient real-time processing in edge environments.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?