Accurate and Efficient Federated-Learning-Based Edge Intelligence for Effective Video Analysis.

Liang Xu,Haoyun Sun,Hongwei Zhao,Weishan Zhang,Huansheng Ning,Hongqing Guan
DOI: https://doi.org/10.1109/jiot.2023.3241039
IF: 10.6
2023-01-01
IEEE Internet of Things Journal
Abstract:Video data is the biggest IoT data which is challenging for effective analysis with good performance. Object misdetection is usually inevitable in edge-based distributed cross-scene video analysis. Traditional centralized model training can potentially result in edge data leakage. Even though joint model can be trained with federated learning while maintaining data privacy, the size of gradient data transmitted is large for computer vision models used. To address these problems, this article proposed an accurate and efficient federated learning-based edge intelligence for effective video analysis method called EIEVA-AEFL. In EIEVA-AEFL, a federation misdetection reinforcement network (FMRN) is designed to alleviate the misdetection problem. FMRN contains a vanilla object detection network and a misdetection reinforcement branch, which finetunes object detection via feature re-extraction to reduce object misdetection. To reduce the communication cost in training, an efficient federated learning strategy is designed. In this strategy, an oscillation suppression loss function is proposed to suppress the loss fluctuation resulting from data on edge clients. Average accuracy and recall increase 0.5 and 0.7 with FMRN on the Microsoft common objects in context (MS COCO) data set, respectively, and with improvements of 4.5 and 5.5 with FMRN on our self-made mis-detection data set, respectively. EIEVA-AEFL can reduce the training speed on the premise of ensuring the accuracy of the model. The model parameters, data amount, transmission delay, and convergence epochs on EIEVA-AEFL model training are reduced by 78%, 89%, 84%, and 36%, respectively.
What problem does this paper attempt to address?