Lightweight PM-YOLO Network Model for Moving Object Recognition on the Distribution Network Side

FU Huitong,WANG Peng,LI Xiaoyan,Lü Zhigang,DI Ruohai
DOI: https://doi.org/10.1109/acctcs53867.2022.00109
2022-01-01
Abstract:For the problems of large size and high computation of deep convolutional neural network (CNN) models, which make it difficult to achieve real-time detection on resource-limited mobile or embedded devices, and the existing lightweight models are not sufficient for detection of small targets on the distribution side, a model compression algorithm that combines multi-scale detection and lightweight target recognition PM-YOLO (Prune-MobileNetv3-YOLOv5s) is proposed to achieve efficient detection. Based on the YOLOv5s model, feature information detection is performed at different scales by adding a network processing layer for small targets to improve model recognition accuracy. Target recognition backbone network is reconfigured based on improved MobileNetv3, which reduces model parameters and convolutional operations and improves target recognition rate. Based on iterative sparsity training, the pruning method eliminates redundant parameters, thus compressing model volume and computation volume to the limit. Fine-tuning techniques and related optimization means are used to ensure that the accuracy error of the model before and after compression is within an acceptable range. The experimental results show that under the same test set and test environment, in compared with the YOLOv5s target recognition algorithm, the PM-YOLO algorithm compresses the YOLOv5s model volume by 85.4%, reduces the floating-point type computation by 84.6%, increases the prediction speed by 3.7 times, and achieves 90.6% accuracy of the compressed model recognition with only 1.5% accuracy loss. The proposed method has high detection accuracy and real-time detection speed while reducing the platform volume power consumption storage and computing power requirements, which has obvious advantages over existing lightweight models and is more conducive to the deployment of the model in resource-limited mobile, thus, making the supervision function of moving targets more accessible.
What problem does this paper attempt to address?