Real-time power line segmentation detection based on multi-attention with strong semantic feature extractor
Qian Zhao,Tangyu Ji,Shuang Liang,WenTao Yu,Chao Yan,Liang, Shuang
DOI: https://doi.org/10.1007/s11554-023-01367-8
IF: 2.293
2023-10-21
Journal of Real-Time Image Processing
Abstract:Power line is an important part of the transmission line, is the only carrier of power transmission, so the detection of power lines is to ensure the stable operation of the power system is an important means. Therefore, to improve the efficiency of power line detection, this paper proposes a real-time power line segmentation method based on multi-attention mechanism and strong semantic feature extraction. The method is improved based on the DeepLab V3+ codec model. In the encoder part, the Convolutional Block Attention Module (CBAM) is firstly introduced in MobileNetV2 network, which strengthens the ability of contextual information interaction; the ASPPAttention fast feature fusion structure is proposed, which achieves the fast extraction of multi-dimensional effective information by designing the depth-separated convolution of different perceptual fields and strengthens the pixel-level feature encoding ability through the coordinate attention (CA) mechanism; in the decoder part, this paper proposes a real-time power line segmentation method based on multiple attention mechanisms and strong semantic feature extraction. In the decoder part, this paper proposes a real-time power line segmentation method based on multi-attention mechanism and strong semantic feature extraction. In the decoder part, a lightweight inverted convolutional decoder structure is proposed, which improves the feature extraction capability of the model by introducing an inverted bottleneck convolution structure in two quadruple downsampling layers with fewer parameters, and avoids heterogeneous splitting through the introduction of the CA attention mechanism; during the training process, the model convergence is accelerated through the migration of the VOC's training weights, and the model convergence is avoided through the introduction of the Dice Loss, the effect of the number of samples on the model to accelerate the model convergence speed. The loss avoids the effect of sample number on model generalisation. The experimental results show that the mean intersection over union (mIoU) of this paper can reach 48.5%, the accuracy can reach 97.5%, and the detection speed of the model can reach 40.8 frames per second (fps), which is better than HRNet, PSPNet, DeepLab V3+ and other network models in the balance of speed and accuracy.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology