Research on lightweight algorithm for gangue detection based on improved Yolov5

Xinpeng Yuan,Zhibo Fu,Bowen Zhang,Zhengkun Xie,Rui Gan
DOI: https://doi.org/10.1038/s41598-024-57259-9
IF: 4.6
2024-03-21
Scientific Reports
Abstract:In order to solve the problems of slow detection speed, large number of parameters and large computational volume of deep learning based gangue target detection method, we propose an improved algorithm for gangue target detection based on Yolov5s. First, the lightweight network EfficientVIT is used as the backbone network to increase the target detection speed. Second, C3_Faster replaces the C3 part in the HEAD module, which reduces the model complexity. once again, the 20 × 20 feature map branch in the Neck region is deleted, which reduces the model complexity; thirdly, the CIOU loss function is replaced by the Mpdiou loss function. The introduction of the SE attention mechanism makes the model pay more attention to critical features to improve detection performance. Experimental results show that the improved model size of the coal gang detection algorithm reduces the compression by 77.8%, the number of parameters by 78.3% the computational cost is reduced by 77.8% and the number of frames is reduced by 30.6%, which can be used as a reference for intelligent coal gangue classification.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper aims to address the issues of slow detection speed, large number of parameters, and high computational cost in deep learning-based gangue detection methods. To improve these issues, the researchers proposed an improved algorithm based on Yolov5s. The specific improvements include: 1. **Lightweight Network**: Using the EfficientVIT network as the backbone network to improve target detection speed. 2. **Module Replacement**: Replacing the original C3 part with C3_Faster in the HEAD module to reduce model complexity. 3. **Feature Map Branch Removal**: Removing the 20 × 20 feature map branch in the Neck area to further reduce model complexity. 4. **Loss Function Improvement**: Replacing the CIOU loss function with the Mpdiou loss function to achieve faster convergence speed and more accurate regression results. 5. **Attention Mechanism Introduction**: Introducing the SE attention mechanism to make the model focus more on key features, thereby improving detection performance. Experimental results show that the improved model significantly reduces model size (compressed by 77.8%), number of parameters (reduced by 78.3%), and computational cost (reduced by 77.8%), while the frame rate decreased by 30.6%, making it suitable for intelligent coal gangue classification.