LDSI-YOLOv8: Real-Time Detection Method for Multiple Targets in Coal Mine Excavation Scenes

Hongwei Wang,Zhihao Zhang,Lei Tao,Chao Li,Jin Li,Linhu Yao
DOI: https://doi.org/10.1109/ACCESS.2024.3450582
IF: 3.9
IEEE Access
Abstract:To address challenges such as missed detection and low recognition rates in multi-target detection within coal mining excavation scenes, attributed to factors like low illumination, high dust and fog, multi-target occlusion and large target scale spans, we propose a multi-target detection method based on LDSI-YOLOv8 specifically designed for such environments. Firstly, the method improves image clarity and reduces the difficulty of effective feature extraction through restricted histogram equalization. Moreover, by using depth-wise convolutional layer with large kernels and dilation-wise residual module with a two-step method, it enhances the extraction and fusion abilities of multi-scale target features. In addition, the relationship between feature maps is utilized to recall occluded features, which enhances the detection of occluded targets. Finally, the idea of fusing auxiliary borders and focusing on the shape and scale of the bounding box makes the bounding box regression faster and more accurate, further enhancing the detection ability for multi-targets. The proposed network is trained and evaluated on a dataset of excavation scenes derived from real mining production videos. The experimental results demonstrate that the proposed LDSI-YOLOv8 detection algorithm is capable of overcoming harsh environments and improving the detection accuracy of occluded and multi-scale targets. With an average detection accuracy of 91.4%, which is 4.3% higher than the original YOLOv8 algorithm, it reduces the number of parameters by 12.2%. Furthermore, it achieves a detection speed of 88.2 FPS, meeting the requirements of real-time detection.
Computer Science,Engineering,Environmental Science
What problem does this paper attempt to address?