Yolov5s-MSD: a multi-scale ship detector for visible video image

Yan-Tong Chen,Yan-Yan Zhang,Jia-Liang Wang,Yang Liu
DOI: https://doi.org/10.1007/s00530-023-01196-6
IF: 3.9
2024-01-14
Multimedia Systems
Abstract:As the ship multi-scale phenomenon is very common in visible video, it is an important factor affecting the performance of visible video ship detection. Based on YOLOv5s, this paper proposes a real-time multi-scale ship detection algorithm. First, reparameterized convolution is adopted to increase the network width, thereby enhancing the network's ability to express multi-scale ship features. Second, the depth of SPPF (Spatial Pyramid Pooling Fast) is adjusted to enhance the scale invariance of ship features extracted from the network. Third, the attention module is combined with feature pyramid network to enhance the network's ability to focus on multi-scale ship features. Finally, confidence propagation cluster is used for post-processing to make the network generation more confident and closer to the boundary box of the real box. The experiment shows that our method can achieve state-of-the-art visible video ship detection performance on multiple evaluation indicators, such as mAP-IOU@0.5, mAP-IOU@[0.5:0.95], APS, APM, APL and so on. And it can meet the requirements of real-time detection.
computer science, information systems, theory & methods
What problem does this paper attempt to address?