Personnel Detection in Dark Aquatic Environments Based on Infrared Thermal Imaging Technology and an Improved YOLOv5s Model

Liang Cheng,Yunze He,Yankai Mao,Zhenkang Liu,Xiangzhao Dang,Yilong Dong,Liangliang Wu
DOI: https://doi.org/10.3390/s24113321
IF: 3.9
2024-05-24
Sensors
Abstract:This study presents a novel method for the nighttime detection of waterborne individuals using an enhanced YOLOv5s algorithm tailored for infrared thermal imaging. To address the unique challenges of nighttime water rescue operations, we have constructed a specialized dataset comprising 5736 thermal images collected from diverse aquatic environments. This dataset was further expanded through synthetic image generation using CycleGAN and a newly developed color gamut transformation technique, which significantly improves the data variance and model training effectiveness. Furthermore, we integrated the Convolutional Block Attention Module (CBAM) at the end of the last encoder's feedforward network. This integration maximizes the utilization of channel and spatial information to capture more intricate details in the feature maps. To decrease the computational demands of the network while maintaining model accuracy, Ghost convolution was employed, thereby boosting the inference speed as much as possible. Additionally, we applied hyperparameter evolution to refine the training parameters. The improved algorithm achieved an average detection accuracy of 85.49% on our proprietary dataset, significantly outperforming its predecessor, with a prediction speed of 23.51 FPS. The experimental outcomes demonstrate the proposed solution's high recognition capabilities and robustness, fulfilling the demands of intelligent lifesaving missions.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The paper aims to address the problem of effectively detecting individuals in dark water environments, such as night-time water rescue operations. Current methods heavily rely on infrared thermal imaging technology, but its application in water scenarios is limited. The paper proposes an improved YOLOv5s model specifically optimized for infrared thermal imaging. The research team constructed a dedicated dataset consisting of 5736 thermal images from different water environments and increased data diversity through CycleGAN and a new color space transformation technique. They also integrated Convolutional Block Attention Module (CBAM) into the model to enhance the capture capability of feature map details and utilized Ghost Convolution to reduce computational requirements and improve inference speed. Additionally, training parameters were optimized through hyperparameter evolution. The improved algorithm achieved an average detection accuracy of 85.49% and a prediction speed of 23.51 FPS on the proprietary dataset, significantly outperforming the original model. The paper emphasizes the challenges in real water environments, such as small object occlusion, the impact of seasonal and climate changes on thermal imaging, low contrast when temperatures are close, and hardware limitations for model deployment. To address these issues, they employed image enhancement techniques, created a thermal image dataset covering various scenarios, developed thermal imaging color space transformation data augmentation techniques, and designed a lightweight thermal imaging water surface object detection network (IWT-YOLO). This network is capable of accurately identifying boats, intelligent rescue robots, humans, and humans in different states such as drowning and swimming. In conclusion, the proposed solution in this paper improves the ability of target detection in dark water environments, contributing to intelligent rescue missions, especially for water surface rescue robots like "Dolphin I".