Infrared multi‐target detection and tracking in dense urban traffic scenes
Chaoneng Zha,Suyun Luo,Xinhao Xu
DOI: https://doi.org/10.1049/ipr2.13053
IF: 2.3
2024-02-27
IET Image Processing
Abstract:This study presents YOLO‐IR and YOLO‐DeepOC‐IR, innovative methods for infrared object detection and tracking in dense urban traffic, emphasizing image processing techniques. The work employs infrared image enhancement techniques (local contrast multi‐scale enhancement, non‐local means, and contrast limited adaptive histogram equalization), a MobileVITv3 backbone, and a multi‐layer feature extraction module with canny edge detection, Gabor filtering, and open operation layers for improved detection in infrared imagery. The tracker's feature processing is further optimized using the learned arrangements of three patch codes descriptor and locality‐sensitive hashing, showcasing the pivotal role of image processing in enhanced traffic surveillance in infrared scenes. Infrared object detection and tracking in dense urban traffic remain a challenge due to factors such as low contrast, small intra‐class differences, and frequent false positives and negatives. To overcome these, the authors introduce YOLO‐IR, an algorithm based on the enhanced YOLOv8s, and YOLO‐DeepOC‐IR, a comprehensive infrared multi‐object tracking method for urban traffic, integrating both detection and tracking. During preprocessing, three infrared image enhancement techniques, local contrast multi‐scale enhancement, non‐local means, and contrast limited adaptive histogram equalization, are applied for better reliability in dense scenes. To further improve the performance, the original YOLOv8s backbone is replaced with MobileVITv3 to enhance detection accuracy and robustness. This infrared feature extraction module, incorporated into the detector, combines canny edge detection, Gabor filtering, and open operation layers, significantly boosting object detection in infrared imagery. The tracker's feature processing capabilities are improved using the learned arrangements of three patch codes descriptor and locality‐sensitive hashing for feature extraction and matching. Experimental results on FLIR ADAS v2 and InfiRay datasets indicate superior performance of this method, achieving 78.6% mAP and 151.1 FPS in detection, and up to 80.8% moving object tracking accuracy, 78.6% identification F1 score, and 62.1% higher order tracking accuracy in multi‐object tracking.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology