Ulit-BiDet: an Ultralightweight Object Detector for SAR Images Based on Binary Neural Networks

Han Pu,Zhengwen Zhu,Qi Hu,Dong Wang
DOI: https://doi.org/10.1109/tgrs.2024.3373488
IF: 8.2
2024-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Synthetic aperture radar (SAR) target detection has extensively utilized convolutional neural networks (CNNs). Nonetheless, CNN-based methods often achieve favorable detection accuracy at the cost of high model complexity, hindering the deployment of the algorithm in real-time application scenarios, such as maritime rescue and military decision-making. To deal with this problem, we are the first to propose an ultralightweight object detector named Ulit-BiDet for SAR images, which incorporates a binary neural network with very low storage and computation costs. First, we creatively design a performance and cost-scalable binary backbone that adapts to the diverse resources and computational capacities of practical devices. Then, the backbone structure is optimized with a new nonlocal module to enhance semantic contextual information, thereby alleviating false detections caused by the interference from land clutter and sea clutter. Third, considering the SAR imaging mechanism, the interference near the ship boundary with similar scattering power probably affects the localization accuracy due to the interfered object-related contour information. To tackle the localization issue, we uniquely propose to utilize valuable and extra object-related contour semantics to guide representation learning of ship targets. The scheme compels the model to generate features that highlight object contour, thereby promoting accurate boundary localization in ship target detection. We validated the robustness of the proposed network in three mostly cited publicly available datasets. Experimental results demonstrate that our model achieves 97.2%, 95.2%, and 77.3% detection accuracy with only 1.27M parameters and 0.22G operations (Ops) on SSDD, SAR-Ship, and AIR-SARShip2.0 ship detection datasets, respectively.
What problem does this paper attempt to address?