ISTD-diff: Infrared Small Target Detection via Conditional Diffusion Models

Nini Du,Xuemei Gong,Ye Liu
DOI: https://doi.org/10.1109/lgrs.2024.3401838
IF: 5.343
2024-05-30
IEEE Geoscience and Remote Sensing Letters
Abstract:Infrared small-target detection (IRSTD), which is to extract tiny and dim targets that are hidden in noisy and messy backgrounds, is a challenging task in computer vision. Inspired by the recently powerful deep generative models, we formulate the IRSTD as a generative task and design a conditional denoising (DE) model termed ISTD-diff to iteratively generate the target mask from the noisy one. In addition, ISTD-diff employs a two-pathway architecture, consisting of a conditional prior (CP) stream for encoding the input infrared image prior and a DE stream for cleaning up the noisy masks. Both streams are equipped with several cascaded innovative channel-dimension transformer (CDT) layers, which capture the global correlations efficiently and reduce computational demands effectively. Moreover, to strengthen the DE learning process, we proposed a simple, but powerful method named attention injection module (AIM), which provides detailed control over the DE stream. Extensive experiments finely demonstrate the superior performance of our ISTD-diff beyond the current representative segmentation-based state-of-the-art (SOTA) algorithms.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?