Abstract:The presence of speckles and geometric distortions poses a serious challenge to the visual interpretation of synthetic aperture radar (SAR) images. SAR-to-optical (S2O) image translation technology provides a feasible solution and has attracted increasing attention. Restricted by substantial gaps between optical and SAR images, current S2O translation methods unavoidably result in geometric distortions, target missing, and generating low-fidelity images, thereby limiting subsequent cross-modal applications. In this article, we propose an augmented conditional denoising diffusion probabilistic model with spatial-frequency refinement (SFDiff) for high-fidelity S2O image translation. SFDiff progressively narrows the gap between synthesized and real images in both spatial and frequency perspectives, showcasing notable performance in terms of quality and consistency. Specifically, to incorporate rich spatial content priors provided by SAR images, we design an SAR context prior extractor (SCPE) with denoising enhancement to extract multiscale conditional representations, thereby aiding SFDiff in capturing more descriptive cues for S2O translation. In addition, a spatial-frequency complementary learning (SFCL) module is designed to learn spatial semantics and simultaneously enhances informative frequency components and global dependencies. Furthermore, SFDiff is optimized using the joint spatial-frequency refinement loss, facilitating iterative refinement in both spatial and frequency domains to enhance content consistency and fidelity in the synthesized images. Based on the experimental findings from the UNICORN dataset and the SEN12 dataset, SFDiff maintains a high level of content and structural consistency, resulting in visually appealing translation results that surpass the state-of-the-art (SOTA) methods. In particular, SFDiff exhibits excellent performance in preserving small targets and details, which is crucial in cross-modal detection applications.

Interpretable Matching of Optical-SAR Image Via Dynamically Conditioned Diffusion Models

Conditional Diffusion for SAR to Optical Image Translation

Conditional Diffusion Model With Spatial-Frequency Refinement for SAR-to-Optical Image Translation

SAR to Optical Image Translation with Color Supervised Diffusion Model

A brain-inspired approach for SAR-to-optical image translation based on diffusion models

Accelerating Diffusion for SAR-to-Optical Image Translation via Adversarial Consistency Distillation

Robust Optical and SAR Image Matching Using Attention-Enhanced Structural Features

Translating SAR to Optical Images for Assisted Interpretation

A Dual-Generator Translation Network Fusing Texture and Structure Features for SAR and Optical Image Matching

Reciprocal translation between SAR and optical remote sensing images with cascaded-residual adversarial networks

Learning to Find the Optimal Correspondence Between SAR and Optical Image Patches

A SAR-to-Optical Image Translation Method Based on Conditional Generation Adversarial Network (cGAN)

SAR-to-Optical Image Translation via an Interpretable Network

Detector-Free Feature Matching for Optical and SAR Images Based on a Two-Step Strategy

Shared contents alignment across multiple granularities for robust SAR-optical image matching

Robust Matching for SAR and Optical Images Using Multiscale Convolutional Gradient Features

Optical and SAR Image Matching Using Pixelwise Deep Dense Features

SAR-to-Optical Image Translation via Thermodynamics-inspired Network

A Bridge Neural Network-Based Optical-SAR Image Joint Intelligent Interpretation Framework

Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation

Explore Better Network Framework for High-Resolution Optical and SAR Image Matching