Causal Adversarial Autoencoder for Disentangled SAR Image Representation and Few-Shot Target Recognition
Qian Guo,Huilin Xu,Feng Xu
DOI: https://doi.org/10.1109/tgrs.2023.3330478
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Lack of interpretability and weak generalization ability have become the major challenges with data-driven intelligent synthetic aperture radar-automatic target recognition (SAR-ATR) technology, especially in practical applications with azimuth-sparse training samples. A novel insight into SAR image representation with neural networks from a causal perspective is presented in this article. First, a causal model of SAR image representation conditioned on disentangled semantic factors is proposed. A set of SAR images is considered as a low-dimensional manifold, which is controlled by three semantic factors, namely, intrinsics, diversity, and randomness. A causal adversarial autoencoder (CAE) for SAR-ATR is then proposed to embody this disentangled representation, which incorporates a number of novel built-in network features. A physically reasonable cyclic high-frequency information-based embedding (CHE) method is proposed for azimuth encoding, which ensures the uniformity, continuity, periodicity, and distinctiveness of angle. A symmetrically conditional encoding (SCE) module is established to constrain the semantic consistency of low-dimensional features. Besides, a hybrid loss function is designed, which is composed of latent adversarial loss, reconstruction loss, and task-oriented losses. Both representation and generalization abilities are thoroughly evaluated through qualitative visualization and quantitative comparison experiments on the moving and stationary target acquisition and recognition (MSTAR) and FUSAR-Ship datasets. Experimental results demonstrate superior representation ability for the disentangled properties via angle interpolation and target transformation of SAR images. Using only 12 samples per class, the proposed CAE can achieve an accuracy of 93.1% for the ten-target SAR-ATR classification task.
What problem does this paper attempt to address?