Using Squeeze-and-Excitation Vision Transformer with Local Feature Fusion for Ship Classification in SAR Images

Yuhang Qi,Lu Wang,Chunhui Zhao,Ning Wang,Jikang Chen
DOI: https://doi.org/10.1109/igarss52108.2023.10283157
2023-01-01
Abstract:The categorization of synthetic aperture radar (SAR) ships primarily focuses on large ships with distinct features, but accurately identifying SAR ships remains challenging due to limited samples in certain ship categories. In this study, we propose a compressed and excited Vision Transformer model based on local feature fusion. This model leverages local feature fusion and channel modeling through the squeezing-and-excitation (SE) mechanism to effectively balance the contributions of each feature. By incorporating better local information, we are able to extract deeper features even from small datasets. To evaluate the efficacy of our model, we trained it on the three-category OpenSARShip 2.0 dataset and conducted experiments. The results demonstrate that our proposed model achieves superior classification accuracy compared to existing methods.
What problem does this paper attempt to address?