Dual-branch PolSAR Image Classification Based on GraphMAE and Local Feature Extraction

Yuchen Wang,Ziyi Guo,Haixia Bi,Danfeng Hong,Chen Xu
2024-08-08
Abstract:The annotation of polarimetric synthetic aperture radar (PolSAR) images is a labor-intensive and time-consuming process. Therefore, classifying PolSAR images with limited labels is a challenging task in remote sensing domain. In recent years, self-supervised learning approaches have proven effective in PolSAR image classification with sparse labels. However, we observe a lack of research on generative selfsupervised learning in the studied task. Motivated by this, we propose a dual-branch classification model based on generative self-supervised learning in this paper. The first branch is a superpixel-branch, which learns superpixel-level polarimetric representations using a generative self-supervised graph masked autoencoder. To acquire finer classification results, a convolutional neural networks-based pixel-branch is further incorporated to learn pixel-level features. Classification with fused dual-branch features is finally performed to obtain the predictions. Experimental results on the benchmark Flevoland dataset demonstrate that our approach yields promising classification results.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve effective classification in the polarimetric synthetic aperture radar (PolSAR) image classification task when the labeled data is limited. Specifically, the annotation of PolSAR images is a labor - intensive and time - consuming process, so classification in the case of scarce labeled data is a challenging task. Existing self - supervised learning methods mainly focus on contrastive learning, while the application research of generative self - supervised learning in PolSAR image classification is relatively scarce. To solve this problem, the author proposes a two - branch classification model based on generative self - supervised learning. ### Main Contributions 1. **Generative Graph Auto - encoder**: Utilize the generative graph auto - encoder (GraphMAE) to learn polarimetric representations at the super - pixel level, and learn features through an auxiliary graph reconstruction task without manual annotation. 2. **Convolutional Neural Network Branch**: Introduce a pixel - level feature learning branch based on a convolutional neural network (CNN) to capture more fine - grained features. 3. **Two - Branch Fusion**: Balance the model performance and computational cost by fusing the features of the super - pixel branch and the pixel branch, and verify the effectiveness of the method on the Flevoland benchmark dataset. ### Method Overview 1. **Super - pixel Branch**: - Use the SLIC algorithm to perform super - pixel segmentation on the Pauli RGB image to generate a mask matrix. - Construct an undirected graph \( G=\langle V, E, X\rangle \), where \( V \) is the set of nodes, \( E \) is the set of edges, and \( X \) is the node feature. - Use the graph attention network (GAT) as an encoder and a decoder to recover the masked node features through partially known nodes and edges. - The loss function is defined as: \[ L_{\text{SCE}}=\frac{1}{|e_V|}\sum_{v_i\in e_V}\left(1 - \frac{x_i^Tz_i}{\|x_i\|\cdot\|z_i\|}\right)^\gamma \] where \( x_i \) and \( z_i \) represent the original feature and the reconstructed feature respectively, and \( \gamma \) is a scaling parameter. 2. **Pixel Branch**: - Use the \( n\times n \) patch of the central pixel as input. - Utilize a CNN architecture containing four convolutional layers, with each convolutional layer followed by a ReLU activation function, and the second and fourth layers followed by a max - pooling layer. - The numbers of output channels are 128, 256, 512 and 512 respectively, and finally output features through a fully - connected layer. 3. **Feature Fusion and Classification**: - Fuse the features of the super - pixel branch and the pixel branch with weights \( \alpha \) and \( 1-\alpha \) respectively: \[ F = \alpha F_S+(1 - \alpha)F_P \] - Use a fully - connected layer and a softmax layer as a classifier, and the optimization objective is the cross - entropy loss function. ### Experimental Results The experiment was carried out on the Flevoland benchmark dataset with a size of 1024×750 and containing 15 categories. The results show that the proposed DB - GC model outperforms the single - branch models (GNN and CNN) in terms of overall accuracy (OA) and average accuracy (AA), with improvements of 9.25% and 12.46% respectively. In particular, for the accuracy of each category, the DB - GC model exceeds 94%, demonstrating its strong feature extraction ability on a limited dataset. ### Conclusion This paper proposes a two - branch PolSAR image classification model based on generative self - supervised learning. By combining a graph auto - encoder at the super - pixel level and a convolutional neural network at the pixel level, it effectively solves the problem of scarce labeled data and achieves remarkable classification results on the Flevoland dataset. Future work will consider designing PolSAR image classification methods based on complex - valued deep architectures to fully utilize the complex - valued characteristics of PolSAR data.