CrCD: Multi-Direction-MLP-based Cross-Contrastive Disambiguation for Hyperspectral Image Partial Label Learning

Xiaoyu Tian,Fulin Luo,Xiuwen Gong,Tan Guo,Bo Du,Xinbo Gao
DOI: https://doi.org/10.1109/tgrs.2024.3483989
IF: 8.2
2024-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Due to the intricate nature of the hyperspectral image (HSI) and the constraints imposed by annotators’ limited prior knowledge, the collection of HSI data poses significant challenges. Although there has been growing attention towards the issue of transfer learning and few-shot learning, partial label learning (PLL) has received scant attention in HSI classification. PLL refers to a scenario where training samples are associated with a set of candidate labels, among which only one is the correct label. Consequently, PLL holds significant practical importance for HSI classification, as it can alleviate the costs associated with HSI labeling. In this paper, a Cross-Contrastive Disambiguation (CrCD) method is proposed based on Multi-Direction Multi-Layer Perceptrons (MLP) for HSI PLL, which includes two main components: representation learning for feature extraction and label disambiguation strategy for PLL. First, we design a heterogeneous network framework composed of 2D-Encoder and 3D-Encoder, and introduce a Multi-Direction-MLP and a Multi-Scale-Attention for long-range spatial-spectral information. Secondly, apart from enforcing consistency in feature representation, we pioneer the establishment of a consistency constraint on semantic prediction probabilities with contrastive learning. Furthermore, the cross label disambiguation strategy is introduced to provide reliable guidance for network training. Extensive experiments demonstrate that CrCD outperforms several current state-of-the-art approaches in HSI PLL and achieves results comparable to fully supervised learning. Code: https://github.com/Nemo96yu/CrCD.
What problem does this paper attempt to address?