Diversity-Guided Distillation with Modality-Center Regularization for Robust Multimodal Remote Sensing Image Classification.

Shicai Wei,Yang Luo,Chunbo Luo
DOI: https://doi.org/10.1109/tgrs.2023.3336297
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Multimodal learning has shown great potential in remote sensing image classification and attracted increasing interest in the community. Although it is preferable to collect multiple modalities for training, not all of them are available in practical scenarios. To this end, we propose a general diversity-guided distillation network (DGDNet) with modality-center regularization (MCR) to facilitate accurate model inference when modalities are missing. Compared with existing modality reconstruction methods, DGDNet does not need prior knowledge of the missing modality and can handle various missing modalities via only one model. Specifically, DGDNet consists of two components: the deployment network extracting the modality-invariant representation for robust inference and the teacher network transferring comprehensive multimodal information to the deployment network. This enables the deployment network to learn the modality invariant and specific information simultaneously while maintaining robustness for incomplete modality input. In particular, we design a novel diversity-guided distillation (DGD) method that transfers knowledge by matching the feature diversity. This helps overcome the representation heterogeneity when encouraging the deployment network to learn modality-specific information. Besides, an MCR strategy is proposed to address the unbalanced training of teacher and deployment networks by constraining the intra-class inter-modality variations. This helps alleviate the underfitting for weak modality, improving the model performance. Finally, extensive experiments demonstrate that the proposed DGDNet can address the problem of missing modalities effectively and achieves state-of-the-art performance. The code is available at https://github.com/shicaiwei123/TGRS-DGDNet .
What problem does this paper attempt to address?