Deep Multiscale Fine-Grained Hashing for Remote Sensing Cross-Modal Retrieval

Jiaxiang Huang,Yong Feng,Mingliang Zhou,Xiancai Xiong,Yongheng Wang,Baohua Qiang
DOI: https://doi.org/10.1109/lgrs.2024.3351368
IF: 5.343
2024-02-02
IEEE Geoscience and Remote Sensing Letters
Abstract:Hashing retrieval is a widely used technique in high spatial resolution remote sensing (RS) images due to its efficient retrieval speed and low memory overhead. However, existing hashing retrieval methods primarily focus on matching multilabel RS images, neglecting the extensive fine-grained semantic information in cross-modal RS data. Moreover, RS images exhibit notable object size differences and contain redundant features that lack effective multiscale feature extraction methods. To address these issues, we propose a novel deep multiscale fine-grained hashing (DMFH) method for cross-modal hashing retrieval of RS data. The DMFH method comprises two modules: the feature extraction module and hashing retrieval module. In the feature extraction module, we introduce a multiscale feature representation method to extract both low-level and high-level features from RS images while using a redundant optimizer to remove duplicate features. In addition, we used embedding vectors to extract fine-grained semantic information from description texts. The hashing retrieval module uses contrastive loss and triplet loss to guide the hash function toward learning and generating hash codes from extracted features. Our proposed DMFH method achieves state-of-the-art performance in two public RS image–text datasets (RSICD and RSITMD) through extensive experiments and ablation studies.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?