Few-Shot Object Detection With Multilevel Information Interaction for Optical Remote Sensing Images
Lefan Wang,Shaohui Mei,Yi Wang,Jiawei Lian,Zonghao Han,Xiaoning Chen
DOI: https://doi.org/10.1109/tgrs.2024.3410308
IF: 8.2
2024-06-21
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Metalearning has been widely applied to solve the few-shot object detection (FSOD) problem in natural scenes, which performs similarity measurement and information aggregation of the support set and the query set. However, regarding remote sensing images (RSIs), many difficulties caused by their disparities need to be further addressed, such as inconsistencies in imaging scale, direction, and background between support and query images. These result in feature misalignment and attention bias, interfering with model performance. In this article, a multilevel information interaction (MLII) strategy is proposed for FSOD to alleviate feature misalignment and attention bias. Information interactions are conducted within multiple scales of features and highlight similar regions of query and support features. A semantic enhancement module (SEM) is proposed to assist MLII in extracting key information and achieving more discriminative feature representation. Moreover, a feature cross-aggregation module (FCM) with separate classification losses is designed to train the detector to identify objects that coexist in query and support images. Extensive experiments demonstrate that the proposed method outperforms several state-of-the-art few-shot object detectors over commonly used benchmark datasets, i.e., DIOR and NWPU-10.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics