TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images

Xiaoxu Feng,Junwei Han,Xiwen Yao,Gong Cheng
DOI: https://doi.org/10.1109/TGRS.2020.3030990
IF: 8.2
2021-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Weakly supervised object detection (WSOD) in remote sensing images (RSI) plays an essential role in RSI understanding applications. Currently, predominant works are inclined to first activate the most discriminative region and then pursue the whole object by analyzing the context information of the activated region. However, the most discriminative region usually only covers a small crucial part. Besides, many same-class instances often appear in adjacent locations. In such a case, treating proposals of large spatial overlap as the same-class instances not only introduces potential ambiguities but also misleads the detection model to recognize multiple adjacent instances as one object instance. To address these challenges, a novel triple context-aware network (TCANet) is proposed to learn complementary and discriminative visual patterns for WSOD in RSIs. Specifically, a global context-aware enhancement (CCAE) module is first designed to activate the features of the whole object by capturing the global visual scene context. Then, a dual local context residual (DLCR) module is further developed to capture the instance-level discriminative cues by leveraging the semantic discrepancy of the local context. Furthermore, an effective adaptive-weighted refinement loss is integrated into the DLCR module to reduce the ambiguities in the label propagating process. The collaboration of CCAE and DLCR formulates a unique TCANet that can be learned in an end-to-end manner. Comprehensive experiments are carried out on the challenging NWPU VHR-10.v2 and DIOR data sets. We achieve a 58.8% mAP and a 25.8% mAP on the NWPU VHR-1O.v2 and DIOR data sets, respectively, which both significantly outperform the state of the arts.
What problem does this paper attempt to address?