Abstract:Oriented object detection is a basic task in the interpretation of high-resolution remote sensing images.Compared with general detectors,oriented detectors can locate instances with oriented bounding boxes,which are consistent with arbitrary-oriented ground truths in remote sensing images.Currently,oriented object detection has greatly progressed with the development of the convolutional neural network.However,this task is still challenging because of the extreme variation in object scales and arbitrary orientations.Most oriented detectors are evolved from horizontal detectors.They first generate horizontal proposals using the Region Proposal Network(RPN).Then,they classify these proposals into different categories and transform them into oriented bounding boxes.Despite their success,these detectors exploit only the annotations at the end of the network and do not fully utilize the angle and semantic information. This work proposes an Angle-based Region Proposal Network(ARPN),which learns the angle of objects and generates oriented proposals.The structure of ARPN is the same as that of RPN.However,for each proposal,instead of outputting four parameters for regression,ARPN generates five parameters,which are the center(x,y),shape(w,h),and angle(t).In the training,we first assign anchors with ground truths by the Intersection of Unions.Then,we directly supervise the ARPN with the shape and angle information of ground truths.We also propose a semantic branch to output image semantic results for utilizing the advantage of the semantic information.The semantic branch consists of two convolutional layers and is parallel with the detection head.We first assign objects to different scale levels according to their areas.Then,we create semantic labels in each scale and use them to supervise the semantic branch.With the semantic information supervision,the model will learn translation-variant features and improve accuracy.Moreover,the outputs of the semantic branch indicate the objectness in each place,which can filter out false positives of final predictions. We conduct comprehensive experiments on the DOTA dataset to validate the effectiveness of the proposed methods.In the data preparation,we first crop original images into 1024x1024 patches with the stride of 824.Compared with the baseline,the ARPN achieves a 2.2％increase in mAP,while the semantic branch contributes an additional 0.8％improvement in mAP.Finally,we combine both methods and achieve a 74.64％mAP,which is competitive with those obtained by other oriented object detectors.We visualize some results on the DOTA dataset.The results show that our method is highly effective for small objects and densely packed objects. We proposed ARPN and the semantic branch to utilize the multi-information in remote sensing images.The ARPN can directly generate oriented proposals,which can lead to better recall of oriented objects.The semantic branch increases the translation-variant property of the features.Experiments demonstrate the effectiveness of our method,which achieves a 74.64％mAP on the DOTA dataset.In the future works,we will focus on the model efficiency and the inference speed.

Dynamic Pyramid Attention Networks for Multi-Orientation Object Detection

EBiDA-FPN: Enhanced Bi-Directional Attention Feature Pyramid Network for Object Detection

Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery

Feature Reassembly and Self-Attention for Oriented Object Detection in Remote Sensing Images

Oriented Object Detection in Remote Sensing Using an Enhanced Feature Pyramid Network

Multi Angle Rotation Object Detection for Remote Sensing Image Based on Modified Feature Pyramid Networks

Feature Enhancement Based Oriented Object Detection in Remote Sensing Images

Discriminative Feature Pyramid Network For Object Detection In Remote Sensing Images

Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images

Shallow Multiplexing and Multiscale Dilation Convolution Combined Attention Based Oriented Object Detection in Remote Sensing Images

Spatial-Coordinate Attention and Multi-Path Residual Block Based Oriented Object Detection in Remote Sensing Images

Feature Pyramid Full Granularity Attention Network for Object Detection in Remote Sensing Imagery

RADet: Refine Feature Pyramid Network and Multi-Layer Attention Network for Arbitrary-Oriented Object Detection of Remote Sensing Images

Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network

A Dual-Path Multihead Feature Enhancement Detector for Oriented Object Detection in Remote Sensing Images.

Shared-Weight-Based Multi-Dimensional Feature Alignment Network for Oriented Object Detection in Remote Sensing Imagery

Multi-information Supervision in Optical Remote Sensing Images

A Pyramid Attention Network with Edge Information Injection for Remote-Sensing Object Detection

A Novel Nonlocal-Aware Pyramid and Multiscale Multitask Refinement Detector for Object Detection in Remote Sensing Images

Dual Attention Based Image Pyramid Network for Object Detection.

Oriented Object Detection in Remote Sensing Images with Anchor-Free Oriented Region Proposal Network