Abstract:Anchor-free aerial object detection methods have recently attracted much attention due to their simplicity and efficiency. However, the performance is still unsatisfactory due to the following two main limitations. On the one hand, the anchor-free detector employs ordinary convolution layers with axis-aligned receptive fields to extract object features, resulting in lacking internal mechanisms to handle the rotation variance. On the other hand, the detector sacrifices much semantic information to achieve faster detection, leading to the inability to deal with objects’ high interclass similarity and intraclass diversity. To address these issues, in this article, we present a unique anchor-free detector, termed rotation-insensitive point representation ( $\text{R}^{2}$ IPoints), of which a set of category-aware points are employed to encode the spatial and semantic information of the arbitrary-oriented objects. Specifically, we first devise a stacked rotation convolution module (SRM) to encourage the learning of $\text{R}^{2}$ IPoints by adaptively modeling orientation-agnostic interdependencies over stochastically rotated features. Meanwhile, we further introduce a class-specific semantic enhancement module (CSM). It performs category-aware semantic activation to recalibrate features, thus enabling the point representation to be aware of object categories. Through jointly optimizing the two proposed modules in an end-to-end manner, $\text{R}^{2}$ IPoints could simultaneously generate rotation-insensitive and category-aware point representation. Extensive experiments on the challenging DIOR and DOTA datasets demonstrate the superiority of the proposed method. We achieve 72.7% mAP on DIOR and 74.34% mAP on DOTA, surpassing the baseline method of +2.4% mAP and +2.49% mAP, respectively. The code is available at https://github.com/shnew/R2IPoints .

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

PointOBB: Learning Oriented Object Detection via Single Point Supervision

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

P2RBox: Point Prompt Oriented Object Detection with SAM

End-to-end Point Supervised Object Detection with low-level instance features

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection

H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection

On Improving Bounding Box Representations for Oriented Object Detection

A Dual-Branch Network for End-to-End Point-Supervised Object Detection on Remote Sensing Images

R²IPoints: Pursuing Rotation-Insensitive Point Representation for Aerial Object Detection

Spatial Self-Distillation for Object Detection with Inaccurate Bounding Boxes

Oriented RepPoints for Aerial Object Detection

ObjectBox: From Centers to Boxes for Anchor-Free Object Detection

Point-Based Weakly Supervised Learning for Object Detection in High Spatial Resolution Remote Sensing Images

DuBox: No-Prior Box Objection Detection via Residual Dual Scale Detectors

HODet: A New Detector for Arbitrary-Oriented Rectangular Object in Optical Remote Sensing Imagery

Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations

Toward High Quality Multi-Object Tracking and Segmentation Without Mask Supervision

Learning Point-Guided Localization for Detection in Remote Sensing Images