Visual oriented object detection via feature alignment and Gaussian parameterization

Xue YANG,Junchi YAN
DOI: https://doi.org/10.1360/SSI-2022-0410
2023-01-01
Abstract:Oriented object detection is a research hotspot in computer vision,and has a wide range of applications in remote sensing,scene text,etc.The problems of large aspect ratio,dense arrangement,and arbitrary orientation are the current main challenges in this field.The authors present a refined oriented detector,R3DetGauss,based on a single-stage detection method,which employs a coarse-to-fine progressive regression manner to locate objects quickly and accurately.Considering the issue of feature misalignment in refined detectors,this paper designs a feature refinement module(FRM)to obtain more accurate features to improve the detection performance.Specifically,FRM re-encodes the position information of the currently refined bounding box to the corresponding feature points through pixel-wise feature interpolation,thereby realizing feature reconstruction and alignment.This paper also designs a scale-invariant normalized Gaussian Wasserstein distance as the regression loss to further improve the quality of the predicted bounding boxes.In addition,this paper proposes an aspect ratio-aware adaptive sampling strategy based on this distance,which improves the quality of sample allocation.A large number of quantitative and qualitative experimental results show that the devised R3DetGauss can improve existing baseline,and achieve state-of-the-art detection accuracy on a variety of datasets.The models and codes are implemented and released by the domestic open-source deep learning framework Jittor,together with PyTorch and TensorFlow.
What problem does this paper attempt to address?