Abstract:Real-world data with skewed distributions poses a serious challenge to existing object detectors. The unbalanced label distribution leads to a bias towards dominate labels, resulting in the worse detection performance on the rare classes than the dominant classes. More unfortunately, the label samplers in these detectors shift the training label distributions to a new skewed distribution, thereby severely limiting the effectiveness of previous prior-based methods such as Logit Adjustment (Menon et al., in ICLR. OpenReview.net, 2021). Additionally, the tremendous ratio of the background samples to the samples per foreground category further hinders the learning of classification on foreground categories. To mitigate these issues, in this paper, we propose Logit Normalization (LogN), a simple technique to self-calibrate the classification logits of detectors in a similar way to Batch Normalization (BN). LogN first leverages the consistency between logit statistics and the training label distribution to eliminate the long-tail bias of detectors in a normalized manner. Second, based on the independence between fore-background imbalance and long-tail distribution, we also introduce a background calibration for LogN, which effectively improves the overall performance by restoring the background discriminability. In general, our LogN is training- and tuning-free ( i.e. require no extra training and tuning process), model- and label distribution-agnostic ( i.e. generalization to different kinds of detectors and datasets), and also plug-and-play ( i.e. direct application without any bells and whistles). Extensive experiments on the LVIS dataset demonstrate the superior performance of LogN to the state-of-the-art methods with various detectors ( e.g. two-stage detectors, one-stage detectors, query-based detectors) and backbones ( e.g. VITs, Swin Transformers). We also provide in-depth studies on different aspects of our LogN. We also conduct experiments on multiple datasets such as Open Images and ImageNet-LT. The results show that LogN can improve performance on other object detection datasets and the image classification task. Our LogN can serve as a strong baseline for long-tail object detection and is expected to inspire future research in this field.

Margin and Average Precision Loss Calibration for Long-Tail Object Detection

Searching Parameterized AP Loss for Object Detection

Long-tail Detection with Effective Class-Margins

Margin Calibration for Long-Tailed Visual Recognition

InterFace:Adjustable Angular Margin Inter-class Loss for Deep Face Recognition

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Rectify the Regression Bias in Long-Tailed Object Detection

Calibrating Class Activation Maps for Long-Tailed Visual Recognition

AP-Loss for Accurate One-Stage Object Detection

Logit Normalization for Long-Tail Object Detection

Feature-Balanced Loss for Long-Tailed Visual Recognition

Balanced Classification: A Unified Framework for Long-Tailed Object Detection

Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail Data

Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection

Towards Prior Gap and Representation Gap for Long-Tailed Recognition

Fractal Calibration for long-tailed object detection

Adaptive Class Suppression Loss for Long-Tail Object Detection

Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition