Autonomous intersection over union (IoU) loss: adaptive dynamic non-monotonic focal IoU loss
Yanchen Zhu,Wenhua Zheng,Jianqiang Du,Qiang Huang
DOI: https://doi.org/10.7717/peerj-cs.2347
2024-09-27
PeerJ Computer Science
Abstract:In object detection algorithms, the bounding box regression (BBR) loss directly influences the model's accuracy in predicting object positions. Initially, we conducted simulation experiments on the proposed intersection over union-based loss. Through in-depth analysis, we discovered that the width and height components of the distance term could hinder BBR. To address this, we proposed the AIoU-v1 loss, which decouples the width and height components of the distance term, thereby preventing the suppression of BBR loss by the distance term. Additionally, issues such as the imbalance between sample quantity and sample quality in the dataset, as well as labeling errors, can adversely affect BBR. To tackle these dataset problems, we designed an adaptive dynamic non-monotonic focusing mechanism with strong robustness and wide applicability. Finally, we proposed a post-processing algorithm that combines fusion and non-maximum suppression, resulting in more accurate bounding boxes during the post-processing stage. Our source code and data are available at https://github.com/Wenhua-Zheng/AIOU.
computer science, information systems, artificial intelligence, theory & methods