Near-duplicated Loss for Accurate Object Localization

He Liu,Xiaocheng Yang,Huaping Liu,Tao Kong,Fuchun Sun
DOI: https://doi.org/10.1109/dsaa49011.2020.00040
2020-01-01
Abstract:Multi-class object detection always involves the tasks of accurate target localization which is mainly related to bounding box regression. Smooth L1 loss is the most popular bounding box regression loss used in the current state-of-the-art object detection systems. However, such loss for regressing the parameters of a bounding box can’t accurately and consistently regress the bounding box to the associated ground truth well. We instead propose the near-duplicated loss, a loss that better evaluate the disparity between the bounding box and the ground truth consistently. We present an approximate algorithm associated with a kernel function that not only considers the absolute distance but also involves the relative overlap area between the two bounding boxes. The new loss doesn’t need additional supervision and is easy to embed into existing networks. Our final result, by incorporating the near-duplicated loss into the state-of-the-art object detection detectors (Faster RCNN, RetinaNet), shows consistent and significant improvements on popular object detection benchmarks (MS COCO and Pascal VOC).
What problem does this paper attempt to address?