YOLOv3 with Asymmetric Intersection over Union Based Loss Function for Human Detection

Heng Zhu,Dongchao Yang,Geng Huang,Qingyuan Wu,Teng Li,Bo Tao
DOI: https://doi.org/10.1145/3453800.3453814
2021-01-01
Abstract:In order to improve the performance of the target detection model based on convolutional neural network, and make the bounding box representing the object location contain as complete original target information as possible, this paper improved the loss function of the bounding box regression part of the YOLOv3 network. A new metric, Asymmetric Intersection over Union (AIoU), is proposed to measure the similarity between the prediction box and the truth. Theoretical analysis shows that AIoU based loss function can better guide the direction of bounding box regression after introducing asymmetry. Experiments show that the improved YOLOv3 network achieves higher accuracy and maintains fast convergence in human target detection. The examples of bounding box regression on the test dataset show that the target information loss can be reduced by the proposed method. In addition, distinguishing between the prediction and the truth in calculation may have reference significance for the loss function design of network models for other tasks
What problem does this paper attempt to address?