Bal-R$^2$CNN: High Quality Recurrent Object Detection with Balance Optimization

Xiaoyu Chen,Hongliang Li,Qingbo Wu,Fanman Meng,Heqian Qiu
DOI: https://doi.org/10.1109/tmm.2021.3067439
IF: 7.3
2021-01-01
IEEE Transactions on Multimedia
Abstract:It is a common practice to refine object detection results using recurrent detection paradigm. We evaluate the recurrent detection on Faster R-CNN, but the improvement is far away from expected. We consider that the performance bottleneck is from imbalance optimization caused by the biased distribution of training data. Low-IoU-skewed RPN proposals could suppress the contribution of High-IoU examples at the training stage. Besides, data imbalance and statistical discrepancy on regression targets between low-IoU and high-IoU examples are not considered in the regression task; this design could impede localization quality. In this work, we propose Bal-R $^2$ CNN for high-quality recurrent object detection. There are two new components in Bal-R $^2$ CNN. Self-iteration box sampling collects object boxes from recurrent steps and increases the number of high-IoU training examples. IoU-sensitive bounding-box regression sends proposal boxes with different IoUs to specified regression branches for more accurate bounding-box prediction. Both two new components could induce balanced optimization and be helpful. With the resulting Bal-R $^2$ CNN detector, evaluation on PASCAL VOC and MSCOCO reveal that our method has a significant improvement on the existing solution and could reach a better performance than several state-of-the-art methods.
What problem does this paper attempt to address?