Automatic Basketball Detection in Sport Video Based on R-FCN and Soft-NMS

Qiaokang Liang,Li Mei,Wanneng Wu,Wei Sun,Yaonan Wang,Dan Zhang
DOI: https://doi.org/10.1145/3351917.3351970
2019-01-01
Abstract:In basketball videos, the ball is always so small in the camera that its appearance feature is hard to be extracted. In this paper, we introduce a deep-learning technology to detect the basketball. Specifically, we train our basketball detection model based on the Region-based Fully Convolutional Networks (R-FCN) which uses the fully convolutional Residual Network (ResNet) as the backbone network. What's more, we apply some new techniques including Online Hard Example Mining (OHEM), Soft-NMS and multi-scale training strategy to achieve higher detection accuracy. In detail, the OHEM method can reduce the cost of fine-tuning during training by calculating the loss of the RoIs. Soft-NMS can reduce the false positive rate by decreasing the object detection score between the overlap object. And the multi-scale training can improve the detection performance by receiving the good feature from the object with different scale. Finally, we achieve a mean average precision (mAP) value of 89.7% on a public basketball dataset. It proves that applying the deep-learning approach to basketball detection is effective.
What problem does this paper attempt to address?