Multi-Scale YOLOv2 for Hand Detection in Complex Scenes

Jia Chen,Zihan Ni,Nong Sang
DOI: https://doi.org/10.1109/icarcv.2018.8581090
2018-01-01
Abstract:This paper presents a model named Multi-Scale YOLOv2 (MS-YOLOv2) for hand detection in complex scenes. The proposed MS-YOLOv2 is implemented by introducing three modules to YOLOv2, including a Multi-Scale Feature Refinement Module to acquire fine-grained features, a Channel Importance Evaluation Module to recalibrate feature channels and a Hard Example Punishment Module to get rid of hand interference areas. Experiment results show that the proposed MS-YOLOv2 makes much performance improvement to YOLOv2, but with little computational complexity gain. On our dataset, the proposed MS-YOLOv2 can achieve 98.2% of AP and 97.9% of AR. Moreover, on the VIVA challenge, the proposed MS-YOLOv2 achieves AP/AR of 85.1%/45.8% at Level-1 and 80.1%/45.9% at Level-2.
What problem does this paper attempt to address?