PHD: A Deep Learning Based Human Detection Framework for Panoramic Videos

Jinting Tang,Zhenhui Chen,Yongkai Huo,Peichang Zhang
DOI: https://doi.org/10.1109/wcsp.2019.8928143
2019-01-01
Abstract:Panoramic video has attracted substantial research attention as the coming video format. It is capable of providing 360 degree immersive experience of omnidirectional visual information. State-of-the-art detection networks may fail to detect humans on spherical images, which are normally represented in deformed rectangular shapes. In this paper, we propose a socalled Panoramic Human Detection (PHD) scheme to address the task of human detection in panoramic videos. Moreover, the PHD method is designed to detect humans by extracting multiple overlapping sub-images from each integral spherical image, where three-dimensional rotation of spherical images is employed to ensure consistency of sub-images. Two detection box filters are designed for removing redundant boxes. Our PHD method is capable of accomplishing the task of human detection in various panoramic video types. Experiments prove that our PHD method outperforms the baseline by 35% and 48.6% in terms of precision and recall, respectively.
What problem does this paper attempt to address?