TCM: Temporal Consistency Model for Head Detection in Complex Videos

Daud Khan,Ahmed B. Altamimi,Mohib Ullah,Habib Ullah,Faouzi Alaya Cheikh,Sultan Daud Khan
DOI: https://doi.org/10.1155/2020/8861296
IF: 2.336
2020-12-16
Journal of Sensors
Abstract:Head detection in real-world videos is a classical research problem in computer vision. Head detection in videos is challenging than in a single image due to many nuisances that are commonly observed in natural videos, including arbitrary poses, appearances, and scales. Generally, head detection is treated as a particular case of object detection in a single image. However, the performance of object detectors deteriorates in unconstrained videos. In this paper, we propose a temporal consistency model (TCM) to enhance the performance of a generic object detector by integrating spatial-temporal information that exists among subsequent frames of a particular video. Generally, our model takes detection from a generic detector as input and improves mean average precision (mAP) by recovering missed detection and suppressing false positives. We compare and evaluate the proposed framework on four challenging datasets, i.e., HollywoodHeads, Casablanca, BOSS, and PAMELA. Experimental evaluation shows that the performance is improved by employing the proposed TCM model. We demonstrate both qualitatively and quantitatively that our proposed framework obtains significant improvements over other methods.
engineering, electrical & electronic,instruments & instrumentation
What problem does this paper attempt to address?