Behaviour Detection in Crowded Classroom Scenes Via Enhancing Features Robust to Scale and Perspective Variations

Mingyu Liu,Fanman Meng,Qingbo Wu,Linfeng Xu,Qianghua Liao
DOI: https://doi.org/10.1049/ipr2.12318
IF: 2.3
2021-01-01
IET Image Processing
Abstract:Detecting human behaviours in images of crowded classroom scenes is a challenging task, due to the large variations of humans in scale and pose perspective. In this paper, two modules are proposed to tackle these two variations. First, an attention-based RoI (region-of-interest) extractor is designed to handle scale variation. Feature fusion and attention mechanism are used to improve the RoI feature with more local and global information. Second, a transformation-based detection head is introduced to handle perspective variation. The spatial transformation is adopted to extract consistent representation under various perspectives. Moreover, since there is a lack of proper datasets for human behaviour detection in classroom scenes, a new dataset is created, namely CLBD. The experiments on the proposed dataset demonstrate that the modules obtain significant improvements of performance over the state-of-the-art detectors.
What problem does this paper attempt to address?