Multi-scale features fused network with multi-level supervised path for crowd counting
Yongjie Wang,Wei Zhang,Dongxiao Huang,Yanyan Liu,Jianghua Zhu
DOI: https://doi.org/10.1016/j.eswa.2022.116949
IF: 8.5
2022-08-01
Expert Systems with Applications
Abstract:Many CNN-based methods which utilize the density map to regress the count number of crowd are introduced to solve the crowd counting problem lately. Due to the head scale variations caused by the perspective change and background noise, these methods can not address these two problems well in highly crowded scenario. In order to solve these two problems, we introduce a multi-scale features fused network with multi-level supervised path to produce the high-quality density map in this paper. Our model utilizes the first 13 layers of VGG16 model as the backbone, the multi-level supervised path in our model employs the multi-level dilated convolution module (MLD) to supervise the whole network at multi-level, and generate the attention map for the density map, which is used to handle the scale variations. The other path is used to fuse multi-scale features to generate the density map with soft spatial-channel attention module (SSCA) which aims to produce a saliency weight map of same size. In the end, the final density map is captured by the feature map multiply the attention map. In addition, a new objective function is proposed to train our network. A large number of experimental results show that compared with other networks, our method achieves better experimental results on four challenging datasets (UCF_CC_50, ShanghaiTech, UCF-QRNF and WorldExpo'10 dataset).
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science