Crowd Distribution Estimation with Multi-scale Recursive Convolutional Neural Network.

Meng Wei,Yu Kang,Weiguo Song,Yang Cao
DOI: https://doi.org/10.1007/978-3-319-73603-7_12
2018-01-01
Abstract:Crowd distribution estimation has strong demands in surveillance applications, such as overcrowding detection, anomaly detection and traffic monitoring. Although a number of methods have been proposed for crowd counting, it is still a challenging task to estimate an accurate crowd distribution map which reflects the actual spatial intensity of the crowd in a real scene, due to the inhomogeneity of crowd distribution and the uncertainty of observation perspective. To address this problem, this paper proposes a multi-scale recursive convolutional neural network (MRCNN) based framework to map the image to its crowd distribution map. The proposed neural network is trained alternatively with two joint objectives, the estimation of crowd density map and perspective map. Since the scale size and scale variance of crowd are good cues for estimating both crowd density map and perspective map, formulating these two objectives together enables learning a strong feature representation for both tasks. By convolving a perspective-adaptive kernel on the crowd density map, we can generate a pixel-wise crowd distribution map in which the pixel value denotes the actual intensity of the crowd at the corresponding location in the real scene. An extension dataset from Shanghaitech crowd dataset B is introduced for the perspective map learning task, in which 700 images with about 3500 height-annotated pedestrians are labelled. Experimental results on Shanghaitech datasets (both A and B), UCF CC 50 dataset and UCSD dataset demonstrate the effectiveness and reliability of our proposed approach.
What problem does this paper attempt to address?