Flounder-Net: An efficient CNN for crowd counting by aerial photography

Jingyu Chen,Shengjie Xiu,Xiang Chen,Hao Guo,Xiaohua Xie
DOI: https://doi.org/10.1016/j.neucom.2020.09.001
IF: 6
2021-01-01
Neurocomputing
Abstract:<p>Crowd counting on aerial images using the embedded system is a challenging task, due to high-definition images, low computing power, and limited memory. To tackle this task, we propose an efficient deep learning model named Flounder-Net structured like a flounder. In the Flounder-Net, a novel interleaved group convolution is proposed to eliminate the redundancy of network, and a rapid shrink of feature maps is employed to tackle the high-resolution problem. Since we would like to investigate the case of online aerial surveillance, we use the embedded system of a drone to run our algorithm. We also use the vision system of this drone to collect a set of high-definition aerial photographs as a benchmark. Extensive experiments on existing datasets and our aerial dataset show that Flounder-Net achieves FCN-level accuracy with three types of photograph devices: handheld cameras, surveillance cameras, and drone-based cameras. Additionally, Flounder-Net has 17<span class="math"><math>×</math></span> fewer parameters and 20<span class="math"><math>×</math></span> faster speed than FCN and allows an input image with arbitrary sizes.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?