MobileCount: An efficient encoder-decoder framework for real-time crowd counting

Peng Wang,Chenyu Gao,Yang Wang,Hui Li,Ye Gao
DOI: https://doi.org/10.1016/j.neucom.2020.05.056
IF: 6
2020-09-01
Neurocomputing
Abstract:<p>In this work, we propose a computation-efficient encoder-decoder architecture, named MobileCount, which is specifically designed for high-accuracy real-time crowd counting on mobile or embedded devices with limited computation resources. For the encoder part, MobileNetV2 is tailored in order to significantly reduce FLOPs at a little cost of performance drop, which has 4 bottleneck blocks preceded by a max pooling layer of stride 2. The design of decoder is motivated by Light-weight RefineNet, which further boosts counting performance with only a <span class="math"><math>10%</math></span> increase of FLOPs. In comparison with state-of-the-arts, our proposed network is able to achieve comparable counting performance with <span class="math"><math>1/10</math></span> FLOPs on a number of benchmarks. At last, we propose a multi-layer knowledge distillation method to further boost the performance of MobileCount without increasing its FLOPs.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?