Cam-net: compressed attentive multi-granularity network for dynamic scene classification

Yue Li,Wenrui Ding,Yanjun Zhu,Yuanjun Huang,Yalong Jiang,Baochang Zhang
DOI: https://doi.org/10.1109/icip40778.2020.9191306
2020-01-01
Abstract:Dynamic scene classification on portable platforms is extremely challenging due to the contradiction between model complexity and computing resources. To resolve this long-standing dilemma, we propose the compressed attentive multi-granularity network (CAM-Net) in a two-step manner. First, we present a novel AM-Net based on multi-granularity attention units to boost the performance of the full-precision model. It captures and enhances both coarse and fine target-related information. Then, we introduce an efficient binary approximation to AM-Net to improve computing efficiency, leading to CAM-Net. Particularly, a grouping guidance approach is adopted to guide the reconstruction of full-precision weights from binary ones. With this guidance, CAM-Net can significantly reduce memory usage as well as CPU consumption, yet only cause a slight decline in accuracy. Extensive experiments have been conducted on three benchmark datasets, i.e., Maryland, YUPENN++ and ActivityNet, demonstrating the effectiveness and superiority of the proposed method on scene classification.
What problem does this paper attempt to address?