ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Ningning Ma,Xiangyu Zhang,Hai-Tao Zheng,Jian Sun
DOI: https://doi.org/10.48550/arXiv.1807.11164
2018-07-30
Computer Vision and Pattern Recognition
Abstract:Currently, the neural network architecture design is mostly guided by the \emph{indirect} metric of computation complexity, i.e., FLOPs. However, the \emph{direct} metric, e.g., speed, also depends on the other factors such as memory access cost and platform characterics. Thus, this work proposes to evaluate the direct metric on the target platform, beyond only considering FLOPs. Based on a series of controlled experiments, this work derives several practical \emph{guidelines} for efficient network design. Accordingly, a new architecture is presented, called \emph{ShuffleNet V2}. Comprehensive ablation experiments verify that our model is the state-of-the-art in terms of speed and accuracy tradeoff.
What problem does this paper attempt to address?