SmartTuning: Selecting Hyper-Parameters of a ConvNet System for Fast Training and Small Working Memory

Xiaqing Li,Guangyan Zhang,Weimin Zheng
DOI: https://doi.org/10.1109/TPDS.2020.3040723
IF: 5.3
2021-01-01
IEEE Transactions on Parallel and Distributed Systems
Abstract:It is desirable to deploy a ConvNet system with high inference accuracy, as well as fast training and small inference memory. However, existing approaches to hyper-parameter tuning only focus on high accuracy. Although achieving high accuracy, tuning poorly can significantly increase the performance burden, and thus degrade the overall performance of a ConvNet system. In this article, we propose SmartTuning, an approach to identifying the hyper-parameters of a ConvNet system for high training speed and small working memory, with the restriction of high inference accuracy. The key idea of SmartTuning is to build a new performance model for a ConvNet system, and to integrate Bayesian Optimization to learn the relationship between the overall performance and the hyper-parameters of a ConvNet system. In this way, SmartTuning can balance inference accuracy, training speed and inference memory usage during the tuning process, and thus maximizes the overall performance of a ConvNet system. Our experiments show that SmartTuning can stably identify the hyper-parameter sets that offer very close accuracy with faster training speed (i.e., 7×-11× over MNIST and 2×-3× over CIFAR-10) and much less inference memory usage (i.e., 17×-23× over MNIST and 4×-9× over CIFAR-10), compared with existing tuning approaches.
What problem does this paper attempt to address?