Performance Analysis of GPU-Based Convolutional Neural Networks

Xiaqing Li,Guangyan Zhang,H. Howie Huang,Zhufan Wang,Weimin Zheng
DOI: https://doi.org/10.1109/icpp.2016.15
2016-01-01
Abstract:As one of the most important deep learning models, convolutional neural networks (CNNs) have achieved great successes in a number of applications such as image classification, speech recognition and nature language understanding. Training CNNs on large data sets is computationally expensive, leading to a flurry of research and development of open-source parallel implementations on GPUs. However, few studies have been performed to evaluate the performance characteristics of those implementations. In this paper, we conduct a comprehensive comparison of these implementations over a wide range of parameter configurations, investigate potential performance bottlenecks and point out a number of opportunities for further optimization.
What problem does this paper attempt to address?