Performance of Convolution Neural Network based on Multiple GPUs with Different Data Communication Models

Che-Lun Hung,Yi-Yang Lin,Chuan Yi Tang,Chilung Wang,Ming-Chiang Chen
DOI: https://doi.org/10.1109/SNPD.2018.8441056
2018-01-01
Abstract:Recently, deep learning technologies have been utilized in many scientific domains successfully. Convolution neural networks are common used in image understanding problems. However, to train a convolution neural network model with huge amount of images is time-consuming task. Most of deep learning frameworks, such as Caffe, TensorFlow, Torch, Keras, MxNet, and so forth, support GPU to train model fast; especially executing these models on multiple GPUs. In this work, we present the comparison of computation performance of AlexNet among different GPU servers and hyperparameters. The results shows that GPU servers with high bandwidth rate, NVLINK, can achieve better performance than others.
What problem does this paper attempt to address?