Layrub

Bo Liu,Wenbin Jiang,Hai Jin,Xuanhua Shi,Yang Ma
DOI: https://doi.org/10.1145/3178487.3178528
2018-01-01
Abstract:Growing accuracy and robustness of Deep Neural Networks (DNN) models are accompanied by growing model capacity (going deeper or wider). However, high memory requirements of those models make it difficult to execute the training process in one GPU. To address it, we first identify the memory usage characteristics for deep and wide convolutional networks, and demonstrate the opportunities of memory reuse on both intra-layer and inter-layer levels. We then present Layrub, a runtime data placement strategy that orchestrates the execution of training process. It achieves layer-centric reuse to reduce memory consumption for extreme-scale deep learning that cannot be run on one single GPU.
What problem does this paper attempt to address?