T1000: Mitigating the Memory Footprint of Convolution Neural Networks with Decomposition and Re-Fusion.

Changxi Liu,Hailong Yang,Rui Wang,Zhongzhi Luan,Depei Qian
DOI: https://doi.org/10.1016/j.future.2018.02.024
IF: 7.307
2018-01-01
Future Generation Computer Systems
Abstract:In recent years, convolution neural networks have significantly advanced the frontier of computer vision and other intelligent applications due to its promising accuracy. However, the improved accuracy comes with the formidable computation complexity with deeper convolution layers, which prevents its adoption on resource constrained system such as embedded and mobile. Although research efforts have been devoted to reduce the computation complexity of convolution neural networks through tensor decomposition, the volume of intermediate data generated by the tensor decomposition grows dramatically, which consumes more memory resource and has not been addressed by existing work.& para;& para;In this work, we propose T1000 to re-fuse the convolutions across tensors after applying the canonical polyadic decomposition to conventional convolution layers so that we can receive the benefit of reduced computation complexity, in the meanwhile mitigate the memory occupancy of the intermediate data. We demonstrate the effectiveness of our approach by applying canonical polyadic decomposition and re-fusion to the convolution layers of two well-known convolution neural networks, AlexNet and VGG-19 implemented with Caffe. Compared to the default canonical polyadic decomposition, our approach reduces the memory occupancy of the intermediate data by 84.6% and 77.4% for AlexNet and VGG-19 respectively. In addition, our approach improves the performance of AlexNet and VGG-19 by 1.77x and 1.4x respectively. (C) 2018 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?