Software-Hardware Codesign for Efficient Neural Network Acceleration

Kaiyuan Guo,Song Han,Song Yao,Yu Wang,Yuan Xie,Huazhong Yang
DOI: https://doi.org/10.1109/mm.2017.39
IF: 2.8212
2017-01-01
IEEE Micro
Abstract:Designers making deep learning computing more efficient cannot rely solely on hardware. Incorporating software-optimization techniques such as model compression leads to significant power savings and performance improvement. This article provides an overview of DeePhi's technology flow, including compression, compilation, and hardware acceleration. Two accelerators, named Aristotle and Descartes, are designed to achieve extremely high energy efficiency for both client and datacenter applications with convolutional neural network and recurrent neural network, respectively.
What problem does this paper attempt to address?