Tactics to Directly Map CNN Graphs on Embedded FPGAs

K. Abdelouahab,M. Pelcat,J. Sérot,C. Bourrasset,F. Berry,J. Serot
DOI: https://doi.org/10.1109/les.2017.2743247
IF: 1.524
2017-12-01
IEEE Embedded Systems Letters
Abstract:Deep convolutional neural networks (CNNs) are the state-of-the-art in image classification. Since CNN feed forward propagation involves highly regular parallel computation, it benefits from a significant speed-up when running on fine grain parallel programmable logic devices. As a consequence, several studies have proposed field-programmable gate array (FPGA)-based accelerators for CNNs. However, because of the large computational power required by CNNs, none of the previous studies has proposed a direct mapping of the CNN onto the physical resources of an FPGA, allocating each processing actor to its own hardware instance. In this letter, we demonstrate the feasibility of the so called direct hardware mapping (DHM) and discuss several tactics we explore to make DHM usable in practice. As a proof of concept, we introduce the HADDOC2 open source tool, that automatically transforms a CNN description into a synthesizable hardware description with platform-independent DHM.
computer science, software engineering, hardware & architecture
What problem does this paper attempt to address?