CNN Acceleration based on Dynamic Pruning and FPGAs Implementation

Qi Li,Hengyi Li,Lin Meng
DOI: https://doi.org/10.1109/ISOCC56007.2022.10031291
2022-01-01
Abstract:Convolutional Neural Networks (CNNs) have achieved exciting performance and are widely used in various fields such as industry, cultural heritage protection, etc. However, CNNs are computational-intensive and resource-consuming, which limits the applications, especially for embedded systems. Therefore, this paper proposes a dynamic pruning method for compacting the mainstream CNNs model. Furthermore, we implement the compacted models on FPGAs for acceleration. Experimental results show that the proposal compress at most 80% parameter and FLOPs, and 80% inference time reduction on FPGAs.
What problem does this paper attempt to address?