Efficient DNN Algorithm Design and Hardware Acceleration for Low-Level Vision

Wendong Mao,Siyu Zhang,Zhongfeng Wang
DOI: https://doi.org/10.5772/intechopen.1007291
2024-01-01
Abstract:Deep learning has garnered widespread attention in low-level vision tasks due to its automatic feature learning capabilities. However, high computational complexity and memory requirements hinder deployment at the edge. This chapter provides a top-down solution for efficiently implementing intelligent image and video computing across algorithm, hardware, and deployment levels. Firstly, we design high-accuracy DNN algorithms for specific visual tasks, exemplified by the first text style transfer algorithm using complex image features. Secondly, we explore hardware-friendly algorithm optimizations and design efficient DNN hardware accelerators for typical visual applications such as image generation, video compression, and super-resolution. Thirdly, we introduce a precision-scalable computing method and an agile hardware mapping framework to achieve efficient deployment of algorithms. We unify convolution, deconvolution, and deformable convolution into an input-adaptive deformable convolution and design a versatile accelerator for various image enhancement tasks. Finally, we provide extensive experimental results to validate the effectiveness of the proposed hardware acceleration solutions.
What problem does this paper attempt to address?