Efficient FPGA-Based Accelerator of the L-BFGS Algorithm for IoT Applications.

Huiyang Xiong,Bohang Xiong,Wenhao Wang,Jing Tian,Hao Zhu,Zhongfeng Wang
DOI: https://doi.org/10.1109/iscas46773.2023.10181544
2023-01-01
Abstract:The Internet of Things (IoT)-centric applications, such as augmented reality and self-driven cars, require real-time task processing, large bandwidth, and low data transmission latency. FPGA-based edge computing is considered an effective solution to tackle these challenges. As an excellent tool in these applications, nonlinear optimization methods involve computation-intensive and data-dependency operations leading to limited real-time applications. The limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) algorithm ranks among the most efficient algorithms for large-scale optimization problems. In this paper, we propose, for the first time, a high-parallel FPGA-based architecture for the two key parts of the L-BFGS algorithm: the search direction computation and line searching. Compared with the implementation on CPU, the search direction computation and line searching implementation on FPGA achieve $\mathbf{39.73}\times$ and $\mathbf{5.50}\times$ speedups, respectively. Compared with the straightforward implementation on GPU, the search direction computation on FPGA obtains a speedup of $\mathbf{31.03}\times$ .
What problem does this paper attempt to address?