Optimizing FPGA Logic Block Architectures for Arithmetic

Kevin E. Murray,Jason Luu,Matthew J. P. Walker,Conor McCullough,Sen Wang,Safeen Huda,Bo Yan,Charles Chiasson,Kenneth B. Kent,Jason Anderson,Jonathan Rose,Vaughn Betz
DOI: https://doi.org/10.1109/tvlsi.2020.2965772
2020-06-01
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Abstract:Hardened adder and carry logic is widely used in commercial field-programmable gate arrays (FPGAs) to improve the efficiency of arithmetic functions. There are many design choices and complexities associated with such hardening, including circuit design, FPGA architectural choices, and the computer-aided design (CAD) flow. However, these choices have not been studied much and hence we explore a number of possibilities. We also highlight front-end elaboration optimization that helps ameliorate the restrictions placed on logic synthesis by hardened arithmetic. We show that hard adders and carry chains increase the performance of simple adders by a factor of 4 or more, but on larger benchmark designs that contain arithmetic improve the overall performance by 15%. Our results also show that for complete application circuits simple hardened ripple-carry adders perform as well as more complex carry-lookahead adders. Our best non-fracturable lookup table (non-fLUT) architecture with hardened arithmetic yields 12% better area–delay product than architectures without hardened arithmetic. We also investigate the impact of fLUTs and their interaction with hardened arithmetic. We find that fLUTs offer significant (12%–15%) area reduction, which is complementary to the delay reduction of hardened arithmetic. Therefore, our best fLUT architectures which use two bits of hardened arithmetic achieve 25% better area–delay product than non-fLUT architectures without hardened arithmetic.
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?