A Fused Continuous Floating-Point Mac On Fpga

Min Yuan,Qianjian Xing,Zhen-guo Ma,Feng Yu,Yingke Xu
DOI: https://doi.org/10.1587/transfun.E101.A.1594
2018-01-01
IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences
Abstract:In this letter, we present a novel single-precision floating-point multiply-accumulator (FNA-MAC) to achieve lower hardware resource, reduced computing latency and improved computing accuracy for continuous dot product operations. By further fusing the normalization and alignment in the traditional FMA algorithm, the proposed architecture eliminates the first N - 1 normalization and rounding operations for an N point dot product, and preserves the precision of interim results in a significant bit size that is twice of that in the traditional methods. The normalization and rounding of the final result is processed at the cost of consuming an additional multiply-add operation. The simulation results show that the improvement in computational accuracy is significant. Meanwhile, when comparing to a recently published FMA design, the proposed FNA-MAC can reduce the slice look-up table/ flip-flop resource and computing latency by a fact of 18%, 33.3%, respectively.
What problem does this paper attempt to address?