Procrastination Is All You Need: Exponent Indexed Accumulators for Floating Point, Posits and Logarithmic Numbers

Vincenzo Liguori

2024-06-10

Abstract:This paper discusses a simple and effective method for the summation of long sequences of floating point numbers. The method comprises two phases: an accumulation phase where the mantissas of the floating point numbers are added to accumulators indexed by the exponents and a reconstruction phase where the actual summation result is finalised. Various architectural details are given for both FPGAs and ASICs including fusing the operation with a multiplier, creating efficient MACs. Some results are presented for FPGAs, including a tensor core capable of multiplying and accumulating two 4x4 matrices of bfloat16 values every clock cycle using ~6,400 LUTs + 64 DSP48 in AMD FPGAs at 700+ MHz. The method is then extended to posits and logarithmic numbers.

Computer Vision and Pattern Recognition,Artificial Intelligence,Hardware Architecture

What problem does this paper attempt to address?

The paper aims to address the problem of summing long sequences of floating-point numbers. Specifically, it proposes a simple and effective method to handle the addition of a large number of floating-point numbers. This method is divided into two stages: the accumulation stage, where all mantissas with the same exponent are partially summed; and the reconstruction stage, where the total sum is derived from these partial sums. Additionally, the paper extends this method to Posits (a variable precision floating-point number representation) and logarithmic number representation. The paper points out that summing long sequences of floating-point numbers is an extremely important operation in fields such as computational science, convolutional neural networks, and large language models. Efficiently performing this operation is crucial for improving overall performance. To achieve this goal, the paper details hardware implementation schemes and explores optimization designs in FPGA and ASIC. Furthermore, it discusses the fusion of multiply-accumulate operations and application examples in different formats, such as the design of Tensor Cores. Finally, the paper investigates the advantages of logarithmic number representation in low-bit applications like neural networks and its compression effects.

Procrastination Is All You Need: Exponent Indexed Accumulators for Floating Point, Posits and Logarithmic Numbers

A Low Latency High Throughput Multiply-accumulator Unit for Float Point and Integer

Area-Efficient Iterative Logarithmic Approximate Multipliers for IEEE 754 and Posit Numbers

Implementation of a Structure-Efficient Multiple-Input Floating-Point Adder on FPGAs

Optimizing Logarithmic Arithmetic on FPGAs

Efficient implementation of signed multipliers on FPGAs

Comparing Floating-Point and Logarithmic Number Representations for Reconfigurable Acceleration

A Fused Continuous Floating-Point Mac On Fpga

FPGA Designs with Optimized Logarithmic Arithmetic

A Design Framework for Hardware-Efficient Logarithmic Floating-Point Multipliers

Optimized design of fast floating-point adder

A Synthesis Method of General Floating-Point Arithmetic Units by Aligned Partition

A Floating-point Coprocessor Configured by a FPGA in a Digital Platform Based on Fixed-point DSP for Power Electronics

Optimizing FPGA-Based DNN Accelerator with Shared Exponential Floating-Point Format

Design Of High Performance IEEE- 754 Single Precision (32 bit) Floating Point Adder Using VHDL

A Reconfigurable Floating-Point Compute-In-Memory with Analog Exponent Pre-Processes

Research of High-Speed Pipelined Floating-Point Multipfier Design

Floating-Point Multiply-Accumulative Processing Element on FPGAs

High-Efficiency Compressor Trees for Latest AMD FPGAs

Leveraging the bfloat16 Artificial Intelligence Datatype For Higher-Precision Computations

Research of Floating-point Summation and Dot-product Computing Architecture