Design and Implementation of a Delay Optimized Multiply-Accumulate Unit for High Speed DSPs

Sheraz Anjum,CHEN Jie,LI Hai-jun
DOI: https://doi.org/10.3969/j.issn.1005-9490.2007.04.064
2007-01-01
Abstract:The Multiply-Accumulate [MAC] unit is a critical element in the data path of any DSP processor and has been a great focus of optimization by the hardware engineers in the last few years. This paper describes the design and implementation of a speed optimized MAC unit that is capable of performing 16×16+40 operations on unsigned and signed two's complement operands and is intended to be used in a high speed VLIW DSP Core. The proposed MAC is superior to the other MAC units implemented with the same or different algorithmic technologies in terms of critical delay. The said MAC has successfully been implemented, synthesized using synopsis tools and compared with the stream line MAC units of same data width from the synopsis design ware library. The comparison results showed that the proposed architecture is faster than all the other implementations from the synopsis's design ware IP library and is suitable for use in any DSP Core especially those requiring high throughput. Note: The comparison was taken under the same attributes and compile options.
What problem does this paper attempt to address?