Customized Network-on-Chip for Message Reduction.

Hongwei Wang,Siyu Lu,Youhui Zhang,Guangwen Yang,Weimin Zheng
DOI: https://doi.org/10.1007/978-3-319-11197-1_41
2014-01-01
Abstract:This paper proposes a network-on-chip (NoC) design customized for message reduction, which enhances some common routers with a special Reduce Processing Unit (RPU) to complete reduce-computations hop-by-hop, as well as to learn the transmission path of reduction-messages adaptively. More specifically, for reduction on a small data-set, the corresponding data is transmitted through the NoC directly. Thus, along the transmission path, enhanced routers can complete reduction in place, which not only speeds up the processing procedure but also coalesces messages. An adaptive method for the deterministic routing algorithm is also introduced to enable these routers to learn transmission paths accurately to improve the processing efficiency. We present the detailed micro-architecture design and evaluate the corresponding performance, the power consumption and chip-area. Testing results show that this design can improve the reduction / all_reduce performance of 2.67 similar to 11.76 times, while the consumption of power and chip-area are both limited.
What problem does this paper attempt to address?