Customized Network-on-Chip Oriented to MPI Collective Operations

Siyu LU,Hongwei WANG,Youhui ZHANG,Guangwen YANG,Weimin ZHENG
DOI: https://doi.org/10.3969/j.issn.1000-3428.2017.06.001
2017-01-01
Abstract:According to the principle of computations approaching data,this paper proposes a design method of Networkon-Chip(NoC) oriented to MPI collective operations,which focuses on the hardware enhancement of common NoC routers to speed up MPI collective operations on the network layer.It designs MPI reduction,extends it to support more operations and combines it with an adaptive method for the deterministic routing algorithm,which can learn transmission paths of messages dynamically.Thus,enhanced routers can complete message processing in place,which not only speed up the processing procedure but also coalesce messages.The design method for detailed micro-architecture of NoC is presented.Different layout strategies of enhanced routers are compared and the corresponding performance,power consumption and extra chip-area are evaluated.Testing results show that,compared with ideal software-based implementation,the proposed method can improve the reduction performance by 6.4 ~ 41.7 times,broadcast by 15.3 ~ 31.2,global reduction by 5.4 ~ 9.7 times,and gather by 1.3 ~ 1.8 times,while the consumption of power and chip-area is limited.
What problem does this paper attempt to address?