Optimal Many-to-many Personalized Concurrent Communication in RapidIO-based Fat-Trees

Lin Shu,Jie Hao,Yafang Song,Chengcheng Li,Donglin Wang
DOI: https://doi.org/10.1109/snpd.2016.7515923
2016-01-01
Abstract:Many-to-many (M-to-N) personalized concurrent communication is one of the most dense traffic patterns in high performance computing system. Each node in group M sends different messages to every node in group N. Hot-spot congestions are the bottleneck of this communication pattern. In this paper, a proactive congestion avoidance mechanism is proposed to optimize many-to-many personalized concurrent traffic in RapidIO-based fat-trees. And a heuristic scheme of virtual destination is proposed to deal with the case that messages from each node in M group to every other node in N group are not of the same size. Our proactive congestion avoidance mechanism has been used in a practical network, which is a part of digital signal processing platform for “the TianLai Pathfinder”. The network delay is effectively improved, reducing from about 34.784 us to 1.484 us. And the communication overhead is decreased by twice.
What problem does this paper attempt to address?