Design and performance optimization of a popular high performance computing system KD-90

Ye Cai,Gang Liu,Rui Mao,Qiuming Luo,Guoliang Chen
DOI: https://doi.org/10.3724/SP.J.1249.2013.02138
2013-01-01
Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering
Abstract:First of its kind and designed with 8-core Godson 3B CPU,KD-90 is a teraflops high-performance computer system.It has the characteristics of high computing density,low-power usage,low in cost and compact in size.Three parallel hierarchies,such as SMP→CC-NUMA→Cluster,as utilized in KD-90.Both the common communication protocol and the proprietary communication protocol are integrated in the system interconnect design.Therefore,breakthroughs in the performance of the CC-NUMA cluster are achieved.The parallel programming model of the system is implemented by the combination of a general-purpose processor and vector coprocessor acceleration technology.The Linpack performance optimization works by means of architectural features and the operating system kernel is discussed in detail.
What problem does this paper attempt to address?