Optimization of BLAS based on Loongson 2F architecture

GU Nai-Jie,LI Kai,CHEN Guo-Liang,WU Chao
2008-01-01
Journal of University of Science and Technology of China
Abstract:BLAS are standard operations to efficiently solve the linear algebra problems on high performance computers.Some new optimization technologies on data prefetch and instruction scheduling developed specifically for Loongson 2F characteristics were proposed based on normal optimization technologies to give full play to develop the performance of Loongson 2F processer and implement a high performance BLAS on KD-50-I platform.According to the experiments,the actual double float operation peak of high performance BLAS on 750 MHz Loongson 2F processor(double float peak 3 Gflops) can reach 1.47 GHz,which is more than 6 times higher than BLAS,and 45% higher than ATLAS.
What problem does this paper attempt to address?