Optimization of Matrix Multiplication Based on a Multi-Core Architecture Extended with Vector Units

ZHU Haitao,CHEN Yunji,QIAN Cheng,WANG Ling,HU Weiwu
DOI: https://doi.org/10.3969/j.issn.0253-2778.2011.02.012
2011-01-01
Abstract:Based on the GODSON-3B 8-core processor,an optimized implementation and evaluation of matrix multiplication was proposed.For the memory access characteristic of each matrix in matrix multiplication,different methods were used to optimize the memory access behavior,hiding memory access time.The performance of optimized matrix multiplication achieves 122 Gflops,and an efficiency of 95.3%.
What problem does this paper attempt to address?