Performance and Energy Optimization of Matrix Multiplication on Asymmetric big.LITTLE Processors

Sandra Catalán,Francisco D. Igual,Rafael Mayo,Luis Piñuel,Enrique S. Quintana-Ortí,Rafael Rodríguez-Sánchez
DOI: https://doi.org/10.48550/arXiv.1507.05129
2015-07-18
Abstract:Asymmetric processors have emerged as an appealing technology for severely energy-constrained environments, especially in the mobile market where heterogeneity in applications is mainstream. In addition, given the growing interest on ultra low-power architectures for high performance computing, this type of platforms are also being investigated in the road towards the implementation of energy- efficient high-performance scientific applications. In this paper, we propose a first step towards a complete implementation of the BLAS interface adapted to asymmetric ARM <a class="link-external link-http" href="http://big.LITTLE" rel="external noopener nofollow">this http URL</a> processors, analyzing the trade-offs between performance and energy efficiency when compared to existing homogeneous (symmetric) multi-threaded BLAS implementations. Our experimental results reveal important gains in performance while maintaining the energy efficiency of homogeneous solutions by efficiently exploiting all the resources of the asymmetric processor.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?