Computational Performance and Energy Efficiency of ARM based HPC servers

Oskar Schirmer
2024-06-12
Abstract:HPC world is dominated by x86 ISA CPUs. This monoculture is not necessarily justified by best performance evaluation, but may inherit from e.g. SW related restrictions on the choice of HW platforms. To avoid running (further) into path dependency, alternate HW platforms need to be evaluated for performance compared to existing HPC setup. As a result, it may turn out alternate HW platforms are more efficient for HPC. In any case, even if performance differences are low, avoiding path dependencies that stem from HW choice restrictions simplifies switching to different HW platforms in future, should suitable systems evolve. Moreover, broadening the perspective to generic HW platforms may trigger cooperation and wield influence on HW platform development, resulting in HW/SW co-design advantages.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on evaluating the advantages and disadvantages of high - performance computing (HPC) servers based on the ARM architecture in terms of computing performance and energy efficiency compared to existing x86 - architecture HPC systems. Specifically, the paper focuses on the following aspects: 1. **Performance Evaluation**: By using the GROMACS molecular dynamics simulation software as a benchmarking tool, evaluate the performance differences of running the same task on different hardware platforms. This includes the measurement of running time and the comparison of performance indicators (such as ns/day). 2. **Energy Efficiency**: In addition to performance, the paper also pays special attention to the energy consumption of different systems when performing the same task, that is, energy efficiency. By measuring the power consumption of different systems and combining it with the running time to calculate the energy required for each nanosecond of calculation, the energy efficiency of the system is evaluated. 3. **Avoiding Path Dependence**: The paper points out that the dominance of the x86 architecture in the current HPC field may not be entirely based on the optimal performance choice, but is affected by factors such as software compatibility. Therefore, evaluating HPC systems based on the ARM architecture helps break this path dependence and provides more flexibility for future hardware selection. 4. **Hardware/Software Co - design**: By evaluating the performance of different hardware platforms, the co - design of hardware and software can be promoted, thereby driving the development of HPC technology. For example, through this evaluation, certain specific hardware characteristics that have a significant impact on performance or energy efficiency can be discovered, thus guiding the design direction of future hardware. In summary, the main purpose of this paper is to provide a comprehensive understanding of the advantages and disadvantages of ARM - architecture servers compared to traditional x86 - architecture servers in the HPC field through detailed performance and energy efficiency evaluations, thereby promoting the diversified development of HPC technology.