Parallel Optimization for Accelerating the Generation of Correctly Rounded Elementary Functions

Xianglin Wang,Xin Yi,Hengbiao Yu,Chun Huang,Lin Peng
DOI: https://doi.org/10.1145/3673038.3673125
2024-01-01
Abstract:Correctly rounded elementary mathematical functions are crucial for numerical computations and scientific applications. Generating these functions accurately is a challenging task. The latest methods automate this process by transforming the problem of generating correctly rounded elementary mathematical functions into a linear programming problem. However, this generation process is serial, and the inefficiency of serialization hinders the creation of new elementary mathematical functions and limits the broader application of the technique. This paper introduces a novel parallel method to accelerate the generation of correctly rounded elementary mathematical functions. The primary approach is to optimize the generation process through two dimensions: task decomposition and data parallelism, effectively mitigating data dependency and competition. This adaptation renders it suitable for conventional parallel models. We implemented our approach to generate six elementary functions. Our experimental results demonstrate that our parallel method can automate the generation of correctly rounded elementary math functions, achieving a speedup of 9.2x over the state-of-the-art approach, RLIBM.
What problem does this paper attempt to address?