Experiences Readying Applications for Exascale
Paul T. Bauman,Reuben D. Budiardja,Dmytro Bykov,Noel Chalmers,Jacqueline Chen,Nicholas Curtis,Marc Day,Markus Eisenbach,Lucas Esclapez,Alessandro Fanfarillo,William Freitag,Nicholas Frontiere,Antigoni Georgiadou,Joseph Glenski,Kalyana Gottiparthi,Marc T. Henry de Frahan,Gustav R. Jansen,Wayne Joubert,Justin G. Lietz,Jakub Kurzak,Nicholas Malaya,Bronson Messer,Damon McDougall,Paul Mullowney,Stephen Nichols,Matthew Norman,Thomas Papatheodore,Jon Rood,Philip C. Roth,Sarat Sreepathi,James White III,Noah Wolfe
2023-10-03
Abstract:The advent of exascale computing invites an assessment of existing best practices for developing application readiness on the world's largest supercomputers. This work details observations from the last four years in preparing scientific applications to run on the Oak Ridge Leadership Computing Facility's (OLCF) Frontier system. This paper addresses a range of topics in software including programmability, tuning, and portability considerations that are key to moving applications from existing systems to future installations. A set of representative workloads provides case studies for general system and software testing. We evaluate the use of early access systems for development across several generations of hardware. Finally, we discuss how best practices were identified and disseminated to the community through a wide range of activities including user-guides and trainings. We conclude with recommendations for ensuring application readiness on future leadership computing systems.
Distributed, Parallel, and Cluster Computing