Online Data Analysis and Reduction: an Important Co-design Motif for Extreme-Scale Computers.

Ian Foster,Mark Ainsworth,Julie Bessac,Franck Cappello,Jong Choi,Sheng Di,Zichao Di,Ali M. Gok,Hanqi Guo,Kevin A. Huck,Christopher Kelly,Scott Klasky,Kerstin Kleese van Dam,Xin Liang,Kshitij Mehta,Manish Parashar,Tom Peterka,Line Pouchard,Tong Shu,Ozan Tugluk,Hubertus van Dam,Lipeng Wan,Matthew Wolf,Justin M. Wozniak,Wei Xu,Igor Yakushin,Shinjae Yoo,Todd Munson
DOI: https://doi.org/10.1177/10943420211023549
2021-01-01
Abstract:A growing disparity between supercomputer computation speeds and I/O rates means that it is rapidly becoming infeasible to analyze supercomputer application output only after that output has been written to a file system. Instead, data-generating applications must run concurrently with data reduction and/or analysis operations, with which they exchange information via high-speed methods such as interprocess communications. The resulting parallel computing motif, online data analysis and reduction (ODAR), has important implications for both application and HPC systems design. Here we introduce the ODAR motif and its co-design concerns, describe a co-design process for identifying and addressing those concerns, present tools that assist in the co-design process, and present case studies to illustrate the use of the process and tools in practical settings.
What problem does this paper attempt to address?