Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales
Ian T. Foster,Mark Ainsworth,Bryce Allen,Julie Bessac,Franck Cappello,Jong Youl Choi,Emil M. Constantinescu,Philip E. Davis,Sheng Di,Wendy Di,Hanqi Guo,Scott Klasky,Kerstin Kleese van Dam,Tahsin M. Kurç,Qing Liu,Abid Malik,Kshitij Mehta,Klaus Mueller,Todd Munson,George Ostrouchov,Manish Parashar,Tom Peterka,Line Pouchard,Dingwen Tao,Ozan Tugluk,Stefan M. Wild,Matthew Wolf,Justin M. Wozniak,Wei Xu,Shinjae Yoo
DOI: https://doi.org/10.1109/HiPC.2017.00042
2017-01-01
Abstract:A growing disparity between supercomputer computation speeds and I/O rates makes it increasingly infeasible for applications to save all results for offline analysis. Instead, applications must analyze and reduce data online so as to output only those results needed to answer target scientific question(s). This change in focus complicates application and experiment design and introduces algorithmic, implementation, and programming model challenges that are unfamiliar to many scientists and that have major implications for the design of various elements of supercomputer systems. I review these challenges and describe methods and tools that various groups, including mine, are developing to enable experimental exploration of algorithmic, software, and system design alternatives.