Vertical, Temporal, and Horizontal Scaling of Hierarchical Hypersparse GraphBLAS Matrices

Jeremy Kepner,Tim Davis,Chansup Byun,William Arcand,David Bestor,William Bergeron,Vijay Gadepally,Matthew Hubbell,Michael Houle,Michael Jones,Anna Klein,Lauren Milechin,Julie Mullen,Andrew Prout,Albert Reuther,Antonio Rosa,Siddharth Samsi,Charles Yee,Peter Michaleas
DOI: https://doi.org/10.1109/HPEC49654.2021.9622802
2021-08-15
Abstract:Hypersparse matrices are a powerful enabler for a variety of network, health, finance, and social applications. Hierarchical hypersparse GraphBLAS matrices enable rapid streaming updates while preserving algebraic analytic power and convenience. In many contexts, the rate of these updates sets the bounds on performance. This paper explores hierarchical hypersparse update performance on a variety of hardware with identical software configurations. The high-level language bindings of the GraphBLAS readily enable performance experiments on simultaneous diverse hardware. The best single process performance measured was 4,000,000 updates per second. The best single node performance measured was 170,000,000 updates per second. The hardware used spans nearly a decade and allows a direct comparison of hardware improvements for this computation over this time range; showing a 2x increase in single-core performance, a 3x increase in single process performance, and a 5x increase in single node performance. Running on nearly 2,000 MIT SuperCloud nodes simultaneously achieved a sustained update rate of over 200,000,000,000 updates per second. Hierarchical hypersparse GraphBLAS allows the MIT SuperCloud to analyze extremely large streaming network data sets.
Distributed, Parallel, and Cluster Computing,Discrete Mathematics,Mathematical Software,Networking and Internet Architecture,Performance
What problem does this paper attempt to address?