Metall: A Persistent Memory Allocator For Data-Centric Analytics

Keita Iwabuchi,Karim Youssef,Kaushik Velusamy,Maya Gokhale,Roger Pearce
DOI: https://doi.org/10.48550/arXiv.2108.07223
2022-03-01
Abstract:Data analytics applications transform raw input data into analytics-specific data structures before performing analytics. Unfortunately, such data ingestion step is often more expensive than analytics. In addition, various types of NVRAM devices are already used in many HPC systems today. Such devices will be useful for storing and reusing data structures beyond a single process life cycle. We developed Metall, a persistent memory allocator built on top of the memory-mapped file mechanism. Metall enables applications to transparently allocate custom C++ data structures into various types of persistent memories. Metall incorporates a concise and high-performance memory management algorithm inspired by Supermalloc and the rich C++ interface developed by <a class="link-external link-http" href="http://Boost.Interprocess" rel="external noopener nofollow">this http URL</a> library. On a dynamic graph construction workload, Metall achieved up to 11.7x and 48.3x performance improvements over <a class="link-external link-http" href="http://Boost.Interprocess" rel="external noopener nofollow">this http URL</a> and memkind (PMEM kind), respectively. We also demonstrate Metall's high adaptability by integrating Metall into a graph processing framework, GraphBLAS Template Library. This study's outcomes indicate that Metall will be a strong tool for accelerating future large-scale data analytics by allowing applications to leverage persistent memory efficiently.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?