TriPoll: Computing Surveys of Triangles in Massive-Scale Temporal Graphs with Metadata

Trevor Steil,Tahsin Reza,Keita Iwabuchi,Benjamin W. Priest,Geoffrey Sanders,Roger Pearce
DOI: https://doi.org/10.48550/arXiv.2107.12330
2021-07-27
Abstract:Understanding the higher-order interactions within network data is a key objective of network science. Surveys of metadata triangles (or patterned 3-cycles in metadata-enriched graphs) are often of interest in this pursuit. In this work, we develop TriPoll, a prototype distributed HPC system capable of surveying triangles in massive graphs containing metadata on their edges and vertices. We contrast our approach with much of the prior effort on triangle analysis, which often focuses on simple triangle counting, usually in simple graphs with no metadata. We assess the scalability of TriPoll when surveying triangles involving metadata on real and synthetic graphs with up to hundreds of billions of <a class="link-external link-http" href="http://edges.We" rel="external noopener nofollow">this http URL</a> utilize communication-reducing optimizations to demonstrate a triangle counting task on a 224 billion edge web graph in approximately half of the time of competing approaches, while additionally supporting metadata-aware capabilities.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?