Spatial Temporal Analysis of 40,000,000,000,000 Internet Darkspace Packets
Jeremy Kepner,Michael Jones,Daniel Andersen,Aydin Buluc,Chansup Byun,K Claffy,Timothy Davis,William Arcand,Jonathan Bernays,David Bestor,William Bergeron,Vijay Gadepally,Micheal Houle,Matthew Hubbell,Anna Klein,Chad Meiners,Lauren Milechin,Julie Mullen,Sandeep Pisharody,Andrew Prout,Albert Reuther,Antonio Rosa,Siddharth Samsi,Doug Stetson,Adam Tse,Charles Yee,Peter Michaleas
DOI: https://doi.org/10.1109/HPEC49654.2021.9622790
2021-08-15
Abstract:The Internet has never been more important to our society, and understanding the behavior of the Internet is essential. The Center for Applied Internet Data Analysis (CAIDA) Telescope observes a continuous stream of packets from an unsolicited darkspace representing 1/256 of the Internet. During 2019 and 2020 over 40,000,000,000,000 unique packets were collected representing the largest ever assembled public corpus of Internet traffic. Using the combined resources of the Supercomputing Centers at UC San Diego, Lawrence Berkeley National Laboratory, and MIT, the spatial temporal structure of anonymized source-destination pairs from the CAIDA Telescope data has been analyzed with GraphBLAS hierarchical hypersparse matrices. These analyses provide unique insight on this unsolicited Internet darkspace traffic with the discovery of many previously unseen scaling relations. The data show a significant sustained increase in unsolicited traffic corresponding to the start of the COVID19 pandemic, but relatively little change in the underlying scaling relations associated with unique sources, source fan-outs, unique links, destination fan-ins, and unique destinations. This work provides a demonstration of the practical feasibility and benefit of the safe collection and analysis of significant quantities of anonymized Internet traffic.
Networking and Internet Architecture,Distributed, Parallel, and Cluster Computing,Performance,Social and Information Networks