Parallelizing Air Shower Simulation for Background Characterization in IceCube

Kevin Meagher,Jakob van Santen
DOI: https://doi.org/10.1051/epjconf/202429511016
2024-05-09
Abstract:The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the Geographic South Pole. For every observed neutrino event, there are over $10^6$ background events caused by cosmic ray air shower muons. In order to properly separate signal from background, it is necessary to produce Monte Carlo simulations of these air showers. Although to-date, IceCube has produced large quantities of background simulation, these studies still remain statistics limited. The first stage of simulation requires heavy CPU usage while the second stage requires heavy GPU usage. Processing both of these stages on the same node will result in an underutilized GPU but using different nodes will encounter bandwidth bottlenecks. Furthermore, due to the power-law energy spectrum of cosmic rays, the memory footprint of the detector response often exceeded the limit in unpredictable ways. This proceeding presents new client-server code which parallelizes the first stage onto multiple CPUs on the same node and then passes it on to the GPU for photon propagation. This results in GPU utilization of greater than 90% as well as more predictable memory usage and an overall factor of 20 improvement in speed over previous techniques.
High Energy Astrophysical Phenomena
What problem does this paper attempt to address?
This paper aims to solve the computational efficiency problems encountered in the background characterization of the IceCube Neutrino Observatory. Specifically, the paper focuses on how to improve the simulation efficiency of background events by parallelizing air - shower simulations. Background events are mainly muons produced by air showers caused by cosmic rays. The number of these background events far exceeds that of neutrino events, so a large amount of computational resources are required for accurate simulation in order to separate signals from the background. The paper points out that the traditional simulation chain has the following main problems: 1. **Inefficient use of computational resources**: In the traditional method, CPU - intensive tasks and GPU - intensive tasks are executed alternately, resulting in low GPU utilization. At the same time, due to excessive memory occupation, the problem of memory overflow often occurs. 2. **Input/Output bottlenecks**: Due to the high randomness of muon propagation in ice, most muons cannot reach the detector area, which leads to a large number of input/Output bottlenecks in the simulation process of low - energy primary particles. 3. **Memory management problems**: The memory occupation of high - energy events often exceeds the available memory, causing the calculation task to be interrupted and need to be resubmitted. To solve these problems, the paper proposes a new client - server simulation framework. The main improvements include: 1. **Parallelizing CPU tasks**: By parallelizing CPU tasks on the same node, the CPU utilization is improved. 2. **Optimizing GPU usage**: By batch - processing photon propagation, the GPU utilization reaches more than 90%. 3. **Reducing memory occupation**: By optimizing the data structure and parallelizing processing, the memory occupation is significantly reduced. 4. **Adaptive sampling**: An adaptive sampling scheme is introduced, which dynamically adjusts the simulation parameters according to the muon energy and zenith angle, further improving the computational efficiency. Through these improvements, the new simulation chain achieves a significant performance improvement while maintaining the consistency of scientific results, with an overall speed increase of 20 times.