Abstract:The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the Geographic South Pole. For every observed neutrino event, there are over $10^6$ background events caused by cosmic ray air shower muons. In order to properly separate signal from background, it is necessary to produce Monte Carlo simulations of these air showers. Although to-date, IceCube has produced large quantities of background simulation, these studies still remain statistics limited. The first stage of simulation requires heavy CPU usage while the second stage requires heavy GPU usage. Processing both of these stages on the same node will result in an underutilized GPU but using different nodes will encounter bandwidth bottlenecks. Furthermore, due to the power-law energy spectrum of cosmic rays, the memory footprint of the detector response often exceeded the limit in unpredictable ways. This proceeding presents new client-server code which parallelizes the first stage onto multiple CPUs on the same node and then passes it on to the GPU for photon propagation. This results in GPU utilization of greater than 90% as well as more predictable memory usage and an overall factor of 20 improvement in speed over previous techniques.

What problem does this paper attempt to address?

This paper aims to solve the computational efficiency problems encountered in the background characterization of the IceCube Neutrino Observatory. Specifically, the paper focuses on how to improve the simulation efficiency of background events by parallelizing air - shower simulations. Background events are mainly muons produced by air showers caused by cosmic rays. The number of these background events far exceeds that of neutrino events, so a large amount of computational resources are required for accurate simulation in order to separate signals from the background. The paper points out that the traditional simulation chain has the following main problems: 1. **Inefficient use of computational resources**: In the traditional method, CPU - intensive tasks and GPU - intensive tasks are executed alternately, resulting in low GPU utilization. At the same time, due to excessive memory occupation, the problem of memory overflow often occurs. 2. **Input/Output bottlenecks**: Due to the high randomness of muon propagation in ice, most muons cannot reach the detector area, which leads to a large number of input/Output bottlenecks in the simulation process of low - energy primary particles. 3. **Memory management problems**: The memory occupation of high - energy events often exceeds the available memory, causing the calculation task to be interrupted and need to be resubmitted. To solve these problems, the paper proposes a new client - server simulation framework. The main improvements include: 1. **Parallelizing CPU tasks**: By parallelizing CPU tasks on the same node, the CPU utilization is improved. 2. **Optimizing GPU usage**: By batch - processing photon propagation, the GPU utilization reaches more than 90%. 3. **Reducing memory occupation**: By optimizing the data structure and parallelizing processing, the memory occupation is significantly reduced. 4. **Adaptive sampling**: An adaptive sampling scheme is introduced, which dynamically adjusts the simulation parameters according to the muon energy and zenith angle, further improving the computational efficiency. Through these improvements, the new simulation chain achieves a significant performance improvement while maintaining the consistency of scientific results, with an overall speed increase of 20 times.

Parallelizing Air Shower Simulation for Background Characterization in IceCube

A Simple Parallelization Scheme for Extensive Air Shower Simulations

HyGrid: A CPU-GPU Hybrid Convolution-Based Gridding Algorithm in Radio Astronomy.

Simulation of radio signals from cosmic-ray cascades in air and ice as observed by in-ice Askaryan radio detectors

Simulation of in-ice cosmic ray air shower induced particle cascades

In-ice Askaryan Emission from Air Showers: Implications for Radio Neutrino Detectors

IceCube experience using XRootD-based Origins with GPU workflows in PNRP

An Algorithm to Parallelise Parton Showers on a GPU

Towards a cosmic ray composition measurement with the IceAct telescopes at the IceCube Neutrino Observatory

CPU/GPU Computing for Long-Wave Radiation Physics on Large GPU Clusters.

Parallel processing of radio signals and detector arrays in CORSIKA 8

Multi-radiators Scattering Characteristic Solver via ARCS Theory and GPU Acceleration

GPU Acceleration of Monte Carlo Simulations for Polarized Photon Scattering in Anisotropic Turbid Media.

Prospects for a radio air-shower detector at South Pole

Studying the mass sensitivity of air-shower observables using simulated cosmic rays

Compute Unified Device Architecture (Cuda)-Based Parallelization of WRF Kessler Cloud Microphysics Scheme

Optical Photon Emission in Extended Airshowers -- Hybrid computing in the context of CORSIKA 8

Simulations of cross media showers with CORSIKA 8

Simulation and Reconstruction Study of a Future Surface Scintillator Array at the IceCube Neutrino Observatory

Improved modeling of in-ice particle showers for IceCube event reconstruction

Novel Simulation Framework for Analyzing Cosmic Ray Particle Distributions at a Global Scale