Abstract:Many scientific applications opt for particles instead of meshes as their basic primitives to model complex systems composed of billions of discrete entities. Such applications span a diverse array of scientific domains, including molecular dynamics, cosmology, computational fluid dynamics, and geology. The scale of the particles in those scientific applications increases substantially thanks to the ever-increasing computational power in high-performance computing (HPC) platforms. However, the actual gains from such increases are often undercut by obstacles in data management systems related to data storage, transfer, and processing. Lossy compression has been widely recognized as a promising solution to enhance scientific data management systems regarding such challenges, although most existing compression solutions are tailored for Cartesian grids and thus have sub-optimal results on discrete particle data. In this paper, we introduce LCP, an innovative lossy compressor designed for particle datasets, offering superior compression quality and higher speed than existing compression solutions. Specifically, our contribution is threefold. (1) We propose LCP-S, an error-bound aware block-wise spatial compressor to efficiently reduce particle data size. This approach is universally applicable to particle data across various domains. (2) We develop LCP, a hybrid compression solution for multi-frame particle data, featuring dynamic method selection and parameter optimization. (3) We evaluate our solution alongside eight state-of-the-art alternatives on eight real-world particle datasets from seven distinct domains. The results demonstrate that our solution achieves up to 104% improvement in compression ratios and up to 593% increase in speed compared to the second-best option, under the same error criteria.

Accelerating Lossy Compression on HPC Datasets Via Partitioning Computation for Parallel Processing

Accelerating Relative-error Bounded Lossy Compression for HPC Datasets with Precomputation-Based Mechanisms.

Performance Optimization for Relative-Error-Bounded Lossy Compression on Scientific Data.

cuSZ-$i$: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation

HoSZp: An Efficient Homomorphic Error-bounded Lossy Compressor for Scientific Data

SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors

High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation

Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5

Total Variation Reduction for Lossless Compression of HPC Applications

Dynamic Quality Metric Oriented Error-bounded Lossy Compression for Scientific Datasets

FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Computing Applications on GPUs

Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs

NeurLZ: On Enhancing Lossy Compression Performance based on Error-Controlled Neural Learning for Scientific Data

LCP: Enhancing Scientific Data Management with Lossy Compression for Particles

Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis

A Survey on Error-Bounded Lossy Compression for Scientific Datasets

High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data

SDC Resilient Error-bounded Lossy Compressor

A High Speed Lossless Compression Algorithm Based on CPU and GPU Hybrid Platform

ZFP-X: Efficient Embedded Coding for Accelerating Lossy Floating Point Compression.

In-Depth Exploration of Single-Snapshot Lossy Compression Techniques for N-Body Simulations