Abstract:Data compression plays a critical role in operating systems and large-scale computing workloads. Its primary objective is to reduce network bandwidth consumption and memory/storage capacity utilization. Given the need to manipulate hash tables, and execute matching operations on extensive data volumes, data compression software has transformed into a resource-intensive CPU task. To tackle this challenge, numerous prior studies have introduced hardware acceleration methods. For example, they have utilized Content-Addressable Memory (CAM) for string matches, incorporated redundant historical copies for each matching component, and so on. While these methods amplify the compression throughput, they often compromise an essential aspect of compression performance: the compression ratio (C.R.). Moreover, hardware accelerators face significant resource costs, especially in memory, when dealing with new large sliding window algorithms. We introduce BeeZip, the first hardware acceleration system designed explicitly for compression with a large sliding window. BeeZip tackles the hardware-level challenge of optimizing both compression ratio and throughput. BeeZip offers architectural support for compression algorithms with the following distinctive attributes: 1) A two-stage compression algorithm adapted for accelerator parallelism, decoupling hash parallelism and match execution dependencies; 2) An organized hash hardware accelerator named BeeHash engine enhanced with dynamic scheduling, which orchestrates hash processes with structured parallelism; 3) A hardware accelerator named HiveMatch engine for the match process, which employs a new scalable parallelism approach and a heterogeneous scale processing unit design to reduce memory resource overhead. Experimental results show that on the Silesia dataset, BeeZip achieves an optimal throughput of 10.42GB/s (C.R. 2.96) and the best C.R. of 3.14 (throughput of 5.95GB/s). Under similar compression ratios, compared to single-threaded/36-threaded software implementations, BeeZip offers accelerator speedups of 23.2×/2.45×, respectively. Against all accelerators we know, BeeZip consistently demonstrates a superior compression ratio, improving by at least 9%.

BeeZip: Towards an Organized and Scalable Architecture for Data Compression

MetaZip: a high-throughput and efficient accelerator for DEFLATE

Hardware Implementation of Fast Huffman Coding Based on Different Sorting Methods

A High-Throughput Hardware Accelerator for Lempel-Ziv 4 Compression Algorithm

MetaZip

Data Compression and Storage under High Speed Network

A self-aware data compression system on FPGA in Hadoop

PIM-DH: Re RAM-based Processing-in-Memory Architecture for Deep Hashing Acceleration

Author Response for "refactoring BZIP2 on the New-Generation Sunway Supercomputer"

HybriDC: A Resource-Efficient CPU-FPGA Heterogeneous Acceleration System for Lossless Data Compression

A General SIMD-Based Approach to Accelerating Compression Algorithms.

BreadZip: a combination of network traffic data and bitmap index encoding algorithm

FPGA Acceleration of Zstd Compression Algorithm

Design and Implementation of A High-Performance Microprocessor Cache Compression Algorithm

C-Pack: A High-Performance Microprocessor Cache Compression Algorithm

Refine and Recycle: A Method to Increase Decompression Parallelism.

ZipCache: A DRAM/SSD Cache with Built-in Transparent Compression

Design and Optimization of Zstandard Algorithm Based on Concurrent Streaming of Multiple Hash Tables

FastqZip: An Improved Reference-Based Genome Sequence Lossy Compression Framework

Streaming Sorting Network Based BWT Acceleration on FPGA for Lossless Compression.

Efficient pipelined CABAC encoding architecture