Selective Run-Length Encoding

Xutan Peng,Yi Zhang,Dejia Peng,Jiafa Zhu
2023-12-28
Abstract:Run-Length Encoding (RLE) is one of the most fundamental tools in data compression. However, its compression power drops significantly if there lacks consecutive elements in the sequence. In extreme cases, the output of the encoder may require more space than the input (aka size inflation). To alleviate this issue, using combinatorics, we quantify RLE's space savings for a given input distribution. With this insight, we develop the first algorithm that automatically identifies suitable symbols, then selectively encodes these symbols with RLE while directly storing the others without RLE. Through experiments on real-world datasets of various modalities, we empirically validate that our method, which maintains RLE's efficiency advantage, can effectively mitigate the size inflation dilemma.
Data Structures and Algorithms,Information Theory,Image and Video Processing,Signal Processing
What problem does this paper attempt to address?