Investigate the additivity of basic compression algorithms

Weiling Chang,Binxing Fang,Xiaochun Yun,Shupeng Wang,Xiangzhan Yu
2010-01-01
Abstract:There are two kinds of redundancy contained in the data stream: statistics redundancy and non-statistics redundancy. The non-statistics redundancy includes redundancy derived from syntax, semantics and pragmatics. Order-1 statistics-based compressor compress the statistics redundancy, higher orders statistics-based and dictionary-based compression algorithms exploit the statistics redundancy and the non-statistics redundancy. The additivity of lossless data compression algorithms is defined as a nature of data which one algorithm compressed data can be recompressed by another compression algorithm over some set S. We found that, for application data stream, the dictionary-based compression algorithms and the statistics-based, the Huffman coding and LZ methods, have good additivity. Their additivity can attribute to data stream's inherent characteristic. We also found that the arithmetic algorithm has little additivity with other compression methods. ICIC International © 2010 ISSN 1881-803X.
What problem does this paper attempt to address?