A Compression Algorithm for English-Text Based on Dynamic Dictionary

JIANG Li,SUN Jian-ling,WANG Xin-yu,YANG Chang-sheng
DOI: https://doi.org/10.3969/j.issn.1671-7147.2007.04.014
2007-01-01
Abstract:The classical text compression algorithm LZ77 and LZ78,as well as later improved LZW,are all based on single-character mode while collecting the tokens.This method will cause the algorithm to slowly learns.the correlations between words,and to directly influence the compress rate.Based on the LZW algorithm,the paper brings in an idea called "word-based" to solve the problem,also called as "Word-based LZW".Tested by an experiment,Word-based LZW algorithm is confirmed to have steady and better compression efficiency.The algorithm can be easily extended to compression for text of other languages as well.
What problem does this paper attempt to address?