Study of LZ-word Distribution and Its Application for Sequence Comparison

Qi Dai,Zhaofang Yan,Zhuoxing Shi,Xiaoqing Liu,Yuhua Yao,Pingan He
DOI: https://doi.org/10.1016/j.jtbi.2013.07.008
IF: 2.405
2013-01-01
Journal of Theoretical Biology
Abstract:Lempel-Ziv complexity has been widely used for sequence comparison and achieved promising results, but until now components' distribution in exhaustive history has not been studied. This paper investigated the whole distribution of LZ-words and presented a novel statistical method for sequence comparison. With the components' length in mind, we revised Lempel-Ziv complexity and obtained various sets of LZ-words. Instead of calculating the LZ-words' contents, we defined a series of set operations on LZ-word set to compare biological sequences. In order to assess the effectiveness of the proposed method, we performed two sets of experiments and compared it with alignment-based methods.
What problem does this paper attempt to address?