Long-range correlations of RNA polymerase II promoter sequences across organisms

P. Katsaloulis,T. Theoharis,W.M. Zheng,B.L. Hao,A. Bountis,Y. Almirantis,A. Provata
DOI: https://doi.org/10.1016/j.physa.2005.10.019
2006-01-01
Abstract:The statistical properties of the size distribution of DNA segments separating identical oligonucleotides are studied. For representative eukaryotes (Homo sapiens, Mus musculus, Saccharomyces cereviciae, Oryza sativa, Arabidopsis thaliana) we have demonstrated the existence of long-range correlations for the distances separating oligonucleotides of sizes 4, 5 and 6, which carry a promoter signature. This observation is independent of the consensus sequence used by the organism, as in the case of O. sativa (which mainly uses the CG promoter box) and A. thaliana (which mainly uses the TATA promoter box). If we use two parameters to characterise the size distribution separating oligonucleotides, we observe that oligonucleotides containing promoter signatures cluster together, away from the others.
What problem does this paper attempt to address?