Long-range Correlation Properties in Bacteria DNA Sequence

LU Xin,CHEN Huimin,LI Yanda
DOI: https://doi.org/10.3321/j.issn:1000-0054.1999.07.024
1999-01-01
Abstract:In order to explore the long range correlation properties of DNA sequences in a further step, some parametric methods are introduced to characterize the self similarity of DNA sequences. Compared with Fourier analysis, these methods perform statistically more stable and yield more reliable results. Using these methods, eight whole genomes of bacteria are analyzed. Long range correlation properties in the nucleotide density distribution along these DNA sequences are explored. Estimation results show that the long range correlation structure prevails through the entire molecule of DNA. Higher order statistics through coarse grain reveal that rather than multi fractal, there are only mono fractal phenomena presented in the sequences. Hence, the nucleotide density distribution can be modeled asymptotically as fractional Gaussian noise. This result brings about a new direction for analyzing and understanding the intrinsic structures of DNA sequences.
What problem does this paper attempt to address?