Multifractal characterisation of length sequences of coding and noncoding segments in a complete genome

Zu-Guo Yu,Vo Anh,Ka-Sing Lau
DOI: https://doi.org/10.1016/S0378-4371%2801%2900391-0
2001-08-28
Abstract:The coding and noncoding length sequences constructed from a complete genome are characterised by multifractal analysis. The dimension spectrum $D_{q}$ and its derivative, the 'analogous' specific heat $C_{q}$, are calculated for the coding and noncoding length sequences of bacteria, where $q$ is the moment order of the partition sum of the sequences. From the shape of the $% D_{q}$ and $C_{q}$ curves, it is seen that there exists a clear difference between the coding/noncoding length sequences of all organisms considered and a completely random sequence. The complexity of noncoding length sequences is higher than that of coding length sequences for bacteria. Almost all $D_{q}$ curves for coding length sequences are flat, so their multifractality is small whereas almost all $D_{q}$ curves for noncoding length sequences are multifractal-like. We propose to characterise the bacteria according to the types of the $C_{q}$ curves of their noncoding length sequences.
Biological Physics,Quantitative Biology
What problem does this paper attempt to address?