AN INFORMATION ENTROPY ANALYSIS OF CONSERVATIVE SITESOF E.coli、 YEAST AND Drosophila GENES

吕军,李宏,马克健
DOI: https://doi.org/10.3321/j.issn:1000-6737.2002.01.014
2002-01-01
ACTA BIOPHYSICA SINICA
Abstract:The formulation of the single base information redundancy D1(l)and the adjacent base related information redundancy D2(l)are revised. For the sequences of upstream and downstream the start codon and the terminal codons of E.coli, yeast and Drosophila genes, the D1(l) and D2(l) for each site l (l=-30, -29, …, +32, +33) are calculated. The results shown that D2(l) have more information than D1(l). In site -3 of coding start sequences, D1(-3) and D2(-3) have a distinct peak value for yeast and Drosophila. In the SD region of E.coli gene sequences, D1(l) and D2(l)have obvious peak value distribution, which is consistent with the others' results. D2(l) in site +4 of coding start sequences in yeast also have a peak value, whose related mode is TC (the combined probability is 0.211). Therefore, the revised information redundancies applied in this thesis are feasible to confirm the conservative sites in DNA sequence.
What problem does this paper attempt to address?