The 6-D representation of RNA secondary structures and the analysis of similarity
BAI Feng-lan,YAO Yu-hua,SUN Li-Bo
2006-01-01
Abstract:According to the composition of RNA secondary structure,the RNA secondary structure is transformed into basic sequence by A′, U′, G′, C′ representing the A, U, G, C in the base-pairs of A-U, G-C and G-U, and we call it the characteristic sequence. On the basis of it, we define a function between the nucleotide sets and point sets in the 6-D space by the chemical structures of the bases of A, C, G, U. Then we get the 6-dimensional representation of RNA secondary structure in the 6-D space. Furthermore, we transform the representation into L/L matrix and characteristic vector P=(μ_x,μ_y,μ_z,μ_k,μ_l,μ_m), where μ_i means the average values of the corresponding sub-coordinate of the vector. In the end, the similarity of the RNA secondary structures of AIMV-3 and the other 8 kinds of viruses are analyzed and some better results are obtained making use of the matrix invariant: the leading eigenvalues of the L/L matrix and the distances between the characteristic vectors, which describe the invariance of the sequences or the structures.