Bioinformatic Analyses on Sequences of the Complete Mitochondrial Genomes of Taenia Species

JIA Wan-zhong,YAN Hong-bin,SHI Wan-gui,WANG Yu-chao,GUO Ai-jiang,ZHAN Fang,FU Bao-quan,CAI Xue-peng
DOI: https://doi.org/10.16303/j.cnki.1005-4545.2010.11.003
2010-01-01
Abstract:The current study is to analyze base composition,gene organization,codon usage and RNAs' structures of mitochondrial genomes of Taenia species,which provide a fundamental foundation for comparative mt genomics,systematic studies,molecular taxonomy and discrimination.Alignments were done by using the ClustalX(1.83)program.Percentage pairwise divergences of nucleotide and amino acid sequences were calculated using DNAStar software.The phylogenetic analyses were performed in MEGA4 program by the Neighbor-joining method with bootstrap and using Echinococcus granulosus as an outgroup.The evolutionary distances were computed using the Poisson correction method for amino acids and the Maximum Composite Likelihood method for nucleotides,respectively.The putative stem-loop structures of tRNAs and non-coding mitochondrial regions(LNR and SNR) were inferred using the ARWEN program and the RNAstructure program(Version 4.6),respectively.The mt genomes of Taenia species were from 13.4 kb to 13.7 kb long,coding 12 proteins,two ribosomal RNAs(rRNAs,a small and a large subunit) and 22 transfer RNAs(tRNAs),which were arranged in the same order.Overlapping regions are found among genes,such as between nad4L and nad4.The GTG initiation codon is used for some genes,for example,nad6 of T.multiceps.The size of the 12 protein-coding genes of the two Taenia tapeworms was conserved.The tRNA genes were 57-74 bp long,and although 18 predicted secondary structures have a typical clover-leaf shape with a paired dihydrouridine(DHU) arm,4 tRNAs have a D-loop structure without the DHU arm.The genes for the two mitochondrial rRNA subunit genes rrnL and rrnS in the two Taenia tapeworms are separated by trnC gene.The non-coding regions of the mt genomes of Taenia spp.consist of two regions: a short non-coding region(SNR) and a long non-coding region(LNR),respectively.The mt DNAs of Taenia spp.have a high level of T(over 45%) and a low level of C(below 9%).The overall nucleic sequence differences in the protein-coding genes of mt genomes among Taenia spp.were from 5.7% to 28.9%.Among the protein-coding genes,cox1 and nad4L genes were relatively conserved,while nad5 and nad6 varied much.Phylogenetic analysis of Taenia species showed that T.multiceps is more closely related to T.asiatica and T.saginata than T.solium using either the complete mitochondrial DNA or protein-coding or tRNAs' sequences,suggesting that T.asiatica,T.saginata and T.multiceps should be sister species.
What problem does this paper attempt to address?