Characterization and mining of EST-SNP from homologous EST sequences of Prunus mume,P.armeniaca and P.persica

LI Xiao-ying,WANG Yu-zhu,SHANGGUAN Ling-fei,Ning Ning,FANG Jing-gui
2012-01-01
Abstract:In this research,4 660,15 388,82 583 were downloaded from the published EST database among Prunus mume,P.armeniaca and P.persica in GenBank,and 4 456,5 595 and 24 243 congtigs were respectively obtained after splicing from original EST sequences,by using CAP 3 software.592 homologous sequences were found with a total length of 235 576 bp,and the average length and homology were 437.67 bp and 97.50%,respectively.The Blast results also showed that 340 of them had the corresponding functional annotation,183 were unknown proteins,and the remaining 69 had new gene sequence information.The amount and frequency of nucleotides were further analyzed,where 8 818 SNPs were found having a total frequency of 26.71 bp per SNP,which mainly comprised transitions and transversions.The amount of SNP compared in pairs was significantly less than the number among the homologous sequences.In addition,the cluster analysis result by using the obtained SNP information showed that the relationship between P.mume and P.armeniaca was closer,and they were distantly related with P.persica,which would provide representative information for understanding the characteristics of genetic evolution,of the three comparative genomics and phylogenetic relationship among these three species.
What problem does this paper attempt to address?