Prediction and Analysis on Secreted Proteins of Echinococcus Multilocularis by Genome-Wide Bioinformatics Approaches

Zhang Ting,Chen Ying,Jia Lifang,Shen Haimo,Hu Wei,Liu Jie,Ao Wuliji
DOI: https://doi.org/10.3760/cma.j.issn.1673-4122.2014.06.003
2014-01-01
Abstract:Objective To predict the secretome and analyze the secreted proteins and signal peptides of Echinococcus multilocularis using genome-wide bioinformatics approaches thus to provide a platform for finding biomarkers for development of diagnosis and drugs.Methods Signal peptides of E.multilocularis were identified from the whole genome sequence using SignalP4.1 program,and the proteins containing signal sequences were analyzed with TMHMM v2.0,Phobius,Big-PI predictor and TargetPl.1 in a stepwise way to minimize the false-positive prediction.Subsequently,the sequence features of both the signal peptides and the secreted proteins were statistically analyzed by SPSS19.0 and Excel.Differences in the numbers of amino acid of each secreted and the non-secreted sequences were determined through kolmogorov-smirnov test (K-S test)and t test.Finally,the KAAS (KEGG Automatic Annotation Sever) functional annotation and clustering were performed for the secretome sequences.Results A total of 875 proteins encoding sequences containing signal peptides was found in 10 780 E.multilocularis genome sequences.Among them,307 sequences are membrane-binding proteins,38 proteins contain GPI anchor site and 12 proteins are located in mitochondria.Finally,a total of 518 proteins was recognized as secreted proteins.The signal sequences contain mostly 11-53amino acids,among them 61% are hydrophobic.The secreted proteins identified possess 38-7 809 amino acids,which is significantly less than the amount possessed by non-secreted proteins(11-11 194 amino acids) (t=0.203,P<0.01).KAAS analysis showed that these secreted proteins are mainly involved in human disease,metabolism,environmental information processing,organismal systems,cellular process and genetic information processing.Among them,6 sequences are related to parasitic infection.Conclusion The secretome of E.multilocularis containing 518 secreted protein sequences was predicted and analyzed thus to provide a database for further identification of diagnostics,vaccine and drug targets.
What problem does this paper attempt to address?