THE CONSTRUCTION OF STRUCTURAL TEMPLATES FOR THE MODELING OF CONSERVED PROTEIN DOMAINS
FA ZHANG,JINGCHUN CHEN,ZHIYONG LIU,BO YUAN
DOI: https://doi.org/10.1142/9789812702098_0042
2005-01-01
Abstract:Series in Mathematical Biology and MedicineAdvances in Bioinformatics and Its Applications, pp. 459-469 (2005) No AccessTHE CONSTRUCTION OF STRUCTURAL TEMPLATES FOR THE MODELING OF CONSERVED PROTEIN DOMAINSFA ZHANG, JINGCHUN CHEN, ZHIYONG LIU and BO YUANFA ZHANGDepartments of Biomedical Informatics, The Ohio State University, Columbus OH 43210, USADepartments of Pharmacology, The Ohio State University, Columbus OH 43210, USAProgram in Pharmacogenomics, The Ohio State University, Columbus OH 43210, USAInstitute of Computing Technology, The Chinese Academe of Sciences, Beijing, China 100080, ChinaGraduate School, The Chinese Academe of Sciences, Beijing, China 100080, China, JINGCHUN CHENDepartments of Biomedical Informatics, The Ohio State University, Columbus OH 43210, USADepartments of Pharmacology, The Ohio State University, Columbus OH 43210, USAProgram in Pharmacogenomics, The Ohio State University, Columbus OH 43210, USA, ZHIYONG LIUInstitute of Computing Technology, The Chinese Academe of Sciences, Beijing, China 100080, China and BO YUANDepartments of Biomedical Informatics, The Ohio State University, Columbus OH 43210, USADepartments of Pharmacology, The Ohio State University, Columbus OH 43210, USAProgram in Pharmacogenomics, The Ohio State University, Columbus OH 43210, USACorresponding author:https://doi.org/10.1142/9789812702098_0042Cited by:2 PreviousNext AboutSectionsPDF/EPUB ToolsAdd to favoritesDownload CitationsTrack CitationsRecommend to Library ShareShare onFacebookTwitterLinked InRedditEmail Abstract: Protein sequences and their structures can be largely described as combinations of conserved protein domains. Although only a very small number of protein structures (<20,000) have been determined using experimental methods, already more than half of the known protein domains can be found in the structural database. This provides a rich source of three-dimensional templates for the potential modeling of at least half of all the conserved protein domains. Here, we searched the entire Protein Data Bank (PDB) for all the InterPro (protein domain database) entries. Similar protein domains with structural information were clustered thus PDB partitioned. In each of the resulting domain clusters, a multiple structural alignment was constructed based only on the 3D positions for all the residues of the same domains involved. The overall goal of this study is thus to use these structure alignments as anchors to increase the alignment accuracy for a query with its 3D template required in the homology-based structural modeling. Here we report 1) the construction of such a structural library for all the known protein domains; 2) the use of a structural alignment (instead of sequence alignment) to select and map optimal templates; and 3) our validation using know structures as benchmarks to assess the modeling outcome. Our preliminary results show this is a promising method aiming at the prediction for a majority of known protein domains. Keywords: protein domainmultiple structure alignmentcomparative modelingRMSD FiguresReferencesRelatedDetailsCited By 2Protein Structure Prediction Based on a Domain Clustering DatabaseZhaoyun Ma, Fa Zhang, Lin Xu, Shengzhong Feng and Zhiyong Liu1 Jan 2007A profile-based protein sequence alignment algorithm for a domain clustering databaseLin Xu, Fa Zhang and Zhiyong Liu1 Sep 2006 Advances in Bioinformatics and Its ApplicationsMetrics History Keywordsprotein domainmultiple structure alignmentcomparative modelingRMSDPDF download