PCA for Prediction of Disulfide Connectivity Patterns

YI Dong-liang,ZHU Lin,YANG Jie,SHEN Hong-bin
DOI: https://doi.org/10.3969/j.issn.1006-9348.2010.02.052
2010-01-01
Abstract:Disulfide bonds are primary covalent crosslinks between two cysteine residues in proteins,it can occur in a protein peptide bond or between different protein peptide bonds.To many proteins,disulfide connectivity is the character of the final folding protein structure.It's an important step for the folding of proteins and it also affects the folding rate and way.Therefore,there is a great need to develop computational methods capable of accurately predicting disulfide connectivity patterns in proteins that have potentially important applications.A novel method is used to predict disulfide connectivity patterns from protein primary sequences,and a support vector regression (SVR) approach is used based on multiple sequence feature vectors to predict secondary structure information by the PSIPRED program.Since above method generates too many higher dimensional data,PCA is used to reduce feature dimensions.Compared to the former method without feature reduction,the accuracy is improved by using the method.
What problem does this paper attempt to address?