Prediction of Long-range Contacts from Sequence Profile

Peng Chen,Bing Wang,Hau-san Wong,De-shuang Huang
DOI: https://doi.org/10.1109/IJCNN.2007.4371084
2007-01-01
Abstract:Theoretic study in this paper shows that we can obtain exact long-range contacts by adopting one classifier if the centers of sequence profiles of residue pairs for long-range contacts and non-long-range contacts are known. The adopted classifier, referred to as multiple conditional probability mass function classifier (MCPMFC), can find an optimized transformation of the variables for each of the classes and therefore resulting in K separate classifiers. As a result, about 44.48% long-range contacts are around at the sequence profile (SP) centre for long-range contacts and about 20.9% long-range contacts are correctly predicted when considering the top L/5 (L is the protein sequence length) predicted contacts and the residue pair with 24 apart. The highest cluster result gives us a clue that SP center should be a sound pathway to investigate contact map in protein structures.
What problem does this paper attempt to address?