LRFragLib: an effective algorithm to identify fragments for de novo protein structure prediction.

Tong Wang,Yuedong Yang,Yaoqi Zhou,Haipeng Gong
DOI: https://doi.org/10.1093/bioinformatics/btw668
IF: 5.8
2017-01-01
Bioinformatics
Abstract:Motivation: The quality of fragment library determines the efficiency of fragment assembly, an approach that is widely used in most de novo protein-structure prediction algorithms. Conventional fragment libraries are constructed mainly based on the identities of amino acids, sometimes facilitated by predicted information including dihedral angles and secondary structures. However, it remains challenging to identify near-native fragment structures with low sequence homology. Results: We introduce a novel fragment-library-construction algorithm, LRFragLib, to improve the detection of near-native low-homology fragments of 7-10 residues, using a multi-stage, flexible selection protocol. Based on logistic regression scoring models, LRFragLib outperforms existing techniques by achieving a significantly higher precision and a comparable coverage on recent CASP protein sets in sampling near-native structures. The method also has a comparable computational efficiency to the fastest existing techniques with substantially reduced memory usage.
What problem does this paper attempt to address?