Micalign: A Sequence-To-Structure Alignment Tool Integrating Multiple Sources of Information in Conditional Random Fields

Xuefeng Xia,Song Zhang,Yu Su,Zhirong Sun
DOI: https://doi.org/10.1093/bioinformatics/btp251
IF: 5.8
2009-01-01
Bioinformatics
Abstract:Sequence-to-structure alignment in template-based protein structure modeling for remote homologs remains a difficult problem even following the correct recognition of folds. Here we present MICAlign, a sequence-to-structure alignment tool that incorporates multiple sources of information from local structural contexts of template, sequence profiles, predicted secondary structures, solvent accessibilities, potential-like terms (including residue-residue contacts and solvent exposures) and pre-aligned structures and sequences. These features, together with a position-specific gap scheme, were integrated into conditional random fields through which the optimal parameters were automatically learned. MICAlign showed improved alignment accuracy over several other state-of-the-art alignment tools based on comparisons by using independent datasets.
What problem does this paper attempt to address?