Deep-learning language models help to improve protein sequence alignment

DOI: https://doi.org/10.1038/s41592-022-01707-9
IF: 48
2022-12-16
Nature Methods
Abstract:We trained DEDAL, an algorithm based on deep-learning language models, to generate pairwise alignments of protein sequences taking into account the sequence-specific context of amino acid substitutions or gaps. DEDAL improved the alignment correctness on remote homologs by up to threefold and the discrimination of remote homologs from evolutionarily unrelated sequences.
biochemical research methods
What problem does this paper attempt to address?