Application of Sequence Embedding in Protein Sequence-Based Predictions

Nabil Ibtehaz,Daisuke Kihara
DOI: https://doi.org/10.48550/arXiv.2110.07609
2021-10-14
Quantitative Methods
Abstract:In sequence-based predictions, conventionally an input sequence is represented by a multiple sequence alignment (MSA) or a representation derived from MSA, such as a position-specific scoring matrix. Recently, inspired by the development in natural language processing, several applications of sequence embedding have been observed. Here, we review different approaches of protein sequence embeddings and their applications including protein contact prediction, secondary structure, prediction, and function prediction.
What problem does this paper attempt to address?