De novo sequencing of proteins by mass spectrometry

Rui Vitorino,Sofia Guedes,Fabio Trindade,InĂªs Correia,Gabriela Moura,Paulo Carvalho,Manuel A. S. Santos,Francisco Amado
DOI: https://doi.org/10.1080/14789450.2020.1831387
2020-08-02
Expert Review of Proteomics
Abstract:<span>Proteins are crucial for every cellular activity and unraveling their sequence and structure is a crucial step to fully understand their biology. Early methods of protein sequencing were mainly based on the use of enzymatic or chemical degradation of peptide chains. With the completion of the human genome project and with the expansion of the information available for each protein, various databases containing this sequence information were formed. <i>De novo</i> protein sequencing, shotgun proteomics and other mass-spectrometric techniques, along with the various software are currently available for proteogenomic analysis. Emphasis is placed on the methods for <i>de novo</i> sequencing, together with potential and shortcomings using databases for interpretation of protein sequence data.As mass-spectrometry sequencing performance is improving with better software and hardware optimizations, combined with user-friendly interfaces, <i>de-novo</i> protein sequencing becomes imperative in shotgun proteomic studies. Issues regarding unknown or mutated peptide sequences, as well as, unexpected post-translational modifications (PTMs) and their identification through false discovery rate searches using the target/decoy strategy need to be addressed. Ideally, it should become integrated in standard proteomic workflows as an add-on to conventional database search engines, which then would be able to provide improved identification.</span>
biochemical research methods
What problem does this paper attempt to address?