Unambiguous discrimination of all 20 proteinogenic amino acids and their modifications by nanopore

Kefan Wang,Shanyu Zhang,Xiao Zhou,Xian Yang,Xinyue Li,Yuqin Wang,Pingping Fan,Yunqi Xiao,Wen Sun,Panke Zhang,Wenfei Li,Shuo Huang
DOI: https://doi.org/10.1038/s41592-023-02021-8
Abstract:Natural proteins are composed of 20 proteinogenic amino acids and their post-translational modifications (PTMs). However, due to the lack of a suitable nanopore sensor that can simultaneously discriminate between all 20 amino acids and their PTMs, direct sequencing of protein with nanopores has not yet been realized. Here, we present an engineered hetero-octameric Mycobacterium smegmatis porin A (MspA) nanopore containing a sole Ni2+ modification. It enables full discrimination of all 20 proteinogenic amino acids and 4 representative modified amino acids, Nω,N'ω-dimethyl-arginine (Me-R), O-acetyl-threonine (Ac-T), N4-(β-N-acetyl-D-glucosaminyl)-asparagine (GlcNAc-N) and O-phosphoserine (P-S). Assisted by machine learning, an accuracy of 98.6% was achieved. Amino acid supplement tablets and peptidase-digested amino acids from peptides were also analyzed using this strategy. This capacity for simultaneous discrimination of all 20 proteinogenic amino acids and their PTMs suggests the potential to achieve protein sequencing using this nanopore-based strategy.
What problem does this paper attempt to address?