Combining mass spectrometry and machine learning to discover bioactive peptides
Christian T. Madsen,Jan C. Refsgaard,Felix G. Teufel,Sonny K. Kjærulff,Zhe Wang,Guangjun Meng,Carsten Jessen,Petteri Heljo,Qunfeng Jiang,Xin Zhao,Bo Wu,Xueping Zhou,Yang Tang,Jacob F. Jeppesen,Christian D. Kelstrup,Stephen T. Buckley,Søren Tullin,Jan Nygaard-Jensen,Xiaoli Chen,Fang Zhang,Jesper V. Olsen,Dan Han,Mads Grønborg,Ulrik de Lichtenberg
DOI: https://doi.org/10.1038/s41467-022-34031-z
IF: 16.6
2022-10-20
Nature Communications
Abstract:Abstract Peptides play important roles in regulating biological processes and form the basis of a multiplicity of therapeutic drugs. To date, only about 300 peptides in human have confirmed bioactivity, although tens of thousands have been reported in the literature. The majority of these are inactive degradation products of endogenous proteins and peptides, presenting a needle-in-a-haystack problem of identifying the most promising candidate peptides from large-scale peptidomics experiments to test for bioactivity. To address this challenge, we conducted a comprehensive analysis of the mammalian peptidome across seven tissues in four different mouse strains and used the data to train a machine learning model that predicts hundreds of peptide candidates based on patterns in the mass spectrometry data. We provide in silico validation examples and experimental confirmation of bioactivity for two peptides, demonstrating the utility of this resource for discovering lead peptides for further characterization and therapeutic development.
multidisciplinary sciences