Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics

Mathias Wilhelm,Daniel P Zolg,Michael Graber,Siegfried Gessulat,Tobias Schmidt,Karsten Schnatbaum,Celina Schwencke-Westphal,Philipp Seifert,Niklas de Andrade Krätzig,Johannes Zerweck,Tobias Knaute,Eva Bräunlein,Patroklos Samaras,Ludwig Lautenbacher,Susan Klaeger,Holger Wenschuh,Roland Rad,Bernard Delanghe,Andreas Huhmer,Steven A Carr,Karl R Clauser,Angela M Krackhardt,Ulf Reimer,Bernhard Kuster
DOI: https://doi.org/10.1038/s41467-021-23713-9
2021-06-07
Abstract:Characterizing the human leukocyte antigen (HLA) bound ligandome by mass spectrometry (MS) holds great promise for developing vaccines and drugs for immune-oncology. Still, the identification of non-tryptic peptides presents substantial computational challenges. To address these, we synthesized and analyzed >300,000 peptides by multi-modal LC-MS/MS within the ProteomeTools project representing HLA class I & II ligands and products of the proteases AspN and LysN. The resulting data enabled training of a single model using the deep learning framework Prosit, allowing the accurate prediction of fragment ion spectra for tryptic and non-tryptic peptides. Applying Prosit demonstrates that the identification of HLA peptides can be improved up to 7-fold, that 87% of the proposed proteasomally spliced HLA peptides may be incorrect and that dozens of additional immunogenic neo-epitopes can be identified from patient tumors in published data. Together, the provided peptides, spectra and computational tools substantially expand the analytical depth of immunopeptidomics workflows.
What problem does this paper attempt to address?