Abstract:In the realm of quantitative proteomics, data-independent acquisition (DIA) has emerged as a promising approach, offering enhanced reproducibility and quantitative accuracy compared to traditional data-dependent acquisition (DDA) methods. However, the analysis of DIA data is currently hindered by its reliance on project-specific spectral libraries derived from DDA analyses, which not only limits proteome coverage but also proves to be a time-intensive process. To overcome these challenges, we propose ProPept-MT, a novel deep learning-based multi-task prediction model designed to accurately forecast key features such as retention time (RT), ion intensity, and ion mobility (IM). Leveraging advanced techniques such as multi-head attention and BiLSTM for feature extraction, coupled with Nash-MTL for gradient coordination, ProPept-MT demonstrates superior prediction performance. Integrating ion mobility alongside RT, mass-to-charge ratio (m/z), and ion intensity forms 4D proteomics. Then, we outline a comprehensive workflow tailored for 4D DIA proteomics research, integrating the use of 4D in silico libraries predicted by ProPept-MT. Evaluation on a benchmark dataset showcases ProPept-MT’s exceptional predictive capabilities, with impressive results including a 99.9% Pearson correlation coefficient (PCC) for RT prediction, a median dot product (DP) of 96.0% for fragment ion intensity prediction, and a 99.3% PCC for IM prediction on the test set. Notably, ProPept-MT manifests efficacy in predicting both unmodified and phosphorylated peptides, underscoring its potential as a valuable tool for constructing high-quality 4D DIA in silico libraries.

DreamAI: algorithm for the imputation of proteomics data

ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects

Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning

Augmented Doubly Robust Post-Imputation Inference for Proteomic data

AIomics: exploring more of the proteome using mass spectral libraries extended by AI

Deep Learning in Proteomics

ProPept-MT: A Multi-Task Learning Model for Peptide Feature Prediction

AI-Assisted Processing Pipeline to Boost Protein Isoform Detection

Imputation of cancer proteomics data with a deep model that learns from many datasets

DeepIso: A Deep Learning Model for Peptide Feature Detection

A fully automated system with online sample loading, isotope dimethyl labeling and multidimensional separation for high-throughput quantitative proteome analysis.

Deep learning the collisional cross sections of the peptide universe from a million experimental values

High-Coverage Four-Dimensional Data-Independent Acquisition Proteomics and Phosphoproteomics Enabled by Deep Learning-Driven Multi-Dimensional Prediction

Abstract P326: an Innovative Peptide Spectral Library Search Engine for Cardiovascular Proteomics

DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics

Missing Values in Longitudinal Proteome Dynamics Studies: Making a Case for Data Multiple Imputation

Test-Time Training for Deep MS/MS Spectrum Prediction Improves Peptide Identification.

The Effects of Nonignorable Missing Data on Label-Free Mass Spectrometry Proteomics Experiments.

APIR: Aggregating Universal Proteomics Database Search Algorithms for Peptide Identification with FDR Control

AlphaDIA enables End-to-End Transfer Learning for Feature-Free Proteomics