Tiansheng Zhu,Yi Judy Zhu,Yue Xuan,Huanhuan Gao,Xue Cai,Sander R Piersma,Thang V Pham,Tim Schelfhorst,Richard R de Haas,Irene V Bijnsdorp,Rui Sun,Liang Yue,Guan Ruan,Qiushi Zhang,Mo Hu,Yue Zhou,Winan J Van Houdt,Tessa YS Le Large,Jacqueline Cloos,Anna Wojtuszkiewicz,Danijela Koppers-Lalic,Franziska Bottger,Chantal Scheepbouwer,RH Brakenhoff,Geert JLH van Leenders,Jan NM Ijzermans,John WM Martens,Renske DM Steenbergen,Nicole C Grieken,Sathiyamoorthy Selvarajan,Sangeeta Mantoo,Sze Sing Lee,Serene Jie Yi Yeow,Syed Muhammad Fahmy Alkaff,Xiang Nan,Yaoting Sun,Xiao Yi,Shaozheng Dai,Wei Liu,Tian Lu,Zhicheng Wu,Xiao Liang,Man Wang,Yingkuan Shao,Xi Zheng,Kailun Xu,Qin Yang,Yifan Meng,Cong Lu,Jiang Zhu,Jin e Zheng,Bo Wang,Sai Lou,Yibei Dai,Chao Xu,Chenhuan Yu,Huazhong Ying,Tony Kiat-hon Lim,Jianmin Wu,Xiaofei Gao,Zhongzhi Luan,Xiaodong Teng,Peng Wu,Shiang Huang,Zhihua Tao,N Gopalakrishna Iyer,Shuigeng Zhou,Wenguang Shao,Henry Lam,Ding Ma,Jiafu Ji,Oi Lian Kon,Shu Zheng,Ruedi Aebersold,Connie R Jimenez,Tiannan Guo

Abstract:To answer the increasing need for detecting and validating protein biomarkers in clinical specimens, proteomic techniques are required that support the fast, reproducible and quantitative analysis of large clinical sample cohorts. Targeted mass spectrometry techniques, specifically SRM, PRM and the massively parallel SWATH/DIA technique have emerged as a powerful method for biomarker research. For optimal performance, they require prior knowledge about the fragment ion spectra of targeted peptides. In this report, we describe a mass spectrometric (MS) pipeline and spectral resource to support data-independent acquisition (DIA) and parallel reaction monitoring (PRM) based biomarker studies. To build the spectral resource we integrated common open-source MS computational tools to assemble an open source computational workflow based on Docker. It was then applied to generate a comprehensive DIA pan-human library (DPHL) from 1,096 data dependent acquisition (DDA) MS raw files, and it comprises 242,476 unique peptide sequences from 14,782 protein groups and 10,943 SwissProt-annotated proteins expressed in 16 types of cancer samples. In particular, tissue specimens from patients with prostate cancer, cervical cancer, colorectal cancer, hepatocellular carcinoma, gastric cancer, lung adenocarcinoma, squamous cell lung carcinoma, diseased thyroid, glioblastoma multiforme, sarcoma and diffuse large B-cell lymphoma (DLBCL), as well as plasma samples from a range of hematologic malignancies were collected from multiple clinics in China, the Netherlands and Singapore and included in the resource. This extensive …

STAVER: a standardized benchmark dataset-based algorithm for effective variation reduction in large-scale DIA-MS data

ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects

A Novel Spectral Library Workflow to Enhance Protein Identifications

A New Evaluation Metric for Quantitative Accuracy of LC-MS/MS-Based Proteomics with Data-Independent Acquisition

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

Data-Driven Tool for Cross-Run Ion Selection and Peak-Picking in Quantitative Proteomics with Data-Independent Acquisition LC-MS/MS

Binomial probability distribution model-based protein identification algorithm for tandem mass spectrometry utilizing peak intensity information.

An automated data analysis pipeline for GC-TOF-MS metabonomics studies.

MetaLab Platform Enables Comprehensive DDA and DIA Metaproteomics Analysis

MetaboAnalystR 3.0: Toward an Optimized Workflow for Global Metabolomics

SDA: a semi-parametric differential abundance analysis method for metabolomics and proteomics data

Statistical batch-aware embedded integration, dimension reduction and alignment for spatial transcriptomics

metaExpertPro: a computational workflow for metaproteomics spectral library construction and data-independent acquisition mass spectrometry data analysis

A Proteomics Pipeline for Generating Clinical Grade Biomarker Candidates from Data‐Independent Acquisition Mass Spectrometry (DIA‐MS) Discovery

AlphaDIA enables End-to-End Transfer Learning for Feature-Free Proteomics

MetaboAnalystR 4.0: a unified LC-MS workflow for global metabolomics

Rapid Development of Improved Data-Dependent Acquisition Strategies

DPHL: A pan human protein mass spectrometry library for robust biomarker discovery using Data Independent Acquisition and Parallel Reaction Monitoring

Paradigm shift in biomarker translation: a pipeline to generate clinical grade biomarker candidates from DIA-MS discovery

SILVER: an Efficient Tool for Stable Isotope Labeling LC-MS Data Quantitative Analysis with Quality Control Methods

Comprehensive Evaluation and Optimization of the Data-Dependent LC-MS/MS Workflow for Deep Proteome Profiling