STAVER: a standardized benchmark dataset-based algorithm for effective variation reduction in large-scale DIA-MS data

Peng Ran,Yunzhi Wang,Kai Li,Shiman He,Subei Tan,Jiacheng Lv,Jiajun Zhu,Shaoshuai Tang,Jinwen Feng,Zhaoyu Qin,Yan Li,Lin Huang,Yanan Yin,Lingli Zhu,Wenjun Yang,Chen Ding
DOI: https://doi.org/10.1093/bib/bbae553
IF: 9.5
2024-11-08
Briefings in Bioinformatics
Abstract:Mass spectrometry (MS)-based proteomics has become instrumental in comprehensively investigating complex biological systems. Data-independent acquisition (DIA)-MS, utilizing hybrid spectral library search strategies, allows for the simultaneous quantification of thousands of proteins, showing promise in enhancing protein identification and quantification precision. However, low-quality profiles can considerably undermine quantitative precision, resulting in inaccurate protein quantification. To tackle this challenge, we introduced STAVER, a novel algorithm that leverages standardized benchmark datasets to reduce non-biological variation in large-scale DIA-MS analyses. By eliminating unwanted noise in MS signals, STAVER significantly improved protein quantification precision, especially in hybrid spectral library searches. Moreover, we validated STAVER's robustness and applicability across multiple large-scale DIA datasets, demonstrating significantly enhanced precision and reproducibility of protein quantification. STAVER offers an innovative and effective approach for enhancing the quality of large-scale DIA proteomic data, facilitating cross-platform and cross-laboratory comparative analyses. This advancement significantly enhances the consistency and reliability of findings in clinical research. The complete package is available at https://github.com/Ran485/STAVER
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?