Dear-DIAXMBD: Deep Autoencoder Enables Deconvolution of Data-Independent Acquisition Proteomics

Qingzu He,Chuan-Qi Zhong,Xiang Li,Huan Guo,Yiming Li,Mingxuan Gao,Rongshan Yu,Xianming Liu,Fangfei Zhang,Donghui Guo,Fangfu Ye,Tiannan Guo,Jianwei Shuai,Jiahuai Han
DOI: https://doi.org/10.34133/research.0179
2023-06-26
Abstract:Data-independent acquisition (DIA) technology for protein identification from mass spectrometry and related algorithms is developing rapidly. The spectrum-centric analysis of DIA data without the use of spectra library from data-dependent acquisition data represents a promising direction. In this paper, we proposed an untargeted analysis method, Dear-DIAXMBD, for direct analysis of DIA data. Dear-DIAXMBD first integrates the deep variational autoencoder and triplet loss to learn the representations of the extracted fragment ion chromatograms, then uses the k-means clustering algorithm to aggregate fragments with similar representations into the same classes, and finally establishes the inverted index tables to determine the precursors of fragment clusters between precursors and peptides and between fragments and peptides. We show that Dear-DIAXMBD performs superiorly with the highly complicated DIA data of different species obtained by different instrument platforms. Dear-DIAXMBD is publicly available at https://github.com/jianweishuai/Dear-DIA-XMBD.
What problem does this paper attempt to address?