A Full Window Data Independent Acquisition Method for Deeper Top-down Proteomics

Chen Sun,Wenjing Zhang,Mowei Zhou,Martin Myu,Wei Xu
DOI: https://doi.org/10.1101/2024.11.08.622616
2024-11-11
Abstract:Top-down proteomics (TDP) is emerging as a vital tool for the comprehensive characterization of proteoforms. However, as its core technology, top-down mass spectrometry (TDMS) still faces significant analytical challenges. While data-independent acquisition (DIA) has revolutionized bottom-up proteomics and metabolomics, they are rarely employed in TDP. The unique feature of protein ions in an electrospray mass spectrum, as well as the data complexity require the development of new DIA strategies. This study introduces a machine learning assisted Full Window DIA (FW-DIA) method that eliminates precursor ion isolation, making it compatible with a wide range of commercial mass spectrometers. Moreover, FW-DIA leverages all precursor protein ions to generate high-quality tandem mass spectra, enhancing signal intensities by ~50-fold and protein sequence coverage by threefold in a modular protein analysis. The method was successfully applied to the analysis of a five-protein mixture under native conditions and Escherichia coli ribosomal proteoform characterization.
Bioinformatics
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: in top - down proteomics (TDP), some limitations of the traditional data - dependent acquisition (DDA) method exist. Specifically: 1. **Difficulty in detecting low - abundance ions**: Due to its characteristic of randomly selecting precursor ions, the traditional DDA method has difficulty detecting low - abundance ions. 2. **Influence of multiple charge states**: Protein ions usually have multiple charge states, while the DDA method only utilizes a small part of the ions generated by a given protein molecule, resulting in weakened fragment - ion intensity. 3. **Information loss and poor repeatability**: The DDA method may lead to information loss and has poor repeatability in the application of TDP, thus affecting the accuracy and comprehensiveness of proteome analysis. To solve these problems, researchers introduced a new data - independent acquisition (DIA) method - FullWindow DIA (FW - DIA). The main features of FW - DIA include: - **Eliminating precursor - ion isolation**: FW - DIA does not perform precursor - ion selection and isolation, but fragments all protein ions eluted from the liquid chromatography system, thus being compatible with a wider range of commercial mass spectrometers. - **Enhancing signal intensity and sequence coverage**: By making full use of all precursor protein ions, FW - DIA can generate high - quality tandem mass spectra, increasing the signal intensity by about 50 times and the protein sequence coverage by three times. - **Machine - learning - assisted data processing**: In order to establish precursor - fragment - ion relationships and reduce interference, FW - DIA adopts a machine - learning - based data processing scheme, including feature detection and ion - pairing strategies, to correlate precursor protein molecules with their corresponding fragments. In conclusion, this study aims to overcome the limitations of the traditional DDA method in top - down proteomics by developing the FW - DIA method, thereby achieving more in - depth and comprehensive identification and characterization of proteoforms.