Abstract:Reliable quantification of low-abundance proteins in complex proteomes is challenging largely owing to the limited number of spectra/peptides identified. In this study we developed a straightforward method to improve the quantitative accuracy and precision of proteins by strategically retrieving the less confident peptides that were previously filtered out using the standard target-decoy search strategy. The filtered-out MS/MS spectra matched to confidently-identified proteins were recovered, and the peptide-spectrum-match FDR were re-calculated and controlled at a confident level of FDR≤1%, while protein FDR maintained at ~1%. We evaluated the performance of this strategy in both spectral count- and ion current-based methods. >60% increase of total quantified spectra/peptides was respectively achieved for analyzing a spike-in sample set and a public dataset from CPTAC. Incorporating the peptide retrieval strategy significantly improved the quantitative accuracy and precision, especially for low-abundance proteins (e.g. one-hit proteins). Moreover, the capacity of confidently discovering significantly-altered proteins was also enhanced substantially, as demonstrated with two spike-in datasets. In summary, improved quantitative performance was achieved by this peptide recovery strategy without compromising confidence of protein identification, which can be readily implemented in a broad range of quantitative proteomics techniques including label-free or labeling approaches. SIGNIFICANCE:We hypothesize that more quantifiable spectra and peptides in a protein, even including less confident peptides, could help reduce variations and improve protein quantification. Hence the peptide retrieval strategy was developed and evaluated in two spike-in sample sets with different LC-MS/MS variations using both MS1- and MS2-based quantitative approach. The list of confidently identified proteins using the standard target-decoy search strategy was fixed and more spectra/peptides with less confidence matched to confident proteins were retrieved. However, the total peptide-spectrum-match false discovery rate (PSM FDR) after retrieval analysis was still controlled at a confident level of FDR≤1%. As expected, the penalty for occasionally incorporating incorrect peptide identifications is negligible by comparison with the improvements in quantitative performance. More quantifiable peptides, lower missing value rate, better quantitative accuracy and precision were significantly achieved for the same protein identifications by this simple strategy. This strategy is theoretically applicable for any quantitative approaches in proteomics and thereby provides more quantitative information, especially on low-abundance proteins.

Improve the Tandem Mass Spectra Identification and Reduce False Negative Rate Using Peptide Lc Retention Time.

Improving peptide identification using an empirical peptide retention time database.

A Three-Stage Search Strategy Combining Database Reduction and Retention Time Filtering to Improve the Sensitivity of Low-Input and Single-Cell Proteomic Analysis.

Increasing Your Peptide Identification Sensitivity Without Changing Your Mass Spectrometer

Application Of Peptide Retention Time In Proteome Research

Protein identification assisted by the prediction of retention time in liquid chromatography/tandem mass spectrometry.

Efficient discovery of abundant post-translational modifications and spectral pairs using peptide mass and retention time differences

Peptide Identification from Mixture Tandem Mass Spectra

Oscore: a Combined Score to Reduce False Negative Rates for Peptide Identification in Tandem Mass Spectrometry Analysis

Peptide Selection for Accurate Targeted Protein Quantification Via a Dimethylation High-Resolution Mass Spectrum Strategy with a Peptide Release Kinetic Model

A peptide-retrieval strategy enables significant improvement of quantitative performance without compromising confidence of identification.

Enhancing peptide identification confidence by combining search methods.

Improving Peptide Identification with Single-Stage Mass Spectrum Peaks.

Improving Deep Proteome and PTMome Coverage Using Tandem HILIC-HPRP Peptide Fractionation Strategy

Integrating de Novo Sequencing and Database Search for Peptide Identification

Increased confidence in large-scale phosphoproteomics data by complementary mass spectrometric techniques and matching of phosphopeptide data sets

PeakSelect: Preprocessing Tandem Mass Spectra for Better Peptide Identification.

Preprocessing Significantly Improves the Peptide/protein Identification Sensitivity of High-Resolution Isobarically Labeled Tandem Mass Spectrometry Data.

An Accurate and Efficient Algorithm for Peptide and Ptm Identification by Tandem Mass Spectrometry.

A Systematical Analysis of Tryptic Peptide Identification with Reverse Phase Liquid Chromatography and Electrospray Ion Trap Mass Spectrometry

Test-Time Training for Deep MS/MS Spectrum Prediction Improves Peptide Identification.