Abstract:The chromosome-centric human proteome project (C-HPP) has made great progress of finding protein evidence (PE) for missing proteins (PE2-4 proteins defined by the neXtProt), which now becomes an increasingly challenging field. As a majority of samples tested in this field were from adult tissues/cells, the developmental stage specific or relevant proteins could be missed due to biological source availability. We posit that epigenetic interventions may help to partially bypass such a limitation by stimulating the expression of the "silenced" genes in adult cells, leading to the increased chance of finding missing proteins. In this study, we established in vitro human cell models to modify the histone acetylation, demethylation, and methylation with near physiological conditions. With mRNA-seq analysis, we found that histone modifications resulted in overall increases of expressed genes in an even distribution manner across different chromosomes. We identified 64 PE2-4 and six PE5 proteins by MaxQuant (FDR < 1% at both protein and peptide levels) and 44 PE2-4 and 7 PE5 proteins by Mascot (FDR < 1% at peptide level) searches, respectively. However, only 24 PE2-4 and five PE5 proteins in Mascot, and 12 PE2-4 and one PE5 proteins in MaxQuant searches could, respectively, pass our stringently manual spectrum inspections. Collectively, 27 PE2-4 and five PE5 proteins were identified from the epigenetically modified cells; among them, 19 PE2-4 and three PE5 proteins passed FDR < 1% at both peptide and protein levels. Gene ontology analyses revealed that the PE2-4 proteins were significantly involved in development and spermatogenesis, although their chemical-physical features had no statistical difference from the background. In addition, we presented an example of suspicious PE5 peptide spectrum matched with unusual AA substitutions related to post-translational modification. In conclusion, the epigenetically manipulated cell models should be a useful tool for finding missing proteins in C-HPP. The mass spectrometry data have been deposited to the iProx database (accession number: IPX00020200).

Identifying PE2 and PE5 Proteins from Existing Mass Spectrometry Data Using Pfind

Finding Missing Proteins from the Epigenetically Manipulated Human Cell with Stringent Quality Criteria.

Abstract P326: an Innovative Peptide Spectral Library Search Engine for Cardiovascular Proteomics

ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects

Identification of Missing Proteins Defined by Chromosome-Centric Proteome Project in the Cytoplasmic Detergent-Insoluble Proteins.

Digging for Missing Proteins Using Low-Molecular-Weight Protein Enrichment and a “mirror Protease” Strategy

Special Enrichment Strategies Greatly Increase the Efficiency of Missing Proteins Identification from Regular Proteome Samples

Multi-Protease Strategy Identifies Three PE2 Missing Proteins in Human Testis Tissue

Leveraging the Human Panproteome to Enhance Peptide and Protein Identification in Proteomics and Metaproteomics

Open-pFind Enhances the Identification of Missing Proteins from Human Testis Tissue

Open-pFind Verified Four Missing Proteins from Multi-Tissues

Comprehensive Identification of Peptides in Tandem Mass Spectra Using an Efficient Open Search Engine.

Identification of Phosphopeptides with Unknown Cleavage Specificity by a De Novo Sequencing Assisted Database Search Strategy.

Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project

Deep Coverage Proteomics Identifies More Low-Abundance Missing Proteins in Human Testis Tissue with Q-Exactive HF Mass Spectrometer

Efficient discovery of abundant post-translational modifications and spectral pairs using peptide mass and retention time differences

DPHL V2: an Updated and Comprehensive DIA Pan-Human Assay Library for Quantifying More Than 14,000 Proteins

Development and Preliminary Application of a Peptide Mass Fingerprinting Technique in Proteome Research.

Improvement of Peptide Separation for Exploring the Missing Proteins Localized on Membranes

Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry

Resolving chromosome-centric human proteome with translating mRNA analysis: a strategic demonstration.