Abstract:An analysis of the structurally and catalytically diverse serine hydrolase protein family in the Saccharomyces cerevisiae proteome was undertaken using two independent but complementary, large-scale approaches. The first approach is based on computational analysis of serine hydrolase active site structures; the second utilizes the chemical reactivity of the serine hydrolase active site in complex mixtures. These proteomics approaches share the ability to fractionate the complex proteome into functional subsets. Each method identified a significant number of sequences, but 15 proteins were identified by both methods. Eight of these were unannotated in the Saccharomyces Genome Database at the time of this study and are thus novel serine hydrolase identifications. Three of the previously uncharacterized proteins are members of a eukaryotic serine hydrolase family, designated as Fsh (family of serine hydrolase), identified here for the first time. OVCA2, a potential human tumor suppressor, and DYR-SCHPO, a dihydrofolate reductase from Schizosaccharomyces pombe, are members of this family. Comparing the combined results to results of other proteomic methods showed that only four of the 15 proteins were identified in a recent large-scale, "shotgun" proteomic analysis and eight were identified using a related, but similar, approach (neither identifies function). Only 10 of the 15 were annotated using alternate motif-based computational tools. The results demonstrate the precision derived from combining complementary, function-based approaches to extract biological information from complex proteomes. The chemical proteomics technology indicates that a functional protein is being expressed in the cell, while the computational proteomics technology adds details about the specific type of function and residue that is likely being labeled. The combination of synergistic methods facilitates analysis, enriches true positive results, and increases confidence in novel identifications. This work also highlights the risks inherent in annotation transfer and the use of scoring functions for determination of correct annotations.

Enrichment-Based Proteogenomics Identifies Microproteins, Missing Proteins, and Novel Smorfs in Saccharomyces Cerevisiae.

Special Enrichment Strategies Greatly Increase the Efficiency of Missing Proteins Identification from Regular Proteome Samples

Identification of Novel Bacterial Microproteins Encoded by Small Open Reading Frames Using a Computational Proteogenomics Workflow

Digging More Missing Proteins Using an Enrichment Approach with ProteoMiner

Digging for Missing Proteins Using Low-Molecular-Weight Protein Enrichment and a “mirror Protease” Strategy

Cysteinyl peptide capture for shotgun proteomics: global assessment of chemoselective fractionation

Mass-spectrometry-based near-complete draft of the Saccharomyces cerevisiae proteome

Identification of Microproteins in Saccharomyces Cerevisiae under Different Stress Conditions.

Mass-spectrometry-based near-complete draft of theSaccharomyces cerevisiae proteome

Synergistic computational and experimental proteomics approaches for more accurate detection of active serine hydrolases in yeast

The Cryptic Bacterial Microproteome

Proteomics‐driven Identification of Short Open Reading Frame‐encoded Peptides

A catalog of small proteins from the global microbiome

Mass-Spectrometry-Based Near-Complete Draft of Thesaccharomyces Cerevisiaeproteome

Pro-SMP finder–A systematic approach for discovering small membrane proteins in prokaryotes

Improving silkworm genome annotation using a proteogenomics approach.

Smorfunction: a Tool for Predicting Functions of Small Open Reading Frames and Microproteins

Proteogenomic Analysis and Global Discovery of Posttranslational Modifications in Prokaryotes

No country for old methods: New tools for studying microproteins

Microproteins: from behind the scenes to the spotlight

sPepFinder expedites genome-wide identification of small proteins in bacteria