Abstract:Background Many model proteomes or "complete" sets of proteins of given organisms are now publicly available. Much effort has been invested in computational annotation of those "draft" proteomes. Motif or domain based algorithms play a pivotal role in functional classification of proteins. Employing most available computational algorithms, mainly motif or domain recognition algorithms, we set up to develop an online proteome annotation system with integrated proteome annotation data to complement existing resources. Results We report here the development of PCAS (ProteinCentric Annotation System) as an online resource of pre-computed proteome annotation data. We applied most available motif or domain databases and their analysis methods, including hmmpfam search of HMMs in Pfam, SMART and TIGRFAM, RPS-PSIBLAST search of PSSMs in CDD, pfscan of PROSITE patterns and profiles, as well as PSI-BLAST search of SUPERFAMILY PSSMs. In addition, signal peptide and TM are predicted using SignalP and TMHMM respectively. We mapped SUPERFAMILY and COGs to InterPro , so the motif or domain databases are integrated through InterPro . PCAS displays table summaries of pre-computed data and a graphical presentation of motifs or domains relative to the protein. As of now, PCAS contains human IPI, mouse IPI, and rat IPI, A. thaliana , C. elegans , D. melanogaster , S. cerevisiae , and S. pombe proteome. PCAS is available at http://pak.cbi.pku.edu.cn/proteome/gca.php Conclusion PCAS gives better annotation coverage for model proteomes by employing a wider collection of available algorithms. Besides presenting the most confident annotation data, PCAS also allows customized query so users can inspect statistically less significant boundary information as well. Therefore, besides providing general annotation information, PCAS could be used as a discovery platform. We plan to update PCAS twice a year. We will upgrade PCAS when new proteome annotation algorithms identified.

CAPER 3.0: A Scalable Cloud-Based System for Data-Intensive Analysis of Chromosome-Centric Human Proteome Project Data Sets.

CAPER 2.0: an Interactive, Configurable, and Extensible Workflow-Based Platform to Analyze Data Sets from the Chromosome-centric Human Proteome Project

Caper: A Chromosome-Assembled Human Proteome Browser

Abstract P326: an Innovative Peptide Spectral Library Search Engine for Cardiovascular Proteomics

Abstract P327: COPa Library: A Proteomic Knowledge Base for Cardiovascular Biology and Medicine

EAP: a versatile cloud-based platform for comprehensive and interactive analysis of large-scale ChIP/ATAC-seq data sets

HiCOPS: High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry based Omics Data

A fully automated system with online sample loading, isotope dimethyl labeling and multidimensional separation for high-throughput quantitative proteome analysis.

Identifying PE2 and PE5 Proteins from Existing Mass Spectrometry Data Using Pfind

CIMAGE2.0: An Expanded Tool for Quantitative Analysis of Activity-Based Protein Profiling (ABPP) Data

PepQuery Enables Fast, Accurate, and Convenient Proteomic Validation of Novel Genomic Alterations

ECL 3.0: a sensitive peptide identification tool for cross-linking mass spectrometry data analysis

Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project

PCAS – a Precomputed Proteome Annotation Database Resource

PepQuery2 democratizes public MS proteomics data for rapid peptide searching

ProteomicsBrowser: MS/proteomics data visualization and investigation

MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses

Cloud-enabled Scalable Analysis of Large Proteomics Cohorts

An Integrated Software System for Analyzing ChIP-chip and ChIP-seq Data

Cloud Computing for Protein-Ligand Binding Site Comparison

A Description of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) Common Data Analysis Pipeline