Towards routine proteome profiling of FFPE tissue: Insights from a 1,200 case pan-cancer study

Johanna Tueshaus,Stephan Eckert,Marius Fraefel,Yuxiang Zhou,Pauline Pfeiffer,Christiane Halves,Federico Fusco,Johannes Weigl,Lisa Hönikl,Vicki Butenschön,Rumyana Todorova,Hilka Rauert-Wunderlich,Matthew The,Andreas Rosenwald,Volker Heinemann,Julian Walter Holch,Bernhard Meyer,Wilko Weichert,Carolin Mogler,Peer-Hendrik Kuhn,Bernhard Küster
DOI: https://doi.org/10.1101/2024.06.21.600043
2024-06-27
Abstract:Proteome profiling of formalin-fixed paraffin-embedded (FFPE) specimens has gained traction for the analysis of cancer tissue for the discovery of molecular biomarkers. However, reports so far focused on single cancer entities, comprised relatively few cases and did not assess the long-term performance of experimental workflows. Here, we did so by analyzing 1,220 tumors from six cancer entities processed over the course of three years. Key findings include the need for a new normalization method ensuring equal and reproducible sample loading for LC-MS/MS analysis across cohorts, showing that tumors can, on average, be profiled to a depth of >4,000 proteins and discovering that current software fails to process such large data sets. We report the first comprehensive pan-cancer proteome expression resource for FFPE material comprising 11,000 proteins which is of immediate utility to the scientific community by way of a web resource. It enables a range of analysis including quantitative comparisons of proteins between patients or cohorts or the discovery of protein fingerprints representing the tissue of origin, or proteins enriched in certain cancer entities.
Pathology
What problem does this paper attempt to address?
The problems that this paper attempts to solve are the technical challenges and data processing difficulties in proteomic analysis of formalin - fixed paraffin - embedded (FFPE) tissues in routine clinical practice. Specifically, the paper focuses on the following aspects: 1. **Improving the consistency and reproducibility of sample loading**: To ensure the same sample loading amount for LC - MS/MS analysis between different cohorts, the researchers introduced a new peptide quantification and normalization method, namely the normalization strategy based on the total ion chromatogram (TIC). This method adjusts the sample volume by comparing the TIC of each sample with the standard curve, thereby achieving consistent sample loading. 2. **The ability to process large - scale data sets**: The study found that current software tools have limitations in processing such large - scale data sets and cannot complete the identification, quantification of peptides and proteins, and false discovery rate (FDR) control all at once. Therefore, the researchers proposed a method of processing data from different cohorts separately and then using the "Selective Proteome FDR" software to merge the results to ensure consistent protein grouping and FDR control. 3. **Comprehensive analysis of proteomic expression characteristics**: By analyzing the proteomes of 1,220 tumor samples (covering six cancer types), the researchers constructed the first comprehensive pan - cancer proteomic expression resource of FFPE materials, containing approximately 11,000 proteins. These data can be used not only for quantitative comparison between different patients or cohorts but also for discovering protein fingerprints of specific tissue origins or specific cancer types. 4. **The stability and reliability of the long - term experimental process**: The researchers evaluated the stability and reliability of the experimental process over a three - year period, especially when the time intervals between different cohorts were long. By randomly inserting quality control (QC) samples of HeLa cell lysates, the performance of the LC - FAIMS - MS/MS system was monitored to ensure the consistency and reliability of the data. 5. **Differences in protein expression among different cancer types**: Through UMAP analysis and quantitative comparison, the researchers discovered differences in protein expression among different cancer types and identified some specific protein markers. These findings are helpful for understanding the molecular mechanisms of different cancer types and provide potential biomarkers for future clinical diagnosis and treatment. In conclusion, through large - scale proteomic analysis of FFPE tumor samples, this paper has solved several technical challenges and provided a valuable public data resource for the scientific research community.