Comparison of Accuracy of Whole-Exome Sequencing with Formalin-Fixed Paraffin-Embedded and Fresh Frozen Tissue Samples

Ensel Oh,Yoon-La Choi,Mi Jeong Kwon,Ryong Nam Kim,Yu Jin Kim,Ji-Young Song,Kyung Soo Jung,Young Kee Shin
DOI: https://doi.org/10.1371/journal.pone.0144162
IF: 3.7
2015-12-07
PLoS ONE
Abstract:Formalin fixing with paraffin embedding (FFPE) has been a standard sample preparation method for decades, and archival FFPE samples are still very useful resources. Nonetheless, the use of FFPE samples in cancer genome analysis using next-generation sequencing, which is a powerful technique for the identification of genomic alterations at the nucleotide level, has been challenging due to poor DNA quality and artificial sequence alterations. In this study, we performed whole-exome sequencing of matched frozen samples and FFPE samples of tissues from 4 cancer patients and compared the next-generation sequencing data obtained from these samples. The major differences between data obtained from the 2 types of sample were the shorter insert size and artificial base alterations in the FFPE samples. A high proportion of short inserts in the FFPE samples resulted in overlapping paired reads, which could lead to overestimation of certain variants; >20% of the inserts in the FFPE samples were double sequenced. A large number of soft clipped reads was found in the sequencing data of the FFPE samples, and about 30% of total bases were soft clipped. The artificial base alterations, C>T and G>A, were observed in FFPE samples only, and the alteration rate ranged from 200 to 1,200 per 1M bases when sequencing errors were removed. Although high-confidence mutation calls in the FFPE samples were compatible to that in the frozen samples, caution should be exercised in terms of the artifacts, especially for low-confidence calls. Despite the clearly observed artifacts, archival FFPE samples can be a good resource for discovery or validation of biomarkers in cancer research based on whole-exome sequencing.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how the accuracy of whole - exome sequencing (WES) using formalin - fixed paraffin - embedded (FFPE) samples compares with that of fresh - frozen tissue samples in cancer genome analysis. Although FFPE samples are widely used in clinical research because of their easy preservation and rich historical data, their poor DNA quality and numerous artificial sequence variations pose challenges to next - generation sequencing (NGS). By comparing the WES data obtained from the matched FFPE and fresh - frozen tissue samples of 4 cancer patients, the paper evaluated the applicability and accuracy of FFPE samples in WES, especially focusing on the impact of DNA damage caused by FFPE treatment on variant detection. Specifically, the paper explored the following aspects: 1. **Insert size**: The insert size of FFPE samples is shorter, which may lead to over - estimation of certain alleles. 2. **Soft - clipped reads**: There are a large number of soft - clipped reads in FFPE samples, which may affect the accuracy of mutation detection. 3. **Base transitions**: The C > T and G > A base transitions specific to FFPE samples are caused by formalin fixation and are significantly increased in FFPE samples. 4. **Somatic mutation detection**: Somatic single - nucleotide variants (SNV) were detected by the MuTect tool to evaluate the mutation consistency between FFPE samples and fresh - frozen samples. Overall, this study aims to evaluate the reliability and potential limitations of FFPE samples in WES, especially the false - positive results that may occur in low - confidence mutation calls. Despite some technical challenges, the study shows that FFPE samples are still an important resource for biomarker discovery or validation in cancer research.