Efficient and effective identification of cancer neoantigens from tumor only RNA-seq

Danilo Tatoni,Mattia Dalsass,Giulia Brunelli,Guido Grandi,Mario Chiariello,Romina D’Aurizio
DOI: https://doi.org/10.1101/2024.08.08.607127
2024-08-09
Abstract:The growing accessibility of sequencing experiments has significantly accelerated the development of personalized immunotherapies based on the identification of cancer neoantigens. Still, the prediction of neoantigens involves lengthy and inefficient protocols, requiring simultaneous analysis of sequencing data from paired tumor/normal exomes and tumor transcriptome, often resulting in a low success rate. To date, the feasibility of adopting a more efficient strategy has not been fully evaluated. To this end, we developed ENEO, a computational approach to detect cancer neoantigens using solely the tumor RNA-seq data while addressing the lack of matched control through a Bayesian probabilistic model. ENEO was assessed on TESLA benchmark dataset, reporting efficient identification of DNA-alterations derived neoantigens and compelling results against state-of-art exome-based methods. We further validated the method on two independent cohorts, encompassing different tumor types and experimental procedures. Our work demonstrates that a tumor-only RNA-based approach, such as the one implemented in ENEO, maintains accuracy in identifying mutated peptides resulting from expressed genomic alterations, while also broadening the pool of potential pMHCs with RNAspecific mutations in a faster and cost-effective way. ENEO is freely available at
Bioinformatics
What problem does this paper attempt to address?
The main goal of this paper is to develop a new computational method (ENEO) for efficiently and effectively identifying cancer neoantigens solely from tumor RNA sequencing data. Traditional methods require paired whole-exome or whole-genome sequencing data from both tumor and normal samples, which is time-consuming, costly, and often has a low success rate. The proposed method in the paper aims to overcome these limitations, specifically including: 1. **Problems Addressed**: - Current neoantigen prediction workflows typically rely on paired sequencing data from tumor and normal samples, which is not only costly and complex but also often identifies only a small fraction of neoantigens that can truly elicit an immune response. - Neoantigen prediction based solely on DNA variations fails to fully account for all detected peptides, as variations at the RNA level provide a more comprehensive representation of the neoantigen epitope space. 2. **Proposed Solution**: - Developed the ENEO computational workflow to identify neoantigens using only tumor RNA sequencing data, without the need for matched normal samples. - Addressed the lack of matched control samples by employing a Bayesian probability model that can estimate the probability of a variant being of germline or somatic origin. - The ENEO workflow includes steps such as read preprocessing, variant calling, and neoantigen prediction, ultimately generating a list of candidate neoantigens along with their predicted binding affinities to patient-specific HLA molecules. 3. **Validation Method**: - The paper evaluated the effectiveness of ENEO on the TESLA benchmark dataset, showing that ENEO can efficiently identify neoantigens derived from DNA variations and performs well compared to existing whole-exome-based methods. - Further validated the ENEO method on two independent datasets, covering different types of tumors and experimental setups. 4. **Main Contributions**: - Demonstrated that a neoantigen prediction method based solely on tumor RNA sequencing data can maintain identification accuracy while expanding the pool of potential pMHC complexes, and can do so in a faster and more cost-effective manner. - Provided a rapid and cost-effective method for detecting tumor neoepitopes, enhancing the role of deep transcriptomic analysis in precision oncology. In summary, this study aims to address the limitations in current neoantigen prediction by developing the ENEO tool, which is particularly significant for the advancement of personalized immunotherapy.