Abstract:High-grade serous ovarian cancer (HGSOC) is one of the most lethal gynecological malignancies. Lack of common targetable oncogenic mutations has complicated the development of directed therapies to combat emerging resistance. This malignancy is mainly characterized by the mutation of gene TP53, which promotes genome instability for the emergence of extensive copy number variations (CNVs). However its impact on gene expression at the single-cell level is not well understood. In this study, we aim to investigate the effect of CNVs on transcriptomic signatures by taking advantage of variational autoencoders (VAE) ability for dimensionality reduction, unsupervised learning and feature extraction. The use of VAEs is becoming more popular for the analysis of scRNAseq data, and scVI is one of the most versatile VAE applications performing wide variety of tasks. Here, we used single-cell RNA sequencing (scRNA-seq) data from 90 longitudinal samples of 64 HGSOC patients and inferred CNVs in each cell using inferCNV, an established computational pipeline. Then, we modified scVI algorithm to allow the VAE to reconstruct CNVs from a latent space originated from gene expression profiles and viceversa. Our models were capable of reconstructing CNV profiles accurately from expression data and also remove batch effect. From these models we could observe how, after the integration of genomic information, the latent clusters produced from the transcriptomic space were influenced by the amplifications or deletions of certain genomic regions. Moreover, some of these clusters were characterized by the alteration of important oncogenes in HGSOC, such as KRAS and CCNE1, and allowed us to focus on the transcriptomic consequences of their amplifications. With the results from the approach presented here, we gained a more comprehensive picture of the impact of genomic alterations on HGSOC. As future work, we plan to validate of this results on external cohorts and link the identified signatures to clinically relevant features such as prognosis or chemotherapy response. Citation Format: Matias Marin Falco, Teemu Närhi, Erdogan Pekcan Erkan, Johanna Hynninen, Anna Vähärautio. Studying the impact of CNVs on expression at single-cell resolution in HGSOC using autoencoders [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular s); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl) nr 893.

Evaluating deep variational autoencoders trained on pan-cancer gene expression

Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders

Variational Autoencoders for Feature Exploration and Malignancy Prediction of Lung Lesions

Variational Autoencoder for Anti-Cancer Drug Response Prediction

Integrated multi-omics analysis of ovarian cancer using variational autoencoders

Variational and Explanatory Neural Networks for Encoding Cancer Profiles and Predicting Drug Responses

$Γ$-VAE: Curvature regularized variational autoencoders for uncovering emergent low dimensional geometric structure in high dimensional data

scVAE: variational auto-encoders for single-cell gene expression data

Variational autoencoders learn transferrable representations of metabolomics data

Abstract 893: Studying the impact of CNVs on expression at single-cell resolution in HGSOC using autoencoders

Cancer Subtyping by Improved Transcriptomic Features Using Vector Quantized Variational Autoencoder

Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data

XOmiVAE: an interpretable deep learning model for cancer classification using high-dimensional omics data

Incorporating Prior Knowledge in Deep Learning Models via Pathway Activity Autoencoders

Parameter tuning is a key part of dimensionality reduction via deep variational autoencoders for single cell RNA transcriptomics

DeepCancer: Detecting Cancer through Gene Expressions via Deep Generative Learning

Accurate Tumor Subtype Detection with Raman Spectroscopy via Variational Autoencoder and Machine Learning

Identification of monotonically expressed long non-coding RNA signatures for breast cancer using variational autoencoders

Autoencoders with shared and specific embeddings for multi-omics data integration

Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification

Explainable autoencoder-based representation learning for gene expression data