Abstract:Abstract Real-time quantitative PCR (RT-qPCR) is a widely used method for quantifying microRNA (miRNA) expression, but its accuracy depends on appropriate normalization. RT-qPCR is susceptible to technical variation from several sources, including sample collection and storage, miRNA quantity/quality, and extraction and amplification efficiencies. Normalization corrects for these factors, theoretically leaving behind only biological variation. With growing interest in miRNAs as biomarkers for the diagnosis and prognosis of various cancers, analyses must be accurate and consistent. Primarily, the comparative Ct method is used, which compares the expression of miRNAs to a reference value. Unfortunately, there is a lack of consistency on what to use as a reference. A standard recommendation is the average of multiple, stable miRNAs referred to as endogenous controls (ECs). Currently, there are no known universally stable miRNAs. Therefore, ECs should be selected on a per-disease and per-sample type basis. Multiple algorithms exist to identify stable ECs, but few studies indicate what, if any, steps were taken to implement them correctly. Also, many studies have used non-ideal reference values such as small nuclear RNAs or a single miRNA. This lack of agreement on protocol often leads to difficulty comparing studies and may produce incorrect results. This work addresses these inconsistencies by making recommendations for the normalization of RT-qPCR miRNA expression. We compared 2 widely used methods for EC selection, NormFinder and GeNorm, and to show how normalization influences results we also used an unstable miRNA. Only miRNAs with expression in all samples were considered as potential ECs. GeNorm is sensitive to correlated genes. Therefore, for any pair or trio of correlated miRNAs 1 was kept. NormFinder performs optimally with 5-10 candidate genes; therefore, we tested the 10 miRNAs with the lowest coefficients of variation (CV). The top 3 stable miRNAs were selected from each algorithm to serve as ECs, and we used their average expression to normalize each sample. For the single miRNA, samples were scaled based on its value. We applied these methods to 3 independent datasets: First, plasma samples from a study on canine osteosarcoma (OSA), including pre-amputation (n=45), post-amputation (n=27), and healthy controls (n=21). Second, tissue samples from a study on canine OSA, including primary tumour (n=42) and lung metastases (n=12). Third, serum samples from a study on canine lymphoma included B-cell (n=24), T-cell (n=16), and healthy controls (n=14). For datasets 1 and 3, NormFinder provided a better reduction in gene-specific CV and a more favourable cumulative distribution of the CV. For dataset 2, there was less initial variability, but NormFinder still had slightly better results. Both algorithms had pros and cons, but NormFinder consistently provides a better reduction in sample variation across datasets. Citation Format: Heather Treleaven, Latasha Ludwig, Alicia Viloria-Petit, R. Darren Wood, Ayesha Ali, Geoffrey A. Wood. Comparison of normalization methods for RT-qPCR microRNA expression in cancer datasets [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 2350.

Effect of normalization methods on the performance of supervised learning algorithms applied to HTSeq-FPKM-UQ data sets: 7SK RNA expression as a predictor of survival in patients with colon adenocarcinoma

Performance evaluation of transcriptomics data normalization for survival risk prediction

Machine Learning Analysis of RNA-seq Data for Diagnostic and Prognostic Prediction of Colon Cancer

Normalization of Real-Time Quantitative Reverse Transcription-PCR Data: A Model-Based Variance Estimation Approach to Identify Genes Suited for Normalization, Applied to Bladder and Colon Cancer Data Sets

Application of RNA Processing Factors for Predicting Clinical Outcomes in Colon Cancer.

Evaluating Cross-Platform Normalization Methods for Integrated Microarray and RNA-seq Data Analysis

Deep learning survival model on transcriptomes level in patients with non-small cell lung cancer.

Abstract 2350: Comparison of normalization methods for RT-qPCR microRNA expression in cancer datasets

Ratio Adjustment and Calibration Scheme for Gene-Wise Normalization to Enhance Microarray Inter-Study Prediction

Effect of RNA-Seq data normalization on protein interactome mapping for Alzheimer's disease

Using Machine Learning Methods to Study Colorectal Cancer Tumor Micro-Environment and Its Biomarkers.

Impact of MRI radiomic feature normalization for prognostic modelling in uterine endometrial and cervical cancers

A Systematic Comparison of Normalization Methods for Eqtl Analysis

Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations

Performance analysis of data resampling on class imbalance and classification techniques on multi-omics data for cancer classification

Identification of Gene Expression in Different Stages of Breast Cancer with Machine Learning

Feature selection followed by a novel residuals-based normalization simplifies and improves single-cell gene expression analysis

Extensive increase of microarray signals in cancers calls for novel normalization assumptions

Depth Normalization of Small RNA Sequencing: Using Data and Biology to Select a Suitable Method

Performance Assessment of Machine Learning Classifiers Using Selective Feature Approaches for Cervical Cancer Detection

Analysis of RNA-Seq data using self-supervised learning for vital status prediction of colorectal cancer patients