Classification and Regression Trees to predict Transcription Factor Combinatorial Interaction in scRNA-seq data

Jean Baptiste Carluer,Laura Steinmann,Clément Carré,André Mas,Gabriel Krouk
DOI: https://doi.org/10.1101/2024.04.17.589552
2024-04-20
Abstract:Understanding the regulatory mechanisms that govern gene expression is crucial for deciphering cellular functions. Transcription factors (TFs) play a key role in regulating gene expression. In particular TF combinatorial interactions (TFCI) are now thought to largely shape genomic transcriptional responses, but predicting TFCI is still a difficult task. Single-cell RNA sequencing (scRNA-seq) has emerged as a powerful tool providing a whole new readout of gene regulatory effects. In this study, we propose a machine learning approach utilizing Classification and Regression Trees (CART) for predicting TFCI in >110k scRNA-seq data points yielded from root. The proposed methodology provides a valuable tool for pointing to new TFCI mechanisms and could advance our understanding of Gene Regulatory Networks’ functioning.
Bioinformatics
What problem does this paper attempt to address?