Predicting genome‐wide tissue‐specific enhancers via combinatorial transcription factor genomic occupancy analysis

Huma Shireen,Fatima Batool,Hizran Khatoon,Nazia Parveen,Noor Us Sehar,Irfan Hussain,Shahid Ali,Amir Ali Abbasi
DOI: https://doi.org/10.1002/1873-3468.15030
2024-10-05
FEBS Letters
Abstract:This study introduces a sequence‐based computational model to predict tissue‐specific enhancers using transcription factor genomic occupancy. Trained on epigenetic signatures and verified enhancer datasets, it identifies ~ 25 000 forebrain‐specific cis‐regulatory modules in the human genome. Validation through multiple datasets and transgenic zebrafish assays confirms the model's effectiveness, advancing tissue‐specific gene regulation and enhancer discovery knowledge. Enhancers are non‐coding cis‐regulatory elements crucial for transcriptional regulation. Mutations in enhancers can disrupt gene regulation, leading to disease phenotypes. Identifying enhancers and their tissue‐specific activity is challenging due to their lack of stereotyped sequences. This study presents a sequence‐based computational model that uses combinatorial transcription factor (TF) genomic occupancy to predict tissue‐specific enhancers. Trained on diverse datasets, including ENCODE and Vista enhancer browser data, the model predicted 25 000 forebrain‐specific cis‐regulatory modules (CRMs) in the human genome. Validation using biochemical features, disease‐associated SNPs, and in vivo zebrafish analysis confirmed its effectiveness. This model aids in predicting enhancers lacking well‐characterized chromatin features, complementing experimental approaches in tissue‐specific enhancer discovery.
cell biology,biochemistry & molecular biology,biophysics
What problem does this paper attempt to address?