A Comprehensive Analysis of 3’UTRs in

Emma Murari,Dalton Meadows,Nicholas Cuda,Marco Mangone
DOI: https://doi.org/10.1101/2024.02.15.580525
2024-02-17
Abstract:3’Untranslated Regions (3’UTRs) are essential portions of genes containing elements necessary for pre-mRNA 3’end processing and are involved in post-transcriptional gene regulation. Despite their importance, they remain poorly characterized in eukaryotes. Here, we have used a multi-pronged approach to extract and curate 3’UTR data from 11,533 publicly available datasets, corresponding to the entire collection of transcriptomes stored in the NCBI repository from 2009 to 2023, and present its complete 3’UTRome dataset sequenced at single-base resolution. This updated 3’UTRome is the most comprehensive resource in any metazoan, covering 97.4% of the 20,362 experimentally validated protein-coding genes with refined and updated 3’UTR boundaries for 23,489 3’UTR isoforms. We also used this novel dataset to identify and characterize sequence elements involved in pre-mRNA 3’end processing and update miRNA target predictions. This resource provides important insights into the 3’UTR formation, function, and regulation in eukaryotes.
Genomics
What problem does this paper attempt to address?