Human cells contain myriad excised linear intron RNAs with links to gene regulation and potential utility as biomarkers

Jun Yao,Hengyi Xu,Elizabeth A. Ferrick-Kiddie,Ryan M. Nottingham,Douglas C. Wu,Manuel Ares,Alan M. Lambowitz
DOI: https://doi.org/10.1371/journal.pgen.1011416
IF: 4.5
2024-09-28
PLoS Genetics
Abstract:A previous study using Thermostable Group II Intron Reverse Transcriptase sequencing (TGIRT-seq) found human plasma contains short (≤300 nt) structured full-length excised linear intron (FLEXI) RNAs with potential to serve as blood-based biomarkers. Here, TGIRT-seq identified >9,000 different FLEXI RNAs in human cell lines, including relatively abundant FLEXIs with cell-type-specific expression patterns. Analysis of public CLIP-seq datasets identified 126 RNA-binding proteins (RBPs) that have binding sites within the region corresponding to the FLEXI or overlapping FLEXI splice sites in pre-mRNAs, including 53 RBPs with binding sites for ≥30 different FLEXIs. These included splicing factors, transcription factors, a chromatin remodeling protein, cellular growth regulators, and proteins with cytoplasmic functions. Analysis of ENCODE datasets identified subsets of these RBPs whose knockdown impacted FLEXI host gene mRNA levels or proximate alternative splicing, indicating functional interactions. Hierarchical clustering identified six subsets of RBPs whose FLEXI binding sites were co-enriched in six subsets of functionally related host genes: AGO1-4 and DICER, including but not limited to agotrons or mirtron pre-miRNAs; DKC1, NOLC1, SMNDC1, and AATF (Apoptosis Antagonizing Transcription Factor), including but not limited to snoRNA-encoding FLEXIs; two subsets of alternative splicing factors; and two subsets that included RBPs with cytoplasmic functions ( e . g ., LARP4, PABPC4, METAP2, and ZNF622) together with regulatory proteins. Cell fractionation experiments showed cytoplasmic enrichment of FLEXI RNAs with binding sites for RBPs with cytoplasmic functions. The subsets of host genes encoding FLEXIs with binding sites for different subsets of RBPs were co-enriched with non-FLEXI other short and long introns with binding sites for the same RBPs, suggesting overarching mechanisms for coordinately regulating expression of functionally related genes. Our findings identify FLEXIs as a previously unrecognized large class of cellular RNAs and provide a comprehensive roadmap for further analyzing their biological functions and the relationship of their RBPs to cellular regulatory mechanisms. We characterized a previously unrecognized, large class of full-length excised liner intron (FLEXI) RNAs that are ≤300 nucleotides in length and relatively stable in human cells and plasma. These included subsets of FLEXIs with binding sites for different subsets of RNA-binding proteins (RBPs) that have different cellular functions and are co-enriched in subsets of functionally related genes with other short and long introns with binding sites for the same RBPs, potentially enabling coordinate regulation of different gene sets. Many of the proteins that have binding sites in different subsets of FLEXIs were also identified as interacting with each other by STRING protein-network analysis, potentially enabling co-localization of subsets of proteins to different intracellular locations, including the cytoplasm. Our findings provide a comprehensive road map for further analysis of the functions of FLEXIs and their associated RBPs and for the identification of FLEXIs that can serve as biomarkers in human cells and plasma.
genetics & heredity
What problem does this paper attempt to address?