Predicting dynamic cellular protein-RNA interactions using deep learning and <i>in vivo</i> RNA structure
Lei Sun,Kui Xu,Wenze Huang,Yucheng Yang,Lei Tang,Tuanlin Xiong,Qiangfeng Cliff Zhang
DOI: https://doi.org/10.1101/2020.05.05.078774
2020-01-01
Abstract:Abstract Interactions with RNA-binding proteins (RBPs) are crucial for RNA regulation and function. While both RNA sequence and structure are critical determinants, RNA structure is dependent on cellular environment and especially important in regulating dynamic RBP bindings across various conditions. However, how distinct it contributes to RBP binding in vivo remains poorly understood. To address this issue, we obtained transcriptome-wide RNA secondary structure profiles in multiple cell-types, and established a deep neural network, PrismNet, that uses in vivo RNA structures to accurately predict cellular protein-RNA interactions. With a deep learning “attention” strategy, PrismNet discovers the exact binding nucleotides and their mutational effect. The predicted binding sites are highly conserved and enriched for rare, deleterious genetic variants. Remarkably, dynamic RBP binding sites are enriched for structure-changing variants (riboSNitches), which are often associated with disease, reflecting dysregulated RBP bindings. Our resource enables the analysis of cell-type-specific RNA regulation, with applications in human disease. Highlights 1, A big data resource of transcriptome-wide RNA secondary structure profiles in multiple cell types 2, PrismNet, a deep neural network, accurately models the sequence and structural combined patterns of protein-RNA interactions in vivo 3, RNA structural information in vivo is critical for the accurate prediction of dynamic RBP binding in various cellular conditions 4, PrismNet can dissect and predict how mutations affect RBP binding via RNA sequence or structure changes 5, RNA structure-changing RiboSNitches are enriched in dynamic RBP binding sites and often associated with disease, likely disrupting RBP-based regulation