A Deep Learning Method for Recovering Missing Signals in Transcriptome-Wide RNA Structure Profiles from Probing Experiments

Jing Gong,Kui Xu,Ziyuan Ma,Zhi John Lu,Qiangfeng Cliff Zhang
DOI: https://doi.org/10.1038/s42256-021-00412-0
IF: 23.8
2021-01-01
Nature Machine Intelligence
Abstract:Sequencing-based RNA structure probing can generate transcriptome-wide profiles of RNA secondary structures. Sufficient structural coverage is needed to obtain unbiased insights about RNA structures and functions, yet probing methods often yield uneven coverage, with missing structural scores across many transcripts. To overcome this barrier, we developed StructureImpute, a deep learning framework inspired by depth completion from computer vision that integrates an RNA sequence with available RNA structural information of neighbouring nucleotides to infer missing structure scores. We demonstrate the strong imputation performance of StructureImpute, with accuracy much superior to predictions based on RNA sequence alone. We also show that StructureImpute reliably reconstructs RNA structural patterns at biologically impactful RNA regulation regions, including protein-binding and RNA-modification sites. Strikingly, StructureImpute can use transfer learning to apply a model trained on one dataset to accurately infer missing structural scores in other datasets, even if they were generated with different technologies (for example, icSHAPE and DMS-seq).
What problem does this paper attempt to address?