An interpretable deep learning model predicts RNA-small molecule binding sites
Wenyu Xi,Ruheng Wang,Li Wang,Xiucai Ye,Mingyang Liu,Tetsuya Sakurai
DOI: https://doi.org/10.1016/j.future.2024.05.029
IF: 7.307
2024-05-19
Future Generation Computer Systems
Abstract:The intricate interplay between RNA molecules and small ligands plays a pivotal role in regulating biological processes, underscoring the necessity for accurate prediction models of RNA- small molecule binding sites. In this study, we develop CapBind, a novel interpretive deep learning framework utilizing a Capsule Network architecture, to precisely identify RNA- small molecule binding sites. This framework stands out for its multifaceted approach, which leverages both the sequence and structural information of RNA as input features, allowing for a thorough understanding of the complex interactions between RNA and small molecules. Leveraging multi-source bioinformatics data, including the distribution of RNA secondary structures, our model characterizes sequence and structural features with unparalleled depth. Through a unified alignment of these multi-source feature representations, we achieve dimensional homogeneity in the potential feature maps for both types of biological data. The crux of our methodology lies in the deployment of a multi-level Capsule Network, engineered to identify the nuanced interactions of RNA sequences with small molecules. The essence of our approach resides in the implementation of a multi-layered Capsule Network. Utilizing a sophisticated routing mechanism, this network demonstrates unparalleled efficacy in learning and identifying the intricate interactions between RNA sequences and small molecules. This mechanism enables dynamic adjustment of information flow across multiple levels, thereby capturing subtle features and patterns that might elude conventional neural network architectures. The experimental results demonstrate the superior performance of our model, particularly highlighting its progress in evaluation metrics compared to current state-of-the-art models. Case studies further validate the model's precision in identifying binding sites, offering potential insights into gene expression pathways and novel therapeutic avenues for neurodegenerative diseases. Moreover, an extensive interpretability analysis of our silicon model reveals the internal interaction mechanisms between specific RNA regions and small molecules, contributing valuable perspectives for drug design and molecular biology research. These findings underscore the model's predictive accuracy and biological applicability, establishing our approach as a significant leap forward in the computational prediction of RNA- small molecule binding sites.
computer science, theory & methods