AScirRNA: A novel computational approach to discover abiotic stress-responsive circular RNAs in plant genomes
Upendra Kumar Pradhan,Prasanjit Behera,Ritwika Das,Sanchita Naha,Ajit Gupta,Rajender Parsad,Sukanta Kumar Pradhan,Prabina Kumar Meher
DOI: https://doi.org/10.1016/j.compbiolchem.2024.108205
IF: 3.737
2024-09-09
Computational Biology and Chemistry
Abstract:In the realm of plant biology, understanding the intricate regulatory mechanisms governing stress responses stands as a pivotal pursuit. Circular RNAs (circRNAs), emerging as critical players in gene regulation, have garnered attention in recent days for their potential roles in abiotic stress adaptation. A comprehensive grasp of circRNAs' functions in stress response offers avenues for breeders to manipulating plants to develop abiotic stress resistant crop cultivars to thrive in challenging climates. This study pioneers a machine learning-based model for predicting abiotic stress-responsive circRNAs. The K-tuple nucleotide composition (KNC) and Pseudo KNC (PKNC) features were utilized to numerically represent circRNAs. Three different Feature selection strategies were employed to select relevant and non-redundant features. Eight shallow and four deep learning algorithms were evaluated to build the final predictive model. Following five-fold cross-validation process, XGBoost learning algorithm demonstrated superior performance with LightGBM-chosen 260 KNC features (Accuracy: 74.55%, auROC: 81.23%, auPRC: 76.52%) and 160 PKNC features (Accuracy: 74.32%, auROC: 81.04%, auPRC: 76.43%), over other combinations of learning algorithms and feature selection techniques. Further, the robustness of the developed models were evaluated using an independent test dataset, where the overall accuracy, auROC and auPRC were found to be 73.13%, 72.34% and 72.68% for KNC feature set and 73.52%, 79.53% and 73.09% for PKNC feature set, respectively. This computational approach was also integrated into an online prediction tool, AScirRNA ( https://iasri-sg.icar.gov.in/ascirna/ ) for easy prediction by the users. Both the proposed model and the developed tool are poised to augment ongoing efforts in identifying stress-responsive circRNAs in plants.
biology,computer science, interdisciplinary applications