PlantDeepSEA, a deep learning-based web service to predict the regulatory effects of genomic variants in plants

Hu Zhao,Zhuo Tu,Yinmeng Liu,Zhanxiang Zong,Jiacheng Li,Hao Liu,Feng Xiong,Jinling Zhan,Xuehai Hu,Weibo Xie
DOI: https://doi.org/10.1093/nar/gkab383
IF: 14.9
2021-05-25
Nucleic Acids Research
Abstract:Abstract Characterizing regulatory effects of genomic variants in plants remains a challenge. Although several tools based on deep-learning models and large-scale chromatin-profiling data have been available to predict regulatory elements and variant effects, no dedicated tools or web services have been reported in plants. Here, we present PlantDeepSEA as a deep learning-based web service to predict regulatory effects of genomic variants in multiple tissues of six plant species (including four crops). PlantDeepSEA provides two main functions. One is called Variant Effector, which aims to predict the effects of sequence variants on chromatin accessibility. Another is Sequence Profiler, a utility that performs ‘in silico saturated mutagenesis’ analysis to discover high-impact sites (e.g., cis-regulatory elements) within a sequence. When validated on independent test sets, the area under receiver operating characteristic curve of deep learning models in PlantDeepSEA ranges from 0.93 to 0.99. We demonstrate the usability of the web service with two examples. PlantDeepSEA could help to prioritize regulatory causal variants and might improve our understanding of their mechanisms of action in different tissues in plants. PlantDeepSEA is available at http://plantdeepsea.ncpgr.cn/.
biochemistry & molecular biology
What problem does this paper attempt to address?
The paper aims to address the challenging problem of predicting the regulatory effects of genomic variations in plants. Specifically, although there are some tools based on deep learning models and large-scale chromatin mapping data available for predicting regulatory elements and variant effects, there are no dedicated tools or web services for this purpose in plants. Therefore, the authors developed PlantDeepSEA, a deep learning-based web service platform for predicting the regulatory effects of genomic variations in multiple tissues of six plant species (including four crops). PlantDeepSEA offers two main functionalities: 1. **Variant Effector**: Predicts the impact of sequence variations on chromatin accessibility. 2. **Sequence Profiler**: Performs "in silico saturation mutagenesis" analysis to identify high-impact sites in the sequence (e.g., cis-regulatory elements). Through validation on independent test sets, the deep learning models in PlantDeepSEA achieved an area under the receiver operating characteristic curve (AUROC) ranging from 0.93 to 0.99. This indicates that the platform has high predictive accuracy and can help prioritize regulatory causal variants, thereby improving our understanding of plant regulatory mechanisms in different tissues.