OrthReg: a Tool to Predict Cis-Regulatory Elements Based on Cross-Species Orthologous Sequence Conservation

Yun-Fei Ma,Cui-Ping Huang,Fang-Ru Lu,Jin-Xiu Li,Xu-Man Han,Adeniyi C Adeola,Yun Gao,Jia-Kun Deng,Hai-Bing Xie,Ya-Ping Zhang
DOI: https://doi.org/10.24272/j.issn.2095-8137.2020.099
2020-01-01
Zoological Research
Abstract:Cis-regulatory elements play an important role in the development of traits and disease in organisms (Ma et al., 2020; Woolfe et al., 2005) and their annotation could facilitate genetic studies. The Encyclopedia of DNA Elements (ENCODE) ( Davis et al., 2018) and Functional Annotation of Animal Genomes (FAANG) ( FAANG Consortium et al., 2015) offer pioneering data on regulatory elements in several species. Currently, however, regulatory element annotation data remain limited for most organisms. In this study, we developed a tool (OrthReg) for annotating conserved orthologous cis-regulatory elements in targeted genomes using an annotated reference genome. Cross-species validation of this annotation tool using human and mouse ENCODE data confirmed the robustness of this strategy. To explore the efficiency of the tool, we annotated the pig genome and identified more than 28 million regulatory annotation records using the reference human ENCODE data. With this regulatory annotation, some putative regulatory non-coding variants were identified within domestication sweeps in European and East Asian pigs. Thus, this tool can utilize data produced by ENCODE, FAANG, and similar projects, and can be easily extended to customized experimental data. The extensive application of this tool will help to identify informative single nucleotide polymorphisms (SNPs) in post-genome-wide association studies and resequencing analysis of organisms with limited regulatory annotation data.
What problem does this paper attempt to address?