DNA4mC-LIP: a Linear Integration Method to Identify N4-methylcytosine Site in Multiple Species

Qiang Tang,Juanjuan Kang,Jiaqing Yuan,Hua Tang,Xianhai Li,Hao Lin,Jian Huang,Wei Chen
DOI: https://doi.org/10.1093/bioinformatics/btaa143
IF: 5.8
2020-01-01
Bioinformatics
Abstract:MOTIVATION:DNA N4-methylcytosine (4mC) is a crucial epigenetic modification. However, the knowledge about its biological functions is limited. Effective and accurate identification of 4mC sites will be helpful to reveal its biological functions and mechanisms. Since experimental methods are cost and ineffective, a number of machine learning-based approaches have been proposed to detect 4mC sites. Although these methods yielded acceptable accuracy, there is still room for the improvement of the prediction performance and the stability of existing methods in practical applications.RESULTS:In this work, we first systematically assessed the existing methods based on an independent dataset. And then, we proposed DNA4mC-LIP, a linear integration method by combining existing predictors to identify 4mC sites in multiple species. The results obtained from independent dataset demonstrated that DNA4mC-LIP outperformed existing methods for identifying 4mC sites. To facilitate the scientific community, a web server for DNA4mC-LIP was developed. We anticipated that DNA4mC-LIP could serve as a powerful computational technique for identifying 4mC sites and facilitate the interpretation of 4mC mechanism.AVAILABILITY AND IMPLEMENTATION:http://i.uestc.edu.cn/DNA4mC-LIP/.CONTACT:hlin@uestc.edu.cn or hj@uestc.edu.cn or chenweiimu@gmail.com.SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.
What problem does this paper attempt to address?