Precision DNA methylation typing via hierarchical clustering of Nanopore current signals and attention-based neural network

Qi Dai,Hu Chen,Wen-Jing Yi,Jia-Ning Zhao,Wei Zhang,Ping-An He,Xiao-Qing Liu,Ying-Feng Zheng,Zhuo-Xing Shi
DOI: https://doi.org/10.1093/bib/bbae596
IF: 9.5
2024-11-18
Briefings in Bioinformatics
Abstract:Decoding DNA methylation sites through nanopore sequencing has emerged as a cutting-edge technology in the field of DNA methylation research, as it enables direct sequencing of native DNA molecules without the need for prior enzymatic or chemical treatments. During nanopore sequencing, methylation modifications on DNA bases cause changes in electrical current intensity. Therefore, constructing deep neural network models to decode the electrical signals of nanopore sequencing has become a crucial step in methylation site identification. In this study, we utilized nanopore sequencing data containing diverse DNA methylation types and motif sequence diversity. We proposed a feature encoding method based on current signal clustering and leveraged the powerful attention mechanism in the Transformer framework to construct the PoreFormer model for identifying DNA methylation sites in nanopore sequencing. The model demonstrated excellent performance under conditions of multi-class methylation and motif sequence diversity, offering new insights into related research fields.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?