Convolutional Hybrid Kernel Network for in-vitro Transcription Factor Binding Sites.

Zihan Zhao,Chuanhuan Yin
DOI: https://doi.org/10.1145/3592686.3592693
2023-01-01
Abstract:Deep learning has become a prominent method for learning high-dimensional structured data representation in many domains, and it is gaining traction in bioinformatics. The discovery of transcription factor binding sites (TFBSs) is crucial for understanding the underlying binding mechanisms and cellular functions. To improve the performance of predicting TFBSs, many deep learning and kernel approaches are used for biological sequences. However, each of them has its limitations. In this paper, a convolutional kernel network based on the CKN-seq is extended for in-vitro TFBS prediction experiments and a hybrid framework is proposed, which integrates convolutional kernel network and convolutional neural network (CKNN) to jointly process DNA sequences and their corresponding shape information. The framework integrates DNA sequences and shape features appropriately to better understand protein-DNA binding preferences. Through transcription factor binding experiments, we find that the framework improves prediction accuracy and performs better on small datasets. The hybrid framework not only has the advantages of deep learning in specific tasks but also combines the strengths of high efficiency of the kernel function in small data sets, so it will have broad prospects in small datasets in biological information, intrusion detection, and other fields.
What problem does this paper attempt to address?