Cross-Cell-Type Prediction Of Tf-Binding Site By Integrating Convolutional Neural Network And Adversarial Network

Gongqiang Lan,Jiyun Zhou,Ruifeng Xu,Qin Lu,Hongpeng Wang
DOI: https://doi.org/10.3390/ijms20143425
IF: 5.6
2019-01-01
International Journal of Molecular Sciences
Abstract:Transcription factor binding sites (TFBSs) play an important role in gene expression regulation. Many computational methods for TFBS prediction need sufficient labeled data. However, many transcription factors (TFs) lack labeled data in cell types. We propose a novel method, referred to as DANN TF, for TFBS prediction. DANN TF consists of a feature extractor, a label predictor, and a domain classifier. The feature extractor and the domain classifier constitute an Adversarial Network, which ensures that learned features are common features across different cell types. DANN TF is evaluated on five TFs in five cell types with a total of 25 cell-type TF pairs and compared to a baseline method which does not use Adversarial Network. For both data augmentation and cross-cell-type prediction, DANN TF performs better than the baseline method on most cell-type TF pairs. DANN TF is further evaluated by an additional 13 TFs in the five cell types with a total of 65 cell-type TF pairs. Results show that DANN TF achieves significantly higher AUC than the baseline method on 96.9% pairs of the 65 cell-type TF pairs. This is a strong indication that DANN TF can indeed learn common features for cross-cell-type TFBS prediction.
What problem does this paper attempt to address?