Abstract:Abstract Motivation Interactions among such cis-regulatory elements as enhancers and promoters are main driving forces shaping context-specific chromatin structure and gene expression. Although there have been computational methods for predicting gene expression from genomic and epigenomic information, most of them overlook long-range enhancer-promoter interactions, due to the difficulty in precisely linking regulatory enhancers to target genes. Recently, a novel high-throughput experimental approach named HiChIP has been developed and generating comprehensive data on high-resolution interactions between promoters and distal enhancers. On the other hand, plenty of studies have suggested that deep learning achieves state-of-the-art performance in epigenomic signal prediction, and thus promoting the understanding of regulatory elements. In consideration of these two factors, we integrate proximal promoter sequences and HiChIP distal enhancer-promoter interactions to accurately model gene expression. Results We propose DeepExpression, a densely connected convolutional neural network to predict gene expression using both promoter sequences and enhancer-promoter interactions. We demonstrate that our model consistently outperforms baseline methods not only in the classification of binary gene expression status but also in the regression of continuous gene expression levels, in both cross-validation experiments and cross-cell lines predictions. We show that sequential promoter information is more informative than experimental enhancer information while enhancer-promoter interactions are most beneficial from those within ±100 kbp around the TSS of a gene. We finally visualize motifs in both promoter and enhancer regions and show the match of identified sequence signatures and known motifs. We expect to see a wide spectrum of applications using HiChIP data in deciphering the mechanism of gene regulation. Availability DeepExpression is freely available at https://github.com/wanwenzeng/DeepExpression . Contact ruijiang@tsinghua.edu.cn , ywang@amss.ac.cn Supplementary information Supplementary data are available at Bioinformatics online.

DeepLncPro: an interpretable convolutional neural network model for identifying long non-coding RNA promoters

LncLSTA: A Versatile Predictor Unveiling Subcellular Localization of Lncrnas Through Long-Short Term Attention

iProL: identifying DNA promoters from sequence information based on Longformer pre-trained model

PromID: human promoter prediction by deep learning

GraphPro: An interpretable graph neural network-based model for identifying promoters in multiple species

Prediction of Prokaryotic and Eukaryotic Promoters Using Convolutional Deep Learning Neural Networks

LncDLSM: Identification of Long Non-Coding RNAs With Deep Learning-Based Sequence Model

LncADeep: an Ab Initio Lncrna Identification and Functional Annotation Tool Based on Deep Learning

Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks

A novel deep learning identifier for promoters and their strength using heterogeneous features

A comprehensive survey on deep learning-based identification and predicting the interaction mechanism of long non-coding RNAs

EVlncRNA-Dpred: improved prediction of experimentally validated lncRNAs by deep learning

DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms

EV1ncRNA-Dpred: improved prediction of experimentally validated lncRNAs by deep learning

Integrating distal and proximal information to predict gene expression via a densely connected convolutional neural network

In-depth characterization and identification of translatable lncRNAs

iPromoter-CLA: Identifying promoters and their strength by deep capsule networks with bidirectional long short-term memory

Evaluation of deep-learning-based lncRNA identification tools

iPro-GAN: A novel model based on generative adversarial learning for identifying promoters and their strength

ncRNAInter: a novel strategy based on graph neural network to discover interactions between lncRNA and miRNA

DeepRegFinder: deep learning-based regulatory elements finder