Abstract:BACKGROUND:The zone adjacent to a transcription start site (TSS), namely, the promoter, is primarily involved in the process of DNA transcription initiation and regulation. As a result, proper promoter identification is critical for further understanding the mechanism of the networks controlling genomic regulation. A number of methodologies for the identification of promoters have been proposed. Nonetheless, due to the great heterogeneity existing in promoters, the results of these procedures are still unsatisfactory. In order to establish additional discriminative characteristics and properly recognize promoters, we developed the hybrid model for promoter identification (HMPI), a hybrid deep learning model that can characterize both the native sequences of promoters and the morphological outline of promoters at the same time. We developed the HMPI to combine a method called the PSFN (promoter sequence features network), which characterizes native promoter sequences and deduces sequence features, with a technique referred to as the DSPN (deep structural profiles network), which is specially structured to model the promoters in terms of their structural profile and to deduce their structural attributes.RESULTS:The HMPI was applied to human, plant and Escherichia coli K-12 strain datasets, and the findings showed that the HMPI was successful at extracting the features of the promoter while greatly enhancing the promoter identification performance. In addition, after the improvements of synthetic sampling, transfer learning and label smoothing regularization, the improved HMPI models achieved good results in identifying subtypes of promoters on prokaryotic promoter datasets.CONCLUSIONS:The results showed that the HMPI was successful at extracting the features of promoters while greatly enhancing the performance of identifying promoters on both eukaryotic and prokaryotic datasets, and the improved HMPI models are good at identifying subtypes of promoters on prokaryotic promoter datasets. The HMPI is additionally adaptable to different biological functional sequences, allowing for the addition of new features or models.

Promoter Analysis and Prediction in the Human Genome Using Sequence-Based Deep Learning Models.

PromID: human promoter prediction by deep learning

Promoter prediction and recognition in human genome based on hybrid neural networks

Prediction of Prokaryotic and Eukaryotic Promoters Using Convolutional Deep Learning Neural Networks

DeeProPre: A Promoter Predictor Based on Deep Learning

A Hybrid Neural Network System for Prediction and Recognition of Promoter Regions in Human Genome

Image-based Promoter Prediction: a Promoter Prediction Method Based on Evolutionarily Generated Patterns.

PromPredictor: A Hybrid Machine Learning System for Recognition and Location of Transcription Start Sites in Human Genome.

Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks

Computational identification of eukaryotic promoters based on cascaded deep capsule neural networks

A Successful Hybrid Deep Learning Model Aiming at Promoter Identification

High-resolution Human Core-Promoter Prediction with CoreBoost_HM

DeepLncPro: an interpretable convolutional neural network model for identifying long non-coding RNA promoters

DPProm: A Two-Layer Predictor for Identifying Promoters and Their Types on Phage Genome Using Deep Learning

Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction

Iproep: A Computational Predictor for Predicting Promoter.

Eukaryotic and Prokaryotic Promoter Prediction Using Hybrid Approach

A novel deep learning identifier for promoters and their strength using heterogeneous features

Prediction of human promoter with Least Square Support Vector Machine based on Kernel Locality Preserving Projection

DeepRegFinder: deep learning-based regulatory elements finder

A Pattern-Based Nearest Neighbor Search Approach for Promoter Prediction Using DNA Structural Profiles