Operon Prediction Using Neural Network Based on Multiple Information of Log-Likelihoods

Wei Du,Yan Wang,Shuqin Wang,Xiumei Wang,Fangxun Sun,Chen Zhang,Chunguang Zhou,Chengquan Hu,Yanchun Liang
DOI: https://doi.org/10.1007/978-3-540-72383-7_77
2007-01-01
Abstract:Operon represents a basic organizational unit in microbial genomes. Operon prediction is an important step to study genic transcriptional and regulatory mechanism in microbial genomes. This paper predicted operons in the Escherichia coli K12 genome using neural network based on four types of genomic log-likelihood data. First this method estimated the log-likelihood values for intergenic distances, COG gene functions, conserved gene pairs and phylogenetic profiles, and then used these information by a generalized regression neural network to discriminate pairs of genes within operons (WO pairs) or transcription unit borders (TUB pairs). We test the method on E. coli K12 and find that it can obtain average sensitivity, specificity and accuracy at 85.9%, 89.2% and 87.9% respectively, which indicates that the proposed method has a powerful capability for operon prediction.
What problem does this paper attempt to address?