Gene Prediction in Metagenomic Fragments with Deep Learning

Shao-Wu Zhang,Xiang-Yang Jin,Teng Zhang
DOI: https://doi.org/10.1155/2017/4740354
2017-01-01
BioMed Research International
Abstract:Next generation sequencing technologies used in metagenomics yield numerous sequencing fragments which come from thousands of different species. Accurately identifying genes from metagenomics fragments is one of the most fundamental issues in metagenomics. In this article, by fusing multifeatures (i.e., monocodon usage, monoamino acid usage, ORF length coverage, and Z-curve features) and using deep stacking networks learning model, we present a novel method (called Meta-MFDL) to predict the metagenomic genes. The results with 10 CV and independent tests show that Meta-MFDL is a powerful tool for identifying genes from metagenomic fragments.
What problem does this paper attempt to address?