PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning

Zhencheng Fang,Jie Tan,Shufang Wu,Mo Li,Congmin Xu,Zhongjie Xie,Huaiqiu Zhu
DOI: https://doi.org/10.1093/gigascience/giz066
IF: 7.658
2019-06-01
GigaScience
Abstract:Phages and plasmids are the major components of mobile genetic elements, and fragments from such elements generally co-exist with chromosome-derived fragments in sequenced metagenomic data. However, there is a lack of efficient methods that can simultaneously identify phages and plasmids in metagenomic data, and the existing tools identifying either phages or plasmids have not yet presented satisfactory performance.We present PPR-Meta, a 3-class classifier that allows simultaneous identification of both phage and plasmid fragments from metagenomic assemblies. PPR-Meta consists of several modules for predicting sequences of different lengths. Using deep learning, a novel network architecture, referred to as the Bi-path Convolutional Neural Network, is designed to improve the performance for short fragments. PPR-Meta demonstrates much better performance than currently available similar tools individually for phage or plasmid identification, while testing on both artificial contigs and real metagenomic data. PPR-Meta is freely available via <a class="link link-uri" href="http://cqb.pku.edu.cn/ZhuLab/PPR_Meta">http://cqb.pku.edu.cn/ZhuLab/PPR_Meta</a> or <a class="link link-uri" href="https://github.com/zhenchengfang/PPR-Meta">https://github.com/zhenchengfang/PPR-Meta</a>.To the best of our knowledge, PPR-Meta is the first tool that can simultaneously identify phage and plasmid fragments efficiently and reliably. The software is optimized and can be easily run on a local PC by non-computer professionals. We developed PPR-Meta to promote the research on mobile genetic elements and horizontal gene transfer.
multidisciplinary sciences
What problem does this paper attempt to address?