An improved genome assembly of the fluke Schistosoma japonicum
Fang Luo,Mingbo Yin,Xiaojin Mo,Chengsong Sun,Qunfeng Wu,Bingkuan Zhu,Manyu Xiang,Jipeng Wang,Yi Wang,Jian Li,Ting Zhang,Bin Xu,Huajun Zheng,Zheng Feng,Wei Hu
DOI: https://doi.org/10.1371/journal.pntd.0007612
2019-08-07
PLoS Neglected Tropical Diseases
Abstract:<span><em>Schistosoma japonicum</em> is a parasitic flatworm that causes human schistosomiasis, which is a significant cause of morbidity in China and the Philippines. A single draft genome was available for <em>S</em>. <em>japonicum</em>, yet this assembly is very fragmented and only covers 90% of the genome, which make it difficult to be applied as a reference in functional genome analysis and genes discovery.In this study, we present a high-quality assembly of the fluke <em>S</em>. <em>japonicum</em> genome by combining 20 G (~53X) long single molecule real time sequencing reads with 80 G (~ 213X) Illumina paired-end reads. This improved genome assembly is approximately 370.5 Mb, with contig and scaffold N50 length of 871.9 kb and 1.09 Mb, representing 142.4-fold and 6.2-fold improvement over the released WGS-based assembly, respectively. Additionally, our assembly captured 85.2% complete and 4.6% partial eukaryotic Benchmarking Universal Single-Copy Orthologs. Repetitive elements account for 46.80% of the genome, and 10,089 of the protein-coding genes were predicted from the improved genome, of which 96.5% have been functionally annotated. Lastly, using the improved assembly, we identified 20 significantly expanded gene families in <em>S</em>. <em>japonicum</em>, and those genes were primarily enriched in functions of proteolysis and protein glycosylation.Using the combination of PacBio and Illumina Sequencing technologies, we provided an improved high-quality genome of <em>S</em>. <em>japonicum</em>. This improved genome assembly, as well as the annotation, will be useful for the comparative genomics of the flukes and more importantly facilitate the molecular studies of this important parasite in the future.Schistosomiasis is an acute and chronic disease that remains one of the most prevalent and serious of the parasitic diseases in the world. Three major <em>Schistosoma</em> species cause human schistosomiasis, including <em>Schistosoma japonicum</em>, <em>S</em>. <em>mansoni</em> and <em>S</em>. <em>haematobium</em>. However, the three schistosome references or draft genomes were released in the last decade, which greatly facilitate the progress in the whole research field of schistosome. However, limited by the sequencing technique and mixture samples at that time, only a genome draft was suppled to <em>S</em>. <em>japonicum</em>, which is fragmented and difficult to be a reference in functional genome analysis and gene discovery. Here, using the combination of PacBio and Illumina Sequencing technologies, we present a high-quality assembly of <em>S</em>. <em>japonicum</em> with contig and scaffold N50 length of 871.9 kb and 1.09 Mb, representing 142.4-fold and 6.2-fold improvement over the released WGS-based assembly, respectively. The assembly genome with high quality will certainly supply a new reference genome of <em>S</em>. <em>japonicum</em> and be beneficial to functional genomic and comparative genomics of schistosome, as well as other helminths.</span>
tropical medicine,parasitology