Assembly of an early-matured japonica ( Geng ) rice genome, Suijing18, based on PacBio and Illumina sequencing

Shou-Jun Nie,Yu-Qiang Liu,Chun-Chao Wang,Shi-Wei Gao,Tian-Tian Xu,Qing Liu,Hui-Lin Chang,Yu-Bao Chen,Peng-Cheng Yan,Wei Peng,Tian-Qing Zheng,Jian-Long Xu,Zhi-Kang Li
DOI: https://doi.org/10.1038/sdata.2017.195
2017-01-01
Scientific Data
Abstract:The early-matured japonica ( Geng ) rice variety, Suijing18 (SJ18), carries multiple elite traits including durable blast resistance, good grain quality, and high yield. Using PacBio SMRT technology, we produced over 25 Gb of long-read sequencing raw data from SJ18 with a coverage of 62×. Using Illumina paired-end whole-genome shotgun sequencing technology, we generated 59 Gb of short-read sequencing data from SJ18 (23.6 Gb from a 200 bp library with a coverage of 59× and 35.4 Gb from an 800 bp library with a coverage of 88×). With these data, we assembled a single SJ18 genome and then generated a set of annotation data. These data sets can be used to test new programs for variation deep mining, and will provide new insights into the genome structure, function, and evolution of SJ18, and will provide essential support for biological research in general.
What problem does this paper attempt to address?