Single-molecule real-time transcript sequencing of developing cotton anthers facilitates genome annotation and fertility restoration candidate gene discovery

Ting Li,Xuexian Zhang,Liping Guo,Tingxiang Qi,Huini Tang,Hailin Wang,Xiuqin Qiao,Meng Zhang,Bingbing Zhang,Juanjuan Feng,Zhidan Zuo,Yongjie Zhang,Chaozhu Xing,Jianyong Wu
DOI: https://doi.org/10.1016/j.ygeno.2021.11.014
IF: 4.31
2021-11-01
Genomics
Abstract:Heterosis refers to the superior phenotypes observed in hybrids. Cytoplasmic male sterility (CMS) system plays an important role in cotton heterosis utilization. However, the global gene expression patterns of CMS-D2 and its interaction with the restorer gene Rf1 remain unclear. Here, the full-length transcript sequencing was performed in anthers of the CMS-D2 restorer line using PacBio single-molecule real-time sequencing technology. Combining PacBio SMRT long-read isoforms and Illumina RNA-seq data, 107,066 isoforms from 44,338 loci were obtained, including 10,086 novel isoforms of novel genes and 66,419 new isoforms of known genes. Totally 56,572 alternative splicing (AS) events, 1146 lncRNAs, 61 fusion transcripts and 10,466 genes exhibited alternative polyadenylation (APA), and 60,995 novel isoforms with predicted open reading frames (ORFs) were further identified. Furthermore, the specifically expressed genes in restorer line were selected and confirmed by qRT-PCR. These findings provide a basis for upland cotton genome annotation and transcriptome research, and will help to reveal the molecular mechanism of interaction between Rf1 and CMS-D2 cytoplasm.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?