A barley pan-transcriptome reveals layers of genotype-dependent transcriptional complexity
Robbie Waugh,Wenbin Guo,Miriam Schreiber,Vanda Marosi,Paolo Bagnaresi,Kenneth Chalmers,Brett Chapman,Viet Dang,Christoph Dockter,Anne Fiebig,Agostino Fricano,John Fuller,Allison Haaning,Georg Haberer,Axel Himmelbach,Murukarthick Jayakodi,Yong Jia,Morten Jørgensen,Nadia Kamal,Peter Langridge,Chengdao Li,Qiongxian Lu,Thomas Lux,Martin Mascher,Klaus Mayer,Nicola McCallum,Linda Milne,Gary Muehlbauer,Sudharsan Padmarasu,Pai Pedas,Klaus Pillen,Curtis Pozniak,Kazuhiro Sato,Thomas Schmutzer,Uwe Scholz,Danuta Schüler,Hana Simkova,Birgitte Skadhauge,Nils Stein,Penghao Wang,Ronja Wonneberger,Xiao-Qi Zhang,Guoping Zhang,Luigi Cattivelli,Manuel Spannagl,Micha Bayer,Craig Simpson,Runxuan Zhang
DOI: https://doi.org/10.21203/rs.3.rs-3787876/v1
2024-01-01
Abstract:Abstract A pan-transcriptome describes the transcriptional and post-transcriptional consequences of genome diversity from multiple individuals within a species, revealing an assortment of functions that drive biological outcomes. We developed a barley pan-transcriptome using twenty inbred genotypes representing domesticated* barley diversity by generating and analysing extensive short- and long-read RNA sequencing datasets from multiple tissues. To overcome single reference bias and facilitate downstream analyses we constructed genotype-specific reference transcript datasets (RTDs) and integrated these into a linear pan-genome framework to create a single pan-RTD. Categorising transcripts based upon presence or absence across genotypes defined them as core (expressed in all), shell (absent in one or more) or cloud (expressed in only one). Focusing on the core we observed significant transcript abundance variation among tissues and between genotypes. We show that drivers of transcript abundance variation in this category include RNA processing, gene copy number, large structural rearrangements and degree of conservation of promotor motifs. We reveal conserved patterns of co-expression module-tissue correlations encompassing distinct biological functions, as well as frequent functional diversification. We complement the pan-transcriptome by integrating extensive and diverse replicated public RNA-seq datasets from the reference cultivar (cv.) Morex into a comprehensive gene-expression atlas