Genomic prediction of regional-scale performance in switchgrass ( Panicum virgatum ) by accounting for genotype-by-environment variation and yield surrogate traits
Neal W Tilhou,Jason Bonnette,Arvid R Boe,Philip A Fay,Felix B Fritschi,Robert B Mitchell,Francis M Rouquette,Yanqi Wu,Julie D Jastrow,Michael Ricketts,Shelley D Maher,Thomas E Juenger,David B Lowry
DOI: https://doi.org/10.1093/g3journal/jkae159
2024-07-19
Abstract:Abstract Switchgrass is a potential crop for bioenergy or carbon capture schemes, but further yield improvements through selective breeding are needed to encourage commercialization. To identify promising switchgrass germplasm for future breeding efforts, we conducted multi-site and multi-trait genomic prediction with a diversity panel of 630 genotypes from 4 switchgrass subpopulations (Gulf, Midwest, Coastal, and Texas), which were measured for spaced plant biomass yield across 10 sites. Our study focused on the use of genomic prediction to share information among traits and environments. Specifically, we evaluated the predictive ability of cross-validation (CV) schemes using only genetic data and the training set, (cross validation 1: CV1), a subset of the sites (cross validation 2: CV2), and/or with two yield surrogates (flowering time and fall plant height). We found that genotype-by-environment interactions were largely due to the north-south distribution of sites. The genetic correlations between yield surrogates and biomass yield were generally positive (mean height r=0.85; mean flowering time r=0.45) and did not vary due to subpopulation or growing region (North, Middle, South). Genomic prediction models had cross-validation predictive abilities of -0.02 for individuals using only genetic data (CV1) but 0.55, 0.69, 0.76, 0.81, and 0.84 for individuals with biomass performance data from one, two, three, four and five sites included in the training data (CV2), respectively. To simulate a resource-limited breeding program, we determined the predictive ability of models provided with: one site observation of flowering time (0.39), one site observation of flowering time and fall height (0.51), one site observation of fall height (0.52), one site observation of biomass (0.55), and five site observations of biomass yield (0.84). The ability to share information at a regional scale is very encouraging but further research is required to accurately translate spaced plant biomass to commercial-scale sward biomass performance.
genetics & heredity