Utilization of dynamic transcriptomics analysis for candidate gene mining of 100-seed weight in soybean

ZENG Jian,XU Xian-Chao,XU Yu-Fei,WANG Xiu-Cheng,YU Hai-Yan,FENG Bei-Bei,XING Guang-Nan
DOI: https://doi.org/10.3724/SP.J.1006.2021.04249
2021-01-01
ACTA AGRONOMICA SINICA
Abstract:100-seed weight of soybean is an important agronomic trait that affects yield, and it is of great significance to reveal its molecular basis and discover key candidate genes for soybean improvement breeding. In this study, weighted gene co-expression network analysis (WGCNA) was performed on the transcriptome data of 36 samples from 12 soybean varieties at three stages of seed development, and 20 gene co-expression modules were obtained. After correlating with 100-seed weight and four-seed shape traits, the green module was found to be most correlated with the phenotypes. Then 13 hub genes of green module were screened based on the Gene Significance (GS) and Eigengene Connectivity (kME) value. Gene differential expression of two groups of soybean varieties with extremely significant differences in 100-seed weight showed that the MAPK signaling pathway in the early and mid-term of seed development might regulate the 100-seed weight in soybean. According to SNPs/InDels calling and Gene Ontology (GO) annotation, Glyma.14G043900 and Glyma.15G217400 in the green module caused synonymous and non-synonymous coding mutations due to SNP mutations, and there were GO Terms and zinc finger domains related to gene expression regulation. These results suggested that they might regulate the 100-seed weight and seed shape of soybeans by regulating the hub gene and differentially expressed genes. Furthermore, Glyma.15G217400 was located in four reported QTLs of 100-seed weight, while Glyma.14G043900 was located in a reported seed protein content QTL and an oil content QTL. Compared with soybean public database, the increasing 100-seed weight alleles of the two genes were artificially selected and their frequency was gradually increased from wild accessions to landraces, resulting in the improved cultivars. These results provide new ideas for further discovering 100-seed weight candidate gene in soybean and its expression regulation mechanism.
What problem does this paper attempt to address?