Trait Ontology Analysis Based on Association Mapping Studies Bridges the Gap Between Crop Genomics and Phenomics

Qingchun Pan,Junfeng Wei,Feng Guo,Suiyong Huang,Yong Gong,Hao Liu,Jianxiao Liu,Lin Li
DOI: https://doi.org/10.1186/s12864-019-5812-0
IF: 4.547
2019-01-01
BMC Genomics
Abstract:Background Trait ontology (TO) analysis is a powerful system for functional annotation and enrichment analysis of genes. However, given the complexity of the molecular mechanisms underlying phenomes, only a few hundred gene-to-TO relationships in plants have been elucidated to date, limiting the pace of research in this “big data” era. Results Here, we curated all the available trait associated sites (TAS) information from 79 association mapping studies of maize ( Zea mays L.) and rice ( Oryza sativa L.) lines with diverse genetic backgrounds and built a large-scale TAS-derived TO system for functional annotation of genes in various crops. Our TO system contains information for up to 18,042 genes (6345 in maize at the 25 k level and 11,697 in rice at the 50 k level), including gene-to-TO relationships, which covers over one fifth of the annotated gene sets for maize and rice. A comparison of Gene Ontology (GO) vs. TO analysis demonstrated that the TAS-derived TO system is an efficient alternative tool for gene functional annotation and enrichment analysis. We therefore combined information from the TO, GO, metabolic pathway, and co-expression network databases and constructed the TAS system, which is publicly available at http://tas.hzau.edu.cn . TAS provides a user-friendly interface for functional annotation of genes, enrichment analysis, genome-wide extraction of trait-associated genes, and crosschecking of different functional annotation databases. Conclusions TAS bridges the gap between genomic and phenomic information in crops. This easy-to-use tool will be useful for geneticists, biologists, and breeders in the agricultural community, as it facilitates the dissection of molecular mechanisms conferring agronomic traits in an easy, genome-wide manner.
What problem does this paper attempt to address?