RicePilaf: a post-GWAS/QTL dashboard to integrate pangenomic, coexpression, regulatory, epigenomic, ontology, pathway, and text-mining information to provide functional insights into rice QTLs and GWAS loci

Anish M S Shrestha,Mark Edward M Gonzales,Phoebe Clare L Ong,Pierre Larmande,Hyun-Sook Lee,Ji-Ung Jeung,Ajay Kohli,Dmytro Chebotarov,Ramil P Mauleon,Jae-Sung Lee,Kenneth L McNally
DOI: https://doi.org/10.1093/gigascience/giae013
IF: 7.658
2024-06-06
GigaScience
Abstract:As the number of genome-wide association study (GWAS) and quantitative trait locus (QTL) mappings in rice continues to grow, so does the already long list of genomic loci associated with important agronomic traits. Typically, loci implicated by GWAS/QTL analysis contain tens to hundreds to thousands of single-nucleotide polmorphisms (SNPs)/genes, not all of which are causal and many of which are in noncoding regions. Unraveling the biological mechanisms that tie the GWAS regions and QTLs to the trait of interest is challenging, especially since it requires collating functional genomics information about the loci from multiple, disparate data sources. We present RicePilaf, a web app for post-GWAS/QTL analysis, that performs a slew of novel bioinformatics analyses to cross-reference GWAS results and QTL mappings with a host of publicly available rice databases. In particular, it integrates (i) pangenomic information from high-quality genome builds of multiple rice varieties, (ii) coexpression information from genome-scale coexpression networks, (iii) ontology and pathway information, (iv) regulatory information from rice transcription factor databases, (v) epigenomic information from multiple high-throughput epigenetic experiments, and (vi) text-mining information extracted from scientific abstracts linking genes and traits. We demonstrate the utility of RicePilaf by applying it to analyze GWAS peaks of preharvest sprouting and genes underlying yield-under-drought QTLs. RicePilaf enables rice scientists and breeders to shed functional light on their GWAS regions and QTLs, and it provides them with a means to prioritize SNPs/genes for further experiments. The source code, a Docker image, and a demo version of RicePilaf are publicly available at https://github.com/bioinfodlsu/rice-pilaf.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper introduces an online tool called RicePilaf for post-GWAS (genome-wide association study) and QTL (quantitative trait locus) analysis to help rice researchers and breeders gain a deeper understanding of gene regions related to important agronomic traits. With the increasing number of GWAS and QTL mappings in rice, RicePilaf aims to address the challenge of how to interpret a large number of genes and SNPs (single nucleotide polymorphisms) in these analysis results and identify which ones are potentially functionally relevant. RicePilaf integrates various data sources, including whole-genome information of different rice varieties, co-expression networks, ontology and pathway information, transcription factor binding information, epigenetic information, and gene-trait associations extracted from scientific literature. Through this integration, the tool can provide biological insights into GWAS regions and QTLs and help prioritize SNPs/genes for further experiments. Specifically, the functionalities of RicePilaf include: 1. Retrieving gene models, their associated descriptions, and orthogonal information for a given gene interval. 2. Obtaining a more comprehensive gene view by performing coordinate transformation (lift-over) between different rice genomes. 3. Analyzing co-expression networks to identify gene modules that might collectively contribute to a trait. 4. Detecting variants associated with transcription factor binding. 5. Supplementing gene lists and discovering relevant literature using text mining data. The paper demonstrates the application of RicePilaf in GWAS/QTL analysis of two key traits (yield under drought conditions and pre-harvest sprouting), showing that the tool can identify novel candidate genes and reveal potential biological mechanisms. RicePilaf is open-source and can be run in a local browser or as a web service, providing a powerful analysis tool for rice research.