Candidate gene identification of existing or induced mutations with pipelines applicable to large genomes

Jiaqiang Dong,Min Tu,Yaping Feng,Anna Zdepski,Fei Ge,Dibyendu Kumar,Janet P. Slovin,Joachim Messing
DOI: https://doi.org/10.1111/tpj.14153
IF: 7.2
2018-12-14
The Plant Journal
Abstract:Bulked segregant analysis (BSA) is used to identify existing or induced variants that are linked to phenotypes. Although it is widely used in Arabidopsis and rice, it remains challenging for crops with large genomes, such as maize. Moreover, analysis of huge data sets can present a bottleneck linking phenotypes to their molecular basis, especially for geneticists without programming experience. Here, we identified two genes of maize defective kernel mutants with newly developed analysis pipelines that require no programing skills and should be applicable to any large genome. In the 1970s, Neuffer and Sheridan generated a chemically induced defective kernel (dek) mutant collection with the potential to uncover critical genes for seed development. To locate such mutations, the dek phenotypes were introgressed into two inbred lines to take advantage of maize haplotype variations and their sequenced genomes. We generated two pipelines that take fastq files derived from next-generation (nextGen) paired-end DNA and cDNA sequencing as input, call on several well established and freely available genomic analysis tools to call SNPs and INDELs, and generate lists of the most likely causal mutations together with variant index plots to locate the mutation to a specific sequence position on a chromosome. The pipelines were validated with a known strawberry mutation before cloning the dek mutants, thereby enabling phenotypic analysis of large genomes by next-generation sequencing.
What problem does this paper attempt to address?