From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline
Geraldine A Van der Auwera,Mauricio O Carneiro,Christopher Hartl,Ryan Poplin,Guillermo Del Angel,Ami Levy-Moonshine,Tadeusz Jordan,Khalid Shakir,David Roazen,Joel Thibault,Eric Banks,Kiran V Garimella,David Altshuler,Stacey Gabriel,Mark A DePristo,Geraldine A. Auwera,Mauricio O. Carneiro,Guillermo del Angel,Ami Levy‐Moonshine,Kiran V. Garimella,Mark A. DePristo
DOI: https://doi.org/10.1002/0471250953.bi1110s43
2013-10-01
Current Protocols in Bioinformatics
Abstract:This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.