From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline

doi:10.1002/0471250953.BI1110S43

Journal Article•DOI•

From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline

Géraldine A. Van der Auwera¹, Mauricio O. Carneiro¹, Christopher Hartl¹, Ryan Poplin¹, Guillermo del Angel¹, Ami Levy-Moonshine¹, Tadeusz Jordan¹, Khalid Shakir¹, David Roazen¹, Joel Thibault¹, Eric Banks¹, Kiran V. Garimella², David Altshuler¹, Stacey Gabriel¹, Mark A. DePristo¹ - Show less +11 more•Institutions (2)

Broad Institute¹, Wellcome Trust Centre for Human Genetics²

15 Oct 2013-Current protocols in human genetics (Wiley-Blackwell)-Vol. 43, Iss: 1

TL;DR: This unit describes how to use BWA and the Genome Analysis Toolkit to map genome sequencing data to a reference and produce high‐quality variant calls that can be used in downstream analyses.

read less

Abstract: This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.

...read moreread less