Home
/
Authors
/
Kati J. Buckingham

Author

Kati J. Buckingham

Other affiliations: Western Washington University

Bio: Kati J. Buckingham is an academic researcher from University of Washington. The author has contributed to research in topics: Exome sequencing & Medicine. The author has an hindex of 17, co-authored 38 publications receiving 4751 citations. Previous affiliations of Kati J. Buckingham include Western Washington University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Exome sequencing identifies the cause of a Mendelian disorder

[...]

Sarah B H Ng¹, Kati J. Buckingham¹, Choli Lee¹, Abigail W. Bigham¹, Holly K. Tabor², Holly K. Tabor¹, Karin M. Dent³, Chad D. Huff³, Paul Shannon⁴, Ethylin Wang Jabs⁵, Ethylin Wang Jabs⁶, Deborah A. Nickerson¹, Jay Shendure¹, Michael J. Bamshad¹, Michael J. Bamshad² - Show less +11 more•Institutions (6)

University of Washington¹, Boston Children's Hospital², University of Utah³, Institute for Systems Biology⁴, Johns Hopkins University⁵, Icahn School of Medicine at Mount Sinai⁶

01 Jan 2010-Nature Genetics

TL;DR: Exome sequencing of a small number of unrelated affected individuals is a powerful, efficient strategy for identifying the genes underlying rare mendelian disorders and will likely transform the genetic analysis of monogenic traits.

...read moreread less

Abstract: We demonstrate the first successful application of exome sequencing to discover the gene for a rare mendelian disorder of unknown cause, Miller syndrome (MIM%263750). For four affected individuals in three independent kindreds, we captured and sequenced coding regions to a mean coverage of 40x and sufficient depth to call variants at approximately 97% of each targeted exome. Filtering against public SNP databases and eight HapMap exomes for genes with two previously unknown variants in each of the four individuals identified a single candidate gene, DHODH, which encodes a key enzyme in the pyrimidine de novo biosynthesis pathway. Sanger sequencing confirmed the presence of DHODH mutations in three additional families with Miller syndrome. Exome sequencing of a small number of unrelated affected individuals is a powerful, efficient strategy for identifying the genes underlying rare mendelian disorders and will likely transform the genetic analysis of monogenic traits.

...read moreread less

1,980 citations

Journal Article•DOI•

Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome

[...]

Sarah B. Ng¹, Abigail W. Bigham¹, Kati J. Buckingham¹, Mark C. Hannibal¹, Mark C. Hannibal², Margaret J. McMillin¹, Heidi I. S. Gildersleeve¹, Anita E. Beck¹, Anita E. Beck³, Holly K. Tabor³, Holly K. Tabor¹, Gregory M. Cooper¹, Heather C Mefford¹, Choli Lee¹, Emily H. Turner¹, Joshua D. Smith¹, Mark J. Rieder¹, Koh-ichiro Yoshiura⁴, Naomichi Matsumoto⁵, Tohru Ohta⁶, Norio Niikawa⁶, Deborah A. Nickerson¹, Michael J. Bamshad¹, Michael J. Bamshad³, Jay Shendure¹ - Show less +21 more•Institutions (6)

University of Washington¹, Seattle Children's², Boston Children's Hospital³, Nagasaki University⁴, Yokohama City University⁵, Health Sciences University of Hokkaido⁶

01 Sep 2010-Nature Genetics

TL;DR: The results strongly suggest that mutations in MLL2, which encodes a Trithorax-group histone methyltransferase, are a major cause of Kabuki syndrome.

...read moreread less

Abstract: We demonstrate the successful application of exome sequencing to discover a gene for an autosomal dominant disorder, Kabuki syndrome (OMIM%147920). We subjected the exomes of ten unrelated probands to massively parallel sequencing. After filtering against existing SNP databases, there was no compelling candidate gene containing previously unknown variants in all affected individuals. Less stringent filtering criteria allowed for the presence of modest genetic heterogeneity or missing data but also identified multiple candidate genes. However, genotypic and phenotypic stratification highlighted MLL2, which encodes a Trithorax-group histone methyltransferase: seven probands had newly identified nonsense or frameshift mutations in this gene. Follow-up Sanger sequencing detected MLL2 mutations in two of the three remaining individuals with Kabuki syndrome (cases) and in 26 of 43 additional cases. In families where parental DNA was available, the mutation was confirmed to be de novo (n = 12) or transmitted (n = 2) in concordance with phenotype. Our results strongly suggest that mutations in MLL2 are a major cause of Kabuki syndrome.

...read moreread less

1,261 citations

Journal Article•DOI•

The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities

[...]

Jessica X. Chong¹, Kati J. Buckingham¹, Shalini N. Jhangiani², C. D. Boehm³, Nara Sobreira³, Joshua D. Smith¹, Tanya M. Harrell¹, Margaret J. McMillin¹, Wojciech Wiszniewski², Tomasz Gambin², Zeynep Coban Akdemir², Kimberly F. Doheny³, Alan F. Scott⁴, Dimitri Avramopoulos⁴, Aravinda Chakravarti⁴, Julie Hoover-Fong³, Debra J. H. Mathews³, P. Dane Witmer³, Hua Ling³, Kurt N. Hetrick³, Lee Watkins³, Karynne E. Patterson¹, Frederic Reinier¹, Elizabeth Blue¹, Donna M. Muzny², Martin Kircher¹, Kaya Bilguvar⁵, Francesc López-Giráldez⁵, V. Reid Sutton², Holly K. Tabor⁶, Holly K. Tabor¹, Suzanne M. Leal², Murat Gunel⁵, Shrikant Mane⁵, Richard A. Gibbs², Eric Boerwinkle⁷, Eric Boerwinkle², Ada Hamosh³, Jay Shendure¹, James R. Lupski², Richard P. Lifton⁵, Richard P. Lifton⁸, David Valle³, Deborah A. Nickerson¹, Michael J. Bamshad¹, Michael J. Bamshad⁹ - Show less +42 more•Institutions (9)

University of Washington¹, Baylor College of Medicine², Johns Hopkins University³, Johns Hopkins University School of Medicine⁴, Yale University⁵, Boston Children's Hospital⁶, University of Texas Health Science Center at Houston⁷, Howard Hughes Medical Institute⁸, Seattle Children's⁹

06 Aug 2015-American Journal of Human Genetics

TL;DR: This collaborative effort has identified 956 genes, including 375 not previously associated with human health, that underlie a Mendelian phenotype, providing insight into study design and analytical strategies, identify novel mechanisms of disease, and reveal the extensive clinical variability of Mendelia phenotypes.

...read moreread less

Abstract: Discovering the genetic basis of a Mendelian phenotype establishes a causal link between genotype and phenotype, making possible carrier and population screening and direct diagnosis. Such discoveries also contribute to our knowledge of gene function, gene regulation, development, and biological mechanisms that can be used for developing new therapeutics. As of February 2015, 2,937 genes underlying 4,163 Mendelian phenotypes have been discovered, but the genes underlying ∼50% (i.e., 3,152) of all known Mendelian phenotypes are still unknown, and many more Mendelian conditions have yet to be recognized. This is a formidable gap in biomedical knowledge. Accordingly, in December 2011, the NIH established the Centers for Mendelian Genomics (CMGs) to provide the collaborative framework and infrastructure necessary for undertaking large-scale whole-exome sequencing and discovery of the genetic variants responsible for Mendelian phenotypes. In partnership with 529 investigators from 261 institutions in 36 countries, the CMGs assessed 18,863 samples from 8,838 families representing 579 known and 470 novel Mendelian phenotypes as of January 2015. This collaborative effort has identified 956 genes, including 375 not previously associated with human health, that underlie a Mendelian phenotype. These results provide insight into study design and analytical strategies, identify novel mechanisms of disease, and reveal the extensive clinical variability of Mendelian phenotypes. Discovering the gene underlying every Mendelian phenotype will require tackling challenges such as worldwide ascertainment and phenotypic characterization of families affected by Mendelian conditions, improvement in sequencing and analytical techniques, and pervasive sharing of phenotypic and genomic data among researchers, clinicians, and families.

...read moreread less

579 citations

Journal Article•DOI•

Spectrum of MLL2 (ALR) mutations in 110 cases of Kabuki syndrome

[...]

Mark C. Hannibal¹, Kati J. Buckingham¹, Sarah B. Ng¹, Jeffrey E. Ming², Anita E. Beck¹, Anita E. Beck³, Margaret J. Mcmillin³, Heidi I. S. Gildersleeve¹, Abigail W. Bigham¹, Holly K. Tabor³, Holly K. Tabor¹, Heather C Mefford¹, Heather C Mefford³, Joseph Cook¹, Koh-ichiro Yoshiura⁴, Tadashi Matsumoto⁴, Naomichi Matsumoto⁵, Noriko Miyake⁵, Hidefumi Tonoki, Kenji Naritomi⁶, Tadashi Kaname⁶, Toshiro Nagai⁷, Hirofumi Ohashi, Kenji Kurosawa, Jia Woei Hou⁸, Tohru Ohta⁹, Deshung Liang¹⁰, Akira Sudo, Colleen A. Morris¹¹, Siddharth Banka¹², Graeme C.M. Black¹², Jill Clayton-Smith¹², Deborah A. Nickerson¹, Elaine H. Zackai², Tamim H. Shaikh¹³, Dian Donnai¹², Norio Niikawa⁹, Jay Shendure¹, Michael J. Bamshad³, Michael J. Bamshad¹ - Show less +36 more•Institutions (13)

University of Washington¹, University of Pennsylvania², Boston Children's Hospital³, Nagasaki University⁴, Yokohama City University⁵, University of the Ryukyus⁶, Dokkyo Medical University⁷, Chang Gung University⁸, Health Sciences University of Hokkaido⁹, Central South University¹⁰, University of Nevada, Reno¹¹, University of Manchester¹², University of Colorado Denver¹³

01 Jul 2011-American Journal of Medical Genetics Part A

TL;DR: In this paper, the authors reported on the screening of 110 families with Kabuki syndrome and found 81/110 (74%) mutations in the Trithorax-group histone methyltransferase, a protein important in the epigenetic control of active chromatin states.

...read moreread less

Abstract: Kabuki syndrome is a rare, multiple malformation disorder characterized by a distinctive facial appearance, cardiac anomalies, skeletal abnormalities, and mild to moderate intellectual disability. Simplex cases make up the vast majority of the reported cases with Kabuki syndrome, but parent-to-child transmission in more than a half-dozen instances indicates that it is an autosomal dominant disorder. We recently reported that Kabuki syndrome is caused by mutations in MLL2, a gene that encodes a Trithorax-group histone methyltransferase, a protein important in the epigenetic control of active chromatin states. Here, we report on the screening of 110 families with Kabuki syndrome. MLL2 mutations were found in 81/110 (74%) of families. In simplex cases for which DNA was available from both parents, 25 mutations were confirmed to be de novo, while a transmitted MLL2 mutation was found in two of three familial cases. The majority of variants found to cause Kabuki syndrome were novel nonsense or frameshift mutations that are predicted to result in haploinsufficiency. The clinical characteristics of MLL2 mutation-positive cases did not differ significantly from MLL2 mutation-negative cases with the exception that renal anomalies were more common in MLL2 mutation-positive cases. These results are important for understanding the phenotypic consequences of MLL2 mutations for individuals and their families as well as for providing a basis for the identification of additional genes for Kabuki syndrome.

...read moreread less

170 citations

Journal Article•DOI•

Mutations in PIEZO2 Cause Gordon Syndrome, Marden-Walker Syndrome, and Distal Arthrogryposis Type 5

[...]

Margaret J. McMillin¹, Anita E. Beck¹, Anita E. Beck², Jessica X. Chong¹, Kathryn M. Shively¹, Kati J. Buckingham¹, Heidi I. S. Gildersleeve¹, Mariana Aracena³, Arthur S. Aylsworth⁴, Pierre Bitoun, John C. Carey⁵, Carol L. Clericuzio⁶, Yanick J. Crow⁷, Cynthia J. Curry⁸, Koenraad Devriendt⁹, David B. Everman, Alan Fryer², Kate Gibson¹⁰, Maria Luisa Giovannucci Uzielli¹¹, John M. Graham¹², Judith G. Hall¹³, Jacqueline T. Hecht¹⁴, Randall A. Heidenreich⁶, Jane A. Hurst¹⁵, Sarosh R. Irani¹⁶, Ingrid P.C. Krapels, Jules G. Leroy¹⁷, David Mowat², David Mowat¹⁸, Gordon T. Plant¹⁵, Stephen P. Robertson¹⁹, Elizabeth K. Schorry²⁰, Richard H Scott¹⁵, Laurie H. Seaver²¹, Elliott H. Sherr⁸, Miranda Splitt, Helen Stewart¹⁶, Constance T. R. M. Stumpel, Sehime Gulsun Temel²², Sehime Gulsun Temel²³, David D. Weaver²⁴, Margo Whiteford²⁵, Marc S. Williams²⁶, Holly K. Tabor², Joshua D. Smith¹, Jay Shendure¹, Deborah A. Nickerson¹, Michael J. Bamshad¹, Michael J. Bamshad² - Show less +45 more•Institutions (26)

University of Washington¹, Boston Children's Hospital², Pontifical Catholic University of Chile³, University of North Carolina at Chapel Hill⁴, University of Utah⁵, University of New Mexico⁶, University of Manchester⁷, University of California, San Francisco⁸, Katholieke Universiteit Leuven⁹, Christchurch Hospital¹⁰, University of Florence¹¹, Cedars-Sinai Medical Center¹², University of British Columbia¹³, University of Texas Health Science Center at Houston¹⁴, University College London¹⁵, University of Oxford¹⁶, Ghent University¹⁷, University of New South Wales¹⁸, University of Otago¹⁹, Cincinnati Children's Hospital Medical Center²⁰, University of Hawaii at Manoa²¹, Uludağ University²², Near East University²³, Indiana University²⁴, Southern General Hospital²⁵, Geisinger Medical Center²⁶

01 May 2014-American Journal of Human Genetics

TL;DR: Findings indicate that GS, DA5, and MWS have traditionally been considered separate disorders, are etiologically related and perhaps represent variable expressivity of the same condition.

...read moreread less

Abstract: Gordon syndrome (GS), or distal arthrogryposis type 3, is a rare, autosomal-dominant disorder characterized by cleft palate and congenital contractures of the hands and feet. Exome sequencing of five GS-affected families identified mutations in piezo-type mechanosensitive ion channel component 2 (PIEZO2) in each family. Sanger sequencing revealed PIEZO2 mutations in five of seven additional families studied (for a total of 10/12 [83%] individuals), and nine families had an identical c.8057G>A (p.Arg2686His) mutation. The phenotype of GS overlaps with distal arthrogryposis type 5 (DA5) and Marden-Walker syndrome (MWS). Using molecular inversion probes for targeted sequencing to screen PIEZO2, we found mutations in 24/29 (82%) DA5-affected families and one of two MWS-affected families. The presence of cleft palate was significantly associated with c.8057G>A (Fisher's exact test, adjusted p value < 0.0001). Collectively, although GS, DA5, and MWS have traditionally been considered separate disorders, our findings indicate that they are etiologically related and perhaps represent variable expressivity of the same condition.

...read moreread less

160 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

[...]

Kai Wang¹, Mingyao Li¹, Hakon Hakonarson¹•Institutions (1)

Children's Hospital of Philadelphia¹

01 Sep 2010-Nucleic Acids Research

TL;DR: The ANNOVAR tool to annotate single nucleotide variants and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP is developed.

...read moreread less

Abstract: High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a 'variants reduction' protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires ∼4 min to perform gene-based annotation and ∼15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/.

...read moreread less

10,461 citations

Journal Article•DOI•

A framework for variation discovery and genotyping using next-generation DNA sequencing data

[...]

Mark A. DePristo¹, Eric Banks¹, Ryan Poplin¹, Kiran V. Garimella¹, Jared Maguire¹, Christopher Hartl¹, Anthony A. Philippakis², Anthony A. Philippakis¹, Anthony A. Philippakis³, Guillermo del Angel¹, Manuel A. Rivas¹, Manuel A. Rivas², Matt Hanna¹, Aaron McKenna¹, Timothy Fennell¹, Andrew Kernytsky¹, Andrey Sivachenko¹, Kristian Cibulskis¹, Stacey Gabriel¹, David Altshuler², David Altshuler¹, Mark J. Daly², Mark J. Daly¹ - Show less +19 more•Institutions (3)

Broad Institute¹, Harvard University², Brigham and Women's Hospital³

01 May 2011-Nature Genetics

TL;DR: A unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs is presented.

...read moreread less

Abstract: Recent advances in sequencing technology make it possible to comprehensively catalogue genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (1) initial read mapping; (2) local realignment around indels; (3) base quality score recalibration; (4) SNP discovery and genotyping to find all potential variants; and (5) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We discuss the application of these tools, instantiated in the Genome Analysis Toolkit (GATK), to deep whole-genome, whole-exome capture, and multi-sample low-pass (~4×) 1000 Genomes Project datasets.

...read moreread less

10,056 citations

Journal Article•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek, Konrad J. Karczewski¹, Konrad J. Karczewski², Eric Vallabh Minikel¹, Eric Vallabh Minikel², Kaitlin E. Samocha, Eric Banks², Timothy Fennell², Anne H. O’Donnell-Luria³, Anne H. O’Donnell-Luria¹, Anne H. O’Donnell-Luria², James S. Ware, Andrew J. Hill⁴, Andrew J. Hill², Andrew J. Hill¹, Beryl B. Cummings¹, Beryl B. Cummings², Taru Tukiainen², Taru Tukiainen¹, Daniel P. Birnbaum², Jack A. Kosmicki, Laramie E. Duncan², Laramie E. Duncan¹, Karol Estrada², Karol Estrada¹, Fengmei Zhao², Fengmei Zhao¹, James Zou², Emma Pierce-Hoffman², Emma Pierce-Hoffman¹, Joanne Berghout⁵, David Neil Cooper⁶, Nicole A. Deflaux⁷, Mark A. DePristo², Ron Do, Jason Flannick¹, Jason Flannick², Menachem Fromer, Laura D. Gauthier², Jackie Goldstein¹, Jackie Goldstein², Namrata Gupta², Daniel P. Howrigan¹, Daniel P. Howrigan², Adam Kiezun², Mitja I. Kurki², Mitja I. Kurki¹, Ami Levy Moonshine², Pradeep Natarajan, Lorena Orozco, Gina M. Peloso², Gina M. Peloso¹, Ryan Poplin², Manuel A. Rivas², Valentin Ruano-Rubio², Samuel A. Rose², Douglas M. Ruderfer⁸, Khalid Shakir², Peter D. Stenson⁶, Christine Stevens², Brett Thomas², Brett Thomas¹, Grace Tiao², María Teresa Tusié-Luna, Ben Weisburd², Hong-Hee Won⁹, Dongmei Yu, David Altshuler¹⁰, David Altshuler², Diego Ardissino, Michael Boehnke¹¹, John Danesh¹², Stacey Donnelly², Roberto Elosua, Jose C. Florez², Jose C. Florez¹, Stacey Gabriel², Gad Getz², Gad Getz¹, Stephen J. Glatt¹³, Christina M. Hultman¹⁴, Sekar Kathiresan, Markku Laakso¹⁵, Steven A. McCarroll², Steven A. McCarroll¹, Mark I. McCarthy¹⁶, Mark I. McCarthy¹⁷, Dermot P.B. McGovern¹⁸, Ruth McPherson¹⁹, Benjamin M. Neale², Benjamin M. Neale¹, Aarno Palotie, Shaun Purcell⁸, Danish Saleheen²⁰, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan²¹, Patrick F. Sullivan¹⁴, Jaakko Tuomilehto²², Ming T. Tsuang²³, Hugh Watkins¹⁶, Hugh Watkins¹⁷, James G. Wilson²⁴, Mark J. Daly¹, Mark J. Daly², Daniel G. MacArthur¹, Daniel G. MacArthur² - Show less +103 more•Institutions (24)

Harvard University¹, Broad Institute², Boston Children's Hospital³, University of Washington⁴, University of Arizona⁵, Cardiff University⁶, Google⁷, Icahn School of Medicine at Mount Sinai⁸, Samsung Medical Center⁹, Vertex Pharmaceuticals¹⁰, University of Michigan¹¹, University of Cambridge¹², State University of New York Upstate Medical University¹³, Karolinska Institutet¹⁴, University of Eastern Finland¹⁵, University of Oxford¹⁶, Wellcome Trust Centre for Human Genetics¹⁷, Cedars-Sinai Medical Center¹⁸, University of Ottawa¹⁹, University of Pennsylvania²⁰, University of North Carolina at Chapel Hill²¹, University of Helsinki²², University of California, San Diego²³, University of Mississippi Medical Center²⁴

18 Aug 2016-Nature

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

...read moreread less

8,758 citations

Journal Article•DOI•

A general framework for estimating the relative pathogenicity of human genetic variants

[...]

Martin Kircher¹, Daniela Witten¹, Preti Jain, Brian J. O'Roak², Brian J. O'Roak¹, Gregory M. Cooper, Jay Shendure¹ - Show less +3 more•Institutions (2)

University of Washington¹, Oregon Health & Science University²

01 Mar 2014-Nature Genetics

TL;DR: The ability of CADD to prioritize functional, deleterious and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current single-annotation method.

...read moreread less

Abstract: Our capacity to sequence human genomes has exceeded our ability to interpret genetic variation. Current genomic annotations tend to exploit a single information type (e.g. conservation) and/or are restricted in scope (e.g. to missense changes). Here, we describe Combined Annotation Dependent Depletion (CADD), a framework that objectively integrates many diverse annotations into a single, quantitative score. We implement CADD as a support vector machine trained to differentiate 14.7 million high-frequency human derived alleles from 14.7 million simulated variants. We pre-compute “C-scores” for all 8.6 billion possible human single nucleotide variants and enable scoring of short insertions/deletions. C-scores correlate with allelic diversity, annotations of functionality, pathogenicity, disease severity, experimentally measured regulatory effects, and complex trait associations, and highly rank known pathogenic variants within individual genomes. The ability of CADD to prioritize functional, deleterious, and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current annotation.

...read moreread less

4,956 citations

Journal Article•DOI•

The mutational constraint spectrum quantified from variation in 141,456 humans

[...]

Konrad J. Karczewski¹, Laurent C. Francioli¹, Grace Tiao¹, Beryl B. Cummings¹, Jessica Alföldi¹, Qingbo Wang¹, Ryan L. Collins¹, Kristen M. Laricchia¹, Andrea Ganna¹, Daniel P. Birnbaum¹, Laura D. Gauthier¹, Harrison Brand¹, Matthew Solomonson¹, Nicholas A. Watts¹, Daniel R. Rhodes², Moriel Singer-Berk¹, Eleina M. England¹, Eleanor G. Seaby¹, Jack A. Kosmicki¹, Raymond K. Walters¹, Katherine Tashman¹, Yossi Farjoun¹, Eric Banks¹, Timothy Poterba¹, Arcturus Wang¹, Cotton Seed¹, Nicola Whiffin¹, Jessica X. Chong³, Kaitlin E. Samocha⁴, Emma Pierce-Hoffman¹, Zachary Zappala¹, Anne H. O’Donnell-Luria¹, Eric Vallabh Minikel¹, Ben Weisburd¹, Monkol Lek⁵, James S. Ware¹, Christopher Vittal⁶, Irina M. Armean¹, Louis Bergelson¹, Kristian Cibulskis¹, Kristen M. Connolly¹, Miguel Covarrubias¹, Stacey Donnelly¹, Steven Ferriera¹, Stacey Gabriel¹, Jeff Gentry¹, Namrata Gupta¹, Thibault Jeandet¹, Diane Kaplan¹, Christopher Llanwarne¹, Ruchi Munshi¹, Sam Novod¹, Nikelle Petrillo¹, David Roazen¹, Valentin Ruano-Rubio¹, Andrea Saltzman¹, Molly Schleicher¹, Jose Soto¹, Kathleen Tibbetts¹, Charlotte Tolonen¹, Gordon Wade¹, Michael E. Talkowski¹, Benjamin M. Neale¹, Mark J. Daly¹, Daniel G. MacArthur¹ - Show less +61 more•Institutions (6)

Broad Institute¹, Queen Mary University of London², University of Washington³, Wellcome Trust Sanger Institute⁴, Yale University⁵, Harvard University⁶

27 May 2020-Nature

TL;DR: A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

Abstract: Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases. A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

4,913 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse