Home
/
Authors
/
Niklas Krumm

Author

Niklas Krumm

Bio: Niklas Krumm is an academic researcher from University of Washington. The author has contributed to research in topics: Exome & Exome sequencing. The author has an hindex of 17, co-authored 25 publications receiving 7461 citations.

Topics: Exome, Exome sequencing, Copy-number variation, Candidate gene, Gene ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The contribution of de novo coding mutations to autism spectrum disorder

[...]

Ivan Iossifov¹, Brian J. O'Roak², Stephen Sanders³, Stephen Sanders⁴, Michael Ronemus¹, Niklas Krumm², Dan Levy¹, Holly A.F. Stessman², Kali Witherspoon², Laura Vives², Karynne E. Patterson², Joshua D. Smith², Bryan W. Paeper², Deborah A. Nickerson², Jeanselle Dea⁴, Shan Dong³, Shan Dong⁵, Luis E. Gonzalez³, Jeffrey D. Mandell⁴, Shrikant Mane³, Michael T. Murtha³, Catherine A.W. Sullivan³, Michael F. Walker⁴, Zainulabedin Waqar³, Liping Wei⁵, A. Jeremy Willsey³, A. Jeremy Willsey⁴, Boris Yamrom¹, Yoon-ha Lee¹, Ewa A. Grabowska¹, Ertugrul Dalkic⁶, Ertugrul Dalkic¹, Zihua Wang¹, Steven Marks¹, Peter Andrews¹, Anthony Leotta¹, Jude Kendall¹, Inessa Hakker¹, Julie Rosenbaum¹, Beicong Ma¹, Linda Rodgers¹, Jennifer Troge¹, Giuseppe Narzisi¹, Seungtai Yoon¹, Michael C. Schatz¹, Kenny Ye⁷, W. Richard McCombie¹, Jay Shendure², Evan E. Eichler⁸, Evan E. Eichler², Matthew W. State⁴, Matthew W. State³, Michael Wigler¹ - Show less +49 more•Institutions (8)

Cold Spring Harbor Laboratory¹, University of Washington², Yale University³, University of California, San Francisco⁴, Peking University⁵, Zonguldak Karaelmas University⁶, Yeshiva University⁷, Howard Hughes Medical Institute⁸

13 Nov 2014-Nature

TL;DR: It is estimated that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation.

...read moreread less

Abstract: Whole exome sequencing has proven to be a powerful tool for understanding the genetic architecture of human disease. Here we apply it to more than 2,500 simplex families, each having a child with an autistic spectrum disorder. By comparing affected to unaffected siblings, we show that 13% of de novo missense mutations and 43% of de novo likely gene-disrupting (LGD) mutations contribute to 12% and 9% of diagnoses, respectively. Including copy number variants, coding de novo mutations contribute to about 30% of all simplex and 45% of female diagnoses. Almost all LGD mutations occur opposite wild-type alleles. LGD targets in affected females significantly overlap the targets in males of lower intelligence quotient (IQ), but neither overlaps significantly with targets in males of higher IQ. We estimate that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation. LGD targets in the joint class overlap with published targets for intellectual disability and schizophrenia, and are enriched for chromatin modifiers, FMRP-associated genes and embryonically expressed genes. Most of the significance for the latter comes from affected females.

...read moreread less

2,124 citations

Journal Article•DOI•

Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations

[...]

Brian J. O'Roak¹, Laura Vives¹, Santhosh Girirajan¹, Emre Karakoc¹, Niklas Krumm¹, Bradley P. Coe¹, Roie Levy¹, Arthur Ko¹, Choli Lee¹, Joshua D. Smith¹, Emily H. Turner¹, Ian B. Stanaway¹, Benjamin Vernot¹, Maika Malig¹, Carl Baker¹, Beau Reilly¹, Joshua M. Akey¹, Elhanan Borenstein², Elhanan Borenstein¹, Mark J. Rieder¹, Deborah A. Nickerson¹, Raphael Bernier¹, Jay Shendure¹, Evan E. Eichler³, Evan E. Eichler¹ - Show less +21 more•Institutions (3)

University of Washington¹, Santa Fe Institute², Howard Hughes Medical Institute³

10 May 2012-Nature

TL;DR: It is shown that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD.

...read moreread less

Abstract: It is well established that autism spectrum disorders (ASD) have a strong genetic component; however, for at least 70% of cases, the underlying genetic cause is unknown. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes--so-called sporadic or simplex families--we sequenced all coding regions of the genome (the exome) for parent-child trios exhibiting sporadic ASD, including 189 new trios and 20 that were previously reported. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19), for a total of 677 individual exomes from 209 families. Here we show that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD. Moreover, 39% (49 of 126) of the most severe or disruptive de novo mutations map to a highly interconnected β-catenin/chromatin remodelling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes: CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3 and SCN1A. Combined with copy number variant (CNV) data, these results indicate extreme locus heterogeneity but also provide a target for future discovery, diagnostics and therapeutics.

...read moreread less

2,062 citations

Journal Article•DOI•

Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders.

[...]

Brian J. O'Roak¹, Laura Vives¹, Wenqing Fu¹, Jarrett D. Egertson¹, Ian B. Stanaway¹, Ian G. Phelps², Ian G. Phelps¹, Gemma L. Carvill¹, Gemma L. Carvill², Akash Kumar¹, Choli Lee¹, Katy Ankenman¹, Jeff Munson¹, Joseph B. Hiatt¹, Emily H. Turner¹, Roie Levy¹, Diana R. O’Day¹, Niklas Krumm¹, Bradley P. Coe¹, Beth Martin¹, Elhanan Borenstein³, Elhanan Borenstein¹, Deborah A. Nickerson¹, Heather C Mefford¹, Heather C Mefford², Dan Doherty², Dan Doherty¹, Joshua M. Akey¹, Raphael Bernier¹, Evan E. Eichler⁴, Evan E. Eichler¹, Jay Shendure¹ - Show less +28 more•Institutions (4)

University of Washington¹, Seattle Children's², Santa Fe Institute³, Howard Hughes Medical Institute⁴

21 Dec 2012-Science

TL;DR: The modified molecular inversion probe method was applied to 44 candidate genes to identify de novo mutations in a large cohort of individuals with and without autism spectrum disorder, supporting the notion that multiple genes underlie autism-spectrum disorders.

...read moreread less

Abstract: Exome sequencing studies of autism spectrum disorders (ASDs) have identified many de novo mutations but few recurrently disrupted genes. We therefore developed a modified molecular inversion probe method enabling ultra-low-cost candidate gene resequencing in very large cohorts. To demonstrate the power of this approach, we captured and sequenced 44 candidate genes in 2446 ASD probands. We discovered 27 de novo events in 16 genes, 59% of which are predicted to truncate proteins or disrupt splicing. We estimate that recurrent disruptive mutations in six genes-CHD8, DYRK1A, GRIN2B, TBR1, PTEN, and TBL1XR1-may contribute to 1% of sporadic ASDs. Our data support associations between specific genes and reciprocal subphenotypes (CHD8-macrocephaly and DYRK1A-microcephaly) and replicate the importance of a β-catenin-chromatin-remodeling network to ASD etiology.

...read moreread less

1,178 citations

Journal Article•DOI•

Copy number variation detection and genotyping from exome sequence data

[...]

Niklas Krumm¹, Peter H. Sudmant¹, Arthur Ko¹, Brian J. O'Roak², Maika Malig¹, Bradley P. Coe¹, Aaron R. Quinlan³, Deborah A. Nickerson¹, Evan E. Eichler⁴ - Show less +5 more•Institutions (4)

University of Washington¹, National Institutes of Health², University of Virginia³, Howard Hughes Medical Institute⁴

14 May 2012-Genome Research

TL;DR: A novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data is developed.

...read moreread less

Abstract: While exome sequencing is readily amenable to single-nucleotide variant discovery, the sparse and nonuniform nature of the exome capture reaction has hindered exome-based detection and characterization of genic copy number variation. We developed a novel method using singular value decomposition (SVD) normalization to discover rare genic copy number variants (CNVs) as well as genotype copy number polymorphic (CNP) loci with high sensitivity and specificity from exome sequencing data. We estimate the precision of our algorithm using 122 trios (366 exomes) and show that this method can be used to reliably predict (94% overall precision) both de novo and inherited rare CNVs involving three or more consecutive exons. We demonstrate that exome-based genotyping of CNPs strongly correlates with whole-genome data (median r(2) = 0.91), especially for loci with fewer than eight copies, and can estimate the absolute copy number of multi-allelic genes with high accuracy (78% call level). The resulting user-friendly computational pipeline, CoNIFER (copy number inference from exome reads), can reliably be used to discover disruptive genic CNVs missed by standard approaches and should have broad application in human genetic studies of disease.

...read moreread less

567 citations

Journal Article•DOI•

Excess of rare, inherited truncating mutations in autism.

[...]

Niklas Krumm¹, Tychele N. Turner¹, Carl Baker¹, Laura Vives¹, Kiana Mohajeri¹, Kali Witherspoon¹, Archana Raja¹, Bradley P. Coe¹, Holly A.F. Stessman¹, Zong Xiao He², Suzanne M. Leal², Raphael Bernier¹, Evan E. Eichler¹ - Show less +9 more•Institutions (2)

University of Washington¹, Baylor College of Medicine²

01 Jun 2015-Nature Genetics

TL;DR: This analysis identifies a second class of candidate genes (for example, RIMS1, CUL7 and LZTR1) where transmitted mutations may create a sensitized background but are unlikely to be completely penetrant, and private truncating SNVs and rare, inherited CNVs are statistically independent risk factors for autism.

...read moreread less

Abstract: To assess the relative impact of inherited and de novo variants on autism risk, we generated a comprehensive set of exonic single-nucleotide variants (SNVs) and copy number variants (CNVs) from 2,377 families with autism. We find that private, inherited truncating SNVs in conserved genes are enriched in probands (odds ratio = 1.14, P = 0.0002) in comparison to unaffected siblings, an effect involving significant maternal transmission bias to sons. We also observe a bias for inherited CNVs, specifically for small (<100 kb), maternally inherited events (P = 0.01) that are enriched in CHD8 target genes (P = 7.4 × 10(-3)). Using a logistic regression model, we show that private truncating SNVs and rare, inherited CNVs are statistically independent risk factors for autism, with odds ratios of 1.11 (P = 0.0002) and 1.23 (P = 0.01), respectively. This analysis identifies a second class of candidate genes (for example, RIMS1, CUL7 and LZTR1) where transmitted mutations may create a sensitized background but are unlikely to be completely penetrant.

...read moreread less

538 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A general framework for estimating the relative pathogenicity of human genetic variants

[...]

Martin Kircher¹, Daniela Witten¹, Preti Jain, Brian J. O'Roak², Brian J. O'Roak¹, Gregory M. Cooper, Jay Shendure¹ - Show less +3 more•Institutions (2)

University of Washington¹, Oregon Health & Science University²

01 Mar 2014-Nature Genetics

TL;DR: The ability of CADD to prioritize functional, deleterious and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current single-annotation method.

...read moreread less

Abstract: Our capacity to sequence human genomes has exceeded our ability to interpret genetic variation. Current genomic annotations tend to exploit a single information type (e.g. conservation) and/or are restricted in scope (e.g. to missense changes). Here, we describe Combined Annotation Dependent Depletion (CADD), a framework that objectively integrates many diverse annotations into a single, quantitative score. We implement CADD as a support vector machine trained to differentiate 14.7 million high-frequency human derived alleles from 14.7 million simulated variants. We pre-compute “C-scores” for all 8.6 billion possible human single nucleotide variants and enable scoring of short insertions/deletions. C-scores correlate with allelic diversity, annotations of functionality, pathogenicity, disease severity, experimentally measured regulatory effects, and complex trait associations, and highly rank known pathogenic variants within individual genomes. The ability of CADD to prioritize functional, deleterious, and pathogenic variants across many functional categories, effect sizes and genetic architectures is unmatched by any current annotation.

...read moreread less

4,956 citations

Journal Article•DOI•

Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype

[...]

Daehwan Kim¹, Joseph M. Paggi², Chanhee Park¹, Christopher Bennett¹, Steven L. Salzberg³ - Show less +1 more•Institutions (3)

University of Texas Southwestern Medical Center¹, Stanford University², Johns Hopkins University³

01 Aug 2019-Nature Biotechnology

TL;DR: This work presents a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index, and uses it to represent and search an expanded model of the human reference genome.

...read moreread less

Abstract: The human reference genome represents only a small number of individuals, which limits its usefulness for genotyping. We present a method named HISAT2 (hierarchical indexing for spliced alignment of transcripts 2) that can align both DNA and RNA sequences using a graph Ferragina Manzini index. We use HISAT2 to represent and search an expanded model of the human reference genome in which over 14.5 million genomic variants in combination with haplotypes are incorporated into the data structure used for searching and alignment. We benchmark HISAT2 using simulated and real datasets to demonstrate that our strategy of representing a population of genomes, together with a fast, memory-efficient search algorithm, provides more detailed and accurate variant analyses than other methods. We apply HISAT2 for HLA typing and DNA fingerprinting; both applications form part of the HISAT-genotype software that enables analysis of haplotype-resolved genes or genomic regions. HISAT-genotype outperforms other computational methods and matches or exceeds the performance of laboratory-based assays. A graph-based genome indexing scheme enables variant-aware alignment of sequences with very low memory requirements.

...read moreread less

4,855 citations

Journal Article•DOI•

Coming of age: ten years of next-generation sequencing technologies

[...]

Sara Goodwin¹, John Douglas Mcpherson², W. Richard McCombie¹•Institutions (2)

Cold Spring Harbor Laboratory¹, University of California, Davis²

01 Jun 2016-Nature Reviews Genetics

TL;DR: These and other strategies are providing researchers and clinicians a variety of tools to probe genomes in greater depth, leading to an enhanced understanding of how genome sequence variants underlie phenotype and disease.

...read moreread less

Abstract: Since the completion of the human genome project in 2003, extraordinary progress has been made in genome sequencing technologies, which has led to a decreased cost per megabase and an increase in the number and diversity of sequenced genomes. An astonishing complexity of genome architecture has been revealed, bringing these sequencing technologies to even greater advancements. Some approaches maximize the number of bases sequenced in the least amount of time, generating a wealth of data that can be used to understand increasingly complex phenotypes. Alternatively, other approaches now aim to sequence longer contiguous pieces of DNA, which are essential for resolving structurally complex regions. These and other strategies are providing researchers and clinicians a variety of tools to probe genomes in greater depth, leading to an enhanced understanding of how genome sequence variants underlie phenotype and disease.

...read moreread less

3,096 citations

Journal Article•

Patterns of Somatic Mutation in Human Cancer Genomes

[...]

Michael R. Stratton¹•Institutions (1)

Wellcome Trust Sanger Institute¹

15 Nov 2007-Clinical Cancer Research

TL;DR: In this paper, the coding exons of the family of 518 protein kinases were sequenced in 210 cancers of diverse histological types to explore the nature of the information that will be derived from cancer genome sequencing.

...read moreread less

Abstract: AACR Centennial Conference: Translational Cancer Medicine-- Nov 4-8, 2007; Singapore PL02-05 All cancers are due to abnormalities in DNA. The availability of the human genome sequence has led to the proposal that resequencing of cancer genomes will reveal the full complement of somatic mutations and hence all the cancer genes. To explore the nature of the information that will be derived from cancer genome sequencing we have sequenced the coding exons of the family of 518 protein kinases, ~1.3Mb DNA per cancer sample, in 210 cancers of diverse histological types. Despite the screen being directed toward the coding regions of a gene family that has previously been strongly implicated in oncogenesis, the results indicate that the majority of somatic mutations detected are “passengers”. There is considerable variation in the number and pattern of these mutations between individual cancers, indicating substantial diversity of processes of molecular evolution between cancers. The imprints of exogenous mutagenic exposures, mutagenic treatment regimes and DNA repair defects can all be seen in the distinctive mutational signatures of individual cancers. This systematic mutation screen and others have previously yielded a number of cancer genes that are frequently mutated in one or more cancer types and which are now anticancer drug targets (for example BRAF , PIK3CA , and EGFR ). However, detailed analyses of the data from our screen additionally suggest that there exist a large number of additional “driver” mutations which are distributed across a substantial number of genes. It therefore appears that cells may be able to utilise mutations in a large repertoire of potential cancer genes to acquire the neoplastic phenotype. However, many of these genes are employed only infrequently. These findings may have implications for future anticancer drug development.

...read moreread less

2,737 citations

Journal Article•DOI•

10 Years of GWAS Discovery: Biology, Function, and Translation

[...]

Peter M. Visscher¹, Naomi R. Wray¹, Qian Zhang¹, Pamela Sklar², Mark I. McCarthy³, Matthew A. Brown⁴, Jian Yang¹ - Show less +3 more•Institutions (4)

University of Queensland¹, Icahn School of Medicine at Mount Sinai², Wellcome Trust Centre for Human Genetics³, Queensland University of Technology⁴

06 Jul 2017-American Journal of Human Genetics

TL;DR: The remarkable range of discoveriesGWASs has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics are reviewed.

...read moreread less

Abstract: Application of the experimental design of genome-wide association studies (GWASs) is now 10 years old (young), and here we review the remarkable range of discoveries it has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics. We predict the likely discoveries in the next 10 years, when GWASs will be based on millions of samples with array data imputed to a large fully sequenced reference panel and on hundreds of thousands of samples with whole-genome sequencing data.

...read moreread less

2,669 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse