Home
/
Authors
/
Joshua D. Smith

Author

Joshua D. Smith

Other affiliations: National Center for Health Statistics, University of Virginia, Harvard University

Bio: Joshua D. Smith is an academic researcher from University of Washington. The author has contributed to research in topics: Exome sequencing & Exome. The author has an hindex of 44, co-authored 92 publications receiving 16354 citations. Previous affiliations of Joshua D. Smith include National Center for Health Statistics & University of Virginia.

Papers published on a yearly basis

2021
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2006
2004
2003
1996

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Biological, clinical and population relevance of 95 loci for blood lipids

[...]

Tanya M. Teslovich¹, Kiran Musunuru, Albert V. Smith², Andrew C. Edmondson³ +215 more•Institutions (46)

05 Aug 2010-Nature

TL;DR: The results identify several novel loci associated with plasma lipids that are also associated with CAD and provide the foundation to develop a broader biological understanding of lipoprotein metabolism and to identify new therapeutic opportunities for the prevention of CAD.

...read moreread less

Abstract: Plasma concentrations of total cholesterol, low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and triglycerides are among the most important risk factors for coronary artery disease (CAD) and are targets for therapeutic intervention. We screened the genome for common variants associated with plasma lipids in >100,000 individuals of European ancestry. Here we report 95 significantly associated loci (P < 5 x 10(-8)), with 59 showing genome-wide significant association with lipid traits for the first time. The newly reported associations include single nucleotide polymorphisms (SNPs) near known lipid regulators (for example, CYP7A1, NPC1L1 and SCARB1) as well as in scores of loci not previously implicated in lipoprotein metabolism. The 95 loci contribute not only to normal variation in lipid traits but also to extreme lipid phenotypes and have an impact on lipid traits in three non-European populations (East Asians, South Asians and African Americans). Our results identify several novel loci associated with plasma lipids that are also associated with CAD. Finally, we validated three of the novel genes-GALNT2, PPP1R3B and TTC39B-with experiments in mouse models. Taken together, our findings provide the foundation to develop a broader biological understanding of lipoprotein metabolism and to identify new therapeutic opportunities for the prevention of CAD.

...read moreread less

3,469 citations

Journal Article•DOI•

The contribution of de novo coding mutations to autism spectrum disorder

[...]

Ivan Iossifov¹, Brian J. O'Roak², Stephen Sanders³, Stephen Sanders⁴, Michael Ronemus¹, Niklas Krumm², Dan Levy¹, Holly A.F. Stessman², Kali Witherspoon², Laura Vives², Karynne E. Patterson², Joshua D. Smith², Bryan W. Paeper², Deborah A. Nickerson², Jeanselle Dea³, Shan Dong⁴, Shan Dong⁵, Luis E. Gonzalez⁴, Jeffrey D. Mandell³, Shrikant Mane⁴, Michael T. Murtha⁴, Catherine A.W. Sullivan⁴, Michael F. Walker³, Zainulabedin Waqar⁴, Liping Wei⁵, A. Jeremy Willsey⁴, A. Jeremy Willsey³, Boris Yamrom¹, Yoon-ha Lee¹, Ewa A. Grabowska¹, Ertugrul Dalkic⁶, Ertugrul Dalkic¹, Zihua Wang¹, Steven Marks¹, Peter Andrews¹, Anthony Leotta¹, Jude Kendall¹, Inessa Hakker¹, Julie Rosenbaum¹, Beicong Ma¹, Linda Rodgers¹, Jennifer Troge¹, Giuseppe Narzisi¹, Seungtai Yoon¹, Michael C. Schatz¹, Kenny Ye⁷, W. Richard McCombie¹, Jay Shendure², Evan E. Eichler², Evan E. Eichler⁸, Matthew W. State⁴, Matthew W. State³, Michael Wigler¹ - Show less +49 more•Institutions (8)

Cold Spring Harbor Laboratory¹, University of Washington², University of California, San Francisco³, Yale University⁴, Peking University⁵, Zonguldak Karaelmas University⁶, Yeshiva University⁷, Howard Hughes Medical Institute⁸

13 Nov 2014-Nature

TL;DR: It is estimated that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation.

...read moreread less

Abstract: Whole exome sequencing has proven to be a powerful tool for understanding the genetic architecture of human disease. Here we apply it to more than 2,500 simplex families, each having a child with an autistic spectrum disorder. By comparing affected to unaffected siblings, we show that 13% of de novo missense mutations and 43% of de novo likely gene-disrupting (LGD) mutations contribute to 12% and 9% of diagnoses, respectively. Including copy number variants, coding de novo mutations contribute to about 30% of all simplex and 45% of female diagnoses. Almost all LGD mutations occur opposite wild-type alleles. LGD targets in affected females significantly overlap the targets in males of lower intelligence quotient (IQ), but neither overlaps significantly with targets in males of higher IQ. We estimate that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation. LGD targets in the joint class overlap with published targets for intellectual disability and schizophrenia, and are enriched for chromatin modifiers, FMRP-associated genes and embryonically expressed genes. Most of the significance for the latter comes from affected females.

...read moreread less

2,124 citations

Journal Article•DOI•

Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations

[...]

Brian J. O'Roak¹, Laura Vives¹, Santhosh Girirajan¹, Emre Karakoc¹, Niklas Krumm¹, Bradley P. Coe¹, Roie Levy¹, Arthur Ko¹, Choli Lee¹, Joshua D. Smith¹, Emily H. Turner¹, Ian B. Stanaway¹, Benjamin Vernot¹, Maika Malig¹, Carl Baker¹, Beau Reilly¹, Joshua M. Akey¹, Elhanan Borenstein², Elhanan Borenstein¹, Mark J. Rieder¹, Deborah A. Nickerson¹, Raphael Bernier¹, Jay Shendure¹, Evan E. Eichler³, Evan E. Eichler¹ - Show less +21 more•Institutions (3)

University of Washington¹, Santa Fe Institute², Howard Hughes Medical Institute³

10 May 2012-Nature

TL;DR: It is shown that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD.

...read moreread less

Abstract: It is well established that autism spectrum disorders (ASD) have a strong genetic component; however, for at least 70% of cases, the underlying genetic cause is unknown. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes--so-called sporadic or simplex families--we sequenced all coding regions of the genome (the exome) for parent-child trios exhibiting sporadic ASD, including 189 new trios and 20 that were previously reported. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19), for a total of 677 individual exomes from 209 families. Here we show that de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD. Moreover, 39% (49 of 126) of the most severe or disruptive de novo mutations map to a highly interconnected β-catenin/chromatin remodelling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes: CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3 and SCN1A. Combined with copy number variant (CNV) data, these results indicate extreme locus heterogeneity but also provide a target for future discovery, diagnostics and therapeutics.

...read moreread less

2,062 citations

Journal Article•DOI•

Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome

[...]

Sarah B. Ng¹, Abigail W. Bigham¹, Kati J. Buckingham¹, Mark C. Hannibal², Mark C. Hannibal¹, Margaret J. McMillin¹, Heidi I. S. Gildersleeve¹, Anita E. Beck¹, Anita E. Beck³, Holly K. Tabor¹, Holly K. Tabor³, Gregory M. Cooper¹, Heather C Mefford¹, Choli Lee¹, Emily H. Turner¹, Joshua D. Smith¹, Mark J. Rieder¹, Koh-ichiro Yoshiura⁴, Naomichi Matsumoto⁵, Tohru Ohta⁶, Norio Niikawa⁶, Deborah A. Nickerson¹, Michael J. Bamshad¹, Michael J. Bamshad³, Jay Shendure¹ - Show less +21 more•Institutions (6)

University of Washington¹, Seattle Children's², Boston Children's Hospital³, Nagasaki University⁴, Yokohama City University⁵, Health Sciences University of Hokkaido⁶

01 Sep 2010-Nature Genetics

TL;DR: The results strongly suggest that mutations in MLL2, which encodes a Trithorax-group histone methyltransferase, are a major cause of Kabuki syndrome.

...read moreread less

Abstract: We demonstrate the successful application of exome sequencing to discover a gene for an autosomal dominant disorder, Kabuki syndrome (OMIM%147920). We subjected the exomes of ten unrelated probands to massively parallel sequencing. After filtering against existing SNP databases, there was no compelling candidate gene containing previously unknown variants in all affected individuals. Less stringent filtering criteria allowed for the presence of modest genetic heterogeneity or missing data but also identified multiple candidate genes. However, genotypic and phenotypic stratification highlighted MLL2, which encodes a Trithorax-group histone methyltransferase: seven probands had newly identified nonsense or frameshift mutations in this gene. Follow-up Sanger sequencing detected MLL2 mutations in two of the three remaining individuals with Kabuki syndrome (cases) and in 26 of 43 additional cases. In families where parental DNA was available, the mutation was confirmed to be de novo (n = 12) or transmitted (n = 2) in concordance with phenotype. Our results strongly suggest that mutations in MLL2 are a major cause of Kabuki syndrome.

...read moreread less

1,261 citations

Journal Article•DOI•

Mapping and sequencing of structural variation from eight human genomes

[...]

Jeffrey M. Kidd¹, Gregory M. Cooper¹, William F. Donahue, Hillary S. Hayden¹, Nick Sampas², Tina Graves³, Nancy F. Hansen⁴, Brian Teague⁵, Can Alkan¹, Francesca Antonacci¹, Eric Haugen¹, Troy Zerr¹, N. Alice Yamada², Peter Tsang², Tera L. Newman¹, Eray Tüzün¹, Ze Cheng¹, Heather Ebling, Nadeem Tusneem, Robert David, Will D. Gillett¹, Karen A. Phelps¹, Molly Weaver¹, David J. Saranga, Adrianne Brand, Wei Tao, Erik Gustafson, Kevin McKernan, Lin Chen¹, Maika Malig¹, Joshua D. Smith¹, Joshua M. Korn⁶, Steven A. McCarroll⁶, David Altshuler⁶, Daniel A. Peiffer⁷, Michael O. Dorschner¹, John A. Stamatoyannopoulos¹, David C. Schwartz⁵, Deborah A. Nickerson¹, James C. Mullikin⁴, Richard K. Wilson³, Laurakay Bruhn², Maynard V. Olson¹, Rajinder Kaul¹, Douglas R. Smith, Evan E. Eichler¹ - Show less +42 more•Institutions (7)

University of Washington¹, Agilent Technologies², Washington University in St. Louis³, National Institutes of Health⁴, University of Wisconsin-Madison⁵, Broad Institute⁶, Illumina⁷

01 May 2008-Nature

TL;DR: This work employs a clone-based method to interrogate intermediate structural variation in eight individuals of diverse geographic ancestry and provides the first high-resolution sequence map of human structural variation—a standard for genotyping platforms and a prelude to future individual genome sequencing projects.

...read moreread less

Abstract: Genetic variation among individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single nucleotide changes. Here we explore variation on an intermediate scale--particularly insertions, deletions and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1,695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number between individuals. Complete sequencing of 261 structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence map of human structural variation--a standard for genotyping platforms and a prelude to future individual genome sequencing projects.

...read moreread less

1,183 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Journal Article•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek, Konrad J. Karczewski¹, Konrad J. Karczewski², Eric Vallabh Minikel¹, Eric Vallabh Minikel², Kaitlin E. Samocha, Eric Banks¹, Timothy Fennell¹, Anne H. O’Donnell-Luria¹, Anne H. O’Donnell-Luria², Anne H. O’Donnell-Luria³, James S. Ware, Andrew J. Hill¹, Andrew J. Hill⁴, Andrew J. Hill², Beryl B. Cummings², Beryl B. Cummings¹, Taru Tukiainen², Taru Tukiainen¹, Daniel P. Birnbaum¹, Jack A. Kosmicki, Laramie E. Duncan¹, Laramie E. Duncan², Karol Estrada¹, Karol Estrada², Fengmei Zhao², Fengmei Zhao¹, James Zou¹, Emma Pierce-Hoffman², Emma Pierce-Hoffman¹, Joanne Berghout⁵, David Neil Cooper⁶, Nicole A. Deflaux⁷, Mark A. DePristo¹, Ron Do, Jason Flannick¹, Jason Flannick², Menachem Fromer, Laura D. Gauthier¹, Jackie Goldstein², Jackie Goldstein¹, Namrata Gupta¹, Daniel P. Howrigan¹, Daniel P. Howrigan², Adam Kiezun¹, Mitja I. Kurki¹, Mitja I. Kurki², Ami Levy Moonshine¹, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso¹, Gina M. Peloso², Ryan Poplin¹, Manuel A. Rivas¹, Valentin Ruano-Rubio¹, Samuel A. Rose¹, Douglas M. Ruderfer⁸, Khalid Shakir¹, Peter D. Stenson⁶, Christine Stevens¹, Brett Thomas¹, Brett Thomas², Grace Tiao¹, María Teresa Tusié-Luna, Ben Weisburd¹, Hong-Hee Won⁹, Dongmei Yu, David Altshuler¹⁰, David Altshuler¹, Diego Ardissino, Michael Boehnke¹¹, John Danesh¹², Stacey Donnelly¹, Roberto Elosua, Jose C. Florez¹, Jose C. Florez², Stacey Gabriel¹, Gad Getz², Gad Getz¹, Stephen J. Glatt¹³, Christina M. Hultman¹⁴, Sekar Kathiresan, Markku Laakso¹⁵, Steven A. McCarroll¹, Steven A. McCarroll², Mark I. McCarthy¹⁶, Mark I. McCarthy¹⁷, Dermot P.B. McGovern¹⁸, Ruth McPherson¹⁹, Benjamin M. Neale², Benjamin M. Neale¹, Aarno Palotie, Shaun Purcell⁸, Danish Saleheen²⁰, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan¹⁴, Patrick F. Sullivan²¹, Jaakko Tuomilehto²², Ming T. Tsuang²³, Hugh Watkins¹⁷, Hugh Watkins¹⁶, James G. Wilson²⁴, Mark J. Daly², Mark J. Daly¹, Daniel G. MacArthur², Daniel G. MacArthur¹ - Show less +103 more•Institutions (24)

Broad Institute¹, Harvard University², Boston Children's Hospital³, University of Washington⁴, University of Arizona⁵, Cardiff University⁶, Google⁷, Icahn School of Medicine at Mount Sinai⁸, Samsung Medical Center⁹, Vertex Pharmaceuticals¹⁰, University of Michigan¹¹, University of Cambridge¹², State University of New York Upstate Medical University¹³, Karolinska Institutet¹⁴, University of Eastern Finland¹⁵, Wellcome Trust Centre for Human Genetics¹⁶, University of Oxford¹⁷, Cedars-Sinai Medical Center¹⁸, University of Ottawa¹⁹, University of Pennsylvania²⁰, University of North Carolina at Chapel Hill²¹, University of Helsinki²², University of California, San Diego²³, University of Mississippi Medical Center²⁴

18 Aug 2016-Nature

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

...read moreread less

8,758 citations

Journal Article•DOI•

Finding the missing heritability of complex diseases

[...]

Teri A. Manolio¹, Francis S. Collins¹, Nancy J. Cox², David Goldstein³, Lucia A. Hindorff¹, David J. Hunter⁴, Mark I. McCarthy⁵, Erin M. Ramos¹, Lon R. Cardon⁶, Aravinda Chakravarti⁷, Judy H. Cho⁸, Alan E. Guttmacher¹, Augustine Kong⁹, Leonid Kruglyak¹⁰, Leonid Kruglyak¹¹, Elaine R. Mardis¹², Charles N. Rotimi¹, Montgomery Slatkin¹³, David Valle⁷, Alice S. Whittemore¹⁴, Michael Boehnke¹⁵, Andrew G. Clark¹⁶, Evan E. Eichler¹⁷, Greg Gibson¹⁸, Jonathan L. Haines¹⁹, Trudy F. C. Mackay²⁰, Steven A. McCarroll⁴, Peter M. Visscher²¹ - Show less +24 more•Institutions (21)

National Institutes of Health¹, University of Chicago², Duke University³, Harvard University⁴, University of Oxford⁵, GlaxoSmithKline⁶, Johns Hopkins University⁷, Yale University⁸, deCODE genetics⁹, Princeton University¹⁰, Howard Hughes Medical Institute¹¹, Washington University in St. Louis¹², University of California, Berkeley¹³, Stanford University¹⁴, University of Michigan¹⁵, Cornell University¹⁶, University of Washington¹⁷, University of Queensland¹⁸, Vanderbilt University¹⁹, North Carolina State University²⁰, QIMR Berghofer Medical Research Institute²¹

08 Oct 2009-Nature

TL;DR: This paper examined potential sources of missing heritability and proposed research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

...read moreread less

Abstract: Genome-wide association studies have identified hundreds of genetic variants associated with complex human diseases and traits, and have provided valuable insights into their genetic architecture. Most variants identified so far confer relatively small increments in risk, and explain only a small proportion of familial clustering, leading many to question how the remaining, 'missing' heritability can be explained. Here we examine potential sources of missing heritability and propose research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

...read moreread less

7,797 citations

Journal Article•DOI•

Sequencing technologies-the next generation

[...]

Michael L. Metzker¹•Institutions (1)

Baylor College of Medicine¹

01 Jan 2010-Nature Reviews Genetics

TL;DR: A technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments is presented.

...read moreread less

Abstract: Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

...read moreread less

7,023 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse