Home
/
Authors
/
John Huddleston

Author

John Huddleston

Other affiliations: Howard Hughes Medical Institute, University of Washington

Bio: John Huddleston is an academic researcher from Fred Hutchinson Cancer Research Center. The author has contributed to research in topics: Human genome & Genome. The author has an hindex of 30, co-authored 57 publications receiving 21575 citations. Previous affiliations of John Huddleston include Howard Hughes Medical Institute & University of Washington.

Topics: Human genome, Genome, Structural variation, Segmental duplication, Genomics ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data

[...]

Chen-Shan Chin¹, David Alexander¹, Patrick Marks¹, Aaron Klammer¹, James P Drake¹, Cheryl Heiner¹, Alicia Clum², Alex Copeland², John Huddleston³, Evan E. Eichler³, Stephen Turner¹, Jonas Korlach¹ - Show less +8 more•Institutions (3)

Pacific Biosciences¹, Joint Genome Institute², University of Washington³

01 Jun 2013-Nature Methods

TL;DR: This work presents a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing.

...read moreread less

Abstract: We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

...read moreread less

3,647 citations

A global reference for human genetic variation

[...]

Adam Auton, Gonçalo R. Abecasis, David Altshuler, Richard Durbin +476 more

01 Oct 2015

TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.

...read moreread less

3,247 citations

Journal Article•DOI•

Nextstrain: real-time tracking of pathogen evolution.

[...]

James Hadfield¹, Colin Megill¹, Sidney M Bell², Sidney M Bell¹, John Huddleston², John Huddleston¹, Barney Potter¹, Charlton Callender¹, Pavel Sagulenko³, Trevor Bedford¹, Richard A. Neher³, Richard A. Neher⁴, Richard A. Neher⁵ - Show less +9 more•Institutions (5)

Fred Hutchinson Cancer Research Center¹, University of Washington², Max Planck Society³, Swiss Institute of Bioinformatics⁴, University of Basel⁵

01 Dec 2018-Bioinformatics

TL;DR: Nextstrain consists of a database of viral genomes, a bioinformatics pipeline for phylodynamics analysis, and an interactive visualization platform that presents a real-time view into the evolution and spread of a range of viral pathogens of high public health importance.

...read moreread less

Abstract: Summary Understanding the spread and evolution of pathogens is important for effective public health measures and surveillance. Nextstrain consists of a database of viral genomes, a bioinformatics pipeline for phylodynamics analysis, and an interactive visualization platform. Together these present a real-time view into the evolution and spread of a range of viral pathogens of high public health importance. The visualization integrates sequence data with other data types such as geographic information, serology, or host species. Nextstrain compiles our current understanding into a single accessible location, open to health professionals, epidemiologists, virologists and the public alike. Availability and implementation All code (predominantly JavaScript and Python) is freely available from github.com/nextstrain and the web-application is available at nextstrain.org.

...read moreread less

2,305 citations

Journal Article•DOI•

An integrated map of structural variation in 2,504 human genomes

[...]

Peter H. Sudmant¹, Tobias Rausch, Eugene J. Gardner², Robert E. Handsaker³, Robert E. Handsaker⁴, Alexej Abyzov⁵, John Huddleston¹, Yan Zhang⁶, Kai Ye⁷, Goo Jun⁸, Goo Jun⁹, Markus His Yang Fritz, Miriam K. Konkel¹⁰, Ankit Malhotra, Adrian M. Stütz, Xinghua Shi¹¹, Francesco Paolo Casale¹², Jieming Chen⁶, Fereydoun Hormozdiari¹, Gargi Dayama⁹, Ken Chen¹³, Maika Malig¹, Mark Chaisson¹, Klaudia Walter¹², Sascha Meiers, Seva Kashin⁴, Seva Kashin³, Erik Garrison¹⁴, Adam Auton¹⁵, Hugo Y. K. Lam, Xinmeng Jasmine Mu⁶, Xinmeng Jasmine Mu³, Can Alkan¹⁶, Danny Antaki¹⁷, Taejeong Bae⁵, Eliza Cerveira, Peter S. Chines¹⁸, Zechen Chong¹³, Laura Clarke¹², Elif Dal¹⁶, Li Ding⁷, S. Emery⁹, Xian Fan¹³, Madhusudan Gujral¹⁷, Fatma Kahveci¹⁶, Jeffrey M. Kidd⁹, Yu Kong¹⁵, Eric-Wubbo Lameijer¹⁹, Shane A. McCarthy¹², Paul Flicek¹², Richard A. Gibbs²⁰, Gabor T. Marth¹⁴, Christopher E. Mason²¹, Androniki Menelaou²², Androniki Menelaou²³, Donna M. Muzny²⁴, Bradley J. Nelson¹, Amina Noor¹⁷, Nicholas F. Parrish²⁵, Matthew Pendleton²⁴, Andrew Quitadamo¹¹, Benjamin Raeder, Eric E. Schadt²⁴, Mallory Romanovitch, Andreas Schlattl, Robert Sebra²⁴, Andrey A. Shabalin²⁶, Andreas Untergasser²⁷, Jerilyn A. Walker¹⁰, Min Wang²⁰, Fuli Yu²⁰, Chengsheng Zhang, Jing Zhang⁶, Xiangqun Zheng-Bradley¹², Wanding Zhou¹³, Thomas Zichner, Jonathan Sebat¹⁷, Mark A. Batzer¹⁰, Steven A. McCarroll³, Steven A. McCarroll⁴, Ryan E. Mills⁹, Mark Gerstein⁶, Ali Bashir²⁴, Oliver Stegle¹², Scott E. Devine², Charles Lee²⁸, Evan E. Eichler¹, Jan O. Korbel¹² - Show less +84 more•Institutions (28)

University of Washington¹, University of Maryland, Baltimore², Broad Institute³, Harvard University⁴, Mayo Clinic⁵, Yale University⁶, Washington University in St. Louis⁷, University of Texas Health Science Center at Houston⁸, University of Michigan⁹, Louisiana State University¹⁰, University of North Carolina at Charlotte¹¹, Wellcome Trust¹², University of Texas MD Anderson Cancer Center¹³, Boston College¹⁴, Yeshiva University¹⁵, Bilkent University¹⁶, University of California, San Diego¹⁷, National Institutes of Health¹⁸, Leiden University¹⁹, Baylor College of Medicine²⁰, Cornell University²¹, University of Oxford²², Utrecht University²³, Icahn School of Medicine at Mount Sinai²⁴, Kyoto University²⁵, Virginia Commonwealth University²⁶, Heidelberg University²⁷, Ewha Womans University²⁸

01 Oct 2015-Nature

TL;DR: In this paper, the authors describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which are constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations.

...read moreread less

Abstract: Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.

...read moreread less

1,971 citations

1
2
3
4
…
5
6
7
8
9
10
11
12

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

...read moreread less

12,661 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek, Konrad J. Karczewski¹, Konrad J. Karczewski², Eric Vallabh Minikel¹, Eric Vallabh Minikel², Kaitlin E. Samocha, Eric Banks¹, Timothy Fennell¹, Anne H. O’Donnell-Luria¹, Anne H. O’Donnell-Luria², Anne H. O’Donnell-Luria³, James S. Ware, Andrew J. Hill², Andrew J. Hill⁴, Andrew J. Hill¹, Beryl B. Cummings¹, Beryl B. Cummings², Taru Tukiainen¹, Taru Tukiainen², Daniel P. Birnbaum¹, Jack A. Kosmicki, Laramie E. Duncan², Laramie E. Duncan¹, Karol Estrada¹, Karol Estrada², Fengmei Zhao², Fengmei Zhao¹, James Zou¹, Emma Pierce-Hoffman², Emma Pierce-Hoffman¹, Joanne Berghout⁵, David Neil Cooper⁶, Nicole A. Deflaux⁷, Mark A. DePristo¹, Ron Do, Jason Flannick¹, Jason Flannick², Menachem Fromer, Laura D. Gauthier¹, Jackie Goldstein¹, Jackie Goldstein², Namrata Gupta¹, Daniel P. Howrigan², Daniel P. Howrigan¹, Adam Kiezun¹, Mitja I. Kurki¹, Mitja I. Kurki², Ami Levy Moonshine¹, Pradeep Natarajan, Lorena Orozco, Gina M. Peloso¹, Gina M. Peloso², Ryan Poplin¹, Manuel A. Rivas¹, Valentin Ruano-Rubio¹, Samuel A. Rose¹, Douglas M. Ruderfer⁸, Khalid Shakir¹, Peter D. Stenson⁶, Christine Stevens¹, Brett Thomas², Brett Thomas¹, Grace Tiao¹, María Teresa Tusié-Luna, Ben Weisburd¹, Hong-Hee Won⁹, Dongmei Yu, David Altshuler¹, David Altshuler¹⁰, Diego Ardissino, Michael Boehnke¹¹, John Danesh¹², Stacey Donnelly¹, Roberto Elosua, Jose C. Florez¹, Jose C. Florez², Stacey Gabriel¹, Gad Getz², Gad Getz¹, Stephen J. Glatt¹³, Christina M. Hultman¹⁴, Sekar Kathiresan, Markku Laakso¹⁵, Steven A. McCarroll², Steven A. McCarroll¹, Mark I. McCarthy¹⁶, Mark I. McCarthy¹⁷, Dermot P.B. McGovern¹⁸, Ruth McPherson¹⁹, Benjamin M. Neale¹, Benjamin M. Neale², Aarno Palotie, Shaun Purcell⁸, Danish Saleheen²⁰, Jeremiah M. Scharf, Pamela Sklar, Patrick F. Sullivan¹⁴, Patrick F. Sullivan²¹, Jaakko Tuomilehto²², Ming T. Tsuang²³, Hugh Watkins¹⁶, Hugh Watkins¹⁷, James G. Wilson²⁴, Mark J. Daly¹, Mark J. Daly², Daniel G. MacArthur², Daniel G. MacArthur¹ - Show less +103 more•Institutions (24)

Broad Institute¹, Harvard University², Boston Children's Hospital³, University of Washington⁴, University of Arizona⁵, Cardiff University⁶, Google⁷, Icahn School of Medicine at Mount Sinai⁸, Samsung Medical Center⁹, Vertex Pharmaceuticals¹⁰, University of Michigan¹¹, University of Cambridge¹², State University of New York Upstate Medical University¹³, Karolinska Institutet¹⁴, University of Eastern Finland¹⁵, University of Oxford¹⁶, Wellcome Trust Centre for Human Genetics¹⁷, Cedars-Sinai Medical Center¹⁸, University of Ottawa¹⁹, University of Pennsylvania²⁰, University of North Carolina at Chapel Hill²¹, University of Helsinki²², University of California, San Diego²³, University of Mississippi Medical Center²⁴

18 Aug 2016-Nature

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.

...read moreread less

8,758 citations

“Bioinformatics” 특집을 내면서

[...]

장병탁, 김삼묘, 허철구

01 Aug 2000

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

...read moreread less

4,833 citations

Journal Article•DOI•

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

[...]

Sergey Koren¹, Brian P. Walenz¹, Konstantin Berlin², Jason R. Miller³, Nicholas H. Bergman, Adam M. Phillippy¹ - Show less +2 more•Institutions (3)

National Institutes of Health¹, Invincea², J. Craig Venter Institute³

15 Mar 2017-Genome Research

TL;DR: Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences, is presented, demonstrating that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences or Oxford Nanopore technologies.

...read moreread less

Abstract: Long-read single-molecule sequencing has revolutionized de novo genome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based on tf-idf weighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either Pacific Biosciences (PacBio) or Oxford Nanopore technologies and achieves a contig NG50 of >21 Mbp on both human and Drosophila melanogaster PacBio data sets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes.

...read moreread less

4,806 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse