A global reference for human genetic variation.

doi:10.1038/NATURE15393

Home
/
Papers
/
A global reference for human genetic variation.

Journal Article•DOI•

A global reference for human genetic variation.

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature (Nature Publishing Group)-Vol. 526, Iss: 7571, pp 68-74

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

read less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Large-Scale Cognitive GWAS Meta-Analysis Reveals Tissue-Specific Neural Expression and Potential Nootropic Drug Targets

[...]

Max Lam, Joey W. Trampush, Jin Yu, Emma Knowles¹, Gail Davies², David C. Liewald², John M. Starr², Srdjan Djurovic³, Ingrid Melle³, Kjetil Sundet⁴, Kjetil Sundet³, Andrea Christoforou⁵, Ivar Reinvang⁴, Pamela DeRosse, Astri J. Lundervold⁶, Vidar M. Steen⁶, Vidar M. Steen⁵, Thomas Espeseth⁴, Thomas Espeseth³, Katri Räikkönen⁷, Elisabeth Widen⁷, Aarno Palotie⁷, Aarno Palotie⁸, Johan G. Eriksson⁹, Johan G. Eriksson⁷, Ina Giegling¹⁰, Bettina Konte¹⁰, Panos Roussos¹¹, Stella Giakoumaki¹², Katherine E. Burdick¹¹, Antony Payton¹³, William Ollier¹³, Ornit Chiba-Falek¹⁴, Deborah K. Attix¹⁴, Anna C. Need¹⁵, Elizabeth T. Cirulli¹⁶, Aristotle N. Voineskos¹⁷, Nikos C. Stefanis¹⁸, Dimitrios Avramopoulos¹⁹, Alex Hatzimanolis¹⁸, Dan E. Arking²⁰, Nikolaos Smyrnis¹⁸, Robert M. Bilder²¹, Nelson A. Freimer²¹, Tyrone D. Cannon¹, Edythe D. London²¹, Russell A. Poldrack²², Fred W. Sabb²³, Eliza Congdon²¹, Emily Drabant Conley, Matthew A. Scult¹⁴, Dwight Dickinson⁹, Richard E. Straub¹⁹, Gary Donohoe²⁴, Derek W. Morris²⁴, Aiden Corvin²⁵, Michael Gill²⁵, Ahmad R. Hariri¹⁴, Daniel R. Weinberger¹⁹, Neil Pendleton²⁶, Panos Bitsios¹², Dan Rujescu¹⁰, Jari Lahti⁷, Stephanie Le Hellard⁵, Stephanie Le Hellard⁶, Matthew C. Keller²⁷, Ole A. Andreassen³, Ian J. Deary², David C. Glahn¹, Anil K. Malhotra²⁸, Todd Lencz²⁸, Todd Lencz²⁹ - Show less +68 more•Institutions (29)

Yale University¹, University of Edinburgh², Oslo University Hospital³, University of Oslo⁴, Haukeland University Hospital⁵, University of Bergen⁶, University of Helsinki⁷, Wellcome Trust Sanger Institute⁸, National Institutes of Health⁹, Martin Luther University of Halle-Wittenberg¹⁰, Icahn School of Medicine at Mount Sinai¹¹, University of Crete¹², University of Manchester¹³, Duke University¹⁴, Imperial College London¹⁵, Durham University¹⁶, Centre for Addiction and Mental Health¹⁷, National and Kapodistrian University of Athens¹⁸, Johns Hopkins University¹⁹, Johns Hopkins University School of Medicine²⁰, Semel Institute for Neuroscience and Human Behavior²¹, Stanford University²², University of Oregon²³, National University of Ireland, Galway²⁴, Trinity College, Dublin²⁵, Manchester Academic Health Science Centre²⁶, University of Colorado Boulder²⁷, The Feinstein Institute for Medical Research²⁸, Hofstra University²⁹

28 Nov 2017-Cell Reports

TL;DR: A large-scale genome-wide association study (GWAS) of general cognitive ability was presented in this paper, which showed significant enrichment for genes causing Mendelian disorders with an intellectual disability phenotype.

...read moreread less

90 citations

Journal Article•DOI•

Single-cell ATAC-Seq in human pancreatic islets and deep learning upscaling of rare cells reveals cell-specific type 2 diabetes regulatory signatures.

[...]

Vivek Rai¹, Daniel Quang¹, Michael R. Erdos², Darren A. Cusanovich³, Riza M. Daza³, Narisu Narisu², Luli S. Zou², John P. Didion², Yuanfang Guan¹, Jay Shendure³, Stephen C. J. Parker¹, Francis S. Collins² - Show less +8 more•Institutions (3)

University of Michigan¹, National Institutes of Health², University of Washington³

01 Feb 2020-Molecular metabolism

TL;DR: In this article, a deep learning model based on U-Net architecture was proposed to accurately predict open chromatin peak calls in rare cell populations and identify cell-type-specific regulatory signatures underlying Type 2 diabetes.

...read moreread less

Abstract: Objective Type 2 diabetes (T2D) is a complex disease characterized by pancreatic islet dysfunction, insulin resistance, and disruption of blood glucose levels. Genome-wide association studies (GWAS) have identified > 400 independent signals that encode genetic predisposition. More than 90% of associated single-nucleotide polymorphisms (SNPs) localize to non-coding regions and are enriched in chromatin-defined islet enhancer elements, indicating a strong transcriptional regulatory component to disease susceptibility. Pancreatic islets are a mixture of cell types that express distinct hormonal programs, so each cell type may contribute differentially to the underlying regulatory processes that modulate T2D-associated transcriptional circuits. Existing chromatin profiling methods such as ATAC-seq and DNase-seq, applied to islets in bulk, produce aggregate profiles that mask important cellular and regulatory heterogeneity. Methods We present genome-wide single-cell chromatin accessibility profiles in >1,600 cells derived from a human pancreatic islet sample using single-cell combinatorial indexing ATAC-seq (sci-ATAC-seq). We also developed a deep learning model based on U-Net architecture to accurately predict open chromatin peak calls in rare cell populations. Results We show that sci-ATAC-seq profiles allow us to deconvolve alpha, beta, and delta cell populations and identify cell-type-specific regulatory signatures underlying T2D. Particularly, T2D GWAS SNPs are significantly enriched in beta cell-specific and across cell-type shared islet open chromatin, but not in alpha or delta cell-specific open chromatin. We also demonstrate, using less abundant delta cells, that deep learning models can improve signal recovery and feature reconstruction of rarer cell populations. Finally, we use co-accessibility measures to nominate the cell-specific target genes at 104 non-coding T2D GWAS signals. Conclusions Collectively, we identify the islet cell type of action across genetic signals of T2D predisposition and provide higher-resolution mechanistic insights into genetically encoded risk pathways.

...read moreread less

90 citations

Journal Article•DOI•

Hepatic NADH reductive stress underlies common variation in metabolic traits

[...]

Russell P. Goodman¹, Andrew L. Markhard¹, Hardik Shah¹, Rohit Sharma¹, Owen S. Skinner¹, Clary B. Clish², Amy Deik², Anupam Patgiri¹, Yu-Han H. Hsu², Yu-Han H. Hsu³, Yu-Han H. Hsu¹, Ricard Masia¹, Hye Lim Noh⁴, Sujin Suk⁴, Olga Goldberger¹, Joel N. Hirschhorn³, Joel N. Hirschhorn¹, Joel N. Hirschhorn², Gary Yellen¹, Jason K. Kim⁴, Vamsi K. Mootha¹, Vamsi K. Mootha², Vamsi K. Mootha⁵ - Show less +19 more•Institutions (5)

Harvard University¹, Broad Institute², Boston Children's Hospital³, University of Massachusetts Medical School⁴, Howard Hughes Medical Institute⁵

27 May 2020-Nature

TL;DR: It is demonstrated that NADH reductive stress mediates the effects of GCKR variation on many metabolic traits, including circulating triglyceride levels, glucose tolerance and FGF21 levels, and underscores the utility of genetic tools such as Lb NOX to empower studies of 'causal metabolism’.

...read moreread less

Abstract: The cellular NADH/NAD+ ratio is fundamental to biochemistry, but the extent to which it reflects versus drives metabolic physiology in vivo is poorly understood. Here we report the in vivo application of Lactobacillus brevis (Lb)NOX1, a bacterial water-forming NADH oxidase, to assess the metabolic consequences of directly lowering the hepatic cytosolic NADH/NAD+ ratio in mice. By combining this genetic tool with metabolomics, we identify circulating α-hydroxybutyrate levels as a robust marker of an elevated hepatic cytosolic NADH/NAD+ ratio, also known as reductive stress. In humans, elevations in circulating α-hydroxybutyrate levels have previously been associated with impaired glucose tolerance2, insulin resistance3 and mitochondrial disease4, and are associated with a common genetic variant in GCKR5, which has previously been associated with many seemingly disparate metabolic traits. Using LbNOX, we demonstrate that NADH reductive stress mediates the effects of GCKR variation on many metabolic traits, including circulating triglyceride levels, glucose tolerance and FGF21 levels. Our work identifies an elevated hepatic NADH/NAD+ ratio as a latent metabolic parameter that is shaped by human genetic variation and contributes causally to key metabolic traits and diseases. Moreover, it underscores the utility of genetic tools such as LbNOX to empower studies of ‘causal metabolism’. The authors identify an increased hepatic NADH/NAD+ ratio as an underlying metabolic parameter that is shaped by human genetic variation and contributes causally to key metabolic traits and diseases.

...read moreread less

89 citations

Journal Article•DOI•

Clinical and biochemical features of different molecular etiologies of familial chylomicronemia

[...]

Robert A. Hegele¹, Amanda J. Berberich¹, Matthew R. Ban¹, Jian Wang¹, Andres Digenio, Veronica J. Alexander², Laura D'Erasmo³, Marcello Arca³, Alan Jones⁴, Eric Bruckert⁵, Erik S.G. Stroes, Jean Bergeron⁶, Fernando Civeira⁷, Joseph L. Witztum⁸, Daniel Gaudet⁹ - Show less +11 more•Institutions (9)

University of Western Ontario¹, Isis Pharmaceuticals², Sapienza University of Rome³, Heart of England NHS Foundation Trust⁴, Institute of Chartered Accountants of Nigeria⁵, Laval University⁶, University of Zaragoza⁷, University of California, San Diego⁸, Université de Montréal⁹

01 May 2017-Journal of Clinical Lipidology

TL;DR: LPL FCS patients have lower postheparin LPL activity and a trend toward higher TGs, whereas low-density lipoprotein cholesterol was higher in non-LPL-FCS patients, according to a phase 3 randomized placebo-controlled trial of volanesorsen.

...read moreread less

89 citations

Journal Article•DOI•

Human TGF-β1 deficiency causes severe inflammatory bowel disease and encephalopathy.

[...]

Daniel Kotlarz¹, Benjamin Marquardt¹, Tuva Barøy², Tuva Barøy³, Way S. Lee⁴, Liza Konnikova¹, Sebastian Hollizeck¹, Thomas Magg¹, Anna S. Lehle¹, Christoph Walz⁵, Ingo Borggraefe¹, Fabian Hauck¹, Philip Bufler¹, Raffaele Conca¹, Sarah Wall¹, E.M. Schumacher², Doriana Misceo², Doriana Misceo³, Eirik Frengen³, Eirik Frengen², Beint S. Bentsen², Holm H. Uhlig⁶, Karl-Peter Hopfner⁵, Aleixo M. Muise⁷, Scott B. Snapper¹, Scott B. Snapper⁸, Scott B. Snapper⁹, Petter Strømme², Petter Strømme³, Christoph Klein¹ - Show less +26 more•Institutions (9)

Boston Children's Hospital¹, Oslo University Hospital², University of Oslo³, University of Malaya⁴, Ludwig Maximilian University of Munich⁵, University of Oxford⁶, University of Toronto⁷, Brigham and Women's Hospital⁸, Harvard University⁹

26 Feb 2018-Nature Genetics

TL;DR: The study shows that TGF-β1 has a critical and nonredundant role in the development and homeostasis of intestinal immunity and the CNS in humans.

...read moreread less

Abstract: Transforming growth factor (TGF)-β1 (encoded by TGFB1) is the prototypic member of the TGF-β family of 33 proteins that orchestrate embryogenesis, development and tissue homeostasis1,2. Following its discovery 3 , enormous interest and numerous controversies have emerged about the role of TGF-β in coordinating the balance of pro- and anti-oncogenic properties4,5, pro- and anti-inflammatory effects 6 , or pro- and anti-fibrinogenic characteristics 7 . Here we describe three individuals from two pedigrees with biallelic loss-of-function mutations in the TGFB1 gene who presented with severe infantile inflammatory bowel disease (IBD) and central nervous system (CNS) disease associated with epilepsy, brain atrophy and posterior leukoencephalopathy. The proteins encoded by the mutated TGFB1 alleles were characterized by impaired secretion, function or stability of the TGF-β1-LAP complex, which is suggestive of perturbed bioavailability of TGF-β1. Our study shows that TGF-β1 has a critical and nonredundant role in the development and homeostasis of intestinal immunity and the CNS in humans.

...read moreread less

89 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
…
151
152
153
154
155
156
157
…
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Basic Local Alignment Search Tool

[...]

Stephen F. Altschul¹, Warren Gish¹, Webb Miller², Eugene W. Myers³, David J. Lipman¹ - Show less +1 more•Institutions (3)

National Institutes of Health¹, Pennsylvania State University², University of Arizona³

01 Oct 1990-Journal of Molecular Biology

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

...read moreread less

88,255 citations

Journal Article•DOI•

The Sequence Alignment/Map format and SAMtools

[...]

Heng Li¹, Bob Handsaker², Alec Wysoker², T. J. Fennell², Jue Ruan³, Nils Homer², Gabor T. Marth⁴, Gonçalo R. Abecasis², Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of California, Los Angeles², Chinese Academy of Sciences³, Boston College⁴

01 Aug 2009-Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

...read moreread less

45,957 citations

Journal Article•DOI•

BEDTools: a flexible suite of utilities for comparing genomic features

[...]

Aaron R. Quinlan¹, Ira M. Hall¹•Institutions (1)

University of Virginia¹

15 Mar 2010-Bioinformatics

TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.

...read moreread less

Abstract: Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing webbased methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools

...read moreread less

18,858 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

The variant call format and VCFtools

[...]

Petr Danecek¹, Adam Auton², Gonçalo R. Abecasis³, Cornelis A. Albers¹, Eric Banks⁴, Mark A. DePristo⁴, Robert E. Handsaker⁴, Gerton Lunter², Gabor T. Marth⁵, Stephen T. Sherry⁶, Gilean McVean², Richard Durbin¹ - Show less +8 more•Institutions (6)

Wellcome Trust¹, University of Oxford², University of Michigan³, Broad Institute⁴, Boston College⁵, National Institutes of Health⁶

01 Aug 2011-Bioinformatics

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

...read moreread less

Abstract: Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: [email protected]

...read moreread less

10,164 citations