A global reference for human genetic variation.

doi:10.1038/NATURE15393

Home
/
Papers
/
A global reference for human genetic variation.

Journal Article•DOI•

A global reference for human genetic variation.

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature (Nature Publishing Group)-Vol. 526, Iss: 7571, pp 68-74

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

read less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Insights into the genetic architecture of the human face.

[...]

Julie D. White¹, Karlijne Indencleef², Sahin Naqvi³, Ryan J. Eller⁴, Hanne Hoskens², Jasmien Roosenboom⁵, Myoung Keun Lee⁵, Jiarui Li², Jaaved Mohammed³, Stephen Richmond⁶, Ellen E. Quillen⁷, Heather L. Norton⁸, Eleanor Feingold⁵, Tomek Swigut³, Mary L. Marazita⁵, Hilde Peeters², Greet Hens², John R. Shaffer⁵, Joanna Wysocka³, Susan Walsh⁴, Seth M. Weinberg⁵, Mark D. Shriver¹, Peter Claes - Show less +19 more•Institutions (8)

Pennsylvania State University¹, Katholieke Universiteit Leuven², Stanford University³, Indiana University – Purdue University Indianapolis⁴, University of Pittsburgh⁵, Cardiff University⁶, Wake Forest University⁷, University of Cincinnati⁸

31 Jan 2021-Nature Genetics

TL;DR: Analysis of a multivariate genome-wide association study meta-analysis of 8,246 European individuals provides insights into the understanding of how complex morphological traits are shaped by both individual and coordinated genetic actions.

...read moreread less

Abstract: The human face is complex and multipartite, and characterization of its genetic architecture remains challenging. Using a multivariate genome-wide association study meta-analysis of 8,246 European individuals, we identified 203 genome-wide-significant signals (120 also study-wide significant) associated with normal-range facial variation. Follow-up analyses indicate that the regions surrounding these signals are enriched for enhancer activity in cranial neural crest cells and craniofacial tissues, several regions harbor multiple signals with associations to different facial phenotypes, and there is evidence for potential coordinated actions of variants. In summary, our analyses provide insights into the understanding of how complex morphological traits are shaped by both individual and coordinated genetic actions.

...read moreread less

76 citations

Journal Article•DOI•

Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans.

[...]

Arbel Harpak¹, Anand Bhaskar¹, Jonathan K. Pritchard², Jonathan K. Pritchard¹•Institutions (2)

Stanford University¹, Howard Hughes Medical Institute²

15 Dec 2016-PLOS Genetics

TL;DR: It is shown that variable mutation rates are key determinants of the SFS in humans and this effect is largely due to sites with elevated mutation rates causing significant departures from the widely-used infinite sites mutation model.

...read moreread less

Abstract: The site frequency spectrum (SFS) has long been used to study demographic history and natural selection. Here, we extend this summary by examining the SFS conditional on the alleles found at the same site in other species. We refer to this extension as the "phylogenetically-conditioned SFS" or cSFS. Using recent large-sample data from the Exome Aggregation Consortium (ExAC), combined with primate genome sequences, we find that human variants that occurred independently in closely related primate lineages are at higher frequencies in humans than variants with parallel substitutions in more distant primates. We show that this effect is largely due to sites with elevated mutation rates causing significant departures from the widely-used infinite sites mutation model. Our analysis also suggests substantial variation in mutation rates even among mutations involving the same nucleotide changes. In summary, we show that variable mutation rates are key determinants of the SFS in humans.

...read moreread less

75 citations

Journal Article•DOI•

Genetic Variation in HSD17B13 Reduces the Risk of Developing Cirrhosis and Hepatocellular Carcinoma in Alcohol Misusers.

[...]

Felix Stickel, Philipp Lutz¹, Stephan Buch², Hans Dieter Nischalke¹, Ines Silva³, Vanessa Rausch³, Janett Fischer, Karl Heinz Weiss⁴, Daniel Gotthardt⁴, Jonas Rosendahl, Astrid Marot⁵, Mona Elamly⁵, Marcin Krawczyk⁶, Marcin Krawczyk⁷, Markus Casper⁷, Frank Lammert⁷, Thomas W.M. Buckley⁸, Andrew McQuillin⁸, Ulrich Spengler¹, Florian Eyer⁹, Arndt Vogel¹⁰, Silke Marhenke¹⁰, Johann von Felden¹¹, Henning Wege¹¹, Rohini Sharma¹², Stephen R. Atkinson¹², Andre Franke¹³, S Nehring², V Moser², Clemens Schafmayer¹³, Laurent Spahr, Carolin Lackner¹⁴, Rudolf E. Stauber¹⁴, Ali Canbay¹⁵, Alexander Link¹⁵, Luca Valenti¹⁶, Luca Valenti¹⁷, Jane I. Grove¹⁸, Jane I. Grove¹⁹, Guruprasad P. Aithal¹⁸, Guruprasad P. Aithal¹⁹, Jens U. Marquardt²⁰, Waleed Fateen¹⁹, Waleed Fateen¹⁸, Steffen Zopf²¹, Jean-François Dufour²², Jonel Trebicka²³, Christian Datz²⁴, Pierre Deltenre⁵, Sebastian Mueller³, Thomas Berg, Jochen Hampe², Marsha Y. Morgan⁸ - Show less +49 more•Institutions (24)

University of Bonn¹, Dresden University of Technology², University Hospital Heidelberg³, Heidelberg University⁴, University Hospital of Lausanne⁵, Medical University of Warsaw⁶, Saarland University⁷, University College London⁸, Technische Universität München⁹, Hannover Medical School¹⁰, University of Hamburg¹¹, Imperial College London¹², University of Kiel¹³, University of Graz¹⁴, Ruhr University Bochum¹⁵, University of Milan¹⁶, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico¹⁷, University of Nottingham¹⁸, Nottingham University Hospitals NHS Trust¹⁹, University of Mainz²⁰, University of Erlangen-Nuremberg²¹, University of Bern²², Goethe University Frankfurt²³, Paracelsus Private Medical University of Salzburg²⁴

01 Jul 2020-Hepatology

TL;DR: This study explores the risk associations between these two genetic variants and the development of alcohol‐related cirrhosis and HCC.

...read moreread less

75 citations

Journal Article•DOI•

What can genome-wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits?

[...]

Emily B. Josephs¹, John R. Stinchcombe², Stephen I. Wright²•Institutions (2)

University of California, Davis¹, University of Toronto²

01 Apr 2017-New Phytologist

TL;DR: The theoretical predictions for genetic architecture and additional signals of selection on genomic sequence for the loci that affect traits are reviewed and how plant GWAS have tested for the signatures of various selective scenarios is reviewed.

...read moreread less

Abstract: Contents 21 I. 21 II. 22 III. 24 IV. 25 V. 29 30 References 30 SUMMARY: Understanding the evolutionary forces that shape genetic variation within species has long been a goal of evolutionary biology. Integrating data for the genetic architecture of traits from genome-wide association mapping studies (GWAS) along with the development of new population genetic methods for identifying selection in sequence data may allow us to evaluate the roles of mutation-selection balance and balancing selection in shaping genetic variation at various scales. Here, we review the theoretical predictions for genetic architecture and additional signals of selection on genomic sequence for the loci that affect traits. Next, we review how plant GWAS have tested for the signatures of various selective scenarios. Limited evidence to date suggests that within-population variation is maintained primarily by mutation-selection balance while variation across the landscape is the result of local adaptation. However, there are a number of inherent biases in these interpretations. We highlight these challenges and suggest ways forward to further understanding of the maintenance of variation.

...read moreread less

75 citations

Journal Article•DOI•

Cholesterol and matrisome pathways dysregulated in astrocytes and microglia

[...]

Julia Tcw, Lu Qian, Nina H. Pipalia, Michael J. Chao, Shuang Liang, Yang Shi, Bharat R. Jain, Sarah Bertelsen, Manav Kapoor, Edoardo Marcora, Elizabeth Sikora, Elizabeth J. Andrews, Alessandra C. Martini, Celeste M. Karch, Elizabeth Head, David M. Holtzman, Bin Zhang, Minghui Wang, Frederick R. Maxfield, Wayne W. Poon, Alison Goate - Show less +17 more

01 Jun 2022-Cell

TL;DR: In this paper , the effects of APOE4 on brain cell types derived from population and isogenic human induced pluripotent stem cells, post-mortem brain, and APOE targeted replacement mice were investigated.

...read moreread less

75 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
…
181
182
183
184
185
186
187
…
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Basic Local Alignment Search Tool

[...]

Stephen F. Altschul¹, Warren Gish¹, Webb Miller², Eugene W. Myers³, David J. Lipman¹ - Show less +1 more•Institutions (3)

National Institutes of Health¹, Pennsylvania State University², University of Arizona³

01 Oct 1990-Journal of Molecular Biology

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.

...read moreread less

88,255 citations

Journal Article•DOI•

The Sequence Alignment/Map format and SAMtools

[...]

Heng Li¹, Bob Handsaker², Alec Wysoker², T. J. Fennell², Jue Ruan³, Nils Homer², Gabor T. Marth⁴, Gonçalo R. Abecasis², Richard Durbin¹ - Show less +5 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of California, Los Angeles², Chinese Academy of Sciences³, Boston College⁴

01 Aug 2009-Bioinformatics

TL;DR: SAMtools as discussed by the authors implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments.

...read moreread less

Abstract: Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: [email protected]

...read moreread less

45,957 citations

Journal Article•DOI•

BEDTools: a flexible suite of utilities for comparing genomic features

[...]

Aaron R. Quinlan¹, Ira M. Hall¹•Institutions (1)

University of Virginia¹

15 Mar 2010-Bioinformatics

TL;DR: A new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format, which allows the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks.

...read moreread less

Abstract: Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing webbased methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools

...read moreread less

18,858 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

The variant call format and VCFtools

[...]

Petr Danecek¹, Adam Auton², Gonçalo R. Abecasis³, Cornelis A. Albers¹, Eric Banks⁴, Mark A. DePristo⁴, Robert E. Handsaker⁴, Gerton Lunter², Gabor T. Marth⁵, Stephen T. Sherry⁶, Gilean McVean², Richard Durbin¹ - Show less +8 more•Institutions (6)

Wellcome Trust¹, University of Oxford², University of Michigan³, Broad Institute⁴, Boston College⁵, National Institutes of Health⁶

01 Aug 2011-Bioinformatics

TL;DR: VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

...read moreread less

Abstract: Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: [email protected]

...read moreread less

10,164 citations