Home
/
Authors
/
Michael Boehnke

Author

Michael Boehnke

Other affiliations: SUNY Downstate Medical Center, Norwegian University of Science and Technology, National Institutes of Health ...read more

Bio: Michael Boehnke is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Type 2 diabetes. The author has an hindex of 152, co-authored 511 publications receiving 136681 citations. Previous affiliations of Michael Boehnke include SUNY Downstate Medical Center & Norwegian University of Science and Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1981

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Ancestry-agnostic estimation of DNA sample contamination from sequence reads.

[...]

Fan Zhang¹, Matthew Flickinger¹, Sarah A Gagliano Taliun¹, Gonçalo R. Abecasis¹, Laura J. Scott¹, Steven A McCaroll², Steven A McCaroll³, Carlos N. Pato⁴, Michael Boehnke¹, Hyun Min Kang¹ - Show less +6 more•Institutions (4)

University of Michigan¹, Harvard University², Broad Institute³, SUNY Downstate Medical Center⁴

24 Jan 2020-Genome Research

TL;DR: A robust statistical method that accurately estimates DNA contamination and is agnostic to genetic ancestry of the intended or contaminating sample and integrates the estimation of genetic ancestry and DNA contamination in a unified likelihood framework by leveraging individual-specific allele frequencies projected from reference genotypes onto principal component coordinates.

...read moreread less

Abstract: Detecting and estimating DNA sample contamination are important steps to ensure high-quality genotype calls and reliable downstream analysis. Existing methods rely on population allele frequency information for accurate estimation of contamination rates. Correctly specifying population allele frequencies for each individual in early stage of sequence analysis is impractical or even impossible for large-scale sequencing centers that simultaneously process samples from multiple studies across diverse populations. On the other hand, incorrectly specified allele frequencies may result in substantial bias in estimated contamination rates. For example, we observed that existing methods often fail to identify 10% contaminated samples at a typical 3% contamination exclusion threshold when genetic ancestry is misspecified. Such an incomplete screening of contaminated samples substantially inflates the estimated rate of genotyping errors even in deeply sequenced genomes and exomes. We propose a robust statistical method that accurately estimates DNA contamination and is agnostic to genetic ancestry of the intended or contaminating sample. Our method integrates the estimation of genetic ancestry and DNA contamination in a unified likelihood framework by leveraging individual-specific allele frequencies projected from reference genotypes onto principal component coordinates. Our method can also be used for estimating genetic ancestries, similar to LASER or TRACE, but simultaneously accounting for potential contamination. We demonstrate that our method robustly estimates contamination rates and genetic ancestries across populations and contamination scenarios. We further demonstrate that, in the presence of contamination, genetic ancestry inference can be substantially biased with existing methods that ignore contamination, while our method corrects for such biases.

...read moreread less

37 citations

Multi-ancestry study of blood lipid levels identifies four loci interacting with physical activity

[...]

Tuomas O. Kilpeläinen, Amy R. Bentley, Raymond Noordam, Yun Ju Sung +233 more

01 Jan 2019

TL;DR: It is found that physical activity modifies the effects of four genetic loci on HDL or LDL cholesterol, and higher levels of physical activity enhance the HDL cholesterol-increasing effects of the CLASP1, LHX1, and SNTA1 loci and attenuate the LDL cholesterol- Increasing effect of the CNTNAP2 locus.

...read moreread less

Abstract: Many genetic loci affect circulating lipid levels, but it remains unknown whether lifestyle factors, such as physical activity, modify these genetic effects. To identify lipid loci interacting with physical activity, we performed genome-wide analyses of circulating HDL cholesterol, LDL cholesterol, and triglyceride levels in up to 120,979 individuals of European, African, Asian, Hispanic, and Brazilian ancestry, with follow-up of suggestive associations in an additional 131,012 individuals. We find four loci, in/near CLASP1, LHX1, SNTA1, and CNTNAP2, that are associated with circulating lipid levels through interaction with physical activity; higher levels of physical activity enhance the HDL cholesterol-increasing effects of the CLASP1, LHX1, and SNTA1 loci and attenuate the LDL cholesterol-increasing effect of the CNTNAP2 locus. The CLASP1, LHX1, and SNTA1 regions harbor genes linked to muscle function and lipid metabolism. Our results elucidate the role of physical activity interactions in the genetic contribution to blood lipid levels.GWAS have identified more than 500 genetic loci associated with blood lipid levels. Here, the authors report a genome-wide analysis of interactions between genetic markers and physical activity, and find that physical activity modifies the effects of four genetic loci on HDL or LDL cholesterol.

...read moreread less

37 citations

Journal Article•DOI•

Colocalization of GWAS and eQTL signals at loci with multiple signals identifies additional candidate genes for body fat distribution.

[...]

Ying Wu¹, K. Alaine Broadaway¹, Chelsea K. Raulerson¹, Laura J. Scott², Calvin Pan³, Arthur Ko³, Aiqing He⁴, Charles Tilford⁴, Christian Fuchsberger⁵, Christian Fuchsberger², Adam E. Locke⁶, Adam E. Locke², Heather M. Stringham², Anne U. Jackson², Narisu Narisu⁷, Johanna Kuusisto⁸, Päivi Pajukanta³, Francis S. Collins⁷, Michael Boehnke², Markku Laakso⁸, Aldons J. Lusis³, Mete Civelek³, Mete Civelek⁹, Karen L. Mohlke¹ - Show less +20 more•Institutions (9)

University of North Carolina at Chapel Hill¹, University of Michigan², University of California, Los Angeles³, Bristol-Myers Squibb⁴, University of Lübeck⁵, Washington University in St. Louis⁶, National Institutes of Health⁷, University of Eastern Finland⁸, University of Virginia⁹

15 Dec 2019-Human Molecular Genetics

TL;DR: Evidence of colocalization is reevaluated using two approaches, conditional analysis and the Bayesian test COLOC, and it is shown that providing COLOC with approximate conditional summary statistics at multi-signal GWAS loci can reconcile disagreements in colocalized classification between the two tests.

...read moreread less

Abstract: Integration of genome-wide association study (GWAS) signals with expression quantitative trait loci (eQTL) studies enables identification of candidate genes. However, evaluating whether nearby signals may share causal variants, termed colocalization, is affected by the presence of allelic heterogeneity, different variants at the same locus impacting the same phenotype. We previously identified eQTL in subcutaneous adipose tissue from 770 participants in the Metabolic Syndrome in Men (METSIM) study and detected 15 eQTL signals that colocalized with GWAS signals for waist-hip ratio adjusted for body mass index (WHRadjBMI) from the Genetic Investigation of Anthropometric Traits consortium. Here, we reevaluated evidence of colocalization using two approaches, conditional analysis and the Bayesian test COLOC, and show that providing COLOC with approximate conditional summary statistics at multi-signal GWAS loci can reconcile disagreements in colocalization classification between the two tests. Next, we performed conditional analysis on the METSIM subcutaneous adipose tissue data to identify conditionally distinct or secondary eQTL signals. We used the two approaches to test for colocalization with WHRadjBMI GWAS signals and evaluated the differences in colocalization classification between the two tests. Through these analyses, we identified four GWAS signals colocalized with secondary eQTL signals for FAM13A, SSR3, GRB14 and FMO1. Thus, at loci with multiple eQTL and/or GWAS signals, analyzing each signal independently enabled additional candidate genes to be identified.

...read moreread less

36 citations

Journal Article•DOI•

Genome-wide association studies of metabolites in Finnish men identify disease-relevant loci

[...]

Xianyong Yin, Lap Sum Chan, Debraj Bose, Anne U. Jackson, Peter VandeHaar, Adam E. Locke, Christian Fuchsberger, Heather M. Stringham, Ryan P. Welch, Ketian Yu, Lilian Fernandes Silva, S. Service, Daiwei Zhang, Emily C. Hector, Erica P. Young, Liron Ganel, Indraniel Das, Haley J. Abel, Michael R. Erdos, Lori L. Bonnycastle, Johanna Kuusisto, Nathan O. Stitziel, Ira M. Hall, Gregory R. Wagner, Samuli Aarno Ripatti Palotie, Samuli Ripatti, Aarno Palotie, Jian Kang, Jean Morrison, Charles F. Burant, Francis S. Collins, Nelson B. Freimer, Karen L. Mohlke, Laura J. Scott, Xiaoquan Wen, Eric B. Fauman, Markku Laakso, Michael Boehnke - Show less +34 more

28 Mar 2022-Nature Communications

TL;DR: In this article , the authors explored the impact of rare variants (minor allele frequency < 1%) on highly heritable plasma metabolites identified in metabolomic screens and identified 303 novel association signals, more than one third at variants rare or enriched in Finns.

...read moreread less

Abstract: Few studies have explored the impact of rare variants (minor allele frequency < 1%) on highly heritable plasma metabolites identified in metabolomic screens. The Finnish population provides an ideal opportunity for such explorations, given the multiple bottlenecks and expansions that have shaped its history, and the enrichment for many otherwise rare alleles that has resulted. Here, we report genetic associations for 1391 plasma metabolites in 6136 men from the late-settlement region of Finland. We identify 303 novel association signals, more than one third at variants rare or enriched in Finns. Many of these signals identify genes not previously implicated in metabolite genome-wide association studies and suggest mechanisms for diseases and disease-related traits.

...read moreread less

36 citations

Journal Article•DOI•

Adipose Tissue Gene Expression Associations Reveal Hundreds of Candidate Genes for Cardiometabolic Traits.

[...]

Chelsea K. Raulerson¹, Arthur Ko², John C. Kidd¹, Kevin W Currin¹, Sarah M Brotman¹, Maren E Cannon¹, Ying Wu¹, Cassandra N. Spracklen¹, Anne U. Jackson³, Heather M. Stringham³, Ryan P. Welch³, Christian Fuchsberger⁴, Adam E. Locke⁵, Narisu Narisu⁶, Aldons J. Lusis², Mete Civelek⁷, Terrence S. Furey¹, Johanna Kuusisto⁸, Francis S. Collins⁶, Michael Boehnke³, Laura J. Scott³, Danyu Lin¹, Michael I. Love¹, Markku Laakso⁸, Päivi Pajukanta², Karen L. Mohlke¹ - Show less +22 more•Institutions (8)

University of North Carolina at Chapel Hill¹, University of California, Los Angeles², University of Michigan³, University of Lübeck⁴, Washington University in St. Louis⁵, National Institutes of Health⁶, University of Virginia⁷, University of Eastern Finland⁸

03 Oct 2019-American Journal of Human Genetics

TL;DR: This work used subcutaneous adipose tissue RNA-seq data from 434 Finnish men from the METSIM study to identify 9,687 primary and 2,785 secondary cis-expression quantitative trait loci (eQTL), identifying hundreds of candidate genes that may act in adipose tissues to influence cardiometabolic traits.

...read moreread less

Abstract: Genome-wide association studies (GWASs) have identified thousands of genetic loci associated with cardiometabolic traits including type 2 diabetes (T2D), lipid levels, body fat distribution, and adiposity, although most causal genes remain unknown. We used subcutaneous adipose tissue RNA-seq data from 434 Finnish men from the METSIM study to identify 9,687 primary and 2,785 secondary cis-expression quantitative trait loci (eQTL;

...read moreread less

36 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
…
61
62
63
64
65
66
67
…
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

Journal Article•DOI•

2013 ESH/ESC Guidelines for the management of arterial hypertension: The Task Force for the management of arterial hypertension of the European Society of Hypertension (ESH) and of the European Society of Cardiology (ESC).

[...]

Giuseppe Mancia¹, Robert Fagard, Krzysztof Narkiewicz, Josep Redon, Alberto Zanchetti, Michael Böhm, Thierry Christiaens, Renata Cifkova, Guy De Backer, Anna F. Dominiczak, Maurizio Galderisi, Diederick E. Grobbee, Tiny Jaarsma, Paulus Kirchhof, Sverre E. Kjeldsen, Stéphane Laurent, Athanasios J. Manolis, Peter M. Nilsson, Luis M. Ruilope, Roland E. Schmieder, Per Anton Sirnes, Peter Sleight, Margus Viigimaa, Bernard Waeber, Faiez Zannad, Michel Burnier, Ettore Ambrosioni, Mark Caufield, Antonio Coca, Michael H. Olsen, Costas Tsioufis, Philippe van de Borne, José Luis Zamorano, Stephan Achenbach, Helmut Baumgartner, Jeroen J. Bax, Héctor Bueno, Veronica Dean, Christi Deaton, Çetin Erol, Roberto Ferrari, David Hasdai, Arno W. Hoes, Juhani Knuuti, Philippe Kolh², Patrizio Lancellotti, Aleš Linhart, Petros Nihoyannopoulos, Massimo F Piepoli, Piotr Ponikowski, Juan Tamargo, Michal Tendera, Adam Torbicki, William Wijns, Stephan Windecker, Denis Clement, Thierry C. Gillebert, Enrico Agabiti Rosei, Stefan D. Anker, Johann Bauersachs, Jana Brguljan Hitij, Mark J. Caulfield, Marc De Buyzere, Sabina De Geest, Geneviève Derumeaux, Serap Erdine, Csaba Farsang, Christian Funck-Brentano, Vjekoslav Gerc, Giuseppe Germanò, Stephan Gielen, Herman Haller, Jens Jordan, Thomas Kahan, Michel Komajda, Dragan Lovic, Heiko Mahrholdt, Jan Östergren, Gianfranco Parati, Joep Perk, Jorge Polónia, Bogdan A. Popescu, Zeljko Reiner, Lars Rydén, Yuriy Sirenko, Alice Stanton, Harry A.J. Struijker-Boudier, Charalambos Vlachopoulos, Massimo Volpe, David A. Wood - Show less +86 more•Institutions (2)

University of Milano-Bicocca¹, University of Liège²

21 Jul 2013-European Heart Journal

TL;DR: In this article, a randomized controlled trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly people was presented. But the authors did not discuss the effect of the combination therapy in patients living with systolic hypertension.

...read moreread less

Abstract: ABCD : Appropriate Blood pressure Control in Diabetes ABI : ankle–brachial index ABPM : ambulatory blood pressure monitoring ACCESS : Acute Candesartan Cilexetil Therapy in Stroke Survival ACCOMPLISH : Avoiding Cardiovascular Events in Combination Therapy in Patients Living with Systolic Hypertension ACCORD : Action to Control Cardiovascular Risk in Diabetes ACE : angiotensin-converting enzyme ACTIVE I : Atrial Fibrillation Clopidogrel Trial with Irbesartan for Prevention of Vascular Events ADVANCE : Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation AHEAD : Action for HEAlth in Diabetes ALLHAT : Antihypertensive and Lipid-Lowering Treatment to Prevent Heart ATtack ALTITUDE : ALiskiren Trial In Type 2 Diabetes Using Cardio-renal Endpoints ANTIPAF : ANgioTensin II Antagonist In Paroxysmal Atrial Fibrillation APOLLO : A Randomized Controlled Trial of Aliskiren in the Prevention of Major Cardiovascular Events in Elderly People ARB : angiotensin receptor blocker ARIC : Atherosclerosis Risk In Communities ARR : aldosterone renin ratio ASCOT : Anglo-Scandinavian Cardiac Outcomes Trial ASCOT-LLA : Anglo-Scandinavian Cardiac Outcomes Trial—Lipid Lowering Arm ASTRAL : Angioplasty and STenting for Renal Artery Lesions A-V : atrioventricular BB : beta-blocker BMI : body mass index BP : blood pressure BSA : body surface area CA : calcium antagonist CABG : coronary artery bypass graft CAPPP : CAPtopril Prevention Project CAPRAF : CAndesartan in the Prevention of Relapsing Atrial Fibrillation CHD : coronary heart disease CHHIPS : Controlling Hypertension and Hypertension Immediately Post-Stroke CKD : chronic kidney disease CKD-EPI : Chronic Kidney Disease—EPIdemiology collaboration CONVINCE : Controlled ONset Verapamil INvestigation of CV Endpoints CT : computed tomography CV : cardiovascular CVD : cardiovascular disease D : diuretic DASH : Dietary Approaches to Stop Hypertension DBP : diastolic blood pressure DCCT : Diabetes Control and Complications Study DIRECT : DIabetic REtinopathy Candesartan Trials DM : diabetes mellitus DPP-4 : dipeptidyl peptidase 4 EAS : European Atherosclerosis Society EASD : European Association for the Study of Diabetes ECG : electrocardiogram EF : ejection fraction eGFR : estimated glomerular filtration rate ELSA : European Lacidipine Study on Atherosclerosis ESC : European Society of Cardiology ESH : European Society of Hypertension ESRD : end-stage renal disease EXPLOR : Amlodipine–Valsartan Combination Decreases Central Systolic Blood Pressure more Effectively than the Amlodipine–Atenolol Combination FDA : U.S. Food and Drug Administration FEVER : Felodipine EVent Reduction study GISSI-AF : Gruppo Italiano per lo Studio della Sopravvivenza nell'Infarto Miocardico-Atrial Fibrillation HbA1c : glycated haemoglobin HBPM : home blood pressure monitoring HOPE : Heart Outcomes Prevention Evaluation HOT : Hypertension Optimal Treatment HRT : hormone replacement therapy HT : hypertension HYVET : HYpertension in the Very Elderly Trial IMT : intima-media thickness I-PRESERVE : Irbesartan in Heart Failure with Preserved Systolic Function INTERHEART : Effect of Potentially Modifiable Risk Factors associated with Myocardial Infarction in 52 Countries INVEST : INternational VErapamil SR/T Trandolapril ISH : Isolated systolic hypertension JNC : Joint National Committee JUPITER : Justification for the Use of Statins in Primary Prevention: an Intervention Trial Evaluating Rosuvastatin LAVi : left atrial volume index LIFE : Losartan Intervention For Endpoint Reduction in Hypertensives LV : left ventricle/left ventricular LVH : left ventricular hypertrophy LVM : left ventricular mass MDRD : Modification of Diet in Renal Disease MRFIT : Multiple Risk Factor Intervention Trial MRI : magnetic resonance imaging NORDIL : The Nordic Diltiazem Intervention study OC : oral contraceptive OD : organ damage ONTARGET : ONgoing Telmisartan Alone and in Combination with Ramipril Global Endpoint Trial PAD : peripheral artery disease PATHS : Prevention And Treatment of Hypertension Study PCI : percutaneous coronary intervention PPAR : peroxisome proliferator-activated receptor PREVEND : Prevention of REnal and Vascular ENdstage Disease PROFESS : Prevention Regimen for Effectively Avoiding Secondary Strokes PROGRESS : Perindopril Protection Against Recurrent Stroke Study PWV : pulse wave velocity QALY : Quality adjusted life years RAA : renin-angiotensin-aldosterone RAS : renin-angiotensin system RCT : randomized controlled trials RF : risk factor ROADMAP : Randomized Olmesartan And Diabetes MicroAlbuminuria Prevention SBP : systolic blood pressure SCAST : Angiotensin-Receptor Blocker Candesartan for Treatment of Acute STroke SCOPE : Study on COgnition and Prognosis in the Elderly SCORE : Systematic COronary Risk Evaluation SHEP : Systolic Hypertension in the Elderly Program STOP : Swedish Trials in Old Patients with Hypertension STOP-2 : The second Swedish Trial in Old Patients with Hypertension SYSTCHINA : SYSTolic Hypertension in the Elderly: Chinese trial SYSTEUR : SYSTolic Hypertension in Europe TIA : transient ischaemic attack TOHP : Trials Of Hypertension Prevention TRANSCEND : Telmisartan Randomised AssessmeNt Study in ACE iNtolerant subjects with cardiovascular Disease UKPDS : United Kingdom Prospective Diabetes Study VADT : Veterans' Affairs Diabetes Trial VALUE : Valsartan Antihypertensive Long-term Use Evaluation WHO : World Health Organization ### 1.1 Principles The 2013 guidelines on hypertension of the European Society of Hypertension (ESH) and the European Society of Cardiology …

...read moreread less

14,173 citations

Journal Article•DOI•

Haploview: analysis and visualization of LD and haplotype maps

[...]

Jeffrey C. Barrett¹, Ben Fry¹, Julian Maller¹, Mark J. Daly¹•Institutions (1)

Massachusetts Institute of Technology¹

15 Jan 2005-Bioinformatics

TL;DR: Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.

...read moreread less

Abstract: Summary: Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the context of medical genetic association studies, is becoming a routine research activity. Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface. Availability: http://www.broad.mit.edu/mpg/haploview/ Contact: jcbarret@broad.mit.edu

...read moreread less

13,862 citations

Journal Article•DOI•

DnaSP v5

[...]

Pablo Librado¹, Julio Rozas¹•Institutions (1)

University of Barcelona¹

01 Jun 2009-Bioinformatics

TL;DR: Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets, including visualizing sliding window results integrated with available genome annotations in the UCSC browser.

...read moreread less

Abstract: Motivation: DnaSP is a software package for a comprehensive analysis of DNA polymorphism data. Version 5 implements a number of new features and analytical methods allowing extensive DNA polymorphism analyses on large datasets. Among other features, the newly implemented methods allow for: (i) analyses on multiple data files; (ii) haplotype phasing; (iii) analyses on insertion/deletion polymorphism data; (iv) visualizing sliding window results integrated with available genome annotations in the UCSC browser. Availability: Freely available to academic users from: http://www.ub.edu/dnasp Contact: [email protected]

...read moreread less

13,511 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse