Home
/
Authors
/
Dennis G. Ballinger

Author

Dennis G. Ballinger

Other affiliations: International Computer Science Institute, University of California, San Diego

Bio: Dennis G. Ballinger is an academic researcher from California Institute of Technology. The author has contributed to research in topics: Single-nucleotide polymorphism & Population. The author has an hindex of 27, co-authored 50 publications receiving 21734 citations. Previous affiliations of Dennis G. Ballinger include International Computer Science Institute & University of California, San Diego.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A haplotype map of the human genome

[...]

John W. Belmont¹, Andrew Boudreau, Suzanne M. Leal¹, Paul Hardenbol +229 more•Institutions (40)

27 Oct 2005

TL;DR: A public database of common variation in the human genome: more than one million single nucleotide polymorphisms for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted.

...read moreread less

Abstract: Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution.

...read moreread less

5,479 citations

Journal Article•DOI•

A second generation human haplotype map of over 3.1 million SNPs

[...]

Kelly A. Frazer¹, Dennis G. Ballinger, David R. Cox, David A. Hinds +234 more•Institutions (48)

18 Oct 2007-Nature

TL;DR: The Phase II HapMap is described, which characterizes over 3.1 million human single nucleotide polymorphisms genotyped in 270 individuals from four geographically diverse populations and includes 25–35% of common SNP variation in the populations surveyed, and increased differentiation at non-synonymous, compared to synonymous, SNPs is demonstrated.

...read moreread less

Abstract: We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.

...read moreread less

4,565 citations

Journal Article•DOI•

Genome-wide association study identifies novel breast cancer susceptibility loci

[...]

Douglas F. Easton¹, Karen A. Pooley¹, Alison M. Dunning¹, Paul D.P. Pharoah¹, Deborah J. Thompson¹, Dennis G. Ballinger, Jeffery P. Struewing², Jonathan J. Morrison¹, Helen I. Field¹, Robert Luben¹, Nicholas J. Wareham¹, Shahana Ahmed¹, Catherine S. Healey¹, Richard Bowman, Kerstin B. Meyer¹, Christopher A. Haiman³, Laurence K. Kolonel, Brian E. Henderson³, Loic Le Marchand, Paul Brennan⁴, Suleeporn Sangrajrang, Valerie Gaborieau⁴, Fabrice Odefrey⁴, Chen-Yang Shen⁵, Pei-Ei Wu⁵, Hui-Chun Wang⁵, Diana Eccles⁶, D. Gareth Evans⁷, Julian Peto⁸, Olivia Fletcher⁹, Nichola Johnson⁹, Sheila Seal, Michael R. Stratton¹⁰, Nazneen Rahman, Georgia Chenevix-Trench¹¹, Georgia Chenevix-Trench¹², Stig E. Bojesen¹³, Børge G. Nordestgaard¹³, C K Axelsson¹³, Montserrat Garcia-Closas², Louise A. Brinton², Stephen J. Chanock², Jolanta Lissowska¹⁴, Beata Peplonska¹⁵, Heli Nevanlinna¹⁶, Rainer Fagerholm¹⁶, H Eerola¹⁶, Daehee Kang¹⁷, Keun-Young Yoo¹⁷, Dong-Young Noh¹⁷, Sei Hyun Ahn¹⁸, David J. Hunter¹⁹, Susan E. Hankinson¹⁹, David G. Cox¹⁹, Per Hall²⁰, Sara Wedrén²⁰, Jianjun Liu²¹, Yen-Ling Low²¹, Natalia Bogdanova²², Peter Schu¨rmann²², Do¨rk Do¨rk²², Rob A. E. M. Tollenaar²³, Catharina E. Jacobi²³, Peter Devilee²³, Jan G. M. Klijn²⁴, Alice J. Sigurdson², Michele M. Doody², Bruce H. Alexander²⁵, Jinghui Zhang², Angela Cox²⁶, Ian W. Brock²⁶, Gordon MacPherson²⁶, Malcolm W.R. Reed²⁶, Fergus J. Couch²⁷, Ellen L. Goode²⁷, Janet E. Olson²⁷, Hanne Meijers-Heijboer²⁸, Hanne Meijers-Heijboer²⁴, Ans M.W. van den Ouweland²⁴, André G. Uitterlinden²⁴, Fernando Rivadeneira²⁴, Roger L. Milne²⁹, Gloria Ribas²⁹, Anna González-Neira²⁹, Javier Benitez²⁹, John L. Hopper³⁰, Margaret R. E. McCredie¹², Margaret R. E. McCredie³¹, Margaret R. E. McCredie³², Melissa C. Southey¹², Melissa C. Southey³⁰, Graham G. Giles³³, Chris Schroen³⁰, Christina Justenhoven³⁴, Christina Justenhoven³⁵, Hiltrud Brauch³⁵, Hiltrud Brauch³⁴, Ute Hamann³⁶, Yon-Dschun Ko, Amanda B. Spurdle¹¹, Jonathan Beesley¹¹, Xiaoqing Chen¹¹, _ kConFab³⁷, Arto Mannermaa³⁷, Veli-Matti Kosma³⁷, Vesa Kataja³⁷, Jaana M. Hartikainen³⁷, Nicholas E. Day¹, David Cox, Bruce A.J. Ponder¹ - Show less +106 more•Institutions (37)

University of Cambridge¹, National Institutes of Health², University of Southern California³, International Agency for Research on Cancer⁴, Academia Sinica⁵, Princess Anne Hospital⁶, St Mary's Hospital⁷, University of London⁸, The Breast Cancer Research Foundation⁹, Wellcome Trust Sanger Institute¹⁰, QIMR Berghofer Medical Research Institute¹¹, Peter MacCallum Cancer Centre¹², University of Copenhagen¹³, Curie Institute¹⁴, Nofer Institute of Occupational Medicine¹⁵, University of Helsinki¹⁶, Seoul National University¹⁷, University of Ulsan¹⁸, Harvard University¹⁹, Karolinska Institutet²⁰, Agency for Science, Technology and Research²¹, Hannover Medical School²², Leiden University²³, Erasmus University Rotterdam²⁴, University of Minnesota²⁵, University of Sheffield²⁶, Mayo Clinic²⁷, VU University Amsterdam²⁸, Carlos III Health Institute²⁹, University of Melbourne³⁰, University of Otago³¹, Cancer Council New South Wales³², Cancer Council Victoria³³, University of Tübingen³⁴, Bosch³⁵, German Cancer Research Center³⁶, University of Eastern Finland³⁷

28 Jun 2007-Nature

TL;DR: To identify further susceptibility alleles, a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls was conducted, followed by a third stage in which 30 single nucleotide polymorphisms were tested for confirmation.

...read moreread less

Abstract: Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2.0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P,1027). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P,0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach.

...read moreread less

2,288 citations

Journal Article•DOI•

Genome-wide detection and characterization of positive selection in human populations

[...]

Pardis C. Sabeti¹, Pardis C. Sabeti², Patrick Varilly², Patrick Varilly¹ +255 more•Institutions (50)

18 Oct 2007-Nature

TL;DR: ‘Long-range haplotype’ methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population are developed.

...read moreread less

Abstract: With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2). We used 'long-range haplotype' methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and we also developed new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population. The analysis reveals more than 300 strong candidate regions. Focusing on the strongest 22 regions, we develop a heuristic for scrutinizing these regions to identify candidate targets of selection. In a complementary analysis, we identify 26 non-synonymous, coding, single nucleotide polymorphisms showing regional evidence of positive selection. Examination of these candidates highlights three cases in which two genes in a common biological process have apparently undergone positive selection in the same population:LARGE and DMD, both related to infection by the Lassa virus, in West Africa;SLC24A5 and SLC45A2, both involved in skin pigmentation, in Europe; and EDAR and EDA2R, both involved in development of hair follicles, in Asia.

...read moreread less

1,778 citations

Journal Article•DOI•

Whole-Genome Patterns of Common DNA Variation in Three Human Populations

[...]

David A. Hinds¹, David A. Hinds², Laura L. Stuve¹, Laura L. Stuve², Geoffrey B. Nilsen¹, Geoffrey B. Nilsen², Eran Halperin², Eran Halperin¹, Eleazar Eskin², Eleazar Eskin¹, Dennis G. Ballinger², Dennis G. Ballinger¹, Kelly A. Frazer², Kelly A. Frazer¹, David R. Cox¹, David R. Cox² - Show less +12 more•Institutions (2)

International Computer Science Institute¹, University of California, San Diego²

18 Feb 2005-Science

TL;DR: This work has characterized whole-genome patterns of common human DNA variation by genotyping 1,586,383 single-nucleotide polymorphisms (SNPs) in 71 Americans of European, African, and Asian ancestry and indicates that these SNPs capture most common genetic variation as a result of linkage disequilibrium.

...read moreread less

Abstract: Individual differences in DNA sequence are the genetic basis of human variability. We have characterized whole-genome patterns of common human DNA variation by genotyping 1,586,383 single-nucleotide polymorphisms (SNPs) in 71 Americans of European, African, and Asian ancestry. Our results indicate that these SNPs capture most common genetic variation as a result of linkage disequilibrium, the correlation among common SNP alleles. We observe a strong correlation between extended regions of linkage disequilibrium and functional genomic elements. Our data provide a tool for exploring many questions that remain regarding the causal role of common human DNA variation in complex human traits and for investigating the nature of genetic variation within and between human populations.

...read moreread less

1,197 citations

1
2
3
4
…
5
6
7
8
9
10

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations

Journal Article•DOI•

A framework for variation discovery and genotyping using next-generation DNA sequencing data

[...]

Mark A. DePristo¹, Eric Banks¹, Ryan Poplin¹, Kiran V. Garimella¹, Jared Maguire¹, Christopher Hartl¹, Anthony A. Philippakis², Anthony A. Philippakis¹, Anthony A. Philippakis³, Guillermo del Angel¹, Manuel A. Rivas³, Manuel A. Rivas¹, Matt Hanna¹, Aaron McKenna¹, Timothy Fennell¹, Andrew Kernytsky¹, Andrey Sivachenko¹, Kristian Cibulskis¹, Stacey Gabriel¹, David Altshuler¹, David Altshuler³, Mark J. Daly³, Mark J. Daly¹ - Show less +19 more•Institutions (3)

Broad Institute¹, Brigham and Women's Hospital², Harvard University³

01 May 2011-Nature Genetics

TL;DR: A unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs is presented.

...read moreread less

Abstract: Recent advances in sequencing technology make it possible to comprehensively catalogue genetic variation in population samples, creating a foundation for understanding human disease, ancestry and evolution. The amounts of raw data produced are prodigious and many computational steps are required to translate this output into high-quality variant calls. We present a unified analytic framework to discover and genotype variation among multiple samples simultaneously that achieves sensitive and specific results across five sequencing technologies and three distinct, canonical experimental designs. Our process includes (1) initial read mapping; (2) local realignment around indels; (3) base quality score recalibration; (4) SNP discovery and genotyping to find all potential variants; and (5) machine learning to separate true segregating variation from machine artifacts common to next-generation sequencing technologies. We discuss the application of these tools, instantiated in the Genome Analysis Toolkit (GATK), to deep whole-genome, whole-exome capture, and multi-sample low-pass (~4×) 1000 Genomes Project datasets.

...read moreread less

10,056 citations

Journal Article•DOI•

Principal components analysis corrects for stratification in genome-wide association studies

[...]

Alkes L. Price¹, Alkes L. Price², Nick Patterson², Robert M. Plenge³, Robert M. Plenge², Michael E. Weinblatt³, Nancy A. Shadick³, David Reich², David Reich¹ - Show less +5 more•Institutions (3)

Harvard University¹, Broad Institute², Brigham and Women's Hospital³

23 Jul 2006-Nature Genetics

TL;DR: This work describes a method that enables explicit detection and correction of population stratification on a genome-wide scale and uses principal components analysis to explicitly model ancestry differences between cases and controls.

...read moreread less

Abstract: Population stratification—allele frequency differences between cases and controls due to systematic ancestry differences—can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker’s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers. Population stratification—allele frequency differences between cases and controls due to systematic ancestry differences—can cause spurious associations in disease studies 1‐8 . Because the effects of stratification vary in proportion to the number of samples 9 , stratification will be an increasing problem in the large-scale association studies of the future, which will analyze thousands of samples in an effort to detect common genetic variants of weak effect. The two prevailing methods for dealing with stratification are genomic control and structured association 9‐14 . Although genomic control and structured association have proven useful in a variety of contexts, they have limitations. Genomic control corrects for stratification by adjusting association statistics at each marker by a uniform overall inflation factor. However, some markers differ in their allele frequencies across ancestral populations more than others. Thus, the uniform adjustment applied by genomic control may be insufficient at markers having unusually strong differentiation across ancestral populations and may be superfluous at markers devoid of such differentiation, leading to a loss in power. Structured association uses a program such as STRUCTURE 15 to assign the samples to discrete subpopulation clusters and then aggregates evidence of association within each cluster. If fractional membership in more than one cluster is allowed, the method cannot currently be applied to genome-wide association studies because of its intensive computational cost on large data sets. Furthermore, assignments of individuals to clusters are highly sensitive to the number of clusters, which is not well defined 14,16 .

...read moreread less

9,387 citations

Journal Article•DOI•

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls

[...]

Paul Burton¹, David Clayton², Lon R. Cardon, Nicholas John Craddock³ +192 more•Institutions (4)

07 Jun 2007-Nature

TL;DR: This study has demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in theBritish population is generally modest.

...read moreread less

Abstract: There is increasing evidence that genome-wide association ( GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study ( using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined similar to 2,000 individuals for each of 7 major diseases and a shared set of similar to 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 X 10(-7): 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals ( including 58 loci with single-point P values between 10(-5) and 5 X 10(-7)) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

...read moreread less

9,244 citations

Journal Article•DOI•

Finding the missing heritability of complex diseases

[...]

Teri A. Manolio¹, Francis S. Collins¹, Nancy J. Cox², David Goldstein³, Lucia A. Hindorff¹, David J. Hunter⁴, Mark I. McCarthy⁵, Erin M. Ramos¹, Lon R. Cardon⁶, Aravinda Chakravarti⁷, Judy H. Cho⁸, Alan E. Guttmacher¹, Augustine Kong⁹, Leonid Kruglyak¹⁰, Leonid Kruglyak¹¹, Elaine R. Mardis¹², Charles N. Rotimi¹, Montgomery Slatkin¹³, David Valle⁷, Alice S. Whittemore¹⁴, Michael Boehnke¹⁵, Andrew G. Clark¹⁶, Evan E. Eichler¹⁷, Greg Gibson¹⁸, Jonathan L. Haines¹⁹, Trudy F. C. Mackay²⁰, Steven A. McCarroll⁴, Peter M. Visscher²¹ - Show less +24 more•Institutions (21)

National Institutes of Health¹, University of Chicago², Duke University³, Harvard University⁴, University of Oxford⁵, GlaxoSmithKline⁶, Johns Hopkins University⁷, Yale University⁸, deCODE genetics⁹, Princeton University¹⁰, Howard Hughes Medical Institute¹¹, Washington University in St. Louis¹², University of California, Berkeley¹³, Stanford University¹⁴, University of Michigan¹⁵, Cornell University¹⁶, University of Washington¹⁷, University of Queensland¹⁸, Vanderbilt University¹⁹, North Carolina State University²⁰, QIMR Berghofer Medical Research Institute²¹

08 Oct 2009-Nature

TL;DR: This paper examined potential sources of missing heritability and proposed research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

...read moreread less

Abstract: Genome-wide association studies have identified hundreds of genetic variants associated with complex human diseases and traits, and have provided valuable insights into their genetic architecture. Most variants identified so far confer relatively small increments in risk, and explain only a small proportion of familial clustering, leading many to question how the remaining, 'missing' heritability can be explained. Here we examine potential sources of missing heritability and propose research strategies, including and extending beyond current genome-wide association approaches, to illuminate the genetics of complex diseases and enhance its potential to enable effective disease prevention or treatment.

...read moreread less

7,797 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse