Home
/
Authors
/
Gonçalo R. Abecasis

Author

Gonçalo R. Abecasis

Other affiliations: Johns Hopkins University School of Medicine, Wellcome Trust Centre for Human Genetics, University of California, Los Angeles ...read more

Bio: Gonçalo R. Abecasis is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 179, co-authored 595 publications receiving 230323 citations. Previous affiliations of Gonçalo R. Abecasis include Johns Hopkins University School of Medicine & Wellcome Trust Centre for Human Genetics.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma

[...]

Miriam F. Moffatt¹, Michael Kabesch², Liming Liang, Anna L. Dixon³, David P. Strachan⁴, Simon Heath⁵, Martin Depner², Andrea von Berg, Albrecht Bufe⁶, Ernst Rietschel⁷, Andrea Heinzmann⁸, Burkard Simma, Thomas Frischer⁸, Saffron A.G. Willis-Owen¹, Kenny C. C. Wong¹, Thomas Illig, Christian Vogelberg⁹, Stephan K. Weiland¹⁰, Erika von Mutius², Gonçalo R. Abecasis, Martin Farrall³, Ivo Gut⁵, G. Mark Lathrop⁵, William O.C.M. Cookson¹ - Show less +20 more•Institutions (10)

National Institutes of Health¹, Ludwig Maximilian University of Munich², Wellcome Trust Centre for Human Genetics³, St George's, University of London⁴, French Alternative Energies and Atomic Energy Commission⁵, Ruhr University Bochum⁶, University of Cologne⁷, Boston Children's Hospital⁸, Dresden University of Technology⁹, University of Ulm¹⁰

26 Jul 2007-Nature

TL;DR: The results indicate that genetic variants regulating ORMDL3 expression are determinants of susceptibility to childhood asthma.

...read moreread less

Abstract: Rates of childhood asthma diagnosis are rising: 6% of children in the United States are sufferers. Both genetic and environmental factors are clearly important. To discover more about the genetic element, Moffatt et al. looked for genes linked to asthma in a genome-wide association scan. More than a third of children with asthma of onset below the age of seven showed variations in expression of the ORMDL3 gene on chromosome 17. Similar genes are found in yeast and other primitive organisms, suggesting that they may be components of an ancient and conserved immune mechanism. Variations in expression of the gene ORMDL3 were found to be associated with development of childhood asthma, suggesting this gene should be examined in more patient groups. Asthma is caused by a combination of poorly understood genetic and environmental factors1,2. We have systematically mapped the effects of single nucleotide polymorphisms (SNPs) on the presence of childhood onset asthma by genome-wide association. We characterized more than 317,000 SNPs in DNA from 994 patients with childhood onset asthma and 1,243 non-asthmatics, using family and case-referent panels. Here we show multiple markers on chromosome 17q21 to be strongly and reproducibly associated with childhood onset asthma in family and case-referent panels with a combined P value of P < 10-12. In independent replication studies the 17q21 locus showed strong association with diagnosis of childhood asthma in 2,320 subjects from a cohort of German children (P = 0.0003) and in 3,301 subjects from the British 1958 Birth Cohort (P = 0.0005). We systematically evaluated the relationships between markers of the 17q21 locus and transcript levels of genes in Epstein–Barr virus (EBV)-transformed lymphoblastoid cell lines from children in the asthma family panel used in our association study. The SNPs associated with childhood asthma were consistently and strongly associated (P < 10-22) in cis with transcript levels of ORMDL3, a member of a gene family that encodes transmembrane proteins anchored in the endoplasmic reticulum3. The results indicate that genetic variants regulating ORMDL3 expression are determinants of susceptibility to childhood asthma.

...read moreread less

1,515 citations

Journal Article•DOI•

Human polymorphism at microRNAs and microRNA target sites.

[...]

Liuqing Yang, Chunru Lin, Chunyu Jin, Joy C. Yang +165 more•Institutions (1)

01 Jan 2013-Frontiers in Genetics

1,514 citations

Journal Article•DOI•

A note on exact tests of Hardy-Weinberg equilibrium.

[...]

Janis E. Wigginton¹, David J. Cutler², Gonçalo R. Abecasis¹•Institutions (2)

University of Michigan¹, Johns Hopkins University School of Medicine²

01 May 2005-American Journal of Human Genetics

TL;DR: These methods adequately control type I error in large and small samples and are computationally efficient and will be useful for quality assessment of genotype data and for the detection of genetic association or population stratification in very large data sets.

...read moreread less

Abstract: Deviations from Hardy-Weinberg equilibrium (HWE) can indicate inbreeding, population stratification, and even problems in genotyping. In samples of affected individuals, these deviations can also provide evidence for association. Tests of HWE are commonly performed using a simple χ2 goodness-of-fit test. We show that this χ2 test can have inflated type I error rates, even in relatively large samples (e.g., samples of 1,000 individuals that include ∼100 copies of the minor allele). On the basis of previous work, we describe exact tests of HWE together with efficient computational methods for their implementation. Our methods adequately control type I error in large and small samples and are computationally efficient. They have been implemented in freely available code that will be useful for quality assessment of genotype data and for the detection of genetic association or population stratification in very large data sets.

...read moreread less

1,374 citations

Journal Article•DOI•

Common variants at 30 loci contribute to polygenic dyslipidemia.

[...]

Sekar Kathiresan¹, Sekar Kathiresan², Sekar Kathiresan³, Cristen J. Willer⁴, Gina M. Peloso², Serkalem Demissie², Kiran Musunuru³, Eric E. Schadt⁵, Lee M. Kaplan³, Derrick A Bennett⁶, Yun Li⁴, Toshiko Tanaka⁷, Benjamin F. Voight¹, Benjamin F. Voight³, Lori L. Bonnycastle⁷, Anne U. Jackson⁴, Gabriel Crawford¹, Aarti Surti¹, Candace Guiducci¹, Noël P. Burtt¹, Sarah Parish⁶, Robert Clarke⁶, Diana Zelenika, Kari Kubalanza⁷, Mario A. Morken⁷, Laura J. Scott⁴, Heather M. Stringham⁴, Pilar Galan⁸, Amy J. Swift⁷, Johanna Kuusisto⁹, Richard N. Bergman¹⁰, Jouko Sundvall¹¹, Markku Laakso⁹, Luigi Ferrucci⁷, Paul Scheet⁴, Serena Sanna, Manuela Uda, Qiong Yang², Kathryn L. Lunetta², Josée Dupuis², Paul I.W. de Bakker³, Christopher J. O'Donnell², Christopher J. O'Donnell⁷, John C. Chambers¹², Jaspal S. Kooner¹², Serge Hercberg⁸, Pierre Meneton, Edward G. Lakatta⁷, Angelo Scuteri, David Schlessinger⁷, Jaakko Tuomilehto¹¹, Francis S. Collins⁷, Leif Groop¹³, Leif Groop¹⁴, David Altshuler³, David Altshuler¹, Rory Collins⁶, G. Mark Lathrop, Olle Melander¹³, Veikko Salomaa¹¹, Leena Peltonen¹⁴, Leena Peltonen¹, Leena Peltonen¹⁵, Marju Orho-Melander¹³, Jose M. Ordovas¹⁶, Michael Boehnke⁴, Gonçalo R. Abecasis⁴, Karen L. Mohlke¹⁷, L. Adrienne Cupples² - Show less +65 more•Institutions (17)

Massachusetts Institute of Technology¹, Boston University², Harvard University³, University of Michigan⁴, Merck & Co.⁵, University of Oxford⁶, National Institutes of Health⁷, French Institute of Health and Medical Research⁸, University of Eastern Finland⁹, University of Southern California¹⁰, National Institute for Health and Welfare¹¹, Imperial College London¹², Lund University¹³, University of Helsinki¹⁴, Wellcome Trust Sanger Institute¹⁵, Tufts University¹⁶, University of North Carolina at Chapel Hill¹⁷

01 Jan 2009-Nature Genetics

TL;DR: The results suggest that the cumulative effect of multiple common variants contributes to polygenic dyslipidemia.

...read moreread less

Abstract: Blood low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol and triglyceride levels are risk factors for cardiovascular disease. To dissect the polygenic basis of these traits, we conducted genome-wide association screens in 19,840 individuals and replication in up to 20,623 individuals. We identified 30 distinct loci associated with lipoprotein concentrations (each with P < 5 x 10(-8)), including 11 loci that reached genome-wide significance for the first time. The 11 newly defined loci include common variants associated with LDL cholesterol near ABCG8, MAFB, HNF1A and TIMD4; with HDL cholesterol near ANGPTL4, FADS1-FADS2-FADS3, HNF4A, LCAT, PLTP and TTC39B; and with triglycerides near AMAC1L2, FADS1-FADS2-FADS3 and PLTP. The proportion of individuals exceeding clinical cut points for high LDL cholesterol, low HDL cholesterol and high triglycerides varied according to an allelic dosage score (P < 10(-15) for each trend). These results suggest that the cumulative effect of multiple common variants contributes to polygenic dyslipidemia.

...read moreread less

1,358 citations

Journal Article•DOI•

Replicating genotype–phenotype associations

[...]

Stephen J. Chanock¹, Teri A. Manolio¹, Michael Boehnke², Eric Boerwinkle³, David J. Hunter⁴, Gilles Thomas¹, Joel N. Hirschhorn⁵, Gonçalo R. Abecasis², David Altshuler⁵, Joan E. Bailey-Wilson¹, Lisa D. Brooks¹, Lon R. Cardon⁶, Mark J. Daly⁵, Peter Donnelly⁷, Joseph F. Fraumeni¹, Nelson B. Freimer⁸, Daniela S. Gerhard¹, Chris Gunter, Alan E. Guttmacher¹, Mark S. Guyer¹, Emily L. Harris¹, Josephine Hoh⁹, Robert N. Hoover¹, C. Augustine Kong¹⁰, Kathleen R. Merikangas¹, Cynthia C. Morton⁴, Lyle J. Palmer¹¹, Elizabeth G. Phimister, John P. Rice¹², Jerry Roberts¹, Charles N. Rotimi¹³, Margaret A. Tucker¹, Kyle Vogan, Sholom Wacholder¹, Ellen M. Wijsman¹⁴, Deborah M. Winn¹, Francis S. Collins¹ - Show less +33 more•Institutions (14)

National Institutes of Health¹, University of Michigan², University of Texas Health Science Center at Houston³, Harvard University⁴, Broad Institute⁵, Fred Hutchinson Cancer Research Center⁶, University of Oxford⁷, University of California, Los Angeles⁸, Yale University⁹, deCODE genetics¹⁰, University of Western Australia¹¹, Washington University in St. Louis¹², Howard University¹³, University of Washington¹⁴

07 Jun 2007-Nature

TL;DR: What constitutes replication of a genotype–phenotype association, and how best can it be achieved, is investigated.

...read moreread less

Abstract: What constitutes replication of a genotype–phenotype association, and how best can it be achieved?

...read moreread less

1,355 citations

1
2
3
4
5
…
6
7
8
9
10
11
12
…
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg², Steven L. Salzberg¹, Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University², Johns Hopkins University School of Medicine³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale³, Benjamin M. Neale¹, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender¹, David Bender², Julian Maller¹, Julian Maller², Pamela Sklar¹, Pamela Sklar², Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly¹, Mark J. Daly², Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse