Home
/
Authors
/
Gonçalo R. Abecasis

Author

Gonçalo R. Abecasis

Other affiliations: Johns Hopkins University School of Medicine, Wellcome Trust Centre for Human Genetics, University of California, Los Angeles ...read more

Bio: Gonçalo R. Abecasis is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 179, co-authored 595 publications receiving 230323 citations. Previous affiliations of Gonçalo R. Abecasis include Johns Hopkins University School of Medicine & Wellcome Trust Centre for Human Genetics.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Exome-wide association study reveals novel psoriasis susceptibility locus at TNFSF15 and rare protective alleles in genes contributing to type I IFN signalling

[...]

Nick Dand¹, Sören Mucha², Lam C. Tsoi³, Satveer K. Mahil¹, Philip E. Stuart, Andreas Arnold⁴, Hansjörg Baurecht², A. David Burden⁵, Kristina Callis Duffin⁶, Vinod Chandran, Charles Curtis¹, Charles Curtis⁷, Sayantan Das³, David Ellinghaus², Eva Ellinghaus², Charlotta Enerbäck⁸, Tõnu Esko⁹, Dafna D. Gladman¹⁰, Dafna D. Gladman¹¹, Christopher E.M. Griffiths¹², Johann E. Gudjonsson, Per Hoffman¹³, Per Hoffman¹⁴, Georg Homuth¹⁵, Ulrike Hüffmeier¹⁶, Gerald G. Krueger⁶, Matthias Laudes, Sang Hyuck Lee⁷, Sang Hyuck Lee¹, Wolfgang Lieb², Henry W. Lim¹⁷, Sabine Löhr¹⁶, Ulrich Mrowietz², Martina Müller-Nurayid, Markus M. Nöthen¹⁴, Annette Peters, Proton Rahman¹⁸, André Reis¹⁶, Nick J. Reynolds¹⁹, Elke Rodriguez², Carsten Oliver Schmidt⁴, Sarah L. Spain¹, Konstantin Strauch, Trilokraj Tejasvi, John J. Voorhees, Richard B. Warren¹², Michael Weichenthal², Stephan Weidinger², Matthew Zawistowski³, Rajan P. Nair, Francesca Capon¹, Catherine H. Smith¹, Richard C. Trembath¹, Gonçalo R. Abecasis³, James T. Elder, Andre Franke², Michael A. Simpson¹, Jonathan Barker¹ - Show less +54 more•Institutions (19)

King's College London¹, University of Kiel², University of Michigan³, Greifswald University Hospital⁴, University of Glasgow⁵, University of Utah⁶, South London and Maudsley NHS Foundation Trust⁷, Linköping University⁸, University of Tartu⁹, University of Toronto¹⁰, University Health Network¹¹, Manchester Academic Health Science Centre¹², University of Basel¹³, University of Bonn¹⁴, University of Greifswald¹⁵, University of Erlangen-Nuremberg¹⁶, Henry Ford Hospital¹⁷, Memorial University of Newfoundland¹⁸, Newcastle University¹⁹

01 Nov 2017-Human Molecular Genetics

TL;DR: Previous reports of protective low-frequency protein-altering variants within IFIH1 and TYK2 (encoding an innate antiviral receptor and Janus kinase) are validated, establishing a further series of protective rare variants.

...read moreread less

Abstract: Psoriasis is a common inflammatory skin disorder for which multiple genetic susceptibility loci have been identified, but few resolved to specific functional variants. In this study, we sought to identify common and rare psoriasis-associated gene-centric variation. Using exome arrays we genotyped four independent cohorts, totalling 11 861 psoriasis cases and 28 610 controls, aggregating the dataset through statistical meta-analysis. Single variant analysis detected a previously unreported risk locus at TNFSF15 (rs6478108; P = 1.50 × 10-8, OR = 1.10), and association of common protein-altering variants at 11 loci previously implicated in psoriasis susceptibility. We validate previous reports of protective low-frequency protein-altering variants within IFIH1 (encoding an innate antiviral receptor) and TYK2 (encoding a Janus kinase), in each case establishing a further series of protective rare variants (minor allele frequency < 0.01) via gene-wide aggregation testing (IFIH1: pburden = 2.53 × 10-7, OR = 0.707; TYK2: pburden = 6.17 × 10-4, OR = 0.744). Both genes play significant roles in type I interferon (IFN) production and signalling. Several of the protective rare and low-frequency variants in IFIH1 and TYK2 disrupt conserved protein domains, highlighting potential mechanisms through which their effect may be exerted.

...read moreread less

34 citations

Journal Article•DOI•

Imputation-Aware Tag SNP Selection To Improve Power for Large-Scale, Multi-ethnic Association Studies

[...]

Genevieve L. Wojcik¹, Christian Fuchsberger², Christian Fuchsberger³, Daniel Taliun², Ryan P. Welch², Alicia R. Martin¹, Suyash Shringarpure¹, Christopher S. Carlson⁴, Gonçalo R. Abecasis², Hyun Min Kang², Michael Boehnke², Carlos Bustamante¹, Christopher R. Gignoux¹, Eimear E. Kenny - Show less +10 more•Institutions (4)

Stanford University¹, University of Michigan², University of Lübeck³, Fred Hutchinson Cancer Research Center⁴

01 Oct 2018-G3: Genes, Genomes, Genetics

TL;DR: A novel framework to select tag SNPs using the reference panel of 26 populations from Phase 3 of the 1000 Genomes Project, which demonstrates increased imputation accuracy for rare variants and examines array design strategies that contrast multi-ethnic cohorts vs. single populations.

...read moreread less

Abstract: The emergence of very large cohorts in genomic research has facilitated a focus on genotype-imputation strategies to power rare variant association. These strategies have benefited from improvements in imputation methods and association tests, however little attention has been paid to ways in which array design can increase rare variant association power. Therefore, we developed a novel framework to select tag SNPs using the reference panel of 26 populations from Phase 3 of the 1000 Genomes Project. We evaluate tag SNP performance via mean imputed r2 at untyped sites using leave-one-out internal validation and standard imputation methods, rather than pairwise linkage disequilibrium. Moving beyond pairwise metrics allows us to account for haplotype diversity across the genome for improve imputation accuracy and demonstrates population-specific biases from pairwise estimates. We also examine array design strategies that contrast multi-ethnic cohorts vs. single populations, and show a boost in performance for the former can be obtained by prioritizing tag SNPs that contribute information across multiple populations simultaneously. Using our framework, we demonstrate increased imputation accuracy for rare variants (frequency < 1%) by 0.5-3.1% for an array of one million sites and 0.7-7.1% for an array of 500,000 sites, depending on the population. Finally, we show how recent explosive growth in non-African populations means tag SNPs capture on average 30% fewer other variants than in African populations. The unified framework presented here will enable investigators to make informed decisions for the design of new arrays, and help empower the next phase of rare variant association for global health.

...read moreread less

33 citations

Journal Article•DOI•

Prevalence of CKD and Its Relationship to eGFR-Related Genetic Loci and Clinical Risk Factors in the SardiNIA Study Cohort

[...]

Antonello Pani, Jennifer L. Bragg-Gresham¹, Marco Masala, Doloretta Piras, Alice Atzeni, Maria Grazia Pilia, Liana Anna Pina Ferreli, Lenuta Balaci, Nicolò Curreli, Alessandro P Delitala, Francesco Loi, Gonçalo R. Abecasis¹, David Schlessinger², Francesco Cucca³ - Show less +10 more•Institutions (3)

University of Michigan¹, National Institutes of Health², University of Sassari³

01 Jul 2014-Journal of The American Society of Nephrology

TL;DR: Diabetes was associated with CKD prevalence, whereas hypertension and hyperuricemia correlated more strongly with fast eGFR decline, and diabetes, hypertension, hyperuricaemia, and high baseline eGfr were associated with a decline of eG FR.

...read moreread less

Abstract: The prevalence of CKD and of renal failure vary worldwide, yet parallel increases in leading risk factors explain only part of the differential prevalence. We measured CKD prevalence and eGFR, and their relationship with traditional and additional risk factors, in a Sardinian founder population cohort. The eGFR was calculated using equations from the CKD Epidemiology Collaboration and Modification of Diet in Renal Disease studies. With use of the Kidney Disease Improving Global Outcomes guidelines, a cross-sectional analysis of 4842 individuals showed that CKD prevalence was 15.1%, including 3.6% of patients in the high-risk and 0.46% in the very-high-risk categories. Longitudinal analyses performed on 4074 of these individuals who completed three visits with an average follow-up of 7 years revealed that, consistent with other populations, average eGFR slope was −0.79 ml/min per 1.73 m 2 per year, but 11.4% of the participants had an eGFR decline >2.3 ml/min per 1.73 m 2 per year (fast decline). A genetic score was generated from 13 reported eGFR- and CKD-related loci, and univariable and multivariable analyses were applied to assess the relationship between clinical, ultrasonographic, and genetic variables with three outcomes: CKD, change in eGFR, and fast eGFR decline. Genetic risk score, older age, and female sex independently correlated with each outcome. Diabetes was associated with CKD prevalence, whereas hypertension and hyperuricemia correlated more strongly with fast eGFR decline. Diabetes, hypertension, hyperuricemia, and high baseline eGFR were associated with a decline of eGFR. Along with differential health practices, population variations in this spectrum of risk factors probably contributes to the variable CKD prevalence worldwide.

...read moreread less

33 citations

Journal Article•DOI•

A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics.

[...]

Yu Ru Su¹, Chong-Zhi Di¹, Stephanie A. Bien¹, Licai Huang¹, Xinyuan Dong², Gonçalo R. Abecasis³, Sonja I. Berndt⁴, Stéphane Bézieau, Hermann Brenner⁵, Bette J. Caan⁶, Graham Casey⁷, Jenny Chang-Claude⁵, Stephen J. Chanock⁴, Sai Chen³, Charles M. Connolly¹, Keith R. Curtis¹, Jane C. Figueiredo⁸, Manish Gala⁹, Steven Gallinger¹⁰, Tabitha A. Harrison¹, Michael Hoffmeister⁵, John L. Hopper, Jeroen R. Huyghe¹, Mark A. Jenkins, Amit Joshi⁹, Loic Le Marchand¹¹, Polly A. Newcomb², Polly A. Newcomb¹, Deborah A. Nickerson², John D. Potter¹, John D. Potter², Robert E. Schoen¹², Martha L. Slattery¹³, Emily White², Emily White¹, Brent W. Zanke¹⁴, Ulrike Peters¹, Ulrike Peters², Li Hsu², Li Hsu¹ - Show less +36 more•Institutions (14)

Fred Hutchinson Cancer Research Center¹, University of Washington², University of Michigan³, National Institutes of Health⁴, German Cancer Research Center⁵, Kaiser Permanente⁶, University of Virginia⁷, Cedars-Sinai Medical Center⁸, Harvard University⁹, Mount Sinai Hospital, Toronto¹⁰, University of Hawaii¹¹, University of Pittsburgh¹², University of Utah¹³, University of Ottawa¹⁴

03 May 2018-American Journal of Human Genetics

TL;DR: A unified mixed effects model is considered that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random, and two data-driven combination approaches to jointly test for the fixed and random effects are proposed.

...read moreread less

Abstract: Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much attention and shown promise for discovering novel variants. One such approach, PrediXcan, is to use predicted gene expression through genetic regulation. However, there are limitations in this approach. The predicted gene expression may be biased, resulting from regularized regression applied to moderately sample-sized reference studies. Further, some variants can individually influence disease risk through alternative functional mechanisms besides expression. Thus, testing only the association of predicted gene expression as proposed in PrediXcan will potentially lose power. To tackle these challenges, we consider a unified mixed effects model that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random. We consider a set-based score testing framework, MiST (mixed effects score test), and propose two data-driven combination approaches to jointly test for the fixed and random effects. We establish the asymptotic distributions, which enable rapid calculation of p values for genome-wide analyses, and provide p values for fixed and random effects separately to enhance interpretability over GWASs. Extensive simulations demonstrate that our approaches are more powerful than existing ones. We apply our approach to a large-scale GWAS of colorectal cancer and identify two genes, POU5F1B and ATF1, which would have otherwise been missed by PrediXcan, after adjusting for all known loci.

...read moreread less

33 citations

Journal Article•DOI•

Fine Mapping on Chromosome 13q32–34 and Brain Expression Analysis Implicates MYO16 in Schizophrenia

[...]

Laura Rodriguez-Murillo¹, Bin Xu¹, J. Louw Roos², Gonçalo R. Abecasis³, Joseph A. Gogos¹, Maria Karayiorgou¹ - Show less +2 more•Institutions (3)

Columbia University¹, University of Pretoria², University of Michigan³

01 Mar 2014-Neuropsychopharmacology

TL;DR: The results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia, and a significant association with a genetic variant within the gene encoding for the myosin heavy-chain Myr 8 (MYO16) is reported.

...read moreread less

33 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
…
75
76
77
78
79
80
81
…
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Fast and accurate short read alignment with Burrows–Wheeler transform

[...]

Heng Li¹, Richard Durbin¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jul 2009-Bioinformatics

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.

...read moreread less

Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

...read moreread less

43,862 citations

Journal Article•DOI•

Fast gapped-read alignment with Bowtie 2

[...]

Ben Langmead¹, Steven L. Salzberg², Steven L. Salzberg¹, Steven L. Salzberg³•Institutions (3)

University of Maryland, College Park¹, Johns Hopkins University², Johns Hopkins University School of Medicine³

01 Apr 2012-Nature Methods

TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

...read moreread less

37,898 citations

Journal Article•DOI•

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

[...]

Shaun Purcell¹, Shaun Purcell², Benjamin M. Neale³, Benjamin M. Neale¹, Kathe Todd-Brown², Lori Thomas², Manuel A. R. Ferreira², David Bender¹, David Bender², Julian Maller¹, Julian Maller², Pamela Sklar¹, Pamela Sklar², Paul I.W. de Bakker², Paul I.W. de Bakker¹, Mark J. Daly¹, Mark J. Daly², Pak C. Sham⁴ - Show less +14 more•Institutions (4)

Massachusetts Institute of Technology¹, Harvard University², University of London³, University of Hong Kong⁴

01 Sep 2007-American Journal of Human Genetics

TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.

...read moreread less

Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

...read moreread less

26,280 citations

Journal Article•DOI•

Initial sequencing and analysis of the human genome.

[...]

Eric S. Lander¹, Lauren Linton¹, Bruce W. Birren¹, Chad Nusbaum¹ +245 more•Institutions (29)

15 Feb 2001-Nature

TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.

...read moreread less

Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

...read moreread less

22,269 citations

Journal Article•DOI•

The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

[...]

Aaron McKenna¹, Matthew Hanna, Eric Banks, Andrey Sivachenko, Kristian Cibulskis, Andrew Kernytsky, Kiran V. Garimella, David Altshuler, Stacey Gabriel, Mark J. Daly, Mark A. DePristo - Show less +7 more•Institutions (1)

Broad Institute¹

01 Sep 2010-Genome Research

TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

...read moreread less

20,557 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse