scispace - formally typeset
Search or ask a question
Author

Gonçalo R. Abecasis

Bio: Gonçalo R. Abecasis is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 179, co-authored 595 publications receiving 230323 citations. Previous affiliations of Gonçalo R. Abecasis include Johns Hopkins University School of Medicine & Wellcome Trust Centre for Human Genetics.


Papers
More filters
Joshua C. Bis, Maryam Kavousi, Nora Franceschini, Aaron Isaacs, Gonçalo R. Abecasis, Ulf Schminke, Wendy S. Post, Albert V. Smith, L. Adrienne Cupples, Hugh S. Markus, Reinhold E. Schmidt, Jennifer E. Huffman, Terho Lehtimäki, Jens Baumert, Thomas Muenzel, Susan R. Heckbert, Abbas Dehghan, Kari E. North, Ben A. Oostra, Steve Bevan, Eva-Maria Stoegerer, Caroline Hayward, Olli T. Raitakari, Christa Meisinger, Arne Schillert, Serena Sanna, Henry Voelzke, Yu-Ching Cheng, Bolli Thorsson, Caroline S. Fox, Kenneth Rice, Fernando Rivadeneira, Vijay Nambi, Eran Halperin, K. Petrovic, Leena Peltonen, H.-Erich Wichmann, Renate B. Schnabel, Marcus Doerr, Afshin Parsa, Thor Aspelund, Serkalem Demissie, Sekar Kathiresan, Muredach P. Reilly, Kent D. Taylor, André G. Uitterlinden, David Couper, Matthias Sitzer, Mika Kähönen, Thomas Illig, Philipp S. Wild, Marco Orru, Jan Luedemann, Alan R. Shuldiner, Gudny Eiriksdottir, Charles C. White, Jerome I. Rotter, Albert Hofman, Jochen Seissler, Tanja Zeller, Gianluca Usala, Florian Ernst, Lenore J. Launer, Ralph B. D'Agostino, Daniel H. O'Leary, Christie M. Ballantyne, Joachim Thiery, Andreas Ziegler, Edward G. Lakatta, Ravi Kumar Chilukoti, Tamara B. Harris, Philip A. Wolf, Bruce M. Psaty, Joseph F. Polak, Xia Li, Wolfgang Rathmann, Manuela Uda, Eric Boerwinkle, Norman Klopp, Helena Schmidt, James F. Wilson, Jorma Viikari, Wolfgang Koenig, Stefan Blankenberg, Anne B. Newman, Jacqueline C.M. Witteman, Gerardo Heiss, Cornelia M. van Duijn, Angelo Scuteri, Georg Homuth, Braxton D. Mitchell, Vilmundur Gudnason, Christopher J. O'Donnell 
01 Jan 2011

186 citations

Journal ArticleDOI
TL;DR: The results provide a useful method for estimating the overlap between various eQTL studies and provide a catalog of cis-eQTLs in skin that can facilitate efforts to understand the functional impact of identified susceptibility variants on psoriasis and other skin traits.
Abstract: Psoriasis, an immune-mediated, inflammatory disease of the skin and joints, provides an ideal system for expression quantitative trait locus (eQTL) analysis, because it has a strong genetic basis and disease-relevant tissue (skin) is readily accessible. To better understand the role of genetic variants regulating cutaneous gene expression, we identified 841 cis-acting eQTLs using RNA extracted from skin biopsies of 53 psoriatic individuals and 57 healthy controls. We found substantial overlap between cis-eQTLs of normal control, uninvolved psoriatic, and lesional psoriatic skin. Consistent with recent studies and with the idea that control of gene expression can mediate relationships between genetic variants and disease risk, we found that eQTL SNPs are more likely to be associated with psoriasis than are randomly selected SNPs. To explore the tissue specificity of these eQTLs and hence to quantify the benefits of studying eQTLs in different tissues, we developed a refined statistical method for estimating eQTL overlap and used it to compare skin eQTLs to a published panel of lymphoblastoid cell line (LCL) eQTLs. Our method accounts for the fact that most eQTL studies are likely to miss some true eQTLs as a result of power limitations and shows that ∼70% of cis-eQTLs in LCLs are shared with skin, as compared with the naive estimate of < 50% sharing. Our results provide a useful method for estimating the overlap between various eQTL studies and provide a catalog of cis-eQTLs in skin that can facilitate efforts to understand the functional impact of identified susceptibility variants on psoriasis and other skin traits.

186 citations

Journal ArticleDOI
TL;DR: The new associations would have been missed in analyses based on 1000 Genomes Project data, underlining the advantages of large-scale sequencing in this founder population of Sardinians.
Abstract: We report ∼17.6 million genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from previous sequencing-based compilations and are enriched for predicted functional consequences. Furthermore, ∼76,000 variants common in our sample (frequency >5%) are rare elsewhere (<0.5% in the 1000 Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. We observe 14 signals, including 2 major new loci, for lipid levels and 19 signals, including 2 new loci, for inflammatory markers. The new associations would have been missed in analyses based on 1000 Genomes Project data, underlining the advantages of large-scale sequencing in this founder population.

186 citations

Journal ArticleDOI
TL;DR: The results consolidate the chromosomal locations of several AMD susceptibility loci and, together with previous reports, should facilitate the search for disease-associated sequence variants.
Abstract: Age-related macular degeneration (AMD) is a complex multifactorial disease that affects the central region of the retina. AMD is clinically heterogeneous, leading to geographic atrophy (GA) and/or choroidal neovascularization (CNV) at advanced stages. Considerable data exists in support of a genetic predisposition for AMD. Recent linkage studies have provided evidence in favor of several AMD susceptibility loci. We have performed a high-resolution (5-cM) genome scan of 412 affected relative pairs that were enriched for late-stage disease (GA and/or CNV). Nonparametric linkage analysis was performed using two different diagnostic criteria and also by dividing the affected individuals according to GA or CNV phenotype. Our results demonstrate evidence of linkage in regions that were suggested in at least one previous study at chromosomes 1q (236–240 cM in the Marshfield genetic map), 5p (40–50 cM), and 9q (111 cM). Multipoint analysis of affected relatives with CNV provided evidence of additional susceptibility loci on chromosomes 2p (10 cM) and 22q (25 cM). A recently identified Gln5345Arg change in HEMICENTIN-1 on chromosome 1q25 was not detected in 274 affected members in the restricted group with AMD, 346 additional patients with AMD, and 237 unaffected controls. Our results consolidate the chromosomal locations of several AMD susceptibility loci and, together with previous reports, should facilitate the search for disease-associated sequence variants.

184 citations

Journal ArticleDOI
TL;DR: Three additional genes, RLBP1, HIC1 and PARP12, after Bonferroni correction are identified and the Eye Genotype Expression database is established as a resource for post-GWAS interpretation of multifactorial ocular traits.
Abstract: Genome-wide association studies (GWAS) have identified genetic variants at 34 loci contributing to age-related macular degeneration (AMD)1-3. We generated transcriptional profiles of postmortem retinas from 453 controls and cases at distinct stages of AMD and integrated retinal transcriptomes, covering 13,662 protein-coding and 1,462 noncoding genes, with genotypes at more than 9 million common SNPs for expression quantitative trait loci (eQTL) analysis of a tissue not included in Genotype-Tissue Expression (GTEx) and other large datasets4,5. Cis-eQTL analysis identified 10,474 genes under genetic regulation, including 4,541 eQTLs detected only in the retina. Integrated analysis of AMD-GWAS with eQTLs ascertained likely target genes at six reported loci. Using transcriptome-wide association analysis (TWAS), we identified three additional genes, RLBP1, HIC1 and PARP12, after Bonferroni correction. Our studies expand the genetic landscape of AMD and establish the Eye Genotype Expression (EyeGEx) database as a resource for post-GWAS interpretation of multifactorial ocular traits.

181 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations

Journal ArticleDOI
TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.
Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

26,280 citations

Journal ArticleDOI
Eric S. Lander1, Lauren Linton1, Bruce W. Birren1, Chad Nusbaum1  +245 moreInstitutions (29)
15 Feb 2001-Nature
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

22,269 citations

Journal ArticleDOI
TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

20,557 citations