scispace - formally typeset
Search or ask a question
Author

Gonçalo R. Abecasis

Bio: Gonçalo R. Abecasis is an academic researcher from University of Michigan. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 179, co-authored 595 publications receiving 230323 citations. Previous affiliations of Gonçalo R. Abecasis include Johns Hopkins University School of Medicine & Wellcome Trust Centre for Human Genetics.


Papers
More filters
Journal ArticleDOI
14 Oct 2020-Nature
TL;DR: Analysis of high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine programme enables simultaneous identification of germline and somatic mutations that predispose individuals to clonal expansion of haematopoietic stem cells.
Abstract: Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.

300 citations

Journal ArticleDOI
TL;DR: The usefulness of joint analyses of clinically distinct immune-mediated diseases and the map of shared genetic risk loci revealed in silico expression quantitative trait locus analyses revealed that the associations at ZMIZ1 and near SOCS1 have a potential functional effect on gene expression.
Abstract: Psoriasis (PS) and Crohn disease (CD) have been shown to be epidemiologically, pathologically, and therapeutically connected, but little is known about their shared genetic causes. We performed meta-analyses of five published genome-wide association studies on PS (2,529 cases and 4,955 controls) and CD (2,142 cases and 5,505 controls), followed up 20 loci that showed strongest evidence for shared disease association and, furthermore, tested cross-disease associations for previously reported PS and CD risk alleles in additional 6,115 PS cases, 4,073 CD cases, and 10,100 controls. We identified seven susceptibility loci outside the human leukocyte antigen region (9p24 near JAK2, 10q22 at ZMIZ1 , 11q13 near PRDX5 , 16p13 near SOCS1 , 17q21 at STAT3 , 19p13 near FUT2 , and 22q11 at YDJC ) shared between PS and CD with genome-wide significance (p −8 ) and confirmed four already established PS and CD risk loci ( IL23R , IL12B , REL , and TYK2 ). Three of the shared loci are also genome-wide significantly associated with PS alone (10q22 at ZMIZ1 , p rs1250544 = 3.53 × 10 −8 , 11q13 near PRDX5 , p rs694739 = 3.71 × 10 −09 , 22q11 at YDJC , p rs181359 = 8.02 × 10 −10 ). In addition, we identified one susceptibility locus for CD (16p13 near SOCS1 , p rs4780355 = 4.99 × 10 −8 ). Refinement of association signals identified shared genome-wide significant associations for exonic SNPs at 10q22 ( ZMIZ1 ) and in silico expression quantitative trait locus analyses revealed that the associations at ZMIZ1 and near SOCS1 have a potential functional effect on gene expression. Our results show the usefulness of joint analyses of clinically distinct immune-mediated diseases and enlarge the map of shared genetic risk loci.

300 citations

Journal ArticleDOI
TL;DR: In this article, the authors used high-throughput complementary DNA sequencing (RNA-seq) to assay the transcriptomes of lesional psoriatic and normal skin, which revealed multiple modules of coordinately expressed epidermal differentiation genes, overlapping significantly with genes regulated by the long noncoding RNA TINCR.

292 citations

Journal ArticleDOI
TL;DR: The epidemiology, immunopathology, and genetics of psoriasis are reviewed, and a disease model integrating its genetics and immunology is presented, using cases and controls rather than families.

288 citations

Journal ArticleDOI
Marleen H. M. de Moor1, Stéphanie Martine van den Berg2, Karin J. H. Verweij3, Karin J. H. Verweij1, Robert F. Krueger4, Michelle Luciano5, Alejandro Arias Vasquez6, Lindsay K. Matteson4, Jaime Derringer7, Tõnu Esko8, Najaf Amin9, Scott D. Gordon3, Narelle K. Hansell3, Amy B. Hart10, Ilkka Seppälä, Jennifer E. Huffman5, Bettina Konte11, Jari Lahti12, Minyoung Lee13, Michael B. Miller4, Teresa Nutile14, Toshiko Tanaka15, Alexander Teumer16, Alexander Viktorin17, Juho Wedenoja12, Gonçalo R. Abecasis18, Daniel E. Adkins13, Arpana Agrawal19, Jüri Allik20, Jüri Allik8, Katja Appel16, Timothy B. Bigdeli13, Fabio Busonero13, Harry Campbell5, Paul T. Costa21, George Davey Smith22, Gail Davies5, Harriet de Wit10, Jun Ding15, Barbara E. Engelhardt23, Johan G. Eriksson, Iryna O. Fedko1, Luigi Ferrucci15, Barbara Franke6, Ina Giegling11, Richard A. Grucza19, Annette M. Hartmann11, Andrew C. Heath19, Kati Heinonen12, Anjali K. Henders3, Georg Homuth16, Jouke-Jan Hottenga1, William G. Iacono4, Joost G. E. Janzing6, Markus Jokela12, Robert Karlsson17, John P. Kemp24, John P. Kemp22, Matthew G. Kirkpatrick10, Antti Latvala25, Antti Latvala12, Terho Lehtimäki, David C. Liewald5, Pamela A. F. Madden19, Chiara Magri26, Patrik K. E. Magnusson17, Jonathan Marten5, Andrea Maschio27, Sarah E. Medland3, Evelin Mihailov8, Yuri Milaneschi1, Grant W. Montgomery3, Matthias Nauck16, Klaasjan G. Ouwens1, Aarno Palotie28, Aarno Palotie12, Erik Pettersson17, Ozren Polasek29, Yong Qian15, Laura Pulkki-Råback12, Olli T. Raitakari30, Anu Realo8, Richard J. Rose31, Daniela Ruggiero14, Carsten Oliver Schmidt16, Wendy S. Slutske32, Rossella Sorice14, John M. Starr5, Beate St Pourcain22, Angelina R. Sutin15, Angelina R. Sutin33, Nicholas J. Timpson22, Holly Trochet5, Sita H. Vermeulen6, Eero Vuoksimaa12, Elisabeth Widen12, Jasper Wouda1, Jasper Wouda2, Margaret J. Wright3, Lina Zgaga34, Lina Zgaga5, David J. Porteous5, Alessandra Minelli26, Abraham A. Palmer10, Dan Rujescu11, Marina Ciullo14, Caroline Hayward5, Igor Rudan5, Andres Metspalu5, Jaakko Kaprio12, Jaakko Kaprio25, Ian J. Deary5, Katri Räikkönen12, James F. Wilson5, Liisa Keltikangas-Järvinen12, Laura J. Bierut19, John M. Hettema13, Hans Joergen Grabe13, Cornelia M. van Duijn9, David M. Evans22, David M. Evans24, David Schlessinger15, N. L. Pedersen14, Antonio Terracciano33, Matt McGue4, Matt McGue35, Brenda W.J.H. Penninx1, Nicholas G. Martin3, Dorret I. Boomsma1 
TL;DR: This study identifies a novel locus for neuroticism located in a known gene that has been associated with bipolar disorder and schizophrenia in previous studies and shows that neuroticism is influenced by many genetic variants of small effect that are either common or tagged by common variants.
Abstract: Importance Neuroticism is a pervasive risk factor for psychiatric conditions. It genetically overlaps with major depressive disorder (MDD) and is therefore an important phenotype for psychiatric genetics. The Genetics of Personality Consortium has created a resource for genome-wide association analyses of personality traits in more than 63 000 participants (including MDD cases). Objectives To identify genetic variants associated with neuroticism by performing a meta-analysis of genome-wide association results based on 1000 Genomes imputation; to evaluate whether common genetic variants as assessed by single-nucleotide polymorphisms (SNPs) explain variation in neuroticism by estimating SNP-based heritability; and to examine whether SNPs that predict neuroticism also predict MDD. Design, Setting, and Participants Genome-wide association meta-analysis of 30 cohorts with genome-wide genotype, personality, and MDD data from the Genetics of Personality Consortium. The study included 63 661 participants from 29 discovery cohorts and 9786 participants from a replication cohort. Participants came from Europe, the United States, or Australia. Analyses were conducted between 2012 and 2014. Main Outcomes and Measures Neuroticism scores harmonized across all 29 discovery cohorts by item response theory analysis, and clinical MDD case-control status in 2 of the cohorts. Results A genome-wide significant SNP was found on 3p14 in MAGI1 (rs35855737; P = 9.26 × 10−9 in the discovery meta-analysis). This association was not replicated (P = .32), but the SNP was still genome-wide significant in the meta-analysis of all 30 cohorts (P = 2.38 × 10−8). Common genetic variants explain 15% of the variance in neuroticism. Polygenic scores based on the meta-analysis of neuroticism in 27 cohorts significantly predicted neuroticism (1.09 × 10−12 < P < .05) and MDD (4.02 × 10−9 < P < .05) in the 2 other cohorts. Conclusions and Relevance This study identifies a novel locus for neuroticism. The variant is located in a known gene that has been associated with bipolar disorder and schizophrenia in previous studies. In addition, the study shows that neuroticism is influenced by many genetic variants of small effect that are either common or tagged by common variants. These genetic variants also influence MDD. Future studies should confirm the role of the MAGI1 locus for neuroticism and further investigate the association of MAGI1 and the polygenic association to a range of other psychiatric disorders that are phenotypically correlated with neuroticism

286 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Abstract: Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: [email protected]

43,862 citations

Journal ArticleDOI
TL;DR: Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
Abstract: As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.

37,898 citations

Journal ArticleDOI
TL;DR: This work introduces PLINK, an open-source C/C++ WGAS tool set, and describes the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation, which focuses on the estimation and use of identity- by-state and identity/descent information in the context of population-based whole-genome studies.
Abstract: Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.

26,280 citations

Journal ArticleDOI
Eric S. Lander1, Lauren Linton1, Bruce W. Birren1, Chad Nusbaum1  +245 moreInstitutions (29)
15 Feb 2001-Nature
TL;DR: The results of an international collaboration to produce and make freely available a draft sequence of the human genome are reported and an initial analysis is presented, describing some of the insights that can be gleaned from the sequence.
Abstract: The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.

22,269 citations

Journal ArticleDOI
TL;DR: The GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.
Abstract: Next-generation DNA sequencing (NGS) projects, such as the 1000 Genomes Project, are already revolutionizing our understanding of genetic variation among individuals. However, the massive data sets generated by NGS—the 1000 Genome pilot alone includes nearly five terabases—make writing feature-rich, efficient, and robust analysis tools difficult for even computationally sophisticated individuals. Indeed, many professionals are limited in the scope and the ease with which they can answer scientific questions by the complexity of accessing and manipulating the data produced by these machines. Here, we discuss our Genome Analysis Toolkit (GATK), a structured programming framework designed to ease the development of efficient and robust analysis tools for next-generation DNA sequencers using the functional programming philosophy of MapReduce. The GATK provides a small but rich set of data access patterns that encompass the majority of analysis tool needs. Separating specific analysis calculations from common data management infrastructure enables us to optimize the GATK framework for correctness, stability, and CPU and memory efficiency and to enable distributed and shared memory parallelization. We highlight the capabilities of the GATK by describing the implementation and application of robust, scale-tolerant tools like coverage calculators and single nucleotide polymorphism (SNP) calling. We conclude that the GATK programming framework enables developers and analysts to quickly and easily write efficient and robust NGS tools, many of which have already been incorporated into large-scale sequencing projects like the 1000 Genomes Project and The Cancer Genome Atlas.

20,557 citations