Showing papers in "Nature Genetics in 2021"

PDF

Open Access

Journal Article•DOI•

ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis.

[...]

Jeffrey M. Granja¹, M. Ryan Corces, Sarah E. Pierce¹, S. Tansu Bagdatli¹, Hani Choudhry², Howard Y. Chang¹, Howard Y. Chang³, William J. Greenleaf - Show less +4 more•Institutions (3)

Stanford University¹, King Abdulaziz University², Howard Hughes Medical Institute³

25 Feb 2021-Nature Genetics

TL;DR: ArchR as discussed by the authors is a software suite for single-cell analysis of regulatory chromatin in R (ArchR; https://www.archrproject.com/ ) that enables fast and comprehensive analysis of singlecell chromatin accessibility data.

...read moreread less

Abstract: The advent of single-cell chromatin accessibility profiling has accelerated the ability to map gene regulatory landscapes but has outpaced the development of scalable software to rapidly extract biological meaning from these data. Here we present a software suite for single-cell analysis of regulatory chromatin in R (ArchR; https://www.archrproject.com/ ) that enables fast and comprehensive analysis of single-cell chromatin accessibility data. ArchR provides an intuitive, user-focused interface for complex single-cell analyses, including doublet removal, single-cell clustering and cell type identification, unified peak set generation, cellular trajectory identification, DNA element-to-gene linkage, transcription factor footprinting, mRNA expression level prediction from chromatin accessibility and multi-omic integration with single-cell RNA sequencing (scRNA-seq). Enabling the analysis of over 1.2 million single cells within 8 h on a standard Unix laptop, ArchR is a comprehensive software suite for end-to-end analysis of single-cell chromatin accessibility that will accelerate the understanding of gene regulation at the resolution of individual cells.

...read moreread less

406 citations

Journal Article•DOI•

Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology

[...]

Niamh Mullins¹, Andreas J. Forstner², Andreas J. Forstner³, Andreas J. Forstner⁴ +396 more•Institutions (119)

17 May 2021-Nature Genetics

TL;DR: The authors performed a genome-wide association study of 41,917 bipolar disorder cases and 371,549 controls of European ancestry, which identified 64 associated genomic loci, including genes encoding targets of antipsychotics, calcium channel blockers, antiepileptics and anesthetics.

...read moreread less

Abstract: Bipolar disorder is a heritable mental illness with complex etiology. We performed a genome-wide association study of 41,917 bipolar disorder cases and 371,549 controls of European ancestry, which identified 64 associated genomic loci. Bipolar disorder risk alleles were enriched in genes in synaptic signaling pathways and brain-expressed genes, particularly those with high specificity of expression in neurons of the prefrontal cortex and hippocampus. Significant signal enrichment was found in genes encoding targets of antipsychotics, calcium channel blockers, antiepileptics and anesthetics. Integrating expression quantitative trait locus data implicated 15 genes robustly linked to bipolar disorder via gene expression, encoding druggable targets such as HTR6, MCHR1, DCLK3 and FURIN. Analyses of bipolar disorder subtypes indicated high but imperfect genetic correlation between bipolar disorder type I and II and identified additional associated loci. Together, these results advance our understanding of the biological etiology of bipolar disorder, identify novel therapeutic leads and prioritize genes for functional follow-up studies.

...read moreread less

378 citations

Journal Article•DOI•

Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression

[...]

Urmo Võsa¹, Annique Claringbould², Annique Claringbould³, Harm-Jan Westra¹, Marc Jan Bonder¹, Patrick Deelen, Biao Zeng⁴, Holger Kirsten⁵, Ashis Saha⁶, Roman Kreuzhuber⁷, Roman Kreuzhuber³, Roman Kreuzhuber⁸, Seyhan Yazar⁹, Harm Brugge¹, Roy Oelen¹, Dylan H. de Vries¹, Monique G. P. van der Wijst¹, Silva Kasela¹⁰, Natalia Pervjakova¹⁰, Isabel Alves¹¹, Marie-Julie Favé¹¹, Mawusse Agbessi¹¹, Mark W. Christiansen¹², Rick Jansen¹³, Ilkka Seppälä, Lin Tong¹⁴, Alexander Teumer¹⁵, Katharina Schramm¹⁶, Gibran Hemani¹⁷, Joost Verlouw¹⁸, Hanieh Yaghootkar¹⁹, Hanieh Yaghootkar²⁰, Hanieh Yaghootkar²¹, Reyhan Sönmez Flitman²², Reyhan Sönmez Flitman²³, Andrew A. Brown²⁴, Andrew A. Brown²⁵, Viktorija Kukushkina¹⁰, Anette Kalnapenkis¹⁰, Sina Rüeger²³, Eleonora Porcu²³, Jaanika Kronberg¹⁰, Johannes Kettunen, Bernett Lee²⁶, Futao Zhang²⁷, Ting Qi²⁷, Jose Alquicira Hernandez⁹, Wibowo Arindrarto²⁸, Frank Beutner⁵, Peter A C 't Hoen²⁹, Joyce B. J. van Meurs¹⁸, Jenny van Dongen¹³, Maarten van Iterson²⁸, Morris A. Swertz, Julia Dmitrieva³⁰, Mahmoud Elansary³⁰, Benjamin P. Fairfax³¹, Michel Georges³⁰, Bastiaan T. Heijmans²⁸, Alex W. Hewitt³², Mika Kähönen, Yungil Kim⁶, Yungil Kim³³, Julian C. Knight³¹, Peter Kovacs⁵, Knut Krohn⁵, Shuang Li¹, Markus Loeffler⁵, Urko M. Marigorta³⁴, Urko M. Marigorta⁴, Hailang Mei²⁸, Yukihide Momozawa³⁰, Martina Müller-Nurasyid¹⁶, Matthias Nauck¹⁵, Michel G. Nivard³⁵, Brenda W.J.H. Penninx¹³, Jonathan K. Pritchard³⁶, Olli T. Raitakari³⁷, Olli T. Raitakari³⁸, Olaf Rötzschke²⁶, Eline Slagboom²⁸, Coen D.A. Stehouwer³⁹, Michael Stumvoll⁵, Patrick F. Sullivan⁴⁰, Joachim Thiery⁵, Anke Tönjes⁵, Jan H. Veldink⁴¹, Uwe Völker¹⁵, Robert Warmerdam¹, Cisca Wijmenga¹, Morris Swertz, Anand Kumar Andiappan²⁶, Grant W. Montgomery²⁷, Samuli Ripatti⁴², Markus Perola⁴³, Zoltán Kutalik²³, Emmanouil T. Dermitzakis²², Emmanouil T. Dermitzakis²⁵, Sven Bergmann²², Sven Bergmann²³, Timothy M. Frayling²⁰, Holger Prokisch⁴⁴, Habibul Ahsan¹⁴, Brandon L. Pierce¹⁴, Terho Lehtimäki, Dorret I. Boomsma¹³, Bruce M. Psaty¹², Sina A. Gharib¹², Philip Awadalla¹¹, Lili Milani¹⁰, Willem H. Ouwehand⁴⁵, Willem H. Ouwehand⁸, Willem H. Ouwehand⁷, Kate Downes⁸, Kate Downes⁷, Oliver Stegle⁴⁶, Oliver Stegle³, Alexis Battle⁶, Peter M. Visscher²⁷, Jian Yang²⁷, Jian Yang⁴⁷, Markus Scholz⁵, Joseph E. Powell⁹, Joseph E. Powell⁴⁸, Greg Gibson⁴, Tõnu Esko¹⁰, Lude Franke¹ - Show less +123 more•Institutions (48)

University Medical Center Groningen¹, Netherlands Cancer Institute², European Bioinformatics Institute³, Georgia Institute of Technology⁴, Leipzig University⁵, Johns Hopkins University⁶, University of Cambridge⁷, NHS Blood and Transplant⁸, Garvan Institute of Medical Research⁹, University of Tartu¹⁰, Ontario Institute for Cancer Research¹¹, University of Washington¹², Public Health Research Institute¹³, University of Chicago¹⁴, Greifswald University Hospital¹⁵, Ludwig Maximilian University of Munich¹⁶, University of Bristol¹⁷, Erasmus University Rotterdam¹⁸, University of Westminster¹⁹, Royal Devon and Exeter Hospital²⁰, Luleå University of Technology²¹, Swiss Institute of Bioinformatics²², University of Lausanne²³, University of Dundee²⁴, University of Geneva²⁵, Agency for Science, Technology and Research²⁶, University of Queensland²⁷, Leiden University Medical Center²⁸, Radboud University Nijmegen²⁹, University of Liège³⁰, University of Oxford³¹, Menzies Research Institute³², Icahn School of Medicine at Mount Sinai³³, Ikerbasque³⁴, VU University Amsterdam³⁵, Stanford University³⁶, Turku University Hospital³⁷, University of Turku³⁸, Maastricht University³⁹, Karolinska Institutet⁴⁰, Utrecht University⁴¹, University of Helsinki⁴², National Institutes of Health⁴³, Technische Universität München⁴⁴, Wellcome Trust Sanger Institute⁴⁵, German Cancer Research Center⁴⁶, Westlake University⁴⁷, University of New South Wales⁴⁸

02 Sep 2021-Nature Genetics

TL;DR: In this article, the authors performed cis-and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium.

...read moreread less

Abstract: Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.

...read moreread less

344 citations

Journal Article•DOI•

A single-cell and spatially resolved atlas of human breast cancers.

[...]

Sunny Z. Wu¹, Sunny Z. Wu², Ghamdan Al-Eryani², Ghamdan Al-Eryani¹, Daniel L. Roden², Daniel L. Roden¹, Simon Junankar², Simon Junankar¹, Kate Harvey¹, Alma Andersson³, Aatish Thennavan⁴, Chenfei Wang⁵, James R. Torpy², James R. Torpy¹, Nenad Bartonicek¹, Nenad Bartonicek², Taopeng Wang¹, Taopeng Wang², Ludvig Larsson³, Dominik C. Kaczorowski¹, Neil I. Weisenfeld, Cedric Uytingco, Jennifer Chew, Zachary Bent, Chia-Ling Chan¹, Vikkitharan Gnanasambandapillai¹, Charles-Antoine Dutertre⁶, Charles-Antoine Dutertre⁷, Laurence Gluch, Mun N. Hui¹, Jane Beith, Andrew Parker⁸, Andrew Parker², Elizabeth Robbins⁹, Davendra Segara⁸, Caroline Cooper¹⁰, Caroline Cooper¹¹, Cindy Mak¹², Belinda Chan, Sanjay Warrier¹², Florent Ginhoux¹³, Florent Ginhoux¹⁴, Florent Ginhoux¹⁵, Ewan K.A. Millar¹⁶, Ewan K.A. Millar¹², Ewan K.A. Millar², Joseph E. Powell¹, Joseph E. Powell², Stephen R. Williams, X. Shirley Liu⁵, Sandra A O'Toole, Elgene Lim¹, Elgene Lim², Elgene Lim⁸, Joakim Lundeberg³, Charles M. Perou⁴, Alexander Swarbrick¹, Alexander Swarbrick² - Show less +54 more•Institutions (16)

Garvan Institute of Medical Research¹, University of New South Wales², Royal Institute of Technology³, University of North Carolina at Chapel Hill⁴, Harvard University⁵, French Institute of Health and Medical Research⁶, Institut Gustave Roussy⁷, St. Vincent's Health System⁸, Royal Prince Alfred Hospital⁹, Princess Alexandra Hospital¹⁰, University of Queensland¹¹, University of Sydney¹², Shanghai Jiao Tong University¹³, National University of Singapore¹⁴, Agency for Science, Technology and Research¹⁵, St George's Hospital¹⁶

01 Sep 2021-Nature Genetics

TL;DR: In this paper, a single-cell and spatially resolved transcriptomics analysis of human breast cancers is presented, which reveals recurrent neoplastic cell heterogeneity and heterotypic interactions play central roles in disease progression.

...read moreread less

Abstract: Breast cancers are complex cellular ecosystems where heterotypic interactions play central roles in disease progression and response to therapy. However, our knowledge of their cellular composition and organization is limited. Here we present a single-cell and spatially resolved transcriptomics analysis of human breast cancers. We developed a single-cell method of intrinsic subtype classification (SCSubtype) to reveal recurrent neoplastic cell heterogeneity. Immunophenotyping using cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq) provides high-resolution immune profiles, including new PD-L1/PD-L2+ macrophage populations associated with clinical outcome. Mesenchymal cells displayed diverse functions and cell-surface protein expression through differentiation within three major lineages. Stromal-immune niches were spatially organized in tumors, offering insights into antitumor immune regulation. Using single-cell signatures, we deconvoluted large breast cancer cohorts to stratify them into nine clusters, termed 'ecotypes', with unique cellular compositions and clinical outcomes. This study provides a comprehensive transcriptional atlas of the cellular architecture of breast cancer.

...read moreread less

303 citations

Journal Article•DOI•

A cross-population atlas of genetic associations for 220 human phenotypes

[...]

Saori Sakaue, Masahiro Kanai, Yosuke Tanigawa¹, Juha Karjalainen, Mitja I. Kurki, Seizo Koshiba², Akira Narita², Takahiro Konuma³, Kenichi Yamamoto³, Masato Akiyama⁴, Kazuyoshi Ishigaki, Akari Suzuki, Ken Suzuki³, Wataru Obara⁵, Ken Yamaji⁶, Kazuhisa Takahashi⁶, Satoshi Asai⁷, Yasuo Takahashi⁷, Takao Suzuki, Nobuaki Shinozaki, Hiroki Yamaguchi⁸, Shiro Minami⁸, Shigeo Murayama, Kozo Yoshimori, Satoshi Nagayama⁹, Daisuke Obata¹⁰, Masahiko Higashiyama, Akihide Masumoto, Yukihiro Koretsune, FinnGen, Kaoru Ito, Chikashi Terao, Toshimasa Yamauchi¹¹, Issei Komuro¹¹, Takashi Kadowaki¹¹, Gen Tamiya, Masayuki Yamamoto², Yusuke Nakamura¹¹, Yusuke Nakamura⁹, Michiaki Kubo, Yoshinori Murakami¹¹, Kazuhiko Yamamoto, Yoichiro Kamatani¹¹, Aarno Palotie¹², Aarno Palotie¹³, Aarno Palotie¹⁴, Manuel A. Rivas¹, Mark J. Daly, Koichi Matsuda¹¹, Yukinori Okada - Show less +46 more•Institutions (14)

Stanford University¹, Tohoku University², Osaka University³, Kyushu University⁴, Iwate Medical University⁵, Juntendo University⁶, Nihon University⁷, Nippon Medical School⁸, Japanese Foundation for Cancer Research⁹, Shiga University of Medical Science¹⁰, University of Tokyo¹¹, University of Helsinki¹², Broad Institute¹³, Harvard University¹⁴

30 Sep 2021-Nature Genetics

TL;DR: In this paper, the authors conducted 220 deep-phenotype genome-wide association studies (diseases, biomarkers and medication usage) in BioBank Japan (n = 179,000), by incorporating past medical history and text-mining of electronic medical records.

...read moreread less

Abstract: Current genome-wide association studies do not yet capture sufficient diversity in populations and scope of phenotypes. To expand an atlas of genetic associations in non-European populations, we conducted 220 deep-phenotype genome-wide association studies (diseases, biomarkers and medication usage) in BioBank Japan (n = 179,000), by incorporating past medical history and text-mining of electronic medical records. Meta-analyses with the UK Biobank and FinnGen (ntotal = 628,000) identified ~5,000 new loci, which improved the resolution of the genomic map of human traits. This atlas elucidated the landscape of pleiotropy as represented by the major histocompatibility complex locus, where we conducted HLA fine-mapping. Finally, we performed statistical decomposition of matrices of phenome-wide summary statistics, and identified latent genetic components, which pinpointed responsible variants and biological mechanisms underlying current disease classifications across populations. The decomposed components enabled genetically informed subtyping of similar diseases (for example, allergic diseases). Our study suggests a potential avenue for hypothesis-free re-investigation of human diseases through genetics.

...read moreread less

291 citations

Journal Article•DOI•

Large-scale association analyses identify host factors influencing human gut microbiome composition

[...]

Alexander Kurilshikov¹, Carolina Medina-Gomez², Rodrigo Bacigalupe³, Djawad Radjabzadeh², Jun Wang⁴, Jun Wang³, Ayse Demirkan¹, Ayse Demirkan⁵, Caroline I. Le Roy⁶, Juan Antonio Raygoza Garay⁷, Casey T. Finnicum⁸, Xingrong Liu⁹, Daria V. Zhernakova¹, Marc Jan Bonder¹, Tue H. Hansen¹⁰, Fabian Frost¹¹, Malte C. Rühlemann¹², Williams Turpin⁷, Jee-Young Moon¹³, Han-Na Kim¹⁴, Kreete Lüll¹⁵, Elad Barkan¹⁶, Shiraz A. Shah¹⁷, Myriam Fornage¹⁸, Joanna Szopinska-Tokov, Zachary D. Wallen¹⁹, Dmitrii Borisevich¹⁰, Lars Agréus⁹, Anna Andreasson²⁰, Corinna Bang¹², Larbi Bedrani⁷, Jordana T. Bell⁶, Hans Bisgaard¹⁷, Michael Boehnke²¹, Dorret I. Boomsma²², Robert D. Burk¹³, Annique Claringbould¹, Kenneth Croitoru⁷, Gareth E. Davies⁸, Gareth E. Davies²², Cornelia M. van Duijn²³, Cornelia M. van Duijn², Liesbeth Duijts², Gwen Falony³, Jingyuan Fu¹, Adriaan van der Graaf¹, Torben Hansen¹⁰, Georg Homuth¹¹, David A. Hughes²⁴, Richard G. IJzerman²⁵, Matthew A. Jackson²³, Matthew A. Jackson⁶, Vincent W. V. Jaddoe², Marie Joossens³, Torben Jørgensen¹⁰, Daniel Keszthelyi²⁶, Rob Knight²⁷, Markku Laakso²⁸, Matthias Laudes, Lenore J. Launer²⁹, Wolfgang Lieb¹², Aldons J. Lusis³⁰, Ad A.M. Masclee²⁶, Henriette A. Moll², Zlatan Mujagic²⁶, Qi Qibin¹³, Daphna Rothschild¹⁶, Hocheol Shin¹⁴, Søren J. Sørensen¹⁰, Claire J. Steves⁶, Jonathan Thorsen¹⁷, Nicholas J. Timpson²⁴, Raul Y. Tito³, Sara Vieira-Silva³, Uwe Völker¹¹, Henry Völzke¹¹, Urmo Võsa¹, Kaitlin H Wade²⁴, Susanna Walter³¹, Kyoko Watanabe²², Stefan Weiss¹¹, Frank Ulrich Weiss¹¹, Omer Weissbrod³², Harm-Jan Westra¹, Gonneke Willemsen²², Haydeh Payami¹⁹, Daisy Jonkers²⁶, Alejandro Arias Vasquez³³, Eco J. C. de Geus²², Katie A. Meyer³⁴, Jakob Stokholm¹⁷, Eran Segal¹⁶, Elin Org¹⁵, Cisca Wijmenga¹, Hyung Lae Kim³⁵, Robert C. Kaplan³⁶, Tim D. Spector⁶, André G. Uitterlinden², Fernando Rivadeneira², Andre Franke¹², Markus M. Lerch¹¹, Lude Franke¹, Serena Sanna³⁷, Serena Sanna¹, Mauro D'Amato, Oluf Pedersen¹⁰, Andrew D. Paterson⁷, Robert Kraaij², Jeroen Raes³, Alexandra Zhernakova¹ - Show less +106 more•Institutions (37)

University of Groningen¹, Erasmus University Rotterdam², Katholieke Universiteit Leuven³, Chinese Academy of Sciences⁴, University of Surrey⁵, King's College London⁶, University of Toronto⁷, Avera Health⁸, Karolinska Institutet⁹, University of Copenhagen¹⁰, University of Greifswald¹¹, University of Kiel¹², Yeshiva University¹³, Sungkyunkwan University¹⁴, University of Tartu¹⁵, Weizmann Institute of Science¹⁶, Copenhagen University Hospital¹⁷, University of Texas Health Science Center at Houston¹⁸, University of Alabama at Birmingham¹⁹, Stockholm University²⁰, University of Michigan²¹, VU University Amsterdam²², University of Oxford²³, University of Bristol²⁴, University of Amsterdam²⁵, Maastricht University²⁶, University of California, San Diego²⁷, University of Eastern Finland²⁸, National Institutes of Health²⁹, University of California, Los Angeles³⁰, Linköping University³¹, Harvard University³², Radboud University Nijmegen³³, University of North Carolina at Chapel Hill³⁴, Ewha Womans University³⁵, Fred Hutchinson Cancer Research Center³⁶, National Research Council³⁷

18 Jan 2021-Nature Genetics

TL;DR: In this article, the MiBioGen consortium curated and analyzed genome-wide genotypes and 16S fecal microbiome data from 18,340 individuals (24 cohorts) and found high variability across cohorts: only 9 of 410 genera were detected in more than 95% of samples.

...read moreread less

Abstract: To study the effect of host genetics on gut microbiome composition, the MiBioGen consortium curated and analyzed genome-wide genotypes and 16S fecal microbiome data from 18,340 individuals (24 cohorts). Microbial composition showed high variability across cohorts: only 9 of 410 genera were detected in more than 95% of samples. A genome-wide association study of host genetic variation regarding microbial taxa identified 31 loci affecting the microbiome at a genome-wide significant (P < 5 × 10−8) threshold. One locus, the lactase (LCT) gene locus, reached study-wide significance (genome-wide association study signal: P = 1.28 × 10−20), and it showed an age-dependent association with Bifidobacterium abundance. Other associations were suggestive (1.95 × 10−10 < P < 5 × 10−8) but enriched for taxa showing high heritability and for genes expressed in the intestine and brain. A phenome-wide association study and Mendelian randomization identified enrichment of microbiome trait loci in the metabolic, nutrition and environment domains and suggested the microbiome might have causal effects in ulcerative colitis and rheumatoid arthritis.

...read moreread less

287 citations

Journal Article•DOI•

A genome-wide association study with 1,126,563 individuals identifies new risk loci for Alzheimer's disease.

[...]

Douglas P Wightman¹, Iris E. Jansen¹, Jeanne E. Savage¹, Alexey A. Shadrin², Shahram Bahrami³, Shahram Bahrami², Dominic Holland⁴, Arvid Rongve⁵, Sigrid Børte³, Sigrid Børte⁶, Sigrid Børte², Bendik S. Winsvold³, Bendik S. Winsvold⁶, Ole Kristian Drange⁶, Amy E Martinsen⁶, Amy E Martinsen³, Amy E Martinsen², Anne Heidi Skogholt⁶, Cristen J. Willer⁷, Geir Bråthen⁶, Ingunn Bosnes⁸, Ingunn Bosnes⁶, Jonas B. Nielsen⁶, Jonas B. Nielsen⁹, Jonas B. Nielsen⁷, Lars G. Fritsche⁷, Laurent F. Thomas⁶, Linda M. Pedersen³, Maiken Elvestad Gabrielsen⁶, Marianne Bakke Johnsen², Marianne Bakke Johnsen⁶, Marianne Bakke Johnsen³, Tore Wergeland Meisingset⁶, Wei Zhou⁷, Wei Zhou¹⁰, Petroula Proitsi¹¹, Angela Hodges¹¹, Richard Dobson, Latha Velayudhan¹¹, Karl Heilbron, Adam Auton, Julia M. Sealock¹², Lea K. Davis¹², Nancy L. Pedersen¹³, Chandra A. Reynolds¹⁴, Ida K. Karlsson¹⁵, Ida K. Karlsson¹³, Sigurdur H. Magnusson¹⁶, Hreinn Stefansson¹⁶, Steinunn Thordardottir, Palmi V. Jonsson¹⁷, Jon Snaedal, Anna Zettergren¹⁸, Ingmar Skoog¹⁸, Ingmar Skoog¹⁹, Silke Kern¹⁹, Silke Kern¹⁸, Margda Waern¹⁹, Margda Waern¹⁸, Henrik Zetterberg, Kaj Blennow¹⁹, Kaj Blennow¹⁸, Eystein Stordal⁸, Eystein Stordal⁶, Kristian Hveem⁶, John-Anker Zwart⁶, John-Anker Zwart², John-Anker Zwart³, Lavinia Athanasiu³, Lavinia Athanasiu², Per Selnes²⁰, Ingvild Saltvedt⁶, Sigrid Botne Sando⁶, Ingun Ulstein³, Srdjan Djurovic³, Srdjan Djurovic⁵, Tormod Fladby², Tormod Fladby²⁰, Dag Aarsland¹¹, Dag Aarsland²¹, Geir Selbæk², Geir Selbæk³, Stephan Ripke¹⁰, Stephan Ripke²², Stephan Ripke²³, Kari Stefansson¹⁶, Ole A. Andreassen³, Ole A. Andreassen², Danielle Posthuma¹, Danielle Posthuma²⁴ - Show less +86 more•Institutions (24)

01 Sep 2021-Nature Genetics

TL;DR: This paper identified microglia, immune cells and protein catabolism as relevant genes for late-onset Alzheimer's disease, while identifying and prioritizing previously unidentified genes of potential interest.

...read moreread less

Abstract: Late-onset Alzheimer's disease is a prevalent age-related polygenic disease that accounts for 50-70% of dementia cases. Currently, only a fraction of the genetic variants underlying Alzheimer's disease have been identified. Here we show that increased sample sizes allowed identification of seven previously unidentified genetic loci contributing to Alzheimer's disease. This study highlights microglia, immune cells and protein catabolism as relevant to late-onset Alzheimer's disease, while identifying and prioritizing previously unidentified genes of potential interest. We anticipate that these results can be included in larger meta-analyses of Alzheimer's disease to identify further genetic variants that contribute to Alzheimer's pathology.

...read moreread less

269 citations

Journal Article•DOI•

Genetics of 35 blood and urine biomarkers in the UK Biobank

[...]

Nasa Sinnott-Armstrong¹, Nasa Sinnott-Armstrong², Nasa Sinnott-Armstrong³, Yosuke Tanigawa², David Amar², David Amar⁴, Nina Mars¹, Christian Benner¹, Matthew Aguirre², Guhan Venkataraman², Michael Wainberg², Hanna Ollila⁵, Hanna Ollila¹, Hanna Ollila², Tuomo Kiiskinen⁶, Tuomo Kiiskinen¹, Aki S. Havulinna¹, Aki S. Havulinna⁶, James P. Pirruccello⁷, James P. Pirruccello⁵, Junyang Qian², Anna Shcherbina¹, Anna Shcherbina⁴, FinnGen⁴, Fatima Rodriguez⁴, Themistocles L. Assimes⁴, Themistocles L. Assimes³, Vineeta Agarwala⁴, Robert Tibshirani², Trevor Hastie², Samuli Ripatti¹, Samuli Ripatti⁷, Jonathan K. Pritchard², Mark J. Daly⁵, Mark J. Daly¹, Mark J. Daly⁷, Manuel A. Rivas² - Show less +33 more•Institutions (7)

University of Helsinki¹, Stanford University², VA Palo Alto Healthcare System³, Cardiovascular Institute of the South⁴, Harvard University⁵, National Institute for Health and Welfare⁶, Broad Institute⁷

18 Jan 2021-Nature Genetics

TL;DR: In this article, the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank (n = 363,228 individuals) was evaluated and the results delineate the genetic underlying of biomarkers and their causal influences on diseases and improve genetic risk stratification for common diseases.

...read moreread less

Abstract: Clinical laboratory tests are a critical component of the continuum of care. We evaluate the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank (n = 363,228 individuals). We identify 1,857 loci associated with at least one trait, containing 3,374 fine-mapped associations and additional sets of large-effect (>0.1 s.d.) protein-altering, human leukocyte antigen (HLA) and copy number variant (CNV) associations. Through Mendelian randomization (MR) analysis, we discover 51 causal relationships, including previously known agonistic effects of urate on gout and cystatin C on stroke. Finally, we develop polygenic risk scores (PRSs) for each biomarker and build 'multi-PRS' models for diseases using 35 PRSs simultaneously, which improved chronic kidney disease, type 2 diabetes, gout and alcoholic cirrhosis genetic risk stratification in an independent dataset (FinnGen; n = 135,500) relative to single-disease PRSs. Together, our results delineate the genetic basis of biomarkers and their causal influences on diseases and improve genetic risk stratification for common diseases.

...read moreread less

262 citations

Journal Article•DOI•

Computationally efficient whole-genome regression for quantitative and binary traits.

[...]

Joelle Mbatchou, Leland Barnard, Joshua D. Backman, Anthony Marcketta, Jack A. Kosmicki, Andrey Ziyatdinov, Christian Benner, Colm O'Dushlaine, Mathew Barber, Boris Boutkov, Lukas Habegger, Manuel A. R. Ferreira, Aris Baras, Jeffrey S. Reid, Gonçalo R. Abecasis, Evan Maxwell, Jonathan Marchini - Show less +13 more

20 May 2021-Nature Genetics

TL;DR: RegenerIE as mentioned in this paper is a whole-genome regression method based on ridge regression that enables highly parallelized analysis of quantitative and binary traits in biobank-scale data with reduced computational requirements.

...read moreread less

Abstract: Genome-wide association analysis of cohorts with thousands of phenotypes is computationally expensive, particularly when accounting for sample relatedness or population structure. Here we present a novel machine-learning method called REGENIE for fitting a whole-genome regression model for quantitative and binary phenotypes that is substantially faster than alternatives in multi-trait analyses while maintaining statistical efficiency. The method naturally accommodates parallel analysis of multiple phenotypes and requires only local segments of the genotype matrix to be loaded in memory, in contrast to existing alternatives, which must load genome-wide matrices into memory. This results in substantial savings in compute time and memory usage. We introduce a fast, approximate Firth logistic regression test for unbalanced case–control phenotypes. The method is ideally suited to take advantage of distributed computing frameworks. We demonstrate the accuracy and computational benefits of this approach using the UK Biobank dataset with up to 407,746 individuals. REGENIE is a whole-genome regression method based on ridge regression that enables highly parallelized analysis of quantitative and binary traits in biobank-scale data with reduced computational requirements.

...read moreread less

239 citations

Journal Article•DOI•

Base-resolution models of transcription-factor binding reveal soft motif syntax.

[...]

Žiga Avsec¹, Žiga Avsec², Melanie Weilert³, Avanti Shrikumar⁴, Sabrina Krueger³, Amr Alexandari⁴, Khyati Dalal⁵, Khyati Dalal³, Robin Fropf³, Charles McAnany³, Julien Gagneur¹, Anshul Kundaje⁴, Julia Zeitlinger⁵, Julia Zeitlinger³ - Show less +10 more•Institutions (5)

Technische Universität München¹, Ludwig Maximilian University of Munich², Stowers Institute for Medical Research³, Stanford University⁴, University of Kansas⁵

18 Feb 2021-Nature Genetics

TL;DR: BPNet as discussed by the authors uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency transcription factor (TF) binding motifs.

...read moreread less

Abstract: The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)-nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.

...read moreread less

229 citations

Journal Article•DOI•

Chromothripsis as an on-target consequence of CRISPR-Cas9 genome editing.

[...]

Mitchell L. Leibowitz¹, Stamatis Papathanasiou², Phillip A. Doerfler³, Logan J. Blaine², Lili Sun², Yu Yao³, Cheng-Zhong Zhang², Mitchell J. Weiss³, David Pellman², David Pellman¹ - Show less +6 more•Institutions (3)

Howard Hughes Medical Institute¹, Harvard University², St. Jude Children's Research Hospital³

12 Apr 2021-Nature Genetics

TL;DR: In this paper, it was shown that CRISPR-Cas9 editing generates structural defects of the nucleus, micronuclei and chromosome bridges, which initiate a mutational process called chromothripsis.

...read moreread less

Abstract: Genome editing has therapeutic potential for treating genetic diseases and cancer. However, the currently most practicable approaches rely on the generation of DNA double-strand breaks (DSBs), which can give rise to a poorly characterized spectrum of chromosome structural abnormalities. Here, using model cells and single-cell whole-genome sequencing, as well as by editing at a clinically relevant locus in clinically relevant cells, we show that CRISPR-Cas9 editing generates structural defects of the nucleus, micronuclei and chromosome bridges, which initiate a mutational process called chromothripsis. Chromothripsis is extensive chromosome rearrangement restricted to one or a few chromosomes that can cause human congenital disease and cancer. These results demonstrate that chromothripsis is a previously unappreciated on-target consequence of CRISPR-Cas9-generated DSBs. As genome editing is implemented in the clinic, the potential for extensive chromosomal rearrangements should be considered and monitored.

...read moreread less

Journal Article•DOI•

Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction

[...]

David V. Conti¹, Burcu F. Darst¹, Lilit C. Moss¹, Edward J. Saunders² +251 more•Institutions (100)

04 Jan 2021-Nature Genetics

TL;DR: This paper conducted a meta-analysis of prostate cancer genome-wide association studies (107,247 cases and 127,006 controls) and identified 86 new genetic risk variants independently associated with prostate cancer risk, bringing the total to 269 known risk variants.

...read moreread less

Abstract: Prostate cancer is a highly heritable disease with large disparities in incidence rates across ancestry populations. We conducted a multiancestry meta-analysis of prostate cancer genome-wide association studies (107,247 cases and 127,006 controls) and identified 86 new genetic risk variants independently associated with prostate cancer risk, bringing the total to 269 known risk variants. The top genetic risk score (GRS) decile was associated with odds ratios that ranged from 5.06 (95% confidence interval (CI), 4.84–5.29) for men of European ancestry to 3.74 (95% CI, 3.36–4.17) for men of African ancestry. Men of African ancestry were estimated to have a mean GRS that was 2.18-times higher (95% CI, 2.14–2.22), and men of East Asian ancestry 0.73-times lower (95% CI, 0.71–0.76), than men of European ancestry. These findings support the role of germline variation contributing to population differences in prostate cancer risk, with the GRS offering an approach for personalized risk prediction.

...read moreread less

Journal Article•DOI•

Genome-wide meta-analysis, fine-mapping and integrative prioritization implicate new Alzheimer’s disease risk genes

[...]

Jeremy Schwartzentruber¹, Jeremy Schwartzentruber², Sarah Cooper², Jimmy Z. Liu³, Inigo Barrio-Hernandez¹, Erica Bello², Natsuhiko Kumasaka², Adam Young⁴, Robin J.M. Franklin⁴, Toby Johnson, K. Estrada⁵, Daniel J. Gaffney², Pedro Beltrao¹, Andrew R. Bassett² - Show less +10 more•Institutions (5)

European Bioinformatics Institute¹, Wellcome Trust Sanger Institute², Biogen Idec³, University of Cambridge⁴, Rafael Advanced Defense Systems⁵

15 Feb 2021-Nature Genetics

TL;DR: In this paper, the authors performed an updated genome-wide AD meta-analysis, which identified 37 risk loci, including new associations near CCDC6, TSPAN14, NCK2 and SPRED2.

...read moreread less

Abstract: Genome-wide association studies have discovered numerous genomic loci associated with Alzheimer's disease (AD); yet the causal genes and variants are incompletely identified. We performed an updated genome-wide AD meta-analysis, which identified 37 risk loci, including new associations near CCDC6, TSPAN14, NCK2 and SPRED2. Using three SNP-level fine-mapping methods, we identified 21 SNPs with >50% probability each of being causally involved in AD risk and others strongly suggested by functional annotation. We followed this with colocalization analyses across 109 gene expression quantitative trait loci datasets and prioritization of genes by using protein interaction networks and tissue-specific expression. Combining this information into a quantitative score, we found that evidence converged on likely causal genes, including the above four genes, and those at previously discovered AD loci, including BIN1, APH1B, PTK2B, PILRA and CASS4.

...read moreread less

Journal Article•DOI•

The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation.

[...]

Samuel A. Lambert, Laurent Gil¹, Laurent Gil², Laurent Gil³, Simon Jupp⁴, Scott C. Ritchie, Yu Xu³, Yu Xu², Annalisa Buniello⁴, Aoife McMahon⁴, Gad Abraham⁵, Gad Abraham³, Michael A Chapman³, Michael A Chapman², Michael A Chapman¹, Helen Parkinson⁴, Helen Parkinson³, John Danesh, Jacqueline A. L. MacArthur⁴, Michael Inouye - Show less +16 more•Institutions (5)

Wellcome Trust Sanger Institute¹, British Heart Foundation², University of Cambridge³, European Bioinformatics Institute⁴, Baker IDI Heart and Diabetes Institute⁵

01 Apr 2021-Nature Genetics

TL;DR: The Polygenic Score (PGS) catalog as discussed by the authors is an open resource of published scores (including variants, alleles and weights) and consistently curated metadata required for reproducibility and independent applications.

...read moreread less

Abstract: We present the Polygenic Score (PGS) Catalog ( https://www.PGSCatalog.org ), an open resource of published scores (including variants, alleles and weights) and consistently curated metadata required for reproducibility and independent applications. The PGS Catalog has capabilities for user deposition, expert curation and programmatic access, thus providing the community with a platform for PGS dissemination, research and translation.

...read moreread less

Journal Article•DOI•

Advancing human genetics research and drug discovery through exome sequencing of the UK Biobank.

[...]

Joseph D. Szustakowski¹, Suganthi Balasubramanian², Erika Kvikstad¹, Shareef Khalid², Paola G. Bronson³, Ariella Sasson¹, Emily Wong⁴, Daren Liu², J. Wade Davis, Carolina Haefliger⁵, A. Katrina Loomis⁶, Rajesh Mikkilineni⁴, Hyun Ji Noh, Samir Wadhawan¹, Xiaodong Bai², Alicia Hawes², Olga Krasheninina², Ricardo Ulloa², Alex Lopez², E. N. Smith⁴, Jeffrey F. Waring, Christopher D. Whelan³, Ellen A. Tsai³, John D. Overton², William J. Salerno², Howard J. Jacob, Sándor Szalma⁴, Heiko Runz³, Gregory Hinkle⁷, Paul Nioi⁷, Slavé Petrovski⁵, Melissa R. Miller⁶, Aris Baras², Lyndon J. Mitnaul², Jeffrey G. Reid² - Show less +31 more•Institutions (7)

Bristol-Myers Squibb¹, Regeneron², Biogen Idec³, Takeda Pharmaceutical Company⁴, AstraZeneca⁵, Pfizer⁶, Alnylam Pharmaceuticals⁷

28 Jun 2021-Nature Genetics

TL;DR: The UK Biobank Exome Sequencing Consortium (UKB-ESC) as mentioned in this paper is a private-public partnership between the UK Biopartition and eight biopharmaceutical companies that will complete the sequencing of exomes for all ~500,000 UKB participants.

...read moreread less

Abstract: The UK Biobank Exome Sequencing Consortium (UKB-ESC) is a private–public partnership between the UK Biobank (UKB) and eight biopharmaceutical companies that will complete the sequencing of exomes for all ~500,000 UKB participants. Here, we describe the early results from ~200,000 UKB participants and the features of this project that enabled its success. The biopharmaceutical industry has increasingly used human genetics to improve success in drug discovery. Recognizing the need for large-scale human genetics data, as well as the unique value of the data access and contribution terms of the UKB, the UKB-ESC was formed. As a result, exome data from 200,643 UKB enrollees are now available. These data include ~10 million exonic variants—a rich resource of rare coding variation that is particularly valuable for drug discovery. The UKB-ESC precompetitive collaboration has further strengthened academic and industry ties and has provided teams with an opportunity to interact with and learn from the wider research community. The UK Biobank Exome Sequencing Consortium aims to sequence all the exomes of approximately 500,000 UK Biobank participants. This Perspective describes the results from approximately 200,000 exomes and discusses the lessons learned from this UK Biobank–biopharmaceutical company collaboration.

...read moreread less

Journal Article•DOI•

The trans-ancestral genomic architecture of glycemic traits

[...]

Ji Chen¹, Ji Chen², Cassandra N. Spracklen³, Cassandra N. Spracklen⁴ +475 more•Institutions (146)

31 May 2021-Nature Genetics

TL;DR: This paper aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available.

...read moreread less

Abstract: Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 × 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.

...read moreread less

Journal Article•DOI•

Ultrafast Sample placement on Existing tRees (UShER) enables real-time phylogenetics for the SARS-CoV-2 pandemic.

[...]

Yatish Turakhia¹, Bryan Thornlow¹, Angie S. Hinrichs¹, Nicola De Maio², Landen Gozashti¹, Landen Gozashti³, Robert Lanfear⁴, David Haussler¹, Russell Corbett-Detig¹, Russell Corbett-Detig⁵ - Show less +6 more•Institutions (5)

University of California, Santa Cruz¹, European Bioinformatics Institute², Harvard University³, Australian National University⁴, National Research University – Higher School of Economics⁵

10 May 2021-Nature Genetics

TL;DR: In this article, a tree-based data structure encoding the inferred evolutionary history of the SARS-CoV-2 virus was proposed to enable real-time genomic contact tracing, which greatly improves the speed of phylogenetic placement of new samples and data visualization.

...read moreread less

Abstract: As the SARS-CoV-2 virus spreads through human populations, the unprecedented accumulation of viral genome sequences is ushering in a new era of 'genomic contact tracing'-that is, using viral genomes to trace local transmission dynamics. However, because the viral phylogeny is already so large-and will undoubtedly grow many fold-placing new sequences onto the tree has emerged as a barrier to real-time genomic contact tracing. Here, we resolve this challenge by building an efficient tree-based data structure encoding the inferred evolutionary history of the virus. We demonstrate that our approach greatly improves the speed of phylogenetic placement of new samples and data visualization, making it possible to complete the placements under the constraints of real-time contact tracing. Thus, our method addresses an important need for maintaining a fully updated reference phylogeny. We make these tools available to the research community through the University of California Santa Cruz SARS-CoV-2 Genome Browser to enable rapid cross-referencing of information in new virus sequences with an ever-expanding array of molecular and structural biology data. The methods described here will empower research and genomic contact tracing for SARS-CoV-2 specifically for laboratories worldwide.

...read moreread less

Journal Article•DOI•

Mapping the temporal and spatial dynamics of the human endometrium in vivo and in vitro.

[...]

Luz Garcia-Alonso¹, Louis-François Handfield¹, Kenny Roberts¹, Konstantina Nikolakopoulou², Konstantina Nikolakopoulou³, Ridma C. Fernando³, Ridma C. Fernando², Lucy Gardner², Benjamin Woodhams¹, Benjamin Woodhams⁴, Anna Arutyunyan¹, Anna Arutyunyan², Krzysztof Polanski¹, Regina Hoo¹, Regina Hoo², Carmen Sancho-Serra¹, Tong Li¹, Kwasi Kwakwa⁴, Elizabeth Tuck¹, Valentina Lorenzi¹, Hassan Massalha², Hassan Massalha¹, Martin Prete¹, Vitalii Kleshchevnikov¹, Aleksandra Tarkowska¹, Tarryn Porter¹, Cecilia Icoresi Mazzeo¹, Stijn van Dongen¹, Monika Dabrowska¹, Vasyl Vaskivskyi¹, Krishnaa T. Mahbubani², Jong-Eun Park¹, Mercedes Jimenez-Linan⁵, Lia S. Campos¹, Vladimir Yu. Kiselev¹, Cecilia Lindskog⁶, Paul Ayuk⁷, Elena Prigmore¹, Michael R. Stratton¹, Kourosh Saeb-Parsy², Ashley Moffett², Luiza Moore¹, Luiza Moore⁵, Omer Ali Bayraktar¹, Sarah A. Teichmann¹, Sarah A. Teichmann², Margherita Y. Turco², Margherita Y. Turco³, Roser Vento-Tormo¹, Roser Vento-Tormo² - Show less +46 more•Institutions (7)

Wellcome Trust Sanger Institute¹, University of Cambridge², Friedrich Miescher Institute for Biomedical Research³, European Bioinformatics Institute⁴, Cambridge University Hospitals NHS Foundation Trust⁵, Science for Life Laboratory⁶, Newcastle upon Tyne Hospitals NHS Foundation Trust⁷

02 Dec 2021-Nature Genetics

TL;DR: In this paper, the authors dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments of the endometrium.

...read moreread less

Abstract: The endometrium, the mucosal lining of the uterus, undergoes dynamic changes throughout the menstrual cycle in response to ovarian hormones. We have generated dense single-cell and spatial reference maps of the human uterus and three-dimensional endometrial organoid cultures. We dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments. Our benchmark of the endometrial organoids reveals the pathways and cell states regulating differentiation of the secretory and ciliated lineages both in vivo and in vitro. In vitro downregulation of WNT or NOTCH pathways increases the differentiation efficiency along the secretory and ciliated lineages, respectively. We utilize our cellular maps to deconvolute bulk data from endometrial cancers and endometriotic lesions, illuminating the cell types dominating in each of these disorders. These mechanistic insights provide a platform for future development of treatments for common conditions including endometriosis and endometrial carcinoma. Single-cell and spatial transcriptomic profiling of the human endometrium highlights pathways governing the proliferative and secretory phases of the menstrual cycle. Analyses of endometrial organoids show that WNT and NOTCH signaling modulate differentiation into the secretory and ciliated epithelial lineages, respectively.

...read moreread less

Journal Article•DOI•

Trans-ancestry analysis reveals genetic and nongenetic associations with COVID-19 susceptibility and severity.

[...]

Janie F. Shelton, Anjali J. Shastri, Chelsea Ye, Catherine H. Weldon, Teresa Filshtein-Sonmez, Daniella Coker, Antony Symons, Jorge Esparza-Gordillo, Stella Aslibekyan, Adam Auton - Show less +6 more

22 Apr 2021-Nature Genetics

TL;DR: In this paper, a study of 1,051,032 23andMe research participants was conducted to identify genetic and nongenetic associations with testing positive for SARS-CoV-2, respiratory symptoms and hospitalization.

...read moreread less

Abstract: COVID-19 presents with a wide range of severity, from asymptomatic in some individuals to fatal in others. Based on a study of 1,051,032 23andMe research participants, we report genetic and nongenetic associations with testing positive for SARS-CoV-2, respiratory symptoms and hospitalization. Using trans-ancestry genome-wide association studies, we identified a strong association between blood type and COVID-19 diagnosis, as well as a gene-rich locus on chromosome 3p21.31 that is more strongly associated with outcome severity. Hospitalization risk factors include advancing age, male sex, obesity, lower socioeconomic status, non-European ancestry and preexisting cardiometabolic conditions. While non-European ancestry was a significant risk factor for hospitalization after adjusting for sociodemographics and preexisting health conditions, we did not find evidence that these two primary genetic associations explain risk differences between populations for severe COVID-19 outcomes.

...read moreread less

Journal Article•DOI•

Genome-wide CRISPR screening identifies TMEM106B as a proviral host factor for SARS-CoV-2.

[...]

Jim Baggen¹, Leentje Persoons¹, Els Vanstreels¹, Sander Jansen¹, Dominique Van Looveren¹, Bram Boeckx¹, Vincent Geudens¹, Julie De Man¹, Dirk Jochmans¹, Joost Wauters¹, Els Wauters¹, Bart M. Vanaudenaerde¹, Diether Lambrechts¹, Johan Neyts¹, Kai Dallmeier¹, Hendrik Jan Thibaut¹, Maarten Jacquemyn¹, Piet Maes¹, Dirk Daelemans¹ - Show less +15 more•Institutions (1)

Katholieke Universiteit Leuven¹

08 Mar 2021-Nature Genetics

TL;DR: It is discovered that SARS-CoV-2 requires the lysosomal protein TMEM106B to infect human cell lines and primary lung cells, and single-cell RNA-sequencing of airway cells from patients with COVID-19 demonstrated that TMEM 106B expression correlates with Sars-Cov-2 infection.

...read moreread less

Abstract: The ongoing COVID-19 pandemic has caused a global economic and health crisis. To identify host factors essential for coronavirus infection, we performed genome-wide functional genetic screens with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and human coronavirus 229E. These screens uncovered virus-specific as well as shared host factors, including TMEM41B and PI3K type 3. We discovered that SARS-CoV-2 requires the lysosomal protein TMEM106B to infect human cell lines and primary lung cells. TMEM106B overexpression enhanced SARS-CoV-2 infection as well as pseudovirus infection, suggesting a role in viral entry. Furthermore, single-cell RNA-sequencing of airway cells from patients with COVID-19 demonstrated that TMEM106B expression correlates with SARS-CoV-2 infection. The present study uncovered a collection of coronavirus host factors that may be exploited to develop drugs against SARS-CoV-2 infection or future zoonotic coronavirus outbreaks.

...read moreread less

Journal Article•DOI•

Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer's disease.

[...]

Samuel Morabito¹, Emily Miyoshi¹, Neethu Michael¹, Saba Shahin¹, Alessandra C. Martini¹, Elizabeth Head¹, Justine Silva¹, Kelsey Leavy¹, Mari Perez-Rosendahl¹, Vivek Swarup¹ - Show less +6 more•Institutions (1)

University of California, Irvine¹

08 Jul 2021-Nature Genetics

TL;DR: In this paper, a multi-omic single-nucleus study of 191,890 nuclei in late-stage Alzheimer's disease (AD), accessible through their web portal, profiling chromatin accessibility and gene expression in the same biological samples and uncovering vast cellular heterogeneity.

...read moreread less

Abstract: The gene-regulatory landscape of the brain is highly dynamic in health and disease, coordinating a menagerie of biological processes across distinct cell types. Here, we present a multi-omic single-nucleus study of 191,890 nuclei in late-stage Alzheimer’s disease (AD), accessible through our web portal, profiling chromatin accessibility and gene expression in the same biological samples and uncovering vast cellular heterogeneity. We identified cell-type-specific, disease-associated candidate cis-regulatory elements and their candidate target genes, including an oligodendrocyte-associated regulatory module containing links to APOE and CLU. We describe cis-regulatory relationships in specific cell types at a subset of AD risk loci defined by genome-wide association studies, demonstrating the utility of this multi-omic single-nucleus approach. Trajectory analysis of glial populations identified disease-relevant transcription factors, such as SREBF1, and their regulatory targets. Finally, we introduce single-nucleus consensus weighted gene coexpression analysis, a coexpression network analysis strategy robust to sparse single-cell data, and perform a systems-level analysis of the AD transcriptome. An integrative analysis of single-nucleus assay for transposase-accessible chromatin with sequencing and RNA sequencing in normal and Alzheimer’s disease brain tissue identifies cell-type-specific cis-regulatory elements and candidate target genes at disease-associated loci.

...read moreread less

Journal Article•DOI•

Common genetic variants and modifiable risk factors underpin hypertrophic cardiomyopathy susceptibility and expressivity

[...]

Andrew R. Harper¹, Anuj Goel¹, Christopher Grace¹, K Thomson¹, K Thomson², Steffen E. Petersen³, Xiaoling Xu⁴, Adam Waring¹, Elizabeth Ormondroyd¹, Christopher M. Kramer⁵, Carolyn Y. Ho⁶, Stefan Neubauer¹, Rafik Tadros⁷, James S. Ware⁴, Connie R. Bezzina, Martin Farrall¹, Hugh Watkins⁸, Hugh Watkins¹ - Show less +14 more•Institutions (8)

University of Oxford¹, Churchill Hospital², Queen Mary University of London³, National Institutes of Health⁴, University of Virginia Health System⁵, Brigham and Women's Hospital⁶, Montreal Heart Institute⁷, John Radcliffe Hospital⁸

25 Jan 2021-Nature Genetics

TL;DR: In this article, a genome-wide association study of 2,780 cases and 47,486 controls was conducted to identify 12 genome wide-significant susceptibility loci for HCM, and Mendelian randomization identified diastolic blood pressure as a key modifiable risk factor for sarcomere-negative HCM.

...read moreread less

Abstract: Hypertrophic cardiomyopathy (HCM) is a common, serious, genetic heart disorder. Rare pathogenic variants in sarcomere genes cause HCM, but with unexplained phenotypic heterogeneity. Moreover, most patients do not carry such variants. We report a genome-wide association study of 2,780 cases and 47,486 controls that identified 12 genome-wide-significant susceptibility loci for HCM. Single-nucleotide polymorphism heritability indicated a strong polygenic influence, especially for sarcomere-negative HCM (64% of cases; h2g = 0.34 ± 0.02). A genetic risk score showed substantial influence on the odds of HCM in a validation study, halving the odds in the lowest quintile and doubling them in the highest quintile, and also influenced phenotypic severity in sarcomere variant carriers. Mendelian randomization identified diastolic blood pressure (DBP) as a key modifiable risk factor for sarcomere-negative HCM, with a one standard deviation increase in DBP increasing the HCM risk fourfold. Common variants and modifiable risk factors have important roles in HCM that we suggest will be clinically actionable.

...read moreread less

Journal Article•DOI•

An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci.

[...]

Edward Mountjoy¹, Ellen M. Schmidt¹, Miguel Carmona, Jeremy Schwartzentruber¹, Jeremy Schwartzentruber², Gareth Peat², Alfredo Miranda², Luca Fumis², James D. Hayhurst², Annalisa Buniello², Mohd Anisul Karim¹, Daniel Wright¹, Andrew Hercules², Eliseo Papa³, Eric B. Fauman⁴, Jeffrey C. Barrett¹, John A. Todd⁵, David Ochoa², Ian Dunham¹, Ian Dunham², Maya Ghoussaini¹ - Show less +17 more•Institutions (5)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², Biogen Idec³, Pfizer⁴, University of Oxford⁵

28 Oct 2021-Nature Genetics

TL;DR: Open Targets Genetics as discussed by the authors is a community resource that provides systematic fine mapping at human GWAS loci, enabling users to prioritize genes at disease-associated regions and assess their potential as drug targets.

...read moreread less

Abstract: Genome-wide association studies (GWASs) have identified many variants associated with complex traits, but identifying the causal gene(s) is a major challenge. In the present study, we present an open resource that provides systematic fine mapping and gene prioritization across 133,441 published human GWAS loci. We integrate genetics (GWAS Catalog and UK Biobank) with transcriptomic, proteomic and epigenomic data, including systematic disease–disease and disease–molecular trait colocalization results across 92 cell types and tissues. We identify 729 loci fine mapped to a single-coding causal variant and colocalized with a single gene. We trained a machine-learning model using the fine-mapped genetics and functional genomics data and 445 gold-standard curated GWAS loci to distinguish causal genes from neighboring genes, outperforming a naive distance-based model. Our prioritized genes were enriched for known approved drug targets (odds ratio = 8.1, 95% confidence interval = 5.7, 11.5). These results are publicly available through a web portal ( http://genetics.opentargets.org ), enabling users to easily prioritize genes at disease-associated loci and assess their potential as drug targets. Open Targets Genetics is a community resource that provides systematic fine mapping at human GWAS loci, enabling users to prioritize genes at disease-associated regions and assess their potential as drug targets.

...read moreread less

Journal Article•DOI•

Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation

[...]

J L Min¹, Gibran Hemani¹, Eilis Hannon², Koen F. Dekkers³ +173 more•Institutions (53)

06 Sep 2021-Nature Genetics

TL;DR: In this paper, the results of DNAm quantitative trait locus (mQTL) analyses on 32,851 participants were presented, identifying genetic variants associated with DNA methylation at 420,509 DNAm sites in blood.

...read moreread less

Abstract: Characterizing genetic influences on DNA methylation (DNAm) provides an opportunity to understand mechanisms underpinning gene regulation and disease. In the present study, we describe results of DNAm quantitative trait locus (mQTL) analyses on 32,851 participants, identifying genetic variants associated with DNAm at 420,509 DNAm sites in blood. We present a database of >270,000 independent mQTLs, of which 8.5% comprise long-range (trans) associations. Identified mQTL associations explain 15-17% of the additive genetic variance of DNAm. We show that the genetic architecture of DNAm levels is highly polygenic. Using shared genetic control between distal DNAm sites, we constructed networks, identifying 405 discrete genomic communities enriched for genomic annotations and complex traits. Shared genetic variants are associated with both DNAm levels and complex diseases, but only in a minority of cases do these associations reflect causal relationships from DNAm to trait or vice versa, indicating a more complex genotype-phenotype map than previously anticipated.

...read moreread less

Journal Article•DOI•

Genome sequencing analysis identifies new loci associated with Lewy body dementia and provides insights into its genetic architecture

[...]

Ruth Chia¹, Marya S. Sabir, Sara Bandres-Ciga¹, Sara Saez-Atienzar¹ +163 more•Institutions (55)

15 Feb 2021-Nature Genetics

TL;DR: This article performed whole-genome sequencing in large cohorts of Lewy body dementia (LBD) cases and neurologically healthy controls to study the genetic architecture of this understudied form of dementia, and to generate a resource for the scientific community.

...read moreread less

Abstract: The genetic basis of Lewy body dementia (LBD) is not well understood. Here, we performed whole-genome sequencing in large cohorts of LBD cases and neurologically healthy controls to study the genetic architecture of this understudied form of dementia, and to generate a resource for the scientific community. Genome-wide association analysis identified five independent risk loci, whereas genome-wide gene-aggregation tests implicated mutations in the gene GBA. Genetic risk scores demonstrate that LBD shares risk profiles and pathways with Alzheimer's disease and Parkinson's disease, providing a deeper molecular understanding of the complex genetic architecture of this age-related neurodegenerative condition.

...read moreread less

Journal Article•DOI•

Shared genetic pathways contribute to risk of hypertrophic and dilated cardiomyopathies with opposite directions of effect

[...]

Rafik Tadros¹, Rafik Tadros², Catherine Francis³, Catherine Francis⁴, Xiao Xu⁵, Alexa M.C. Vermeer¹, Andrew R. Harper⁶, Roy Huurman⁷, Ken Kelu Bisabu², Roddy Walsh¹, Edgar T. Hoorntje⁸, Wouter P. te Rijdt⁸, Rachel Buchan³, Rachel Buchan⁴, Hannah G. van Velzen⁷, Marjon van Slegtenhorst⁷, Jentien M Vermeulen¹, Joost A. Offerhaus¹, Wenjia Bai⁵, Antonio de Marvao⁵, Najim Lahrouchi¹, Leander Beekman¹, Jacco C. Karper⁸, Jan H. Veldink⁹, Elham Kayvanpour¹⁰, Antonis Pantazis³, A. John Baksi⁴, A. John Baksi³, Nicola Whiffin⁴, Nicola Whiffin³, Nicola Whiffin⁵, Francesco Mazzarotto, Geraldine Sloane³, Geraldine Sloane⁴, Hideaki Suzuki¹¹, Hideaki Suzuki⁵, Deborah Schneider-Luftman⁵, Deborah Schneider-Luftman¹², Paul Elliott⁵, Pascale Richard¹³, Flavie Ader¹³, Eric Villard¹³, Peter Lichtner, Thomas Meitinger¹⁴, Michael W.T. Tanck¹, J. Peter van Tintelen¹, J. Peter van Tintelen⁹, Andrew Thain¹⁵, David McCarty¹⁵, Robert A. Hegele¹⁵, Jason D. Roberts¹⁵, Julie Amyot², Marie-Pierre Dubé², Julia Cadrin-Tourigny², Geneviève Giraldeau², Philippe L. L’Allier², Patrick Garceau², Jean-Claude Tardif², S. Matthijs Boekholdt¹, R. Thomas Lumbers¹⁶, Folkert W. Asselbergs¹⁶, Folkert W. Asselbergs⁹, Paul J.R. Barton³, Paul J.R. Barton⁴, Stuart A. Cook, Sanjay K Prasad³, Sanjay K Prasad⁴, Declan P. O'Regan⁵, Jolanda van der Velden, Karin J. H. Verweij¹, Mario Talajic², Guillaume Lettre², Yigal M. Pinto¹, Benjamin Meder¹⁰, Philippe Charron¹³, Rudolf A. de Boer⁸, Imke Christiaans⁸, Michelle Michels⁷, Arthur A.M. Wilde¹, Hugh Watkins⁶, Paul M. Matthews⁵, James S. Ware³, James S. Ware⁵, James S. Ware⁴, Connie R. Bezzina¹ - Show less +81 more•Institutions (16)

University of Amsterdam¹, Montreal Heart Institute², National Health Service³, National Institutes of Health⁴, Imperial College London⁵, University of Oxford⁶, Erasmus University Medical Center⁷, University Medical Center Groningen⁸, Utrecht University⁹, Heidelberg University¹⁰, Tohoku University¹¹, Francis Crick Institute¹², University of Paris¹³, Technische Universität München¹⁴, University of Western Ontario¹⁵, University College London¹⁶

25 Jan 2021-Nature Genetics

TL;DR: In this paper, the authors conducted genome-wide association studies and multi-trait analyses in hypertrophic (HCM) and dilated (DCM) cardiomyopathies.

...read moreread less

Abstract: The heart muscle diseases hypertrophic (HCM) and dilated (DCM) cardiomyopathies are leading causes of sudden death and heart failure in young, otherwise healthy, individuals. We conducted genome-wide association studies and multi-trait analyses in HCM (1,733 cases), DCM (5,521 cases) and nine left ventricular (LV) traits (19,260 UK Biobank participants with structurally normal hearts). We identified 16 loci associated with HCM, 13 with DCM and 23 with LV traits. We show strong genetic correlations between LV traits and cardiomyopathies, with opposing effects in HCM and DCM. Two-sample Mendelian randomization supports a causal association linking increased LV contractility with HCM risk. A polygenic risk score explains a significant portion of phenotypic variability in carriers of HCM-causing rare variants. Our findings thus provide evidence that polygenic risk score may account for variability in Mendelian diseases. More broadly, we provide insights into how genetic pathways may lead to distinct disorders through opposing genetic effects.

...read moreread less

Journal Article•DOI•

A compendium of uniformly processed human gene expression and splicing quantitative trait loci.

[...]

Nurlan Kerimov¹, James D. Hayhurst², Kateryna Peikova¹, Jonathan R. Manning², Peter Walter², Liis Kolberg¹, Marija Samoviča¹, Manoj Pandian Sakthivel², Ivan Kuzmin¹, Stephen J. Trevanion², Tony Burdett², Simon Jupp², Helen Parkinson², Irene Papatheodorou², Andrew D. Yates², Daniel R. Zerbino², Kaur Alasoo¹ - Show less +13 more•Institutions (2)

University of Tartu¹, European Bioinformatics Institute²

06 Sep 2021-Nature Genetics

TL;DR: The eQTL Catalogue as discussed by the authors is a set of gene expression quantitative trait locus (eQTL) studies published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization.

...read moreread less

Abstract: Many gene expression quantitative trait locus (eQTL) studies have published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization. However, technical differences between these datasets are a barrier to their widespread use. Consequently, target genes for most genome-wide association study (GWAS) signals have still not been identified. In the present study, we present the eQTL Catalogue ( https://www.ebi.ac.uk/eqtl ), a resource of quality-controlled, uniformly re-computed gene expression and splicing QTLs from 21 studies. We find that, for matching cell types and tissues, the eQTL effect sizes are highly reproducible between studies. Although most QTLs were shared between most bulk tissues, we identified a greater diversity of cell-type-specific QTLs from purified cell types, a subset of which also manifested as new disease co-localizations. Our summary statistics are freely available to enable the systematic interpretation of human GWAS associations across many cell types and tissues.

...read moreread less

Journal Article•DOI•

Integrating human brain proteomes with genome-wide association data implicates new proteins in Alzheimer’s disease pathogenesis

[...]

Aliza P. Wingo¹, Aliza P. Wingo², Yue Liu², Ekaterina S. Gerasimov², Jake Gockley³, Benjamin A. Logsdon³, Duc M. Duong², Eric B. Dammer², Chloe Robins², Thomas G. Beach⁴, Eric M. Reiman⁵, Michael P. Epstein², Philip L. De Jager⁶, James J. Lah², David A. Bennett⁷, Nicholas T. Seyfried², Allan I. Levey², Thomas S. Wingo² - Show less +14 more•Institutions (7)

Veterans Health Administration¹, Emory University², Sage Bionetworks³, Banner Health⁴, Arizona State University⁵, Columbia University Medical Center⁶, Rush University Medical Center⁷

28 Jan 2021-Nature Genetics

TL;DR: In this paper, a proteome-wide association study (PWAS) of AD was performed, followed by Mendelian randomization and colocalization analysis to identify loci that confer AD risk through their effects on brain protein abundance to provide new insights into AD pathogenesis.

...read moreread less

Abstract: Genome-wide association studies (GWAS) have identified many risk loci for Alzheimer's disease (AD)1,2, but how these loci confer AD risk is unclear. Here, we aimed to identify loci that confer AD risk through their effects on brain protein abundance to provide new insights into AD pathogenesis. To that end, we integrated AD GWAS results with human brain proteomes to perform a proteome-wide association study (PWAS) of AD, followed by Mendelian randomization and colocalization analysis. We identified 11 genes that are consistent with being causal in AD, acting via their cis-regulated brain protein abundance. Nine replicated in a confirmation PWAS and eight represent new AD risk genes not identified before by AD GWAS. Furthermore, we demonstrated that our results were independent of APOE e4. Together, our findings provide new insights into AD pathogenesis and promising targets for further mechanistic and therapeutic studies.

...read moreread less

Journal Article•DOI•

A high-quality genome assembly highlights rye genomic characteristics and agronomically important genes.

[...]

Guangwei Li¹, Lijian Wang¹, Jianping Yang¹, Hang He², Huaibing Jin¹, Xuming Li, Tianheng Ren³, Zhenglong Ren³, Feng Li⁴, Xue Han², Xiaoge Zhao⁴, Lingli Dong⁴, Yiwen Li⁴, Zhongping Song³, Ze-Hong Yan³, Nannan Zheng¹, Cuilan Shi¹, Zhaohui Wang¹, Shuling Yang¹, Zijun Xiong¹, Menglan Zhang¹, Guanghua Sun¹, Xu Zheng¹, Mingyue Gou¹, Changmian Ji, Junkai Du, Hongkun Zheng, Jaroslav Doležel⁵, Xing Wang Deng², Nils Stein⁶, Nils Stein⁷, Qinghua Yang¹, Kunpu Zhang⁴, Kunpu Zhang¹, Daowen Wang⁴, Daowen Wang¹ - Show less +32 more•Institutions (7)

Henan Agricultural University¹, Peking University², Sichuan Agricultural University³, Chinese Academy of Sciences⁴, Academy of Sciences of the Czech Republic⁵, University of Göttingen⁶, Leibniz Association⁷

18 Mar 2021-Nature Genetics

TL;DR: In this paper, the genome of Weining rye, an elite Chinese rye variety, was sequenced and the assembled contigs (7.74 Gb) accounted for 98.47% of the estimated genome size with 93.67% assigned to seven chromosomes.

...read moreread less

Abstract: Rye is a valuable food and forage crop, an important genetic resource for wheat and triticale improvement and an indispensable material for efficient comparative genomic studies in grasses. Here, we sequenced the genome of Weining rye, an elite Chinese rye variety. The assembled contigs (7.74 Gb) accounted for 98.47% of the estimated genome size (7.86 Gb), with 93.67% of the contigs (7.25 Gb) assigned to seven chromosomes. Repetitive elements constituted 90.31% of the assembled genome. Compared to previously sequenced Triticeae genomes, Daniela, Sumaya and Sumana retrotransposons showed strong expansion in rye. Further analyses of the Weining assembly shed new light on genome-wide gene duplications and their impact on starch biosynthesis genes, physical organization of complex prolamin loci, gene expression features underlying early heading trait and putative domestication-associated chromosomal regions and loci in rye. This genome sequence promises to accelerate genomic and breeding studies in rye and related cereal crops. A high-quality genome assembly of Weining rye sheds new light on gene duplications and their effects on starch biosynthesis genes, gene expression features underlying early heading trait and putative domestication-associated chromosomal regions.

...read moreread less

Journal Article•DOI•

Population-scale single-cell RNA-seq profiling across dopaminergic neuron differentiation

[...]

Julie Jerber¹, Daniel D Seaton², Anna S E Cuomo², Natsuhiko Kumasaka¹, James Haldane¹, Juliette Steer¹, Minal Patel¹, Daniel Pearce¹, Malin Andersson¹, Marc Jan Bonder², Ed Mountjoy, Maya Ghoussaini, Madeline A. Lancaster³, John C. Marioni¹, John C. Marioni², John C. Marioni⁴, Florian T. Merkle⁴, Daniel J. Gaffney¹, Oliver Stegle - Show less +15 more•Institutions (4)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², Laboratory of Molecular Biology³, University of Cambridge⁴

04 Mar 2021-Nature Genetics

TL;DR: In this article, the authors used an efficient multiplexing strategy to differentiate 215 human induced pluripotent stem cell (iPSC) lines toward a midbrain neural fate, including dopaminergic neurons, and use single-cell RNA sequencing (scRNA-seq) to profile over 1 million cells across three differentiation time points.

...read moreread less

Abstract: Studying the function of common genetic variants in primary human tissues and during development is challenging. To address this, we use an efficient multiplexing strategy to differentiate 215 human induced pluripotent stem cell (iPSC) lines toward a midbrain neural fate, including dopaminergic neurons, and use single-cell RNA sequencing (scRNA-seq) to profile over 1 million cells across three differentiation time points. The proportion of neurons produced by each cell line is highly reproducible and is predictable by robust molecular markers expressed in pluripotent cells. Expression quantitative trait loci (eQTL) were characterized at different stages of neuronal development and in response to rotenone-induced oxidative stress. Of these, 1,284 eQTL colocalize with known neurological trait risk loci, and 46% are not found in the Genotype-Tissue Expression (GTEx) catalog. Our study illustrates how coupling scRNA-seq with long-term iPSC differentiation enables mechanistic studies of human trait-associated genetic variants in otherwise inaccessible cell states.

...read moreread less

Collapse