Showing papers by "Wellcome Trust Sanger Institute published in 2021"

PDF

Open Access

Journal Article•DOI•

[...]

Petr Danecek¹, James K. Bonfield¹, Jennifer Liddle¹, John Marshall², Valeriu Ohan¹, Martin O. Pollard¹, Andrew Whitwham¹, Thomas M. Keane³, Shane A. McCarthy¹, Robert L. Davies¹, Heng Li⁴ - Show less +7 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of Glasgow², European Bioinformatics Institute³, Harvard University⁴

01 Feb 2021-GigaScience

TL;DR: The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines and are freely available on GitHub under the permissive MIT licence, free for both noncommercial and commercial use.

...read moreread less

Abstract: Background: SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. They include tools for file format conversion and manipulation, sorting, querying, statistics, variant calling, and effect analysis amongst other methods. Findings: The first version appeared online 12 years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. Conclusion: Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed >1 million times via Bioconda. The source code and documentation are available from https://www.htslib.org.

...read moreread less

2,448 citations

Journal Article•DOI•

SARS-CoV-2 variants, spike mutations and immune escape.

[...]

William T. Harvey¹, Alessandro M Carabelli², Ben Jackson³, Ravindra K. Gupta², E. Thomson⁴, E. Thomson⁵, Ewan M. Harrison⁵, Ewan M. Harrison², Catherine Ludden², Richard Reeve¹, Andrew Rambaut³, Sharon J. Peacock², David Robertson¹ - Show less +9 more•Institutions (5)

University of Glasgow¹, University of Cambridge², University of Edinburgh³, University of London⁴, Wellcome Trust Sanger Institute⁵

01 Jun 2021-Nature Reviews Microbiology

TL;DR: A review of the literature on mutations of the SARS-CoV-2 spike protein, the primary antigen, focusing on their impacts on antigenicity and contextualizing them in the protein structure is presented in this article.

...read moreread less

Abstract: Although most mutations in the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome are expected to be either deleterious and swiftly purged or relatively neutral, a small proportion will affect functional properties and may alter infectivity, disease severity or interactions with host immunity. The emergence of SARS-CoV-2 in late 2019 was followed by a period of relative evolutionary stasis lasting about 11 months. Since late 2020, however, SARS-CoV-2 evolution has been characterized by the emergence of sets of mutations, in the context of ‘variants of concern’, that impact virus characteristics, including transmissibility and antigenicity, probably in response to the changing immune profile of the human population. There is emerging evidence of reduced neutralization of some SARS-CoV-2 variants by postvaccination serum; however, a greater understanding of correlates of protection is required to evaluate how this may impact vaccine effectiveness. Nonetheless, manufacturers are preparing platforms for a possible update of vaccine sequences, and it is crucial that surveillance of genetic and antigenic changes in the global virus population is done alongside experiments to elucidate the phenotypic impacts of mutations. In this Review, we summarize the literature on mutations of the SARS-CoV-2 spike protein, the primary antigen, focusing on their impacts on antigenicity and contextualizing them in the protein structure, and discuss them in the context of observed mutation frequencies in global sequence datasets. The evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been characterized by the emergence of mutations and so-called variants of concern that impact virus characteristics, including transmissibility and antigenicity. In this Review, members of the COVID-19 Genomics UK (COG-UK) Consortium and colleagues summarize mutations of the SARS-CoV-2 spike protein, focusing on their impacts on antigenicity and contextualizing them in the protein structure, and discuss them in the context of observed mutation frequencies in global sequence datasets.

...read moreread less

2,047 citations

Journal Article•DOI•

Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England.

[...]

Erik M. Volz¹, Swapnil Mishra¹, Meera Chand², Jeffrey C. Barrett³, Robert Johnson¹, Lily Geidelberg¹, Wes Hinsley¹, Daniel J. Laydon¹, Gavin Dabrera², Áine O'Toole⁴, Roberto Amato³, Manon Ragonnet-Cronin¹, Ian Harrison², Ben Jackson⁴, Cristina V. Ariani³, Olivia Boyd¹, Nicholas J. Loman², Nicholas J. Loman⁵, John T. McCrone⁴, Sónia Gonçalves³, David Jorgensen¹, Richard M. Myers², Verity Hill⁴, David K. Jackson³, Katy A. M. Gaythorpe¹, Natalie Groves², John Sillitoe³, Dominic P. Kwiatkowski³, Seth Flaxman¹, Oliver Ratmann¹, Samir Bhatt¹, Samir Bhatt⁶, Susan Hopkins², Axel Gandy¹, Andrew Rambaut⁴, Neil M. Ferguson¹ - Show less +32 more•Institutions (6)

Imperial College London¹, Public Health England², Wellcome Trust Sanger Institute³, University of Edinburgh⁴, University of Birmingham⁵, University of Copenhagen⁶

25 Mar 2021-Nature

TL;DR: In this paper, the authors show that changes in VOC frequency inferred from genetic data correspond closely to changes inferred by S gene target failures (SGTF) in community-based diagnostic PCR testing.

...read moreread less

Abstract: The SARS-CoV-2 lineage B.1.1.7, designated variant of concern (VOC) 202012/01 by Public Health England1, was first identified in the UK in late summer to early autumn 20202. Whole-genome SARS-CoV-2 sequence data collected from community-based diagnostic testing for COVID-19 show an extremely rapid expansion of the B.1.1.7 lineage during autumn 2020, suggesting that it has a selective advantage. Here we show that changes in VOC frequency inferred from genetic data correspond closely to changes inferred by S gene target failures (SGTF) in community-based diagnostic PCR testing. Analysis of trends in SGTF and non-SGTF case numbers in local areas across England shows that B.1.1.7 has higher transmissibility than non-VOC lineages, even if it has a different latent period or generation time. The SGTF data indicate a transient shift in the age composition of reported cases, with cases of B.1.1.7 including a larger share of under 20-year-olds than non-VOC cases. We estimated time-varying reproduction numbers for B.1.1.7 and co-circulating lineages using SGTF and genomic data. The best-supported models did not indicate a substantial difference in VOC transmissibility among different age groups, but all analyses agreed that B.1.1.7 has a substantial transmission advantage over other lineages, with a 50% to 100% higher reproduction number.

...read moreread less

827 citations

Journal Article•DOI•

Towards complete and error-free genome assemblies of all vertebrate species

[...]

Arang Rhie¹, Shane A. McCarthy², Shane A. McCarthy³, Olivier Fedrigo⁴, Joana Damas⁵, Giulio Formenti⁴, Sergey Koren¹, Marcela Uliano-Silva⁶, William Chow³, Arkarachai Fungtammasan, J. H. Kim⁷, Chul Hee Lee⁷, Byung June Ko⁷, Mark Chaisson⁸, Gregory Gedman⁴, Lindsey J. Cantin⁴, Françoise Thibaud-Nissen¹, Leanne Haggerty⁹, Iliana Bista², Iliana Bista³, Michelle Smith³, Bettina Haase⁴, Jacquelyn Mountcastle⁴, Sylke Winkler¹⁰, Sylke Winkler¹¹, Sadye Paez⁴, Jason T. Howard, Sonja C. Vernes¹¹, Sonja C. Vernes¹², Sonja C. Vernes¹³, Tanya M. Lama¹⁴, Frank Grützner¹⁵, Wesley C. Warren¹⁶, Christopher N. Balakrishnan¹⁷, Dave W Burt¹⁸, Jimin George¹⁹, Matthew T. Biegler⁴, David Iorns, Andrew Digby, Daryl Eason, Bruce C. Robertson²⁰, Taylor Edwards²¹, Mark Wilkinson²², George F. Turner²³, Axel Meyer²⁴, Andreas F. Kautt²⁴, Andreas F. Kautt²⁵, Paolo Franchini²⁴, H. William Detrich²⁶, Hannes Svardal²⁷, Hannes Svardal²⁸, Maximilian Wagner²⁹, Gavin J. P. Naylor³⁰, Martin Pippel¹¹, Milan Malinsky³¹, Milan Malinsky³, Mark Mooney, Maria Simbirsky, Brett T. Hannigan, Trevor Pesout³², Marlys L. Houck³³, Ann C Misuraca³³, Sarah B. Kingan³⁴, Richard Hall³⁴, Zev N. Kronenberg³⁴, Ivan Sović³⁴, Christopher Dunn³⁴, Zemin Ning³, Alex Hastie, Joyce V. Lee, Siddarth Selvaraj, Richard E. Green³², Nicholas H. Putnam, Ivo Gut³⁵, Jay Ghurye³⁶, Erik Garrison³², Ying Sims³, Joanna Collins³, Sarah Pelan³, James Torrance³, Alan Tracey³, Jonathan Wood³, Robel E. Dagnew⁸, Dengfeng Guan³⁷, Dengfeng Guan², Sarah E. London³⁸, David F. Clayton¹⁹, Claudio V. Mello³⁹, Samantha R. Friedrich³⁹, Peter V. Lovell³⁹, Ekaterina Osipova¹¹, Farooq O. Al-Ajli⁴⁰, Farooq O. Al-Ajli⁴¹, Simona Secomandi⁴², Heebal Kim⁷, Constantina Theofanopoulou⁴, Michael Hiller⁴³, Yang Zhou, Robert S. Harris⁴⁴, Kateryna D. Makova⁴⁴, Paul Medvedev⁴⁴, Jinna Hoffman¹, Patrick Masterson¹, Karen Clark¹, Fergal J. Martin⁹, Kevin L. Howe⁹, Paul Flicek⁹, Brian P. Walenz¹, Woori Kwak, Hiram Clawson³², Mark Diekhans³², Luis R Nassar³², Benedict Paten³², Robert H. S. Kraus¹¹, Robert H. S. Kraus²⁴, Andrew J. Crawford⁴⁵, M. Thomas P. Gilbert⁴⁶, M. Thomas P. Gilbert⁴⁷, Guojie Zhang, Byrappa Venkatesh⁴⁸, Robert W. Murphy⁴⁹, Klaus-Peter Koepfli⁵⁰, Beth Shapiro³², Beth Shapiro⁵¹, Warren E. Johnson⁵⁰, Warren E. Johnson⁵², Federica Di Palma⁵³, Tomas Marques-Bonet, Emma C. Teeling⁵⁴, Tandy Warnow⁵⁵, Jennifer A. Marshall Graves⁵⁶, Oliver A. Ryder³³, Oliver A. Ryder⁵⁷, David Haussler³², Stephen J. O'Brien⁵⁸, Jonas Korlach³⁴, Harris A. Lewin⁵, Kerstin Howe³, Eugene W. Myers¹¹, Eugene W. Myers¹⁰, Richard Durbin³, Richard Durbin², Adam M. Phillippy¹, Erich D. Jarvis⁴, Erich D. Jarvis⁵¹ - Show less +141 more•Institutions (58)

National Institutes of Health¹, University of Cambridge², Wellcome Trust Sanger Institute³, Rockefeller University⁴, University of California, Davis⁵, Leibniz Association⁶, Seoul National University⁷, University of Southern California⁸, European Bioinformatics Institute⁹, Dresden University of Technology¹⁰, Max Planck Society¹¹, Radboud University Nijmegen¹², University of St Andrews¹³, University of Massachusetts Amherst¹⁴, University of Adelaide¹⁵, University of Missouri¹⁶, East Carolina University¹⁷, University of Queensland¹⁸, Clemson University¹⁹, University of Otago²⁰, University of Arizona²¹, Natural History Museum²², Bangor University²³, University of Konstanz²⁴, Harvard University²⁵, Northeastern University²⁶, National Museum of Natural History²⁷, University of Antwerp²⁸, University of Graz²⁹, University of Florida³⁰, University of Basel³¹, University of California, Santa Cruz³², Zoological Society of San Diego³³, Pacific Biosciences³⁴, Pompeu Fabra University³⁵, University of Maryland, College Park³⁶, Harbin Institute of Technology³⁷, University of Chicago³⁸, Oregon Health & Science University³⁹, Monash University Malaysia Campus⁴⁰, Qatar Airways⁴¹, University of Milan⁴², Goethe University Frankfurt⁴³, Pennsylvania State University⁴⁴, University of Los Andes⁴⁵, University of Copenhagen⁴⁶, Norwegian University of Science and Technology⁴⁷, Agency for Science, Technology and Research⁴⁸, Royal Ontario Museum⁴⁹, Smithsonian Institution⁵⁰, Howard Hughes Medical Institute⁵¹, Walter Reed Army Institute of Research⁵², University of East Anglia⁵³, University College Dublin⁵⁴, University of Illinois at Urbana–Champaign⁵⁵, La Trobe University⁵⁶, University of California, San Diego⁵⁷, Nova Southeastern University⁵⁸

28 Apr 2021-Nature

TL;DR: The Vertebrate Genomes Project (VGP) as mentioned in this paper is an international effort to generate high quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.

...read moreread less

Abstract: High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1-4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences.

...read moreread less

647 citations

Posted Content•DOI•

Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from linking epidemiological and genetic data

[...]

Erik M. Volz¹, Swapnil Mishra¹, Meera Chand², Jeffrey C. Barrett³, Robert Johnson¹, Lily Geidelberg¹, Wes Hinsley¹, Daniel J Laydon¹, Gavin Dabrera², Áine O'Toole⁴, Roberto Amato³, Manon Ragonnet-Cronin¹, Ian Harrison², Ben Jackson⁴, Cristina V. Ariani³, Olivia Boyd¹, Nicholas J. Loman², John T. McCrone⁴, Sónia Gonçalves³, David Jorgensen¹, Richard M. Myers², Verity Hill⁴, David K. Jackson³, Katy A. M. Gaythorpe¹, Natalie Groves², John Sillitoe³, Dominic P. Kwiatkowski³, Cog-Uk, Seth Flaxman¹, Oliver Ratman¹, Samir Bhatt¹, Susan Hopkins², Axel Gandy¹, Andrew Rambaut⁴, Neil M. Ferguson¹ - Show less +31 more•Institutions (4)

Imperial College London¹, Public Health England², Wellcome Trust Sanger Institute³, University of Edinburgh⁴

04 Jan 2021-medRxiv

TL;DR: The SARS-CoV-2 lineage B.7, now designated Variant of Concern 202012/01 (VOC) by Public Health England, originated in the UK in late Summer to early Autumn 2020 as mentioned in this paper.

...read moreread less

Abstract: The SARS-CoV-2 lineage B.1.1.7, now designated Variant of Concern 202012/01 (VOC) by Public Health England, originated in the UK in late Summer to early Autumn 2020. We examine epidemiological evidence for this VOC having a transmission advantage from several perspectives. First, whole genome sequence data collected from community-based diagnostic testing provides an indication of changing prevalence of different genetic variants through time. Phylodynamic modelling additionally indicates that genetic diversity of this lineage has changed in a manner consistent with exponential growth. Second, we find that changes in VOC frequency inferred from genetic data correspond closely to changes inferred by S-gene target failures (SGTF) in community-based diagnostic PCR testing. Third, we examine growth trends in SGTF and non-SGTF case numbers at local area level across England, and show that the VOC has higher transmissibility than non-VOC lineages, even if the VOC has a different latent period or generation time. Available SGTF data indicate a shift in the age composition of reported cases, with a larger share of under 20 year olds among reported VOC than non-VOC cases. Fourth, we assess the association of VOC frequency with independent estimates of the overall SARS-CoV-2 reproduction number through time. Finally, we fit a semi-mechanistic model directly to local VOC and non-VOC case incidence to estimate the reproduction numbers over time for each. There is a consensus among all analyses that the VOC has a substantial transmission advantage, with the estimated difference in reproduction numbers between VOC and non-VOC ranging between 0.4 and 0.7, and the ratio of reproduction numbers varying between 1.4 and 1.8. We note that these estimates of transmission advantage apply to a period where high levels of social distancing were in place in England; extrapolation to other transmission contexts therefore requires caution.

...read moreread less

547 citations

Journal Article•DOI•

Efficacy of ChAdOx1 nCoV-19 (AZD1222) vaccine against SARS-CoV-2 variant of concern 202012/01 (B.1.1.7): an exploratory analysis of a randomised controlled trial.

[...]

Katherine R. W. Emary¹, Tanya Golubchik¹, Parvinder K. Aley¹, Cristina V. Ariani², Brian Angus¹, S Bibi¹, Beth Blane³, David Bonsall¹, P Cicconi¹, Sue Charlton⁴, Elizabeth A. Clutterbuck¹, Andrea M. Collins⁵, Tony Cox, Thomas C. Darton⁶, Christina Dold¹, Alexander D. Douglas¹, Christopher J A Duncan⁷, Christopher J A Duncan⁸, Katie J. Ewer¹, Amy Flaxman¹, Saul N. Faust⁹, Saul N. Faust¹⁰, Daniela M. Ferreira⁵, Shuo Feng¹, Adam Finn¹¹, P M Folegatti¹, Michelle Fuskova¹, Eva P. Galiza¹², Anna L. Goodman¹³, Anna L. Goodman¹⁴, Catherine M. Green¹, Christopher A Green¹⁵, Melanie Greenland¹, Bassam Hallis⁴, Paul T. Heath¹², Jodie Hay¹⁶, Helen Hill⁵, D Jenkin¹, Simon Kerridge¹, Rajeka Lazarus¹⁷, Vincenzo Libri¹⁸, Patrick J. Lillie¹⁹, Catherine Ludden³, N G Marchevsky¹, Angela M. Minassian¹, Alastair McGregor²⁰, Yama F Mujadidi¹, Daniel J. Phillips¹, Emma Plested¹, Katrina M Pollock, Hannah Robinson¹, Andrew Smith²¹, R Song¹, Matthew D. Snape¹, Rebecca K. Sutherland²², E. Thomson¹⁶, E. Thomson¹⁷, Mark Toshner³, David P. J. Turner²³, David P. J. Turner²⁴, Johan Vekemans²⁵, Tonya Villafana²⁵, Christopher Williams²⁶, Christopher Williams²⁷, Adrian V. S. Hill¹, Teresa Lambe¹, Sarah C. Gilbert¹, Merryn Voysey¹, M N Ramasamy¹, Andrew J. Pollard¹ - Show less +66 more•Institutions (27)

University of Oxford¹, Wellcome Trust Sanger Institute², University of Cambridge³, Public Health England⁴, Liverpool School of Tropical Medicine⁵, University of Sheffield⁶, Newcastle upon Tyne Hospitals NHS Foundation Trust⁷, Newcastle University⁸, University Hospital Southampton NHS Foundation Trust⁹, University of Southampton¹⁰, University Hospitals Bristol NHS Foundation Trust¹¹, St George's, University of London¹², Guy's and St Thomas' NHS Foundation Trust¹³, University College London¹⁴, University Hospitals Birmingham NHS Foundation Trust¹⁵, University of Glasgow¹⁶, North Bristol NHS Trust¹⁷, University College Hospital¹⁸, University of Hull¹⁹, Northwest University (China)²⁰, Glasgow Dental Hospital and School²¹, Western General Hospital²², Nottingham University Hospitals NHS Trust²³, University of Nottingham²⁴, AstraZeneca²⁵, Aneurin Bevan University Health Board²⁶, Cardiff University²⁷

10 Apr 2021-The Lancet

TL;DR: A post-hoc analysis of the efficacy of the adenoviral vector vaccine, ChAdOx1 nCoV-19 (AZD1222), against B.1.7, emerged as the dominant cause of COVID-19 disease in the UK from November, 2020 as discussed by the authors.

...read moreread less

521 citations

Journal Article•DOI•

A unified catalog of 204,938 reference genomes from the human gut microbiome.

[...]

Alexandre Almeida¹, Alexandre Almeida², Stephen Nayfach³, Stephen Nayfach⁴, Miguel Boland¹, Francesco Strozzi, Martin Beracochea¹, Zhou Jason Shi⁵, Katherine S. Pollard, Ekaterina A. Sakharova¹, Donovan H. Parks⁶, Philip Hugenholtz⁶, Nicola Segata⁷, Nikos C. Kyrpides⁴, Nikos C. Kyrpides³, Robert D. Finn¹ - Show less +12 more•Institutions (7)

European Bioinformatics Institute¹, Wellcome Trust Sanger Institute², United States Department of Energy³, Lawrence Berkeley National Laboratory⁴, Gladstone Institutes⁵, University of Queensland⁶, University of Trento⁷

01 Jan 2021-Nature Biotechnology

TL;DR: The Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes, is presented, providing comprehensive resources for microbiome researchers.

...read moreread less

Abstract: Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome.

...read moreread less

485 citations

Journal Article•DOI•

SARS-CoV-2 infection of the oral cavity and saliva.

[...]

Ni Huang¹, Paola Perez², Takafumi Kato³, Yu Mikami³, Kenichi Okuda³, Rodney C. Gilmore³, Cecilia Domínguez Conde¹, Billel Gasmi², Sydney Stein², Margaret Beach², Eileen Pelayo², José O. Maldonado², Bernard A. P. Lafont², Shyh-Ing Jang², Nadia Nasir², Ricardo J. Padilla³, Valerie A. Murrah³, Robert Maile³, William Lovell³, Shannon M. Wallet³, Natalie M. Bowman³, Suzanne L Meinig³, Matthew C. Wolfgang³, Saibyasachi N. Choudhury⁴, Mark Novotny⁴, Brian D. Aevermann⁴, Richard H. Scheuermann⁴, Gabrielle Cannon³, Carlton W Anderson³, Rhianna E. Lee³, Julie T. Marchesan³, Mandy Bush³, Marcelo Freire⁴, Adam J. Kimple³, Daniel Herr⁵, Joseph Rabin⁵, Alison Grazioli², Sanchita Das², Benjamin N French², Thomas Pranzatelli², John A. Chiorini², David E. Kleiner², Stefania Pittaluga², Stephen M. Hewitt², Peter D. Burbelo², Daniel S. Chertow², Karen M. Frank², Janice Lee³, Richard C. Boucher³, Sarah A. Teichmann¹ - Show less +46 more•Institutions (5)

Wellcome Trust Sanger Institute¹, National Institutes of Health², University of North Carolina at Chapel Hill³, J. Craig Venter Institute⁴, University of Maryland, Baltimore⁵

25 Mar 2021-Nature Medicine

TL;DR: In this article, the authors generated and analyzed two single-cell RNA sequencing datasets of the human minor salivary glands and gingiva (9 samples, 13,824 cells), identifying 50 cell clusters.

...read moreread less

Abstract: Despite signs of infection-including taste loss, dry mouth and mucosal lesions such as ulcerations, enanthema and macules-the involvement of the oral cavity in coronavirus disease 2019 (COVID-19) is poorly understood. To address this, we generated and analyzed two single-cell RNA sequencing datasets of the human minor salivary glands and gingiva (9 samples, 13,824 cells), identifying 50 cell clusters. Using integrated cell normalization and annotation, we classified 34 unique cell subpopulations between glands and gingiva. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) viral entry factors such as ACE2 and TMPRSS members were broadly enriched in epithelial cells of the glands and oral mucosae. Using orthogonal RNA and protein expression assessments, we confirmed SARS-CoV-2 infection in the glands and mucosae. Saliva from SARS-CoV-2-infected individuals harbored epithelial cells exhibiting ACE2 and TMPRSS expression and sustained SARS-CoV-2 infection. Acellular and cellular salivary fractions from asymptomatic individuals were found to transmit SARS-CoV-2 ex vivo. Matched nasopharyngeal and saliva samples displayed distinct viral shedding dynamics, and salivary viral burden correlated with COVID-19 symptoms, including taste loss. Upon recovery, this asymptomatic cohort exhibited sustained salivary IgG antibodies against SARS-CoV-2. Collectively, these data show that the oral cavity is an important site for SARS-CoV-2 infection and implicate saliva as a potential route of SARS-CoV-2 transmission.

...read moreread less

417 citations

Journal Article•DOI•

Significantly improving the quality of genome assemblies through curation

[...]

Kerstin Howe¹, William Chow¹, Joanna Collins¹, Sarah Pelan¹, Damon-Lee Pointon¹, Ying Sims¹, James Torrance¹, Alan Tracey¹, Jonathan Wood¹ - Show less +5 more•Institutions (1)

Wellcome Trust Sanger Institute¹

09 Jan 2021-GigaScience

TL;DR: In this paper, a tried and tested approach for genome curation using gEVAL, the genome evaluation browser, is described and recommended for assembly curation in a GEVAL-independent context to facilitate the uptake of genome curations in the wider community.

...read moreread less

Abstract: Genome sequence assemblies provide the basis for our understanding of biology. Generating error-free assemblies is therefore the ultimate, but sadly still unachieved goal of a multitude of research projects. Despite the ever-advancing improvements in data generation, assembly algorithms and pipelines, no automated approach has so far reliably generated near error-free genome assemblies for eukaryotes. Whilst working towards improved datasets and fully automated pipelines, assembly evaluation and curation is actively used to bridge this shortcoming and significantly reduce the number of assembly errors. In addition to this increase in product value, the insights gained from assembly curation are fed back into the automated assembly strategy and contribute to notable improvements in genome assembly quality. We describe our tried and tested approach for assembly curation using gEVAL, the genome evaluation browser. We outline the procedures applied to genome curation using gEVAL and also our recommendations for assembly curation in a gEVAL-independent context to facilitate the uptake of genome curation in the wider community.

...read moreread less

373 citations

Journal Article•DOI•

Small-molecule inhibition of METTL3 as a strategy against myeloid leukaemia.

[...]

Eliza Yankova¹, Eliza Yankova², Wesley Blackaby, Mark Albertella, Justyna Rak², Justyna Rak¹, Etienne De Braekeleer², Etienne De Braekeleer¹, Georgia Tsagkogeorga², Ewa S. Pilka, Demetrios Aspris¹, Dan Leggate, Alan G. Hendrick, Natalie A. Webster, Byron Andrews, Richard Fosbeary, Patrick Guest, Nerea Irigoyen², Maria Eleftheriou², Malgorzata Gozdecka¹, João Lopes Dias², Andrew J. Bannister², Binje Vick, Irmela Jeremias³, George S. Vassiliou², George S. Vassiliou¹, Oliver Rausch, Konstantinos Tzelepis, Tony Kouzarides² - Show less +25 more•Institutions (3)

Wellcome Trust Sanger Institute¹, University of Cambridge², Ludwig Maximilian University of Munich³

26 Apr 2021-Nature

TL;DR: In this article, a first-in-class catalytic inhibitor of METTL3 was identified and characterized, and a crystal structure of STM2457 in complex with METTL 3 and METTL14 was presented.

...read moreread less

Abstract: N6-methyladenosine (m6A) is an abundant internal RNA modification1,2 that is catalysed predominantly by the METTL3–METTL14 methyltransferase complex3,4. The m6A methyltransferase METTL3 has been linked to the initiation and maintenance of acute myeloid leukaemia (AML), but the potential of therapeutic applications targeting this enzyme remains unknown5–7. Here we present the identification and characterization of STM2457, a highly potent and selective first-in-class catalytic inhibitor of METTL3, and a crystal structure of STM2457 in complex with METTL3–METTL14. Treatment of tumours with STM2457 leads to reduced AML growth and an increase in differentiation and apoptosis. These cellular effects are accompanied by selective reduction of m6A levels on known leukaemogenic mRNAs and a decrease in their expression consistent with a translational defect. We demonstrate that pharmacological inhibition of METTL3 in vivo leads to impaired engraftment and prolonged survival in various mouse models of AML, specifically targeting key stem cell subpopulations of AML. Collectively, these results reveal the inhibition of METTL3 as a potential therapeutic strategy against AML, and provide proof of concept that the targeting of RNA-modifying enzymes represents a promising avenue for anticancer therapy. Treatment with a specific inhibitor of the N6-methyladenosine methyltransferase METTL3 leads to reduced growth of cancer cells, indicating the potential of approaches targeting RNA-modifying enzymes for anticancer therapy.

...read moreread less

362 citations

Journal Article•DOI•

Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression

[...]

Urmo Võsa¹, Annique Claringbould², Annique Claringbould³, Harm-Jan Westra¹, Marc Jan Bonder¹, Patrick Deelen, Biao Zeng⁴, Holger Kirsten⁵, Ashis Saha⁶, Roman Kreuzhuber⁷, Roman Kreuzhuber², Roman Kreuzhuber⁸, Seyhan Yazar⁹, Harm Brugge¹, Roy Oelen¹, Dylan H. de Vries¹, Monique G. P. van der Wijst¹, Silva Kasela¹⁰, Natalia Pervjakova¹⁰, Isabel Alves¹¹, Marie-Julie Favé¹¹, Mawusse Agbessi¹¹, Mark W. Christiansen¹², Rick Jansen¹³, Ilkka Seppälä, Lin Tong¹⁴, Alexander Teumer¹⁵, Katharina Schramm¹⁶, Gibran Hemani¹⁷, Joost Verlouw¹⁸, Hanieh Yaghootkar¹⁹, Hanieh Yaghootkar²⁰, Hanieh Yaghootkar²¹, Reyhan Sönmez Flitman²², Reyhan Sönmez Flitman²³, Andrew A. Brown²⁴, Andrew A. Brown²⁵, Viktorija Kukushkina¹⁰, Anette Kalnapenkis¹⁰, Sina Rüeger²², Eleonora Porcu²², Jaanika Kronberg¹⁰, Johannes Kettunen, Bernett Lee²⁶, Futao Zhang²⁷, Ting Qi²⁷, Jose Alquicira Hernandez⁹, Wibowo Arindrarto²⁸, Frank Beutner⁵, Peter A C 't Hoen²⁹, Joyce B. J. van Meurs¹⁸, Jenny van Dongen¹³, Maarten van Iterson²⁸, Morris A. Swertz, Julia Dmitrieva³⁰, Mahmoud Elansary³⁰, Benjamin P. Fairfax³¹, Michel Georges³⁰, Bastiaan T. Heijmans²⁸, Alex W. Hewitt³², Mika Kähönen, Yungil Kim⁶, Yungil Kim³³, Julian C. Knight³¹, Peter Kovacs⁵, Knut Krohn⁵, Shuang Li¹, Markus Loeffler⁵, Urko M. Marigorta⁴, Urko M. Marigorta³⁴, Hailang Mei²⁸, Yukihide Momozawa³⁰, Martina Müller-Nurasyid¹⁶, Matthias Nauck¹⁵, Michel G. Nivard³⁵, Brenda W.J.H. Penninx¹³, Jonathan K. Pritchard³⁶, Olli T. Raitakari³⁷, Olli T. Raitakari³⁸, Olaf Rötzschke²⁶, Eline Slagboom²⁸, Coen D.A. Stehouwer³⁹, Michael Stumvoll⁵, Patrick F. Sullivan⁴⁰, Joachim Thiery⁵, Anke Tönjes⁵, Jan H. Veldink⁴¹, Uwe Völker¹⁵, Robert Warmerdam¹, Cisca Wijmenga¹, Morris Swertz, Anand Kumar Andiappan²⁶, Grant W. Montgomery²⁷, Samuli Ripatti⁴², Markus Perola⁴³, Zoltán Kutalik²², Emmanouil T. Dermitzakis²⁴, Emmanouil T. Dermitzakis²³, Sven Bergmann²³, Sven Bergmann²², Timothy M. Frayling²⁰, Holger Prokisch⁴⁴, Habibul Ahsan¹⁴, Brandon L. Pierce¹⁴, Terho Lehtimäki, Dorret I. Boomsma¹³, Bruce M. Psaty¹², Sina A. Gharib¹², Philip Awadalla¹¹, Lili Milani¹⁰, Willem H. Ouwehand⁴⁵, Willem H. Ouwehand⁷, Willem H. Ouwehand⁸, Kate Downes⁸, Kate Downes⁷, Oliver Stegle², Oliver Stegle⁴⁶, Alexis Battle⁶, Peter M. Visscher²⁷, Jian Yang⁴⁷, Jian Yang²⁷, Markus Scholz⁵, Joseph E. Powell⁹, Joseph E. Powell⁴⁸, Greg Gibson⁴, Tõnu Esko¹⁰, Lude Franke¹ - Show less +123 more•Institutions (48)

University Medical Center Groningen¹, European Bioinformatics Institute², Netherlands Cancer Institute³, Georgia Institute of Technology⁴, Leipzig University⁵, Johns Hopkins University⁶, NHS Blood and Transplant⁷, University of Cambridge⁸, Garvan Institute of Medical Research⁹, University of Tartu¹⁰, Ontario Institute for Cancer Research¹¹, University of Washington¹², Public Health Research Institute¹³, University of Chicago¹⁴, Greifswald University Hospital¹⁵, Ludwig Maximilian University of Munich¹⁶, University of Bristol¹⁷, Erasmus University Rotterdam¹⁸, Luleå University of Technology¹⁹, Royal Devon and Exeter Hospital²⁰, University of Westminster²¹, University of Lausanne²², Swiss Institute of Bioinformatics²³, University of Geneva²⁴, University of Dundee²⁵, Agency for Science, Technology and Research²⁶, University of Queensland²⁷, Leiden University Medical Center²⁸, Radboud University Nijmegen²⁹, University of Liège³⁰, University of Oxford³¹, Menzies Research Institute³², Icahn School of Medicine at Mount Sinai³³, Ikerbasque³⁴, VU University Amsterdam³⁵, Stanford University³⁶, University of Turku³⁷, Turku University Hospital³⁸, Maastricht University³⁹, Karolinska Institutet⁴⁰, Utrecht University⁴¹, University of Helsinki⁴², National Institutes of Health⁴³, Technische Universität München⁴⁴, Wellcome Trust Sanger Institute⁴⁵, German Cancer Research Center⁴⁶, Westlake University⁴⁷, University of New South Wales⁴⁸

02 Sep 2021-Nature Genetics

TL;DR: In this article, the authors performed cis-and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium.

...read moreread less

Abstract: Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.

...read moreread less

Journal Article•DOI•

Single-cell multi-omics analysis of the immune response in COVID-19.

[...]

Emily Stephenson¹, Gary Reynolds¹, Rachel A. Botting¹, Fernando J Calero-Nieto², Michael D Morgan², Michael D Morgan³, Zewen K. Tuong², Zewen K. Tuong⁴, Karsten Bach², Karsten Bach³, Waradon Sungnak⁴, Kaylee B Worlock⁵, Masahiro Yoshida⁵, Natsuhiko Kumasaka⁴, Katarzyna D. Kania², Justin Engelbert¹, Bayanne Olabi¹, Jarmila Stremenova Spegarova¹, Nicola K. Wilson², Nicole Mende², Laura Jardine¹, Louis C.S. Gardner¹, Issac Goh¹, Dave Horsfall¹, Jim McGrath¹, Simone Webb¹, Michael W. Mather¹, Rik G.H. Lindeboom⁴, Emma Dann⁴, Ni Huang⁴, Krzysztof Polanski⁴, Elena Prigmore⁴, Florian Gothe¹, Florian Gothe⁶, Jonathan M. Scott¹, Rebecca Payne¹, Kenneth F Baker¹, Kenneth F Baker⁷, Aidan T Hanrath⁷, Aidan T Hanrath¹, Ina Schim van der Loeff¹, Andrew Barr⁷, Amada Sanchez-Gonzalez⁷, Laura Bergamaschi², Federica Mescia², Josephine Barnes⁵, Eliz Kilich⁸, Angus de Wilton⁸, A Saigal⁹, Aarash Saleh⁹, Sam M. Janes⁸, Sam M. Janes⁵, Claire Smith¹⁰, Nusayhah Gopee¹, Caroline Wilson¹, Caroline Wilson¹¹, Paul Coupland², Jonathan Coxhead¹ - Show less +54 more•Institutions (11)

Newcastle University¹, University of Cambridge², European Bioinformatics Institute³, Wellcome Trust Sanger Institute⁴, University College London⁵, Ludwig Maximilian University of Munich⁶, Newcastle upon Tyne Hospitals NHS Foundation Trust⁷, University College London Hospitals NHS Foundation Trust⁸, Royal Free Hospital⁹, UCL Institute of Child Health¹⁰, Harvard University¹¹

20 Apr 2021-Nature Medicine

TL;DR: In this article, the authors performed single-cell transcriptome, surface proteome and T and B lymphocyte antigen receptor analyses of over 780,000 peripheral blood mononuclear cells from a cross-sectional cohort of 130 patients with varying severities of COVID-19.

...read moreread less

Abstract: Analysis of human blood immune cells provides insights into the coordinated response to viral infections such as severe acute respiratory syndrome coronavirus 2, which causes coronavirus disease 2019 (COVID-19). We performed single-cell transcriptome, surface proteome and T and B lymphocyte antigen receptor analyses of over 780,000 peripheral blood mononuclear cells from a cross-sectional cohort of 130 patients with varying severities of COVID-19. We identified expansion of nonclassical monocytes expressing complement transcripts (CD16+C1QA/B/C+) that sequester platelets and were predicted to replenish the alveolar macrophage pool in COVID-19. Early, uncommitted CD34+ hematopoietic stem/progenitor cells were primed toward megakaryopoiesis, accompanied by expanded megakaryocyte-committed progenitors and increased platelet activation. Clonally expanded CD8+ T cells and an increased ratio of CD8+ effector T cells to effector memory T cells characterized severe disease, while circulating follicular helper T cells accompanied mild disease. We observed a relative loss of IgA2 in symptomatic disease despite an overall expansion of plasmablasts and plasma cells. Our study highlights the coordinated immune response that contributes to COVID-19 pathogenesis and reveals discrete cellular components that can be targeted for therapy.

...read moreread less

Journal Article•DOI•

Genome-wide association studies

[...]

Emil Uffelmann¹, Qin Qin Huang², Nchangwi Syntia Munung³, Jantina de Vries³, Yukinori Okada⁴, Alicia R. Martin⁵, Alicia R. Martin⁶, Hilary C. Martin², Tuuli Lappalainen⁷, Tuuli Lappalainen⁸, Danielle Posthuma¹ - Show less +7 more•Institutions (8)

VU University Amsterdam¹, Wellcome Trust Sanger Institute², University of Cape Town³, Osaka University⁴, Harvard University⁵, Broad Institute⁶, Columbia University⁷, Royal Institute of Technology⁸

26 Aug 2021

TL;DR: This Primer provides an introduction to genome-wide association studies (GWAS), techniques for deriving functional inferences from the results and applications of GWAS in understanding disease risk and trait architecture, and discusses important ethical considerations when considering GWAS populations and data.

...read moreread less

Abstract: Genome-wide association studies (GWAS) test hundreds of thousands of genetic variants across many genomes to find those statistically associated with a specific trait or disease. This methodology has generated a myriad of robust associations for a range of traits and diseases, and the number of associated variants is expected to grow steadily as GWAS sample sizes increase. GWAS results have a range of applications, such as gaining insight into a phenotype’s underlying biology, estimating its heritability, calculating genetic correlations, making clinical risk predictions, informing drug development programmes and inferring potential causal relationships between risk factors and health outcomes. In this Primer, we provide the reader with an introduction to GWAS, explaining their statistical basis and how they are conducted, describe state-of-the art approaches and discuss limitations and challenges, concluding with an overview of the current and future applications for GWAS results. Uffelmann et al. describe the key considerations and best practices for conducting genome-wide association studies (GWAS), techniques for deriving functional inferences from the results and applications of GWAS in understanding disease risk and trait architecture. The Primer also provides information on the best practices for data sharing and discusses important ethical considerations when considering GWAS populations and data.

...read moreread less

Journal Article•DOI•

Massive expansion of human gut bacteriophage diversity

[...]

Luis F. Camarillo-Guerrero¹, Alexandre Almeida², Guillermo Rangel-Pineros³, Robert D. Finn², Trevor D. Lawley¹ - Show less +1 more•Institutions (3)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², University of Los Andes³

18 Feb 2021-Cell

TL;DR: The Gut Phage Database as discussed by the authors is a collection of ∼142,000 non-redundant viral genomes (>10 kb) obtained by mining a dataset of 28,060 globally distributed human gut metagenomes and 2,898 reference genomes of cultured gut bacteria.

...read moreread less

Journal Article•DOI•

Open Targets Platform: supporting systematic drug-target identification and prioritisation.

[...]

David Ochoa¹, Andrew Hercules¹, Miguel Carmona¹, Daniel Suveges¹, Asier Gonzalez-Uriarte¹, Cinzia Malangone¹, Alfredo Miranda¹, Luca Fumis¹, Denise Carvalho-Silva¹, Michaela Spitzer¹, Jarrod Baker¹, Javier Ferrer¹, Arwa Bin Raies¹, Olesya Razuvayevskaya¹, Adam Faulconbridge¹, Eirini Petsalaki¹, Prudence Mutowo², Sandra Machlitt-Northen², Gareth Peat¹, Elaine McAuley¹, Chuang Kee Ong¹, Edward Mountjoy³, Maya Ghoussaini³, Andrea Pierleoni¹, Eliseo Papa⁴, Miguel Pignatelli¹, Gautier Koscielny², Mohd Anisul Karim³, Jeremy Schwartzentruber³, David G. Hulcoop², Ian Dunham¹, Ian Dunham³, Ellen M. McDonagh¹ - Show less +29 more•Institutions (4)

European Bioinformatics Institute¹, GlaxoSmithKline², Wellcome Trust Sanger Institute³, Biogen Idec⁴

08 Jan 2021-Nucleic Acids Research

TL;DR: To aid the prioritisation of targets and inform on the potential impact of modulating a given target, evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety are added.

...read moreread less

Abstract: The Open Targets Platform (https://www.targetvalidation.org/) provides users with a queryable knowledgebase and user interface to aid systematic target identification and prioritisation for drug discovery based upon underlying evidence. It is publicly available and the underlying code is open source. Since our last update two years ago, we have had 10 releases to maintain and continuously improve evidence for target-disease relationships from 20 different data sources. In addition, we have integrated new evidence from key datasets, including prioritised targets identified from genome-wide CRISPR knockout screens in 300 cancer models (Project Score), and GWAS/UK BioBank statistical genetic analysis evidence from the Open Targets Genetics Portal. We have evolved our evidence scoring framework to improve target identification. To aid the prioritisation of targets and inform on the potential impact of modulating a given target, we have added evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety. We have also developed the user interface and backend technologies to improve performance and usability. In this article, we describe the latest enhancements to the Platform, to address the fundamental challenge that developing effective and safe drugs is difficult and expensive.

...read moreread less

Journal Article•DOI•

Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics.

[...]

Maya Ghoussaini¹, Edward Mountjoy¹, Miguel Carmona², Gareth Peat², Ellen M. Schmidt¹, Andrew Hercules², Luca Fumis², Alfredo Miranda², Denise Carvalho-Silva², Annalisa Buniello², Tony Burdett², James D. Hayhurst², Jarrod Baker², Javier Ferrer², Asier Gonzalez-Uriarte², Simon Jupp², Mohd Anisul Karim¹, Gautier Koscielny³, Sandra Machlitt-Northen³, Cinzia Malangone², Zoë May Pendlington², Paola Roncaglia², Daniel Suveges², Daniel Wright¹, Olga Vrousgou², Eliseo Papa⁴, Helen Parkinson², Jacqueline A. L. MacArthur², John A. Todd⁵, Jeffrey C. Barrett¹, Jeremy Schwartzentruber¹, David G. Hulcoop³, David Ochoa², Ellen M. McDonagh¹, Ellen M. McDonagh², Ian Dunham², Ian Dunham¹ - Show less +33 more•Institutions (5)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², GlaxoSmithKline³, Biogen Idec⁴, University of Oxford⁵

08 Jan 2021-Nucleic Acids Research

TL;DR: Open Targets Genetics offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue.

...read moreread less

Abstract: Open Targets Genetics (https://genetics.opentargets.org) is an open-access integrative resource that aggregates human GWAS and functional genomics data including gene expression, protein abundance, chromatin interaction and conformation data from a wide range of cell types and tissues to make robust connections between GWAS-associated loci, variants and likely causal genes. This enables systematic identification and prioritisation of likely causal variants and genes across all published trait-associated loci. In this paper, we describe the public resources we aggregate, the technology and analyses we use, and the functionality that the portal offers. Open Targets Genetics can be searched by variant, gene or study/phenotype. It offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue. Data visualizations such as Manhattan-like plots, regional plots, credible sets overlap between studies and PheWAS plots enable users to explore GWAS signals in depth. The integrated data is made available through the web portal, for bulk download and via a GraphQL API, and the software is open source. Applications of this integrated data include identification of novel targets for drug discovery and drug repurposing.

...read moreread less

Journal Article•DOI•

Single-cell meta-analysis of SARS-CoV-2 entry genes across tissues and demographics.

[...]

Christoph Muus¹, Malte D Luecken, Gökcen Eraslan¹, Lisa Sikkema, Avinash Waghray², Graham Heimberg¹, Yoshihiko Kobayashi³, Eeshit Dhaval Vaishnav¹, Eeshit Dhaval Vaishnav⁴, Ayshwarya Subramanian¹, Christopher Smillie¹, Karthik A. Jagadeesh¹, Elizabeth Thu Duong⁵, Evgenij Fiskin¹, Elena Torlai Triglia¹, Meshal Ansari, Peiwen Cai⁶, Brian M. Lin², Justin Buchanan⁵, Sijia Chen⁷, Jian Shu¹, Adam L. Haber¹, Adam L. Haber², Hattie Chung¹, Daniel T. Montoro¹, Taylor Adams⁸, Hananeh Aliee, Samuel J. Allon⁹, Samuel J. Allon⁴, Samuel J. Allon¹, Zaneta Andrusivova¹⁰, Ilias Angelidis, Orr Ashenberg¹, Kevin Bassler¹¹, Christophe Bécavin¹², Inbal Benhar¹, Joseph Bergenstråhle¹⁰, Ludvig Bergenstråhle¹⁰, Liam Bolt¹³, Emelie Braun¹⁴, Linh T. Bui¹⁵, Steven Callori¹⁶, Mark Chaffin¹, Evgeny Chichelnitskiy¹⁷, Joshua Chiou⁵, Thomas M. Conlon, Michael S. Cuoco¹, Anna S E Cuomo¹⁸, Marie Deprez¹², Grant Duclos¹⁶, Denise Fine¹⁹, David S. Fischer²⁰, Shila Ghazanfar²¹, Astrid Gillich²² - Show less +50 more•Institutions (22)

02 Mar 2021-Nature Medicine

TL;DR: In this paper, cell-type-specific expression of ACE2, TMPRSS2 and CTSL across 107 single-cell RNA-sequencing studies from different tissues was assessed.

...read moreread less

Abstract: Angiotensin-converting enzyme 2 (ACE2) and accessory proteases (TMPRSS2 and CTSL) are needed for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) cellular entry, and their expression may shed light on viral tropism and impact across the body. We assessed the cell-type-specific expression of ACE2, TMPRSS2 and CTSL across 107 single-cell RNA-sequencing studies from different tissues. ACE2, TMPRSS2 and CTSL are coexpressed in specific subsets of respiratory epithelial cells in the nasal passages, airways and alveoli, and in cells from other organs associated with coronavirus disease 2019 (COVID-19) transmission or pathology. We performed a meta-analysis of 31 lung single-cell RNA-sequencing studies with 1,320,896 cells from 377 nasal, airway and lung parenchyma samples from 228 individuals. This revealed cell-type-specific associations of age, sex and smoking with expression levels of ACE2, TMPRSS2 and CTSL. Expression of entry factors increased with age and in males, including in airway secretory cells and alveolar type 2 cells. Expression programs shared by ACE2+TMPRSS2+ cells in nasal, lung and gut tissues included genes that may mediate viral entry, key immune functions and epithelial-macrophage cross-talk, such as genes involved in the interleukin-6, interleukin-1, tumor necrosis factor and complement pathways. Cell-type-specific expression patterns may contribute to the pathogenesis of COVID-19, and our work highlights putative molecular pathways for therapeutic intervention.

...read moreread less

Journal Article•DOI•

Trans-ancestry genome-wide association meta-analysis of prostate cancer identifies new susceptibility loci and informs genetic risk prediction

[...]

David V. Conti¹, Burcu F. Darst¹, Lilit C. Moss¹, Edward J. Saunders² +251 more•Institutions (100)

04 Jan 2021-Nature Genetics

TL;DR: This paper conducted a meta-analysis of prostate cancer genome-wide association studies (107,247 cases and 127,006 controls) and identified 86 new genetic risk variants independently associated with prostate cancer risk, bringing the total to 269 known risk variants.

...read moreread less

Abstract: Prostate cancer is a highly heritable disease with large disparities in incidence rates across ancestry populations. We conducted a multiancestry meta-analysis of prostate cancer genome-wide association studies (107,247 cases and 127,006 controls) and identified 86 new genetic risk variants independently associated with prostate cancer risk, bringing the total to 269 known risk variants. The top genetic risk score (GRS) decile was associated with odds ratios that ranged from 5.06 (95% confidence interval (CI), 4.84–5.29) for men of European ancestry to 3.74 (95% CI, 3.36–4.17) for men of African ancestry. Men of African ancestry were estimated to have a mean GRS that was 2.18-times higher (95% CI, 2.14–2.22), and men of East Asian ancestry 0.73-times lower (95% CI, 0.71–0.76), than men of European ancestry. These findings support the role of germline variation contributing to population differences in prostate cancer risk, with the GRS offering an approach for personalized risk prediction.

...read moreread less

Journal Article•DOI•

Characterizing genetic intra-tumor heterogeneity across 2,658 human cancer genomes

[...]

Stefan C. Dentro¹, Stefan C. Dentro², Stefan C. Dentro³, Ignaty Leshchiner⁴, Kerstin Haase¹, Maxime Tarabichi¹, Maxime Tarabichi², Jeff Wintersinger⁵, Amit G. Deshwar⁵, Kaixian Yu⁶, Yulia Rubanova⁵, Geoff Macintyre⁷, Jonas Demeulemeester¹, Jonas Demeulemeester⁸, Ignacio Vázquez-García, Kortine Kleinheinz⁹, Kortine Kleinheinz¹⁰, Dimitri Livitz⁴, Salem Malikic, Nilgun Donmez¹¹, Nilgun Donmez¹², Subhajit Sengupta¹³, Pavana Anur¹⁴, Clemency Jolly¹, Marek Cmero¹⁵, Marek Cmero¹⁶, Daniel Rosebrock⁴, Steven E. Schumacher⁴, Yu Fan⁶, Matthew Fittall¹, Ruben M. Drews⁷, Xiaotong Yao¹⁷, Thomas B.K. Watkins¹, Juhee Lee¹⁸, Matthias Schlesner¹⁰, Hongtu Zhu⁶, David J. Adams², Nicholas McGranahan¹⁹, Charles Swanton¹, Charles Swanton¹⁹, Gad Getz, Paul C. Boutros⁵, Paul C. Boutros²⁰, Paul C. Boutros²¹, Marcin Imielinski¹⁷, Rameen Beroukhim²², Rameen Beroukhim⁴, S. Cenk Sahinalp, Yuan Ji¹³, Yuan Ji²³, Martin Peifer²⁴, Inigo Martincorena², Florian Markowetz⁷, Ville Mustonen²⁵, Ke Yuan²⁶, Ke Yuan⁷, Moritz Gerstung²⁷, Moritz Gerstung², Paul T. Spellman¹⁴, Wenyi Wang⁶, Quaid Morris, David C. Wedge²⁸, David C. Wedge³, Peter Van Loo¹, Santiago Gonzalez, David D.L. Bowtell, Peter J. Campbell, Shaolong Cao, Elizabeth L. Christie, Yupeng Cun, Kevin J. Dawson, Roland Eils, Dale W. Garsed, Gavin Ha, Lara Jerman, Henry Lee-Six, Thomas J. Mitchell, Layla Oesper, Myron Peto, Benjamin J. Raphael, Adriana Salcedo, Ruian Shi, Seung Jun Shin, Lincoln Stein, Oliver Spiro, Shankar Vembu, David A. Wheeler, Tsun-Po Yang - Show less +84 more•Institutions (28)

Francis Crick Institute¹, Wellcome Trust Sanger Institute², University of Oxford³, Broad Institute⁴, University of Toronto⁵, University of Texas MD Anderson Cancer Center⁶, University of Cambridge⁷, Katholieke Universiteit Leuven⁸, Heidelberg University⁹, German Cancer Research Center¹⁰, Simon Fraser University¹¹, Vancouver Prostate Centre¹², NorthShore University HealthSystem¹³, Oregon Health & Science University¹⁴, University of Melbourne¹⁵, Walter and Eliza Hall Institute of Medical Research¹⁶, Cornell University¹⁷, University of California, Santa Cruz¹⁸, University College London¹⁹, University of California, Los Angeles²⁰, Ontario Institute for Cancer Research²¹, Harvard University²², University of Chicago²³, University of Cologne²⁴, University of Helsinki²⁵, University of Glasgow²⁶, European Bioinformatics Institute²⁷, University of Manchester²⁸

15 Apr 2021-Cell

TL;DR: In this article, the authors extensively characterize intra-tumor heterogeneity (ITH) across whole-genome sequences of 2,658 cancer samples spanning 38 cancer types and identify cancer type-specific subclonal patterns of driver gene mutations, fusions, structural variants, and copy number alterations.

...read moreread less

Journal Article•DOI•

Genome-wide meta-analysis, fine-mapping and integrative prioritization implicate new Alzheimer’s disease risk genes

[...]

Jeremy Schwartzentruber¹, Jeremy Schwartzentruber², Sarah Cooper¹, Jimmy Z. Liu³, Inigo Barrio-Hernandez², Erica Bello¹, Natsuhiko Kumasaka¹, Adam Young⁴, Robin J.M. Franklin⁴, Toby Johnson, K. Estrada⁵, Daniel J. Gaffney¹, Pedro Beltrao², Andrew R. Bassett¹ - Show less +10 more•Institutions (5)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², Biogen Idec³, University of Cambridge⁴, Rafael Advanced Defense Systems⁵

15 Feb 2021-Nature Genetics

TL;DR: In this paper, the authors performed an updated genome-wide AD meta-analysis, which identified 37 risk loci, including new associations near CCDC6, TSPAN14, NCK2 and SPRED2.

...read moreread less

Abstract: Genome-wide association studies have discovered numerous genomic loci associated with Alzheimer's disease (AD); yet the causal genes and variants are incompletely identified. We performed an updated genome-wide AD meta-analysis, which identified 37 risk loci, including new associations near CCDC6, TSPAN14, NCK2 and SPRED2. Using three SNP-level fine-mapping methods, we identified 21 SNPs with >50% probability each of being causally involved in AD risk and others strongly suggested by functional annotation. We followed this with colocalization analyses across 109 gene expression quantitative trait loci datasets and prioritization of genes by using protein interaction networks and tissue-specific expression. Combining this information into a quantitative score, we found that evidence converged on likely causal genes, including the above four genes, and those at previously discovered AD loci, including BIN1, APH1B, PTK2B, PILRA and CASS4.

...read moreread less

Journal Article•DOI•

The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation.

[...]

Samuel A. Lambert, Laurent Gil¹, Laurent Gil², Laurent Gil³, Simon Jupp⁴, Scott C. Ritchie, Yu Xu², Yu Xu¹, Annalisa Buniello⁴, Aoife McMahon⁴, Gad Abraham¹, Gad Abraham⁵, Michael A Chapman³, Michael A Chapman², Michael A Chapman¹, Helen Parkinson¹, Helen Parkinson⁴, John Danesh, Jacqueline A. L. MacArthur⁴, Michael Inouye - Show less +16 more•Institutions (5)

University of Cambridge¹, British Heart Foundation², Wellcome Trust Sanger Institute³, European Bioinformatics Institute⁴, Baker IDI Heart and Diabetes Institute⁵

01 Apr 2021-Nature Genetics

TL;DR: The Polygenic Score (PGS) catalog as discussed by the authors is an open resource of published scores (including variants, alleles and weights) and consistently curated metadata required for reproducibility and independent applications.

...read moreread less

Abstract: We present the Polygenic Score (PGS) Catalog ( https://www.PGSCatalog.org ), an open resource of published scores (including variants, alleles and weights) and consistently curated metadata required for reproducibility and independent applications. The PGS Catalog has capabilities for user deposition, expert curation and programmatic access, thus providing the community with a platform for PGS dissemination, research and translation.

...read moreread less

Journal Article•DOI•

Developmental cell programs are co-opted in inflammatory skin disease

[...]

Gary Reynolds¹, Peter Vegh¹, James Fletcher¹, Elizabeth Poyner¹, Emily Stephenson¹, Issac Goh¹, Rachel A. Botting¹, Ni Huang², Bayanne Olabi³, Bayanne Olabi¹, Anna Dubois¹, David Dixon¹, Kile Green¹, Daniel Maunder¹, Justin Engelbert¹, Mirjana Efremova², Krzysztof Polanski², Laura Jardine¹, C Jones¹, Thomas Ness¹, Dave Horsfall¹, Jim McGrath¹, Christopher D. Carey¹, Dorin-Mirel Popescu¹, Simone Webb¹, Xiao-Nong Wang¹, Ben Sayer¹, Jong-Eun Park², Victor A. Negri⁴, Daria Belokhvostova⁴, Magnus D. Lynch⁴, David McDonald¹, Andrew Filby¹, Tzachi Hagai⁵, Kerstin B. Meyer², A. Husain⁶, Jonathan Coxhead¹, Roser Vento-Tormo², Sam Behjati², Sam Behjati⁷, Steven Lisgo¹, Alexandra-Chloé Villani⁸, Alexandra-Chloé Villani⁹, Jaume Bacardit¹, Philip H. Jones², Philip H. Jones⁷, Edel A. O'Toole¹⁰, Graham S. Ogg¹¹, Neil Rajan¹, Nick J. Reynolds¹, Sarah A. Teichmann², Sarah A. Teichmann⁷, Fiona M. Watt⁴, Muzlifah Haniffa², Muzlifah Haniffa¹ - Show less +51 more•Institutions (11)

Newcastle University¹, Wellcome Trust Sanger Institute², NHS Lothian³, King's College London⁴, Tel Aviv University⁵, Royal Victoria Infirmary⁶, University of Cambridge⁷, Broad Institute⁸, Harvard University⁹, Queen Mary University of London¹⁰, University of Oxford¹¹

22 Jan 2021-Science

TL;DR: In this paper, the transcriptomes of more than 500,000 single cells from developing human fetal skin, healthy adult skin, and adult skin with atopic dermatitis and psoriasis were compared across development, homeostasis, and disease.

...read moreread less

Abstract: The skin confers biophysical and immunological protection through a complex cellular network established early in embryonic development. We profiled the transcriptomes of more than 500,000 single cells from developing human fetal skin, healthy adult skin, and adult skin with atopic dermatitis and psoriasis. We leveraged these datasets to compare cell states across development, homeostasis, and disease. Our analysis revealed an enrichment of innate immune cells in skin during the first trimester and clonal expansion of disease-associated lymphocytes in atopic dermatitis and psoriasis. We uncovered and validated in situ a reemergence of prenatal vascular endothelial cell and macrophage cellular programs in atopic dermatitis and psoriasis lesional skin. These data illustrate the dynamism of cutaneous immunity and provide opportunities for targeting pathological developmental programs in inflammatory skin diseases.

...read moreread less

Journal Article•DOI•

Chromothripsis drives the evolution of gene amplification in cancer.

[...]

Ofer Shoshani¹, Simon F. Brunner², Rona Yaeger³, Peter Ly, Yael Nechemia-Arbely, Dong Hyun Kim¹, Rongxin Fang¹, Guillaume A. Castillon¹, Miao Yu¹, Julia S. Z. Li¹, Ying Sun¹, Mark H. Ellisman¹, Bing Ren¹, Peter J. Campbell², Peter J. Campbell⁴, Don W. Cleveland¹ - Show less +12 more•Institutions (4)

University of California, San Diego¹, Wellcome Trust Sanger Institute², Memorial Sloan Kettering Cancer Center³, University of Cambridge⁴

04 Mar 2021-Nature

TL;DR: In this paper, the authors used whole-genome sequencing of clonal cell isolates that developed chemotherapeutic resistance to show that chromothripsis is a major driver of circular extrachromosomal DNA (ecDNA) amplification through mechanisms that depend on poly(ADP-ribose) polymerases (PARP) and the catalytic subunit of DNA-dependent protein kinase (DNA-PKcs).

...read moreread less

Abstract: Focal chromosomal amplification contributes to the initiation of cancer by mediating overexpression of oncogenes1–3, and to the development of cancer therapy resistance by increasing the expression of genes whose action diminishes the efficacy of anti-cancer drugs. Here we used whole-genome sequencing of clonal cell isolates that developed chemotherapeutic resistance to show that chromothripsis is a major driver of circular extrachromosomal DNA (ecDNA) amplification (also known as double minutes) through mechanisms that depend on poly(ADP-ribose) polymerases (PARP) and the catalytic subunit of DNA-dependent protein kinase (DNA-PKcs). Longitudinal analyses revealed that a further increase in drug tolerance is achieved by structural evolution of ecDNAs through additional rounds of chromothripsis. In situ Hi-C sequencing showed that ecDNAs preferentially tether near chromosome ends, where they re-integrate when DNA damage is present. Intrachromosomal amplifications that formed initially under low-level drug selection underwent continuing breakage–fusion–bridge cycles, generating amplicons more than 100 megabases in length that became trapped within interphase bridges and then shattered, thereby producing micronuclei whose encapsulated ecDNAs are substrates for chromothripsis. We identified similar genome rearrangement profiles linked to localized gene amplification in human cancers with acquired drug resistance or oncogene amplifications. We propose that chromothripsis is a primary mechanism that accelerates genomic DNA rearrangement and amplification into ecDNA and enables rapid acquisition of tolerance to altered growth conditions. Chromothripsis—a process during which chromosomes are ‘shattered’—drives the evolution of gene amplification and subsequent drug resistance in cancer cells.

...read moreread less

Journal Article•DOI•

Somatic mutation landscapes at single-molecule resolution

[...]

Federico Abascal¹, Luke M. R. Harvey¹, Emily G. Mitchell², Emily G. Mitchell¹, Andrew R. J. Lawson¹, Stefanie V Lensing¹, Peter R. Ellis¹, Andrew Russell¹, Raul E. Alcantara¹, Adrian Baez-Ortega¹, Yichen Wang¹, Eugene Jing Kwa¹, Henry Lee-Six¹, Alex Cagan¹, Tim H. H. Coorens¹, Michael Spencer Chapman¹, Sigurgeir Olafsson¹, Steven Leonard¹, David T. Jones¹, Heather E. Machado¹, Megan Davies², Nina F. Øbro², Krishnaa T. Mahubani², Kieren Allinson, Moritz Gerstung³, Kourosh Saeb-Parsy², David G. Kent⁴, David G. Kent², Elisa Laurenti², Michael R. Stratton¹, Raheleh Rahbari¹, Peter J. Campbell², Peter J. Campbell¹, Robert J. Osborne¹, Inigo Martincorena¹ - Show less +31 more•Institutions (4)

Wellcome Trust Sanger Institute¹, University of Cambridge², European Bioinformatics Institute³, University of York⁴

28 Apr 2021-Nature

TL;DR: NanoSeq as discussed by the authors is a duplex sequencing protocol with error rates of less than five errors per billion base pairs in single DNA molecules from cell populations, enabling the study of somatic mutations in any tissue independently of clonality.

...read moreread less

Abstract: Somatic mutations drive the development of cancer and may contribute to ageing and other diseases1,2. Despite their importance, the difficulty of detecting mutations that are only present in single cells or small clones has limited our knowledge of somatic mutagenesis to a minority of tissues. Here, to overcome these limitations, we developed nanorate sequencing (NanoSeq), a duplex sequencing protocol with error rates of less than five errors per billion base pairs in single DNA molecules from cell populations. This rate is two orders of magnitude lower than typical somatic mutation loads, enabling the study of somatic mutations in any tissue independently of clonality. We used this single-molecule sensitivity to study somatic mutations in non-dividing cells across several tissues, comparing stem cells to differentiated cells and studying mutagenesis in the absence of cell division. Differentiated cells in blood and colon displayed remarkably similar mutation loads and signatures to their corresponding stem cells, despite mature blood cells having undergone considerably more divisions. We then characterized the mutational landscape of post-mitotic neurons and polyclonal smooth muscle, confirming that neurons accumulate somatic mutations at a constant rate throughout life without cell division, with similar rates to mitotically active tissues. Together, our results suggest that mutational processes that are independent of cell division are important contributors to somatic mutagenesis. We anticipate that the ability to reliably detect mutations in single DNA molecules could transform our understanding of somatic mutagenesis and enable non-invasive studies on large-scale cohorts. NanoSeq is used to detect mutations in single DNA molecules and analyses show that mutational processes that are independent of cell division are important contributors to somatic mutagenesis.

...read moreread less

Journal Article•DOI•

The trans-ancestral genomic architecture of glycemic traits

[...]

Ji Chen¹, Ji Chen², Cassandra N. Spracklen³, Cassandra N. Spracklen⁴ +475 more•Institutions (146)

31 May 2021-Nature Genetics

TL;DR: This paper aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available.

...read moreread less

Abstract: Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 × 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.

...read moreread less

Journal Article•DOI•

Ventilator-associated pneumonia in critically ill patients with COVID-19.

[...]

Mailis Maes¹, Ellen Higginson¹, Joana Pereira-Dias¹, Martin D. Curran², Surendra Parmar, Fahad A Khokhar¹, Delphine Cuchet-Lourenço¹, Janine Lux¹, Sapna Sharma-Hajela², Benjamin Ravenhill², Islam Hamed², Laura Heales², Razeen Mahroof², Amelia Solderholm¹, Sally Forrest¹, Sushmita Sridhar¹, Sushmita Sridhar³, Nicholas M. Brown, Stephen Baker¹, Vilas Navapurkar², Gordon Dougan¹, Josefin Bartholdson Scott¹, Andrew Conway Morris¹, Andrew Conway Morris² - Show less +20 more•Institutions (3)

University of Cambridge¹, Cambridge University Hospitals NHS Foundation Trust², Wellcome Trust Sanger Institute³

11 Jan 2021-Critical Care

TL;DR: In this article, the authors compared the incidence of VAP and secondary infections using a combination of microbial culture and a TaqMan multi-pathogen array, and determined the lung microbiome composition using 16S RNA analysis in a subset of samples.

...read moreread less

Abstract: Pandemic COVID-19 caused by the coronavirus SARS-CoV-2 has a high incidence of patients with severe acute respiratory syndrome (SARS). Many of these patients require admission to an intensive care unit (ICU) for invasive ventilation and are at significant risk of developing a secondary, ventilator-associated pneumonia (VAP). To study the incidence of VAP and bacterial lung microbiome composition of ventilated COVID-19 and non-COVID-19 patients. In this retrospective observational study, we compared the incidence of VAP and secondary infections using a combination of microbial culture and a TaqMan multi-pathogen array. In addition, we determined the lung microbiome composition using 16S RNA analysis in a subset of samples. The study involved 81 COVID-19 and 144 non-COVID-19 patients receiving invasive ventilation in a single University teaching hospital between March 15th 2020 and August 30th 2020. COVID-19 patients were significantly more likely to develop VAP than patients without COVID (Cox proportional hazard ratio 2.01 95% CI 1.14–3.54, p = 0.0015) with an incidence density of 28/1000 ventilator days versus 13/1000 for patients without COVID (p = 0.009). Although the distribution of organisms causing VAP was similar between the two groups, and the pulmonary microbiome was similar, we identified 3 cases of invasive aspergillosis amongst the patients with COVID-19 but none in the non-COVID-19 cohort. Herpesvirade activation was also numerically more frequent amongst patients with COVID-19. COVID-19 is associated with an increased risk of VAP, which is not fully explained by the prolonged duration of ventilation. The pulmonary dysbiosis caused by COVID-19, and the causative organisms of secondary pneumonia observed are similar to that seen in critically ill patients ventilated for other reasons.

...read moreread less

Journal Article•DOI•

Cells of the human intestinal tract mapped across space and time.

[...]

Rasa Elmentaite¹, Natsuhiko Kumasaka¹, Kenny Roberts¹, Aaron M. Fleming², Emma Dann¹, Hamish W King³, Vitalii Kleshchevnikov¹, Monika Dabrowska¹, Sophie Pritchard¹, Liam Bolt¹, Sara F. Vieira¹, Lira Mamanova¹, Ni Huang¹, Francesca Perrone⁴, Issac Goh Kai’En⁵, Steven Lisgo⁵, Matilda Katan⁶, Steven Leonard¹, Thomas R. W. Oliver⁷, Thomas R. W. Oliver¹, C. Elizabeth Hook⁷, Komal Nayak⁴, Lia S. Campos¹, Cecilia Domínguez Conde¹, Emily Stephenson⁵, Justin Engelbert⁵, Rachel A. Botting⁵, Krzysztof Polanski¹, Stijn van Dongen¹, Minal Patel¹, Michael D Morgan⁴, Michael D Morgan⁸, John C. Marioni⁴, John C. Marioni⁸, John C. Marioni¹, Omer Ali Bayraktar¹, Kerstin B Meyer¹, Xiaoling He⁴, Roger A. Barker⁴, Holm H. Uhlig⁹, Holm H. Uhlig¹⁰, Krishnaa T. Mahbubani⁴, Kourosh Saeb-Parsy⁴, Matthias Zilbauer⁴, Menna R. Clatworthy¹, Menna R. Clatworthy², Muzlifah Haniffa¹, Muzlifah Haniffa⁵, Kylie R. James¹, Kylie R. James¹¹, Sarah A. Teichmann¹, Sarah A. Teichmann⁴ - Show less +48 more•Institutions (11)

Wellcome Trust Sanger Institute¹, Laboratory of Molecular Biology², Queen Mary University of London³, University of Cambridge⁴, Newcastle University⁵, University College London⁶, Cambridge University Hospitals NHS Foundation Trust⁷, European Bioinformatics Institute⁸, University of Oxford⁹, John Radcliffe Hospital¹⁰, Garvan Institute of Medical Research¹¹

09 Sep 2021-Nature

TL;DR: The cellular landscape of the human intestinal tract is dynamic throughout life, developing in utero and changing in response to functional requirements and environmental exposures as discussed by the authors, using single-cell RNA sequencing and antigen receptor analysis of almost half a million cells from up to 5 anatomical regions of the developing and up to 11 distinct anatomical regions in the healthy human gut.

...read moreread less

Abstract: The cellular landscape of the human intestinal tract is dynamic throughout life, developing in utero and changing in response to functional requirements and environmental exposures. Here, to comprehensively map cell lineages, we use single-cell RNA sequencing and antigen receptor analysis of almost half a million cells from up to 5 anatomical regions in the developing and up to 11 distinct anatomical regions in the healthy paediatric and adult human gut. This reveals the existence of transcriptionally distinct BEST4 epithelial cells throughout the human intestinal tract. Furthermore, we implicate IgG sensing as a function of intestinal tuft cells. We describe neural cell populations in the developing enteric nervous system, and predict cell-type-specific expression of genes associated with Hirschsprung’s disease. Finally, using a systems approach, we identify key cell players that drive the formation of secondary lymphoid tissue in early human development. We show that these programs are adopted in inflammatory bowel disease to recruit and retain immune cells at the site of inflammation. This catalogue of intestinal cells will provide new insights into cellular programs in development, homeostasis and disease. Cells from embryonic, fetal, paediatric and adult human intestinal tissue are analysed at different locations along the intestinal tract to construct a single-cell atlas of the developing and adult human intestinal tract, encompassing all cell lineages.

...read moreread less

Journal Article•DOI•

Computational principles and challenges in single-cell data integration.

[...]

Ricard Argelaguet¹, Ricard Argelaguet², Anna S E Cuomo², Anna S E Cuomo³, Oliver Stegle³, Oliver Stegle⁴, John C. Marioni², John C. Marioni⁵, John C. Marioni³ - Show less +5 more•Institutions (5)

Babraham Institute¹, European Bioinformatics Institute², Wellcome Trust Sanger Institute³, German Cancer Research Center⁴, University of Cambridge⁵

03 May 2021-Nature Biotechnology

TL;DR: In this article, a broad collection of approaches ranging from batch correction of individual omics datasets to association of chromatin accessibility and genetic variation with transcription are reviewed, as the number of single-cell experiments with multiple data modalities increases.

...read moreread less

Abstract: The development of single-cell multimodal assays provides a powerful tool for investigating multiple dimensions of cellular heterogeneity, enabling new insights into development, tissue homeostasis and disease. A key challenge in the analysis of single-cell multimodal data is to devise appropriate strategies for tying together data across different modalities. The term ‘data integration’ has been used to describe this task, encompassing a broad collection of approaches ranging from batch correction of individual omics datasets to association of chromatin accessibility and genetic variation with transcription. Although existing integration strategies exploit similar mathematical ideas, they typically have distinct goals and rely on different principles and assumptions. Consequently, new definitions and concepts are needed to contextualize existing methods and to enable development of new methods. As the number of single-cell experiments with multiple data modalities increases, Argelaguet and colleagues review the concepts and challenges of data integration.

...read moreread less

Journal Article•DOI•

Mapping the temporal and spatial dynamics of the human endometrium in vivo and in vitro.

[...]

Luz Garcia-Alonso¹, Louis-François Handfield¹, Kenny Roberts¹, Konstantina Nikolakopoulou², Konstantina Nikolakopoulou³, Ridma C. Fernando², Ridma C. Fernando³, Lucy Gardner³, Benjamin Woodhams¹, Benjamin Woodhams⁴, Anna Arutyunyan³, Anna Arutyunyan¹, Krzysztof Polanski¹, Regina Hoo¹, Regina Hoo³, Carmen Sancho-Serra¹, Tong Li¹, Kwasi Kwakwa⁴, Elizabeth Tuck¹, Valentina Lorenzi¹, Hassan Massalha¹, Hassan Massalha³, Martin Prete¹, Vitalii Kleshchevnikov¹, Aleksandra Tarkowska¹, Tarryn Porter¹, Cecilia Icoresi Mazzeo¹, Stijn van Dongen¹, Monika Dabrowska¹, Vasyl Vaskivskyi¹, Krishnaa T. Mahbubani³, Jong-Eun Park¹, Mercedes Jimenez-Linan⁵, Lia S. Campos¹, Vladimir Yu. Kiselev¹, Cecilia Lindskog⁶, Paul Ayuk⁷, Elena Prigmore¹, Michael R. Stratton¹, Kourosh Saeb-Parsy³, Ashley Moffett³, Luiza Moore⁵, Luiza Moore¹, Omer Ali Bayraktar¹, Sarah A. Teichmann¹, Sarah A. Teichmann³, Margherita Y. Turco², Margherita Y. Turco³, Roser Vento-Tormo¹, Roser Vento-Tormo³ - Show less +46 more•Institutions (7)

Wellcome Trust Sanger Institute¹, Friedrich Miescher Institute for Biomedical Research², University of Cambridge³, European Bioinformatics Institute⁴, Cambridge University Hospitals NHS Foundation Trust⁵, Science for Life Laboratory⁶, Newcastle upon Tyne Hospitals NHS Foundation Trust⁷

02 Dec 2021-Nature Genetics

TL;DR: In this paper, the authors dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments of the endometrium.

...read moreread less

Abstract: The endometrium, the mucosal lining of the uterus, undergoes dynamic changes throughout the menstrual cycle in response to ovarian hormones. We have generated dense single-cell and spatial reference maps of the human uterus and three-dimensional endometrial organoid cultures. We dissect the signaling pathways that determine cell fate of the epithelial lineages in the lumenal and glandular microenvironments. Our benchmark of the endometrial organoids reveals the pathways and cell states regulating differentiation of the secretory and ciliated lineages both in vivo and in vitro. In vitro downregulation of WNT or NOTCH pathways increases the differentiation efficiency along the secretory and ciliated lineages, respectively. We utilize our cellular maps to deconvolute bulk data from endometrial cancers and endometriotic lesions, illuminating the cell types dominating in each of these disorders. These mechanistic insights provide a platform for future development of treatments for common conditions including endometriosis and endometrial carcinoma. Single-cell and spatial transcriptomic profiling of the human endometrium highlights pathways governing the proliferative and secretory phases of the menstrual cycle. Analyses of endometrial organoids show that WNT and NOTCH signaling modulate differentiation into the secretory and ciliated epithelial lineages, respectively.

...read moreread less

Journal Article•DOI•

Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data.

[...]

Tallulah S. Andrews¹, Vladimir Yu. Kiselev¹, Davis J. McCarthy², Davis J. McCarthy³, Martin Hemberg¹ - Show less +1 more•Institutions (3)

Wellcome Trust Sanger Institute¹, St. Vincent's Institute of Medical Research², University of Melbourne³

01 Jan 2021-Nature Protocols

TL;DR: This tutorial provides a hands-on guide for experimentalists interested in analyzing their data as well as an overview for bioinformaticians seeking to develop new computational methods.

...read moreread less

Abstract: Single-cell RNA sequencing (scRNA-seq) is a popular and powerful technology that allows you to profile the whole transcriptome of a large number of individual cells. However, the analysis of the large volumes of data generated from these experiments requires specialized statistical and computational methods. Here we present an overview of the computational workflow involved in processing scRNA-seq data. We discuss some of the most common tasks and the tools available for addressing central biological questions. In this article and our companion website ( https://scrnaseq-course.cog.sanger.ac.uk/website/index.html ), we provide guidelines regarding best practices for performing computational analyses. This tutorial provides a hands-on guide for experimentalists interested in analyzing their data as well as an overview for bioinformaticians seeking to develop new computational methods.

...read moreread less

Collapse