Showing papers on "Gene published in 2020"

PDF

Open Access

Journal Article•DOI•

The mutational constraint spectrum quantified from variation in 141,456 humans

[...]

Konrad J. Karczewski¹, Laurent C. Francioli¹, Grace Tiao¹, Beryl B. Cummings¹, Jessica Alföldi¹, Qingbo Wang¹, Ryan L. Collins¹, Kristen M. Laricchia¹, Andrea Ganna¹, Daniel P. Birnbaum¹, Laura D. Gauthier¹, Harrison Brand¹, Matthew Solomonson¹, Nicholas A. Watts¹, Daniel R. Rhodes², Moriel Singer-Berk¹, Eleina M. England¹, Eleanor G. Seaby¹, Jack A. Kosmicki¹, Raymond K. Walters¹, Katherine Tashman¹, Yossi Farjoun¹, Eric Banks¹, Timothy Poterba¹, Arcturus Wang¹, Cotton Seed¹, Nicola Whiffin¹, Jessica X. Chong³, Kaitlin E. Samocha⁴, Emma Pierce-Hoffman¹, Zachary Zappala¹, Anne H. O’Donnell-Luria¹, Eric Vallabh Minikel¹, Ben Weisburd¹, Monkol Lek⁵, James S. Ware¹, Christopher Vittal⁶, Irina M. Armean¹, Louis Bergelson¹, Kristian Cibulskis¹, Kristen M. Connolly¹, Miguel Covarrubias¹, Stacey Donnelly¹, Steven Ferriera¹, Stacey Gabriel¹, Jeff Gentry¹, Namrata Gupta¹, Thibault Jeandet¹, Diane Kaplan¹, Christopher Llanwarne¹, Ruchi Munshi¹, Sam Novod¹, Nikelle Petrillo¹, David Roazen¹, Valentin Ruano-Rubio¹, Andrea Saltzman¹, Molly Schleicher¹, Jose Soto¹, Kathleen Tibbetts¹, Charlotte Tolonen¹, Gordon Wade¹, Michael E. Talkowski¹, Benjamin M. Neale¹, Mark J. Daly¹, Daniel G. MacArthur¹ - Show less +61 more•Institutions (6)

Broad Institute¹, Queen Mary University of London², University of Washington³, Wellcome Trust Sanger Institute⁴, Yale University⁵, Harvard University⁶

27 May 2020-Nature

TL;DR: A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

Abstract: Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases. A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.

...read moreread less

4,913 citations

Journal Article•DOI•

Generalizing RNA velocity to transient cell states through dynamical modeling.

[...]

Volker Bergen¹, Marius Lange¹, Stefan Peidli¹, F. Alexander Wolf, Fabian J. Theis¹ - Show less +1 more•Institutions (1)

Technische Universität München¹

03 Aug 2020-Nature Biotechnology

TL;DR: ScVelo reconstructs transient cell states and differentiation pathways from single-cell RNA-sequencing data, and infer gene-specific rates of transcription, splicing and degradation, recover each cell’s position in the underlying differentiation processes and detect putative driver genes.

...read moreread less

Abstract: RNA velocity has opened up new ways of studying cellular differentiation in single-cell RNA-sequencing data. It describes the rate of gene expression change for an individual gene at a given time point based on the ratio of its spliced and unspliced messenger RNA (mRNA). However, errors in velocity estimates arise if the central assumptions of a common splicing rate and the observation of the full splicing dynamics with steady-state mRNA levels are violated. Here we present scVelo, a method that overcomes these limitations by solving the full transcriptional dynamics of splicing kinetics using a likelihood-based dynamical model. This generalizes RNA velocity to systems with transient cell states, which are common in development and in response to perturbations. We apply scVelo to disentangling subpopulation kinetics in neurogenesis and pancreatic endocrinogenesis. We infer gene-specific rates of transcription, splicing and degradation, recover each cell's position in the underlying differentiation processes and detect putative driver genes. scVelo will facilitate the study of lineage decisions and gene regulation.

...read moreread less

1,041 citations

Journal Article•DOI•

Emerging SARS-CoV-2 mutation hot spots include a novel RNA-dependent-RNA polymerase variant.

[...]

Maria Pachetti¹, Maria Pachetti², Bruna Marini³, Francesca Benedetti⁴, Fabiola Giudici⁵, Elisabetta Mauro³, Paola Storici², Claudio Masciovecchio², Silvia Angeletti, Massimo Ciccozzi, Robert C. Gallo⁴, Robert C. Gallo⁶, Davide Zella⁴, Davide Zella⁶, Rudy Ippodrino³ - Show less +11 more•Institutions (6)

University of Trieste¹, Elettra Sincrotrone Trieste², AREA Science Park³, University of Maryland, Baltimore⁴, Health Science University⁵, Global Virus Network⁶

22 Apr 2020-Journal of Translational Medicine

TL;DR: The findings suggest that the virus is evolving and European, North American and Asian strains might coexist, each of them characterized by a different mutation pattern.

...read moreread less

Abstract: SARS-CoV-2 is a RNA coronavirus responsible for the pandemic of the Severe Acute Respiratory Syndrome (COVID-19). RNA viruses are characterized by a high mutation rate, up to a million times higher than that of their hosts. Virus mutagenic capability depends upon several factors, including the fidelity of viral enzymes that replicate nucleic acids, as SARS-CoV-2 RNA dependent RNA polymerase (RdRp). Mutation rate drives viral evolution and genome variability, thereby enabling viruses to escape host immunity and to develop drug resistance. We analyzed 220 genomic sequences from the GISAID database derived from patients infected by SARS-CoV-2 worldwide from December 2019 to mid-March 2020. SARS-CoV-2 reference genome was obtained from the GenBank database. Genomes alignment was performed using Clustal Omega. Mann–Whitney and Fisher-Exact tests were used to assess statistical significance. We characterized 8 novel recurrent mutations of SARS-CoV-2, located at positions 1397, 2891, 14408, 17746, 17857, 18060, 23403 and 28881. Mutations in 2891, 3036, 14408, 23403 and 28881 positions are predominantly observed in Europe, whereas those located at positions 17746, 17857 and 18060 are exclusively present in North America. We noticed for the first time a silent mutation in RdRp gene in England (UK) on February 9th, 2020 while a different mutation in RdRp changing its amino acid composition emerged on February 20th, 2020 in Italy (Lombardy). Viruses with RdRp mutation have a median of 3 point mutations [range: 2–5], otherwise they have a median of 1 mutation [range: 0–3] (p value < 0.001). These findings suggest that the virus is evolving and European, North American and Asian strains might coexist, each of them characterized by a different mutation pattern. The contribution of the mutated RdRp to this phenomenon needs to be investigated. To date, several drugs targeting RdRp enzymes are being employed for SARS-CoV-2 infection treatment. Some of them have a predicted binding moiety in a SARS-CoV-2 RdRp hydrophobic cleft, which is adjacent to the 14408 mutation we identified. Consequently, it is important to study and characterize SARS-CoV-2 RdRp mutation in order to assess possible drug-resistance viral phenotypes. It is also important to recognize whether the presence of some mutations might correlate with different SARS-CoV-2 mortality rates.

...read moreread less

842 citations

Journal Article•DOI•

A molecular cell atlas of the human lung from single-cell RNA sequencing.

[...]

Kyle J. Travaglini¹, Ahmad N. Nabhan², Ahmad N. Nabhan¹, Lolita Penland, Rahul Sinha¹, Astrid Gillich¹, Rene Sit, Stephen Chang¹, Stephanie D. Conley¹, Yasuo Mori¹, Yasuo Mori³, Jun Seita¹, Gerald J. Berry¹, Joseph B. Shrager¹, Ross J. Metzger¹, Christin S. Kuo¹, Norma Neff, Irving L. Weissman, Stephen R. Quake¹, Mark A. Krasnow¹ - Show less +16 more•Institutions (3)

Stanford University¹, Genentech², Kyushu University³

18 Nov 2020-Nature

TL;DR: Droplet- and plate-based single cell RNA sequencing applied to ~75,000 human cells across all lung tissue compartments and circulating blood, combined with a multi-pronged cell annotation approach, have allowed them to define the gene expression profiles and anatomical locations of 58 cell populations in the human lung.

...read moreread less

Abstract: Although single-cell RNA sequencing studies have begun to provide compendia of cell expression profiles1–9, it has been difficult to systematically identify and localize all molecular cell types in individual organs to create a full molecular cell atlas. Here, using droplet- and plate-based single-cell RNA sequencing of approximately 75,000 human cells across all lung tissue compartments and circulating blood, combined with a multi-pronged cell annotation approach, we create an extensive cell atlas of the human lung. We define the gene expression profiles and anatomical locations of 58 cell populations in the human lung, including 41 out of 45 previously known cell types and 14 previously unknown ones. This comprehensive molecular atlas identifies the biochemical functions of lung cells and the transcription factors and markers for making and monitoring them; defines the cell targets of circulating hormones and predicts local signalling interactions and immune cell homing; and identifies cell types that are directly affected by lung disease genes and respiratory viruses. By comparing human and mouse data, we identified 17 molecular cell types that have been gained or lost during lung evolution and others with substantially altered expression profiles, revealing extensive plasticity of cell types and cell-type-specific gene expression during organ evolution including expression switches between cell types. This atlas provides the molecular foundation for investigating how lung cell identities, functions and interactions are achieved in development and tissue engineering and altered in disease and evolution. Expression profiling on 75,000 single cells creates a comprehensive cell atlas of the human lung that includes 41 out of 45 previously known cell types and 14 new ones.

...read moreread less

795 citations

Journal Article•DOI•

NicheNet: modeling intercellular communication by linking ligands to target genes

[...]

Robin Browaeys¹, Wouter Saelens¹, Yvan Saeys¹•Institutions (1)

Ghent University¹

01 Feb 2020-Nature Methods

TL;DR: NicheNet is presented, a method that predicts ligand–target links between interacting cells by combining their expression data with prior knowledge on signaling and gene regulatory networks, and can infer active ligands and their gene regulatory effects on interacting cells.

...read moreread less

Abstract: Computational methods that model how gene expression of a cell is influenced by interacting cells are lacking. We present NicheNet (https://github.com/saeyslab/nichenetr), a method that predicts ligand-target links between interacting cells by combining their expression data with prior knowledge on signaling and gene regulatory networks. We applied NicheNet to tumor and immune cell microenvironment data and demonstrate that NicheNet can infer active ligands and their gene regulatory effects on interacting cells.

...read moreread less

681 citations

Posted Content•DOI•

Single-cell RNA expression profiling of ACE2, the putative receptor of Wuhan 2019-nCov

[...]

Yu Zhao¹, Zixian Zhao¹, Yujia Wang¹, Yueqing Zhou¹, Yu Ma, Wei Zuo², Wei Zuo¹ - Show less +3 more•Institutions (2)

Tongji University¹, Guangzhou Medical University²

26 Jan 2020-bioRxiv

TL;DR: A biological background for the epidemic investigation of the 2019-nCov infection disease is provided, and the result indicates that the ACE2 virus receptor expression is concentrated in a small population of type II alveolar cells (AT2).

...read moreread less

Abstract: A novel coronavirus (2019-nCov) was identified in Wuhan, Hubei Province, China in December of 2019. This new coronavirus has resulted in thousands of cases of lethal disease in China, with additional patients being identified in a rapidly growing number internationally. 2019-nCov was reported to share the same receptor, Angiotensin-converting enzyme 2 (ACE2), with SARS-Cov. Here based on the public database and the state-of-the-art single-cell RNA-Seq technique, we analyzed the ACE2 RNA expression profile in the normal human lungs. The result indicates that the ACE2 virus receptor expression is concentrated in a small population of type II alveolar cells (AT2). Surprisingly, we found that this population of ACE2-expressing AT2 also highly expressed many other genes that positively regulating viral reproduction and transmission. A comparison between eight individual samples demonstrated that the Asian male one has an extremely large number of ACE2-expressing cells in the lung. This study provides a biological background for the epidemic investigation of the 2019-nCov infection disease, and could be informative for future anti-ACE2 therapeutic strategy development.

...read moreread less

631 citations

Journal Article•DOI•

Single-Cell RNA Expression Profiling of ACE2, the Receptor of SARS-CoV-2.

[...]

Yu Zhao¹, Zixian Zhao¹, Yujia Wang¹, Yueqing Zhou¹, Yu Ma, Wei Zuo - Show less +2 more•Institutions (1)

Tongji University¹

01 Sep 2020-American Journal of Respiratory and Critical Care Medicine

TL;DR: The recently developed single-cell RNA-sequencing technology enables us to study the ACE2 expression in each cell type and provides quantitative information at a single- cell resolution, and shows that in the normal human lung, ACE2 is mainly expressed by type II alveolar (AT2) and type I alveolars (AT1) epithelial cells.

...read moreread less

Abstract: A novel coronavirus SARS-CoV-2 was identified in Wuhan, Hubei Province, China in December of 2019. According to WHO report, this new coronavirus has resulted in 76,392 confirmed infections and 2,348 deaths in China by 22 February, 2020, with additional patients being identified in a rapidly growing number internationally. SARS-CoV-2 was reported to share the same receptor, Angiotensin-converting enzyme 2 (ACE2), with SARS-CoV. Here based on the public database and the state-of-the-art single-cell RNA-Seq technique, we analyzed the ACE2 RNA expression profile in the normal human lungs. The result indicates that the ACE2 virus receptor expression is concentrated in a small population of type II alveolar cells (AT2). Surprisingly, we found that this population of ACE2-expressing AT2 also highly expressed many other genes that positively regulating viral entry, reproduction and transmission. This study provides a biological background for the epidemic investigation of the COVID-19, and could be informative for future anti-ACE2 therapeutic strategy development.

...read moreread less

610 citations

Journal Article•DOI•

The regulation and functions of DNA and RNA G-quadruplexes.

[...]

Dhaval Varshney, Jochen Spiegel, Katherine G. Zyner, David Tannahill, Shankar Balasubramanian¹ - Show less +1 more•Institutions (1)

University of Cambridge¹

20 Apr 2020-Nature Reviews Molecular Cell Biology

TL;DR: This Review discusses the identification of G4s and evidence for their formation in cells using chemical biology, imaging and genomic technologies, and discusses the connection between G4 formation and synthetic lethality in cancer cells, and recent progress towards considering G 4s as therapeutic targets in human diseases.

...read moreread less

Abstract: DNA and RNA can adopt various secondary structures. Four-stranded G-quadruplex (G4) structures form through self-recognition of guanines into stacked tetrads, and considerable biophysical and structural evidence exists for G4 formation in vitro. Computational studies and sequencing methods have revealed the prevalence of G4 sequence motifs at gene regulatory regions in various genomes, including in humans. Experiments using chemical, molecular and cell biology methods have demonstrated that G4s exist in chromatin DNA and in RNA, and have linked G4 formation with key biological processes ranging from transcription and translation to genome instability and cancer. In this Review, we first discuss the identification of G4s and evidence for their formation in cells using chemical biology, imaging and genomic technologies. We then discuss possible functions of DNA G4s and their interacting proteins, particularly in transcription, telomere biology and genome instability. Roles of RNA G4s in RNA biology, especially in translation, are also discussed. Furthermore, we consider the emerging relationships of G4s with chromatin and with RNA modifications. Finally, we discuss the connection between G4 formation and synthetic lethality in cancer cells, and recent progress towards considering G4s as therapeutic targets in human diseases.

...read moreread less

543 citations

Journal Article•DOI•

A large-scale binding and functional map of human RNA-binding proteins

[...]

Eric L. Van Nostrand¹, Peter Freese², Gabriel A. Pratt¹, Xiaofeng Wang, Xintao Wei³, Rui Xiao⁴, Rui Xiao¹, Steven M. Blue¹, Jia-Yu Chen¹, Neal A.L. Cody, Daniel Dominguez², Sara Olson³, Balaji Sundararaman¹, Lijun Zhan³, Cassandra Bazile², Louis Philip Benoit Bouvrette⁵, Julie Bergalet, Michael O. Duff³, Keri E. Garcia¹, Chelsea Gelboin-Burkhart¹, Myles Hochman², Nicole J. Lambert², Hairi Li¹, Michael P. McGurk², Thai B. Nguyen¹, Tsultrim Palden², Ines Rabano¹, Shashank Sathe¹, Rebecca Stanton¹, Amanda Su², Ruth Wang¹, Brian A. Yee¹, Bing Zhou¹, Ashley L. Louie¹, Stefan Aigner¹, Xiang-Dong Fu¹, Eric Lécuyer⁵, Eric Lécuyer⁶, Christopher B. Burge², Brenton R. Graveley³, Gene W. Yeo¹ - Show less +37 more•Institutions (6)

University of California, San Diego¹, Massachusetts Institute of Technology², University of Connecticut Health Center³, Wuhan University⁴, Université de Montréal⁵, McGill University⁶

29 Jul 2020-Nature

TL;DR: The spectrum of RBP binding throughout the transcriptome and the connections between these interactions and various aspects of RNA biology, including RNA stability, splicing regulation and RNA localization are described.

...read moreread less

Abstract: Many proteins regulate the expression of genes by binding to specific regions encoded in the genome1. Here we introduce a new data set of RNA elements in the human genome that are recognized by RNA-binding proteins (RBPs), generated as part of the Encyclopedia of DNA Elements (ENCODE) project phase III. This class of regulatory elements functions only when transcribed into RNA, as they serve as the binding sites for RBPs that control post-transcriptional processes such as splicing, cleavage and polyadenylation, and the editing, localization, stability and translation of mRNAs. We describe the mapping and characterization of RNA elements recognized by a large collection of human RBPs in K562 and HepG2 cells. Integrative analyses using five assays identify RBP binding sites on RNA and chromatin in vivo, the in vitro binding preferences of RBPs, the function of RBP binding sites and the subcellular localization of RBPs, producing 1,223 replicated data sets for 356 RBPs. We describe the spectrum of RBP binding throughout the transcriptome and the connections between these interactions and various aspects of RNA biology, including RNA stability, splicing regulation and RNA localization. These data expand the catalogue of functional elements encoded in the human genome by the addition of a large set of elements that function at the RNA level by interacting with RBPs.

...read moreread less

542 citations

Journal Article•DOI•

SARS-coronavirus-2 replication in Vero E6 cells: replication kinetics, rapid adaptation and cytopathology.

[...]

Natacha S. Ogando¹, Tim J. Dalebout¹, Jessika C. Zevenhoven-Dobbe¹, Ronald W. A. L. Limpens¹, Yvonne van der Meer¹, Leon Caly², Julian Druce², Jutte J.C. de Vries¹, Marjolein Kikkert¹, Montserrat Bárcena¹, Igor A. Sidorov¹, Eric J. Snijder¹ - Show less +8 more•Institutions (2)

Leiden University Medical Center¹, Royal Melbourne Hospital²

01 Sep 2020-Journal of General Virology

TL;DR: The sensitivity of the two viruses to three established inhibitors of coronavirus replication is very similar, but that SARS-CoV-2 infection was substantially more sensitive to pre-treatment of cells with pegylated interferon alpha.

...read moreread less

Abstract: The sudden emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) at the end of 2019 from the Chinese province of Hubei and its subsequent pandemic spread highlight the importance of understanding the full molecular details of coronavirus infection and pathogenesis Here, we compared a variety of replication features of SARS-CoV-2 and SARS-CoV and analysed the cytopathology caused by the two closely related viruses in the commonly used Vero E6 cell line Compared to SARS-CoV, SARS-CoV-2 generated higher levels of intracellular viral RNA, but strikingly about 50-fold less infectious viral progeny was recovered from the culture medium Immunofluorescence microscopy of SARS-CoV-2-infected cells established extensive cross-reactivity of antisera previously raised against a variety of non-structural proteins, membrane and nucleocapsid protein of SARS-CoV Electron microscopy revealed that the ultrastructural changes induced by the two SARS viruses are very similar and occur within comparable time frames after infection Furthermore, we determined that the sensitivity of the two viruses to three established inhibitors of coronavirus replication (remdesivir, alisporivir and chloroquine) is very similar, but that SARS-CoV-2 infection was substantially more sensitive to pre-treatment of cells with pegylated interferon alpha An important difference between the two viruses is the fact that - upon passaging in Vero E6 cells - SARS-CoV-2 apparently is under strong selection pressure to acquire adaptive mutations in its spike protein gene These mutations change or delete a putative furin-like cleavage site in the region connecting the S1 and S2 domains and result in a very prominent phenotypic change in plaque assays

...read moreread less

445 citations

Journal Article•DOI•

Somatic Mutations in UBA1 and Severe Adult-Onset Autoinflammatory Disease.

[...]

David B. Beck¹, Marcela A. Ferrada¹, Keith A. Sikora¹, Amanda K. Ombrello¹, Jason C. Collins¹, Wuhong Pei¹, Nicholas Balanda¹, Daron L. Ross¹, Daniela Ospina Cardona¹, Zhijie Wu¹, Bhavisha A Patel¹, Kalpana Manthiram¹, Emma M. Groarke¹, Fernanda Gutierrez-Rodrigues¹, Patrycja Hoffmann¹, Sofia Rosenzweig¹, Shuichiro Nakabo¹, Laura W. Dillon¹, Christopher S. Hourigan¹, Wanxia L. Tsai¹, Sarthak Gupta¹, Carmelo Carmona-Rivera¹, Anthony J. Asmar¹, Lisha Xu¹, Hirotsugu Oda¹, Wendy Goodspeed¹, Karyl S. Barron¹, Michele Nehrebecky¹, Anne Jones¹, Ryan S. Laird¹, Natalie Deuitch¹, Dorota Rowczenio², Emily Rominger¹, Kristina V. Wells¹, Chyi-Chia Richard Lee¹, Weixin Wang¹, Megan Trick¹, James C. Mullikin¹, Gustaf Wigerblad¹, Stephen R. Brooks¹, Stefania Dell'Orso¹, Zuoming Deng¹, Jae Jin Chae¹, Alina Dulau-Florea¹, May Christine V. Malicdan¹, Danica Novacic¹, Robert A. Colbert¹, Mariana J. Kaplan¹, Massimo Gadina¹, Sinisa Savic³, Helen J. Lachmann², Mones Abu-Asab¹, Benjamin D. Solomon¹, Kyle Retterer⁴, William A. Gahl¹, Shawn M. Burgess¹, Ivona Aksentijevich¹, Neal S. Young¹, Katherine R. Calvo¹, Achim Werner¹, Daniel L. Kastner¹, Peter C. Grayson¹ - Show less +58 more•Institutions (4)

National Institutes of Health¹, University College London², University of Leeds³, GeneDx⁴

27 Oct 2020-The New England Journal of Medicine

TL;DR: Using a genotype-driven approach, this disorder is identified that connects seemingly unrelated adult-onset inflammatory syndromes and is named the VEXAS (vacuoles, E1 enzyme, X-linked, autoinflammatory, somatic) syndrome.

...read moreread less

Abstract: Background Adult-onset inflammatory syndromes often manifest with overlapping clinical features. Variants in ubiquitin-related genes, previously implicated in autoinflammatory disease, may...

...read moreread less

Posted Content•

The ITensor Software Library for Tensor Network Calculations

[...]

Matthew Fishman, Steven R. White, E. Miles Stoudenmire

25 Jul 2020-arXiv: Mathematical Software

TL;DR: The philosophy behind ITensor, a system for programming tensor network calculations with an interface modeled on tensor diagram notation, and examples of each part of the interface including Index objects, the ITensor product operator, Tensor factorizations, tensor storage types, algorithms for matrix product state (MPS) and matrix product operator (MPO) tensor networks, and the NDTensors library are discussed.

...read moreread less

Abstract: ITensor is a system for programming tensor network calculations with an interface modeled on tensor diagram notation, which allows users to focus on the connectivity of a tensor network without manually bookkeeping tensor indices. The ITensor interface rules out common programming errors and enables rapid prototyping of tensor network algorithms. After discussing the philosophy behind the ITensor approach, we show examples of each part of the interface including Index objects, the ITensor product operator, tensor factorizations, tensor storage types, algorithms for matrix product state (MPS) and matrix product operator (MPO) tensor networks, quantum number conserving block-sparse tensors, and the NDTensors library. We also review publications that have used ITensor for quantum many-body physics and for other areas where tensor networks are increasingly applied. To conclude we discuss promising features and optimizations to be added in the future.

...read moreread less

Journal Article•DOI•

Global Organization and Proposed Megataxonomy of the Virus World

[...]

Eugene V. Koonin¹, Valerian V. Dolja², Mart Krupovic³, Arvind Varsani⁴, Arvind Varsani⁵, Yuri I. Wolf¹, Natalya Yutin¹, F. Murilo Zerbini⁶, Jens H. Kuhn¹ - Show less +5 more•Institutions (6)

National Institutes of Health¹, Oregon State University², Pasteur Institute³, Arizona State University⁴, University of Cape Town⁵, Universidade Federal de Viçosa⁶

20 May 2020-Microbiology and Molecular Biology Reviews

TL;DR: Phylogenetic analyses of virus hallmark genes combined with analyses of gene-sharing networks show that replication modules of five BCs evolved from a common ancestor that encoded an RNA-directed RNA polymerase or a reverse transcriptase, and propose a comprehensive hierarchical taxonomy of viruses.

...read moreread less

Abstract: Viruses and mobile genetic elements are molecular parasites or symbionts that coevolve with nearly all forms of cellular life. The route of virus replication and protein expression is determined by the viral genome type. Comparison of these routes led to the classification of viruses into seven "Baltimore classes" (BCs) that define the major features of virus reproduction. However, recent phylogenomic studies identified multiple evolutionary connections among viruses within each of the BCs as well as between different classes. Due to the modular organization of virus genomes, these relationships defy simple representation as lines of descent but rather form complex networks. Phylogenetic analyses of virus hallmark genes combined with analyses of gene-sharing networks show that replication modules of five BCs (three classes of RNA viruses and two classes of reverse-transcribing viruses) evolved from a common ancestor that encoded an RNA-directed RNA polymerase or a reverse transcriptase. Bona fide viruses evolved from this ancestor on multiple, independent occasions via the recruitment of distinct cellular proteins as capsid subunits and other structural components of virions. The single-stranded DNA (ssDNA) viruses are a polyphyletic class, with different groups evolving by recombination between rolling-circle-replicating plasmids, which contributed the replication protein, and positive-sense RNA viruses, which contributed the capsid protein. The double-stranded DNA (dsDNA) viruses are distributed among several large monophyletic groups and arose via the combination of distinct structural modules with equally diverse replication modules. Phylogenomic analyses reveal the finer structure of evolutionary connections among RNA viruses and reverse-transcribing viruses, ssDNA viruses, and large subsets of dsDNA viruses. Taken together, these analyses allow us to outline the global organization of the virus world. Here, we describe the key aspects of this organization and propose a comprehensive hierarchical taxonomy of viruses.

...read moreread less

Journal Article•DOI•

Roles and mechanisms of alternative splicing in cancer - implications for care.

[...]

Sophie Bonnal¹, Irene López-Oreja², Irene López-Oreja¹, Juan Valcárcel³, Juan Valcárcel¹ - Show less +1 more•Institutions (3)

Pompeu Fabra University¹, University of Barcelona², Catalan Institution for Research and Advanced Studies³

17 Apr 2020-Nature Reviews Clinical Oncology

TL;DR: Antisense oligonucleotides offer promise to modulate cancer-relevant alternative splicing decisions, with proof of concept for this type of therapy demonstrated by Nusinersen, a first-in-class treatment for patients with spinal muscular atrophy.

...read moreread less

Abstract: Removal of introns from messenger RNA precursors (pre-mRNA splicing) is an essential step for the expression of most eukaryotic genes. Alternative splicing enables the regulated generation of multiple mRNA and protein products from a single gene. Cancer cells have general as well as cancer type-specific and subtype-specific alterations in the splicing process that can have prognostic value and contribute to every hallmark of cancer progression, including cancer immune responses. These splicing alterations are often linked to the occurrence of cancer driver mutations in genes encoding either core components or regulators of the splicing machinery. Of therapeutic relevance, the transcriptomic landscape of cancer cells makes them particularly vulnerable to pharmacological inhibition of splicing. Small-molecule splicing modulators are currently in clinical trials and, in addition to splice site-switching antisense oligonucleotides, offer the promise of novel and personalized approaches to cancer treatment.

...read moreread less

Journal Article•DOI•

SARS-CoV-2 ORF3b Is a Potent Interferon Antagonist Whose Activity Is Increased by a Naturally Occurring Elongation Variant

[...]

Yoriyuki Konno¹, Izumi Kimura¹, Keiya Uriu¹, Masaya Fukushi², Takashi Irie², Yoshio Koyanagi³, Daniel Sauter⁴, Robert J. Gifford⁵, So Nakagawa⁶, Kei Sato¹ - Show less +6 more•Institutions (6)

University of Tokyo¹, Hiroshima University², Kyoto University³, University of Ulm⁴, University of Glasgow⁵, Tokai University⁶

22 Sep 2020-Cell Reports

TL;DR: These findings not only help to explain the poor interferon response in COVID-19 patients, but also describe the emergence of natural SARS-CoV-2 quasispecies with an extended ORF3b gene that may potentially affect CO VID-19 pathogenesis.

...read moreread less

Posted Content•DOI•

Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia

[...]

Stephan Ripke¹, James T.R. Walters², Michael Conlon O'Donovan²•Institutions (2)

Charité¹, Cardiff University²

13 Sep 2020-medRxiv

TL;DR: This work identifies biological processes of pathophysiological relevance to schizophrenia, shows convergence of common and rare variant associations in schizophrenia and neurodevelopmental disorders, and provides a rich resource of priority genes and variants to advance mechanistic studies.

...read moreread less

Abstract: Schizophrenia is a psychiatric disorder whose pathophysiology is largely unknown. It has a heritability of 60-80%, much of which is attributable to common risk alleles, suggesting genome-wide association studies can inform our understanding of aetiology. Here, in 69,369 people with schizophrenia and 236,642 controls, we report common variant associations at 270 distinct loci. Using fine-mapping and functional genomic data, we prioritise 19 genes based on protein-coding or UTR variation, and 130 genes in total as likely to explain these associations. Fine-mapped candidates were enriched for genes associated with rare disruptive coding variants in people with schizophrenia, including the glutamate receptor subunit GRIN2A and transcription factor SP4, and were also enriched for genes implicated by such variants in autism and developmental disorder. Associations were concentrated in genes expressed in CNS neurons, both excitatory and inhibitory, but not other tissues or cell types, and implicated fundamental processes related to neuronal function, particularly synaptic organisation, differentiation and transmission. We identify biological processes of pathophysiological relevance to schizophrenia, show convergence of common and rare variant associations in schizophrenia and neurodevelopmental disorders, and provide a rich resource of priority genes and variants to advance mechanistic studies.

...read moreread less

Journal Article•DOI•

Single-cell RNA counting at allele and isoform resolution using Smart-seq3.

[...]

Michael Hagemann-Jensen¹, Christoph Ziegenhain¹, Ping Chen¹, Daniel Ramsköld¹, Gert-Jan Hendriks¹, Anton J. M. Larsson¹, Omid R. Faridani², Omid R. Faridani³, Omid R. Faridani¹, Rickard Sandberg¹ - Show less +6 more•Institutions (3)

Karolinska Institutet¹, University of New South Wales², Garvan Institute of Medical Research³

04 May 2020-Nature Biotechnology

TL;DR: Smart-seq3 is introduced, which combines full-length transcriptome coverage with a 5′ unique molecular identifier RNA counting strategy that enables in silico reconstruction of thousands of RNA molecules per cell.

...read moreread less

Abstract: Large-scale sequencing of RNA from individual cells can reveal patterns of gene, isoform and allelic expression across cell types and states1. However, current short-read single-cell RNA-sequencing methods have limited ability to count RNAs at allele and isoform resolution, and long-read sequencing techniques lack the depth required for large-scale applications across cells2,3. Here we introduce Smart-seq3, which combines full-length transcriptome coverage with a 5' unique molecular identifier RNA counting strategy that enables in silico reconstruction of thousands of RNA molecules per cell. Of the counted and reconstructed molecules, 60% could be directly assigned to allelic origin and 30-50% to specific isoforms, and we identified substantial differences in isoform usage in different mouse strains and human cell types. Smart-seq3 greatly increased sensitivity compared to Smart-seq2, typically detecting thousands more transcripts per cell. We expect that Smart-seq3 will enable large-scale characterization of cell types and states across tissues and organisms.

...read moreread less

Journal Article•DOI•

Producing polished prokaryotic pangenomes with the Panaroo pipeline

[...]

Gerry Tonkin-Hill¹, Gerry Tonkin-Hill², Neil MacAlasdair³, Neil MacAlasdair¹, Christopher Ruis³, Christopher Ruis⁴, Aaron Weimann, Gal Horesh¹, John A. Lees⁵, Rebecca A. Gladstone², Stephanie W. Lo¹, Christopher A. Beaudoin³, R. Andres Floto⁶, R. Andres Floto³, Simon D. W. Frost⁷, Simon D. W. Frost⁸, Jukka Corander⁹, Jukka Corander², Jukka Corander¹, Stephen D. Bentley¹, Julian Parkhill³ - Show less +17 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of Oslo², University of Cambridge³, Laboratory of Molecular Biology⁴, Imperial College London⁵, Papworth Hospital⁶, Microsoft⁷, University of London⁸, Helsinki Institute for Information Technology⁹

22 Jul 2020-Genome Biology

TL;DR: Panaroo is introduced, a graph-based pangenome clustering tool that is able to account for many of the sources of error introduced during the annotation of prokaryotic genome assemblies.

...read moreread less

Abstract: Population-level comparisons of prokaryotic genomes must take into account the substantial differences in gene content resulting from horizontal gene transfer, gene duplication and gene loss. However, the automated annotation of prokaryotic genomes is imperfect, and errors due to fragmented assemblies, contamination, diverse gene families and mis-assemblies accumulate over the population, leading to profound consequences when analysing the set of all genes found in a species. Here, we introduce Panaroo, a graph-based pangenome clustering tool that is able to account for many of the sources of error introduced during the annotation of prokaryotic genome assemblies. Panaroo is available at https://github.com/gtonkinhill/panaroo .

...read moreread less

Journal Article•DOI•

The impact of sex on gene expression across human tissues

[...]

Meritxell Oliva¹, Manuel Muñoz-Aguirre², Sarah Kim-Hellmuth³, Sarah Kim-Hellmuth⁴, Valentin Wucher, Ariel D. H. Gewirtz⁵, Daniel J. Cotter⁶, Princy Parsana⁷, Silva Kasela⁴, Brunilda Balliu⁸, Ana Viñuela⁹, Stephane E. Castel⁴, Pejman Mohammadi¹⁰, François Aguet¹¹, Yuxin Zou¹, Ekaterina A. Khramtsova¹, Ekaterina A. Khramtsova¹², Andrew D. Skol, Diego Garrido-Martín, Ferran Reverter¹³, Andrew A. Brown¹⁴, Patrick Evans¹⁵, Eric R. Gamazon¹⁵, Eric R. Gamazon¹⁶, Anthony Payne¹⁷, Rodrigo Bonazzola¹, Alvaro N. Barbeira¹, Andrew R Hamel¹¹, Andrew R Hamel¹⁸, Angel Martinez-Perez, José Manuel Soria, Brandon L. Pierce¹, Matthew Stephens¹, Eleazar Eskin⁸, Emmanouil T. Dermitzakis⁹, Ayellet V. Segrè¹⁸, Ayellet V. Segrè¹¹, Hae Kyung Im¹, Barbara E. Engelhardt⁵, Kristin G. Ardlie¹¹, Stephen B. Montgomery⁶, Alexis Battle⁷, Tuuli Lappalainen⁴, Roderic Guigó¹⁹, Barbara E. Stranger - Show less +41 more•Institutions (19)

University of Chicago¹, Polytechnic University of Catalonia², Max Planck Society³, Columbia University⁴, Princeton University⁵, Stanford University⁶, Johns Hopkins University⁷, University of California, Los Angeles⁸, University of Geneva⁹, Scripps Research Institute¹⁰, Broad Institute¹¹, Janssen Pharmaceutica¹², University of Barcelona¹³, University of Dundee¹⁴, Vanderbilt University Medical Center¹⁵, University of Cambridge¹⁶, University of Oxford¹⁷, Massachusetts Eye and Ear Infirmary¹⁸, Pompeu Fabra University¹⁹

11 Sep 2020-Science

TL;DR: A catalog of sex differences in gene expression and its genetic regulation across 44 human tissue sources surveyed by the GTEx project (v8 data release), analyzing 16,245 RNA-sequencing samples and genotypes of 838 adult individuals is generated.

...read moreread less

Abstract: Many complex human phenotypes exhibit sex-differentiated characteristics. However, the molecular mechanisms underlying these differences remain largely unknown. We generated a catalog of sex differences in gene expression and in the genetic regulation of gene expression across 44 human tissue sources surveyed by the Genotype-Tissue Expression project (GTEx, v8 release). We demonstrate that sex influences gene expression levels and cellular composition of tissue samples across the human body. A total of 37% of all genes exhibit sex-biased expression in at least one tissue. We identify cis expression quantitative trait loci (eQTLs) with sex-differentiated effects and characterize their cellular origin. By integrating sex-biased eQTLs with genome-wide association study data, we identify 58 gene-trait associations that are driven by genetic regulation of gene expression in a single sex. These findings provide an extensive characterization of sex differences in the human transcriptome and its genetic regulation.

...read moreread less

Journal Article•DOI•

Single cell RNA sequencing of human microglia uncovers a subset associated with Alzheimer’s disease

[...]

Marta Olah, Vilas Menon, Naomi Habib¹, Naomi Habib², Mariko Taga, Yiyi Ma, Christina J. Yung³, Maria Cimpean³, Anthony Khairallah³, Guillermo Coronas-Samano³, Roman Sankowski⁴, Dominic Grün⁵, Alexandra Kroshilina³, Danielle Dionne², Rani A. Sarkis⁶, Garth Rees Cosgrove⁶, Jeffrey Helgager⁶, Jeffrey A. Golden⁶, Page B. Pennell⁶, Marco Prinz⁴, Jean Paul Vonsattel³, Andrew F. Teich³, Julie A. Schneider⁷, David A. Bennett⁷, Aviv Regev, Wassim Elyaman², Wassim Elyaman³, Elizabeth M. Bradshaw², Elizabeth M. Bradshaw³, Philip L. De Jager - Show less +26 more•Institutions (7)

Hebrew University of Jerusalem¹, Broad Institute², Columbia University Medical Center³, University of Freiburg⁴, Max Planck Society⁵, Brigham and Women's Hospital⁶, Rush University Medical Center⁷

30 Nov 2020-Nature Communications

TL;DR: The population structure of live microglia purified from human cerebral cortex samples obtained at autopsy and during neurosurgical procedures is investigated, and it is found that some subsets are enriched for disease-related genes and RNA signatures.

...read moreread less

Abstract: The extent of microglial heterogeneity in humans remains a central yet poorly explored question in light of the development of therapies targeting this cell type. Here, we investigate the population structure of live microglia purified from human cerebral cortex samples obtained at autopsy and during neurosurgical procedures. Using single cell RNA sequencing, we find that some subsets are enriched for disease-related genes and RNA signatures. We confirm the presence of four of these microglial subpopulations histologically and illustrate the utility of our data by characterizing further microglial cluster 7, enriched for genes depleted in the cortex of individuals with Alzheimer's disease (AD). Histologically, these cluster 7 microglia are reduced in frequency in AD tissue, and we validate this observation in an independent set of single nucleus data. Thus, our live human microglia identify a range of subtypes, and we prioritize one of these as being altered in AD.

...read moreread less

Journal Article•DOI•

From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases.

[...]

Eddie Cano-Gamez¹, Gosia Trynka¹•Institutions (1)

Wellcome Trust Sanger Institute¹

13 May 2020-Frontiers in Genetics

TL;DR: A review of how challenges of integrating GWAS results with single-cell sequencing read-outs, designing functionally informed polygenic risk scores (PRS), and validating disease associated genes using genetic engineering have been addressed over the last decade are summarized.

...read moreread less

Abstract: Genome-wide association studies (GWAS) have successfully mapped thousands of loci associated with complex traits. These associations could reveal the molecular mechanisms altered in common complex diseases and result in the identification of novel drug targets. However, GWAS have also left a number of outstanding questions. In particular, the majority of disease-associated loci lie in non-coding regions of the genome and, even though they are thought to play a role in gene expression regulation, it is unclear which genes they regulate and in which cell types or physiological contexts this regulation occurs. This has hindered the translation of GWAS findings into clinical interventions. In this review we summarize how these challenges have been addressed over the last decade, with a particular focus on the integration of GWAS results with functional genomics datasets. Firstly, we investigate how the tissues and cell types involved in diseases can be identified using methods that test for enrichment of GWAS variants in genomic annotations. Secondly, we explore how to find the genes regulated by GWAS loci using methods that test for colocalization of GWAS signals with molecular phenotypes such as quantitative trait loci (QTLs). Finally, we highlight potential future research avenues such as integrating GWAS results with single-cell sequencing read-outs, designing functionally informed polygenic risk scores (PRS), and validating disease associated genes using genetic engineering. These tools will be crucial to identify new drug targets for common complex diseases.

...read moreread less

Journal Article•DOI•

Genomic basis for RNA alterations in cancer

[...]

Claudia Calabrese¹, Natalie R. Davidson, Deniz Demircioğlu², Deniz Demircioğlu³, Nuno A. Fonseca¹, Yao He⁴, André Kahles, Kjong-Van Lehmann, Fenglin Liu⁴, Yuichi Shiraishi⁵, Cameron M. Soulette⁶, Lara Urban¹, Liliana Greger¹, Siliang Li, Dongbing Liu, Marc D. Perry⁷, Marc D. Perry⁸, Qian Xiang⁸, Fan Zhang⁴, Junjun Zhang⁸, Peter Bailey⁹, Serap Erkek, Katherine A. Hoadley¹⁰, Yong Hou, Matthew R. Huska¹¹, Helena Kilpinen¹², Jan O. Korbel, Maximillian G. Marin⁶, Julia Markowski¹¹, Tannistha Nandi², Qiang Pan-Hammarström¹³, Chandra Sekhar Pedamallu¹⁴, Chandra Sekhar Pedamallu¹⁵, Reiner Siebert¹⁶, Stefan G. Stark, Hong Su, Patrick Tan², Patrick Tan³, Sebastian M. Waszak, Christina K. Yung⁸, Shida Zhu, Philip Awadalla⁸, Philip Awadalla¹⁷, Chad J. Creighton¹⁸, Matthew Meyerson¹⁴, Matthew Meyerson¹⁵, B. F. Francis Ouellette¹⁷, Kui Wu, Huanming Yang, Alvis Brazma¹, Angela N. Brooks¹⁵, Angela N. Brooks⁶, Angela N. Brooks¹⁴, Jonathan Göke², Gunnar Rätsch, Roland F. Schwarz, Oliver Stegle¹, Oliver Stegle¹⁹, Zemin Zhang⁴ - Show less +55 more•Institutions (19)

European Bioinformatics Institute¹, Genome Institute of Singapore², National University of Singapore³, Peking University⁴, University of Tokyo⁵, University of California, Santa Cruz⁶, University of California, San Francisco⁷, Ontario Institute for Cancer Research⁸, University of Glasgow⁹, University of North Carolina at Chapel Hill¹⁰, Max Delbrück Center for Molecular Medicine¹¹, University College London¹², Karolinska Institutet¹³, Harvard University¹⁴, Broad Institute¹⁵, University of Ulm¹⁶, University of Toronto¹⁷, Baylor College of Medicine¹⁸, German Cancer Research Center¹⁹

06 Feb 2020-Nature

TL;DR: The most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Gome Atlas (TCGA) was presented in this article.

...read moreread less

Abstract: Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed 'bridged' fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer.

...read moreread less

Journal Article•DOI•

IDH mutation in glioma: molecular mechanisms and potential therapeutic targets.

[...]

Sue Han, Yang Liu, Sabrina J. Cai, Mingyu Qian, Jianyi Ding, Mioara Larion, Mark R. Gilbert, Chunzhang Yang - Show less +4 more

15 Apr 2020-British Journal of Cancer

TL;DR: An overview of the latest findings in IDH-mutated human malignancies is provided, with a focus on glioma, discussing unique biological signatures and proceedings in translational research.

...read moreread less

Abstract: Isocitrate dehydrogenase (IDH) enzymes catalyse the oxidative decarboxylation of isocitrate and therefore play key roles in the Krebs cycle and cellular homoeostasis. Major advances in cancer genetics over the past decade have revealed that the genes encoding IDHs are frequently mutated in a variety of human malignancies, including gliomas, acute myeloid leukaemia, cholangiocarcinoma, chondrosarcoma and thyroid carcinoma. A series of seminal studies further elucidated the biological impact of the IDH mutation and uncovered the potential role of IDH mutants in oncogenesis. Notably, the neomorphic activity of the IDH mutants establishes distinctive patterns in cancer metabolism, epigenetic shift and therapy resistance. Novel molecular targeting approaches have been developed to improve the efficacy of therapeutics against IDH-mutated cancers. Here we provide an overview of the latest findings in IDH-mutated human malignancies, with a focus on glioma, discussing unique biological signatures and proceedings in translational research.

...read moreread less

Journal Article•DOI•

Directed evolution of adenine base editors with increased activity and therapeutic application.

[...]

Nicole M. Gaudelli, Dieter K. Lam, Holly A. Rees, Noris M. Solá-Esteves, Luis A. Barrera, David A. Born, Aaron Edwards, Jason Michael Gehrke, Seung-Joo Lee, Alexander Liquori, Ryan Murray, Michael S. Packer, Conrad Rinaldi, Ian Slaymaker, Jonathan Yen¹, Lauren Young, Giuseppe Ciaramella - Show less +13 more•Institutions (1)

St. Jude Children's Research Hospital¹

13 Apr 2020-Nature Biotechnology

TL;DR: In primary human T cells, ABE8s achieve 98–99% target modification, which is maintained when multiplexed across three loci, and in human CD34 + cells, Abe8 can recreate a natural allele at the promoter of the γ-globin genes HBG1 and HBG2 with up to 60% efficiency.

...read moreread less

Abstract: The foundational adenine base editors (for example, ABE7.10) enable programmable A•T to G•C point mutations but editing efficiencies can be low at challenging loci in primary human cells. Here we further evolve ABE7.10 using a library of adenosine deaminase variants to create ABE8s. At NGG protospacer adjacent motif (PAM) sites, ABE8s result in ~1.5× higher editing at protospacer positions A5-A7 and ~3.2× higher editing at positions A3-A4 and A8-A10 compared with ABE7.10. Non-NGG PAM variants have a ~4.2-fold overall higher on-target editing efficiency than ABE7.10. In human CD34+ cells, ABE8 can recreate a natural allele at the promoter of the γ-globin genes HBG1 and HBG2 with up to 60% efficiency, causing persistence of fetal hemoglobin. In primary human T cells, ABE8s achieve 98-99% target modification, which is maintained when multiplexed across three loci. Delivered as messenger RNA, ABE8s induce no significant levels of single guide RNA (sgRNA)-independent off-target adenine deamination in genomic DNA and very low levels of adenine deamination in cellular mRNA.

...read moreread less

Journal Article•DOI•

A Genetic Map of the Response to DNA Damage in Human Cells

[...]

Michele Olivieri¹, Michele Olivieri², Tiffany Cho¹, Tiffany Cho², Alejandro Álvarez-Quilón², Kejiao Li³, Matthew J. Schellenberg⁴, Michal Zimmermann², Nicole Hustedt², Silvia Emma Rossi², Salomé Adam², Henrique Melo², Anne Margriet Heijink², Guillermo Sastre-Moreno², Nathalie Moatti², Rachel K. Szilard², Andrea McEwan², Alexanda K. Ling¹, Almudena Serrano-Benitez⁵, Tajinder Ubhi¹, Sumin Feng², Judy Pawling², Irene Delgado-Sainz⁵, Michael W. Ferguson¹, James W. Dennis¹, James W. Dennis², Grant W. Brown¹, Felipe Cortés-Ledesma⁵, R. Scott Williams⁴, Alberto Martin¹, Dongyi Xu³, Daniel Durocher¹, Daniel Durocher² - Show less +29 more•Institutions (5)

University of Toronto¹, Lunenfeld-Tanenbaum Research Institute², Peking University³, National Institutes of Health⁴, Spanish National Research Council⁵

23 Jul 2020-Cell

TL;DR: A map of the DNA damage response provides a rich resource to study this fundamental cellular system and has implications for the development and use of genotoxic agents in cancer therapy.

...read moreread less

Journal Article•DOI•

Pan-cancer characterization of immune-related lncRNAs identifies potential oncogenic biomarkers.

[...]

Yongsheng Li¹, Yongsheng Li², Tiantongfei Jiang², Weiwei Zhou², Junyi Li², Xinhui Li², Qi Wang², Xiaoyan Jin², Jiaqi Yin², Liuxin Chen², Yunpeng Zhang², Juan Xu¹, Juan Xu², Xia Li², Xia Li¹ - Show less +11 more•Institutions (2)

Hainan Medical University¹, Harbin Medical University²

21 Feb 2020-Nature Communications

TL;DR: An integrated algorithm, ImmLnc, is introduced that can help prioritise immune-related lncRNAs in cancer immunotherapy research and serve as a valuable resource for understanding lncRNA function and to advance identification of immunotherapy targets.

...read moreread less

Abstract: Long noncoding RNAs (lncRNAs) are emerging as critical regulators of gene expression and they play fundamental roles in immune regulation. Here we introduce an integrated algorithm, ImmLnc, for identifying lncRNA regulators of immune-related pathways. We comprehensively chart the landscape of lncRNA regulation in the immunome across 33 cancer types and show that cancers with similar tissue origin are likely to share lncRNA immune regulators. Moreover, the immune-related lncRNAs are likely to show expression perturbation in cancer and are significantly correlated with immune cell infiltration. ImmLnc can help prioritize cancer-related lncRNAs and further identify three molecular subtypes (proliferative, intermediate, and immunological) of non-small cell lung cancer. These subtypes are characterized by differences in mutation burden, immune cell infiltration, expression of immunomodulatory genes, response to chemotherapy, and prognosis. In summary, the ImmLnc pipeline and the resulting data serve as a valuable resource for understanding lncRNA function and to advance identification of immunotherapy targets. In cancer, long noncoding RNAs (lncRNAs) can regulate immune-related pathways. Here, the authors present ImmLnc, an algorithm that can help prioritise immune-related lncRNAs in cancer immunotherapy research

...read moreread less

Journal Article•DOI•

Comprehensive molecular characterization of mitochondrial genomes in human cancers

[...]

Yuan Yuan¹, Young Seok Ju², Young-Wook Kim³, Jun Li¹, Yumeng Wang¹, Yumeng Wang⁴, Christopher J. Yoon⁵, Yang Yang⁶, Inigo Martincorena², Chad J. Creighton⁴, John N. Weinstein¹, Yanxun Xu⁷, Leng Han⁶, Hyung Lae Kim⁸, Hidewaki Nakagawa, Keunchil Park⁹, Peter J. Campbell², Peter J. Campbell¹⁰, Han Liang⁴, Han Liang¹ - Show less +16 more•Institutions (10)

University of Texas MD Anderson Cancer Center¹, Wellcome Trust Sanger Institute², Sungkyunkwan University³, Baylor College of Medicine⁴, KAIST⁵, University of Texas Health Science Center at Houston⁶, Johns Hopkins University⁷, Ewha Womans University⁸, Samsung Medical Center⁹, Cambridge University Hospitals NHS Foundation Trust¹⁰

05 Feb 2020-Nature Genetics

TL;DR: This analysis presents the most definitive mutational landscape of mitochondrial genomes and identifies several hypermutated cases, frequent somatic nuclear transfer of mt DNA and high variability of mtDNA copy number in many cancers.

...read moreread less

Abstract: Mitochondria are essential cellular organelles that play critical roles in cancer. Here, as part of the International Cancer Genome Consortium/The Cancer Genome Atlas Pan-Cancer Analysis of Whole Genomes Consortium, which aggregated whole-genome sequencing data from 2,658 cancers across 38 tumor types, we performed a multidimensional, integrated characterization of mitochondrial genomes and related RNA sequencing data. Our analysis presents the most definitive mutational landscape of mitochondrial genomes and identifies several hypermutated cases. Truncating mutations are markedly enriched in kidney, colorectal and thyroid cancers, suggesting oncogenic effects with the activation of signaling pathways. We find frequent somatic nuclear transfers of mitochondrial DNA, some of which disrupt therapeutic target genes. Mitochondrial copy number varies greatly within and across cancers and correlates with clinical variables. Co-expression analysis highlights the function of mitochondrial genes in oxidative phosphorylation, DNA repair and the cell cycle, and shows their connections with clinically actionable genes. Our study lays a foundation for translating mitochondrial biology into clinical applications.

...read moreread less

Journal Article•DOI•

m6A-dependent glycolysis enhances colorectal cancer progression.

[...]

Chaoqin Shen¹, Baoqin Xuan¹, Tingting Yan¹, Yanru Ma¹, Pingping Xu¹, Xianglong Tian¹, Xinyu Zhang¹, Yingying Cao¹, Dan Ma¹, Xiaoqiang Zhu¹, Youwei Zhang², Jing-Yuan Fang¹, Haoyan Chen¹, Jie Hong¹ - Show less +10 more•Institutions (2)

Shanghai Jiao Tong University¹, Xuzhou Medical College²

03 Apr 2020-Molecular Cancer

TL;DR: Targeting METTL3 and its pathway offer alternative rational therapeutic targets in CRC patients with high glucose metabolism, as well as exploring the molecular mechanism ofMETTL3 in CRC.

...read moreread less

Abstract: Epigenetic alterations are involved in various aspects of colorectal carcinogenesis. N6-methyladenosine (m6A) modifications of RNAs are emerging as a new layer of epigenetic regulation. As the most abundant chemical modification of eukaryotic mRNA, m6A is essential for the regulation of mRNA stability, splicing, and translation. Alterations of m6A regulatory genes play important roles in the pathogenesis of a variety of human diseases. However, whether this mRNA modification participates in the glucose metabolism of colorectal cancer (CRC) remains uncharacterized. Transcriptome-sequencing and liquid chromatography-tandem mass spectrometry (LC-MS) were performed to evaluate the correlation between m6A modifications and glucose metabolism in CRC. Mass spectrometric metabolomics analysis, in vitro and in vivo experiments were conducted to investigate the effects of METTL3 on CRC glycolysis and tumorigenesis. RNA MeRIP-sequencing, immunoprecipitation and RNA stability assay were used to explore the molecular mechanism of METTL3 in CRC. A strong correlation between METTL3 and 18F-FDG uptake was observed in CRC patients from Xuzhou Central Hospital. METTL3 induced-CRC tumorigenesis depends on cell glycolysis in multiple CRC models. Mechanistically, METTL3 directly interacted with the 5′/3’UTR regions of HK2, and the 3’UTR region of SLC2A1 (GLUT1), then further stabilized these two genes and activated the glycolysis pathway. M6A-mediated HK2 and SLC2A1 (GLUT1) stabilization relied on the m6A reader IGF2BP2 or IGF2BP2/3, respectively. METTL3 is a functional and clinical oncogene in CRC. METTL3 stabilizes HK2 and SLC2A1 (GLUT1) expression in CRC through an m6A-IGF2BP2/3- dependent mechanism. Targeting METTL3 and its pathway offer alternative rational therapeutic targets in CRC patients with high glucose metabolism.

...read moreread less

Journal Article•DOI•

Interferons and viruses induce a novel truncated ACE2 isoform and not the full-length SARS-CoV-2 receptor.

[...]

Olusegun O. Onabajo¹, A. Rouf Banday¹, Megan L. Stanifer², Wusheng Yan¹, Adeola Obajemu¹, Deanna M. Santer³, Oscar Florez-Vargas¹, Helen Piontkivska⁴, Joselin M. Vargas¹, Timothy J. Ring¹, Carmon Kee⁵, Carmon Kee², Patricio Doldan², Patricio Doldan⁵, D. Lorne Tyrrell³, Juan L. Mendoza⁶, Steeve Boulant⁵, Steeve Boulant², Ludmila Prokunina-Olsson¹ - Show less +15 more•Institutions (6)

National Institutes of Health¹, University Hospital Heidelberg², University of Alberta³, Kent State University⁴, German Cancer Research Center⁵, University of Chicago⁶

19 Oct 2020-Nature Genetics

TL;DR: The results suggest that the ISG-type induction of dACE2 in IFN-high conditions created by treatments, an inflammatory tumor microenvironment or viral co-infections is unlikely to increase the cellular entry of SARS-CoV-2 and promote infection.

...read moreread less

Abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes COVID-19, utilizes angiotensin-converting enzyme 2 (ACE2) for entry into target cells. ACE2 has been proposed as an interferon-stimulated gene (ISG). Thus, interferon-induced variability in ACE2 expression levels could be important for susceptibility to COVID-19 or its outcomes. Here, we report the discovery of a novel, transcriptionally independent truncated isoform of ACE2, which we designate as deltaACE2 (dACE2). We demonstrate that dACE2, but not ACE2, is an ISG. In The Cancer Genome Atlas, the expression of dACE2 was enriched in squamous tumors of the respiratory, gastrointestinal and urogenital tracts. In vitro, dACE2, which lacks 356 amino-terminal amino acids, was non-functional in binding the SARS-CoV-2 spike protein and as a carboxypeptidase. Our results suggest that the ISG-type induction of dACE2 in IFN-high conditions created by treatments, an inflammatory tumor microenvironment or viral co-infections is unlikely to increase the cellular entry of SARS-CoV-2 and promote infection.

...read moreread less

Journal Article•DOI•

Global reference mapping of human transcription factor footprints.

[...]

Jeff Vierstra, John Lazar¹, Richard Sandstrom, Jessica Halow, Kristen Lee, Daniel Bates, Morgan Diegel, Douglas Dunn, Fidencio Neri, Eric Haugen, Eric Rynes, Alex Reynolds, J. K. Nelson, Audra K. Johnson, Mark Frerker, Michael Buckley, Rajinder Kaul, Wouter Meuleman, John A. Stamatoyannopoulos¹ - Show less +15 more•Institutions (1)

University of Washington¹

29 Jul 2020-Nature

TL;DR: A high-density DNase I cleavage map from 243 human cell and tissue types provides a genome-wide, nucleotide-resolution map of human transcription factor footprints, and shows that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions is almost entirely attributable to variants within footprints.

...read moreread less

Abstract: Combinatorial binding of transcription factors to regulatory DNA underpins gene regulation in all organisms. Genetic variation in regulatory regions has been connected with diseases and diverse phenotypic traits1, but it remains challenging to distinguish variants that affect regulatory function2. Genomic DNase I footprinting enables the quantitative, nucleotide-resolution delineation of sites of transcription factor occupancy within native chromatin3–6. However, only a small fraction of such sites have been precisely resolved on the human genome sequence6. Here, to enable comprehensive mapping of transcription factor footprints, we produced high-density DNase I cleavage maps from 243 human cell and tissue types and states and integrated these data to delineate about 4.5 million compact genomic elements that encode transcription factor occupancy at nucleotide resolution. We map the fine-scale structure within about 1.6 million DNase I-hypersensitive sites and show that the overwhelming majority are populated by well-spaced sites of single transcription factor–DNA interaction. Cell-context-dependent cis-regulation is chiefly executed by wholesale modulation of accessibility at regulatory DNA rather than by differential transcription factor occupancy within accessible elements. We also show that the enrichment of genetic variants associated with diseases or phenotypic traits in regulatory regions1,7 is almost entirely attributable to variants within footprints, and that functional variants that affect transcription factor occupancy are nearly evenly partitioned between loss- and gain-of-function alleles. Unexpectedly, we find increased density of human genetic variation within transcription factor footprints, revealing an unappreciated driver of cis-regulatory evolution. Our results provide a framework for both global and nucleotide-precision analyses of gene regulatory mechanisms and functional genetic variation. A high-density DNase I cleavage map from 243 human cell and tissue types provides a genome-wide, nucleotide-resolution map of human transcription factor footprints.

...read moreread less

Collapse