Showing papers on "Gene published in 2019"

PDF

Open Access

Journal Article•DOI•

Asymmetric paralog evolution between the “cryptic” gene Bmp16 and its well-studied sister genes Bmp2 and Bmp4

[...]

Nathalie Feiner¹, Fumio Motone², Axel Meyer¹, Shigehiro Kuraku¹•Institutions (2)

University of Konstanz¹, Kwansei Gakuin University²

28 Feb 2019-Scientific Reports

TL;DR: The phylogenetic analysis complemented with synteny analyses suggests that Bmp2, -4 and -16 are remnants of a gene quartet that originated during the two rounds of whole-genome duplication (2R-WGD) early in vertebrate evolution.

...read moreread less

Abstract: The vertebrate gene repertoire is characterized by “cryptic” genes whose identification has been hampered by their absence from the genomes of well-studied species. One example is the Bmp16 gene, a paralog of the developmental key genes Bmp2 and -4. We focus on the Bmp2/4/16 group of genes to study the evolutionary dynamics following gen(om)e duplications with special emphasis on the poorly studied Bmp16 gene. We reveal the presence of Bmp16 in chondrichthyans in addition to previously reported teleost fishes and reptiles. Using comprehensive, vertebrate-wide gene sampling, our phylogenetic analysis complemented with synteny analyses suggests that Bmp2, -4 and -16 are remnants of a gene quartet that originated during the two rounds of whole-genome duplication (2R-WGD) early in vertebrate evolution. We confirm that Bmp16 genes were lost independently in at least three lineages (mammals, archelosaurs and amphibians) and report that they have elevated rates of sequence evolution. This finding agrees with their more “flexible” deployment during development; while Bmp16 has limited embryonic expression domains in the cloudy catshark, it is broadly expressed in the green anole lizard. Our study illustrates the dynamics of gene family evolution by integrating insights from sequence diversification, gene repertoire changes, and shuffling of expression domains.

...read moreread less

1,376 citations

Posted Content•DOI•

Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes

[...]

Konrad J. Karczewski¹, Konrad J. Karczewski², Laurent C. Francioli¹, Laurent C. Francioli², Grace Tiao¹, Grace Tiao², Beryl B. Cummings¹, Beryl B. Cummings², Jessica Alföldi¹, Jessica Alföldi², Qingbo Wang², Qingbo Wang¹, Ryan L. Collins², Ryan L. Collins¹, Kristen M. Laricchia¹, Kristen M. Laricchia², Andrea Ganna³, Andrea Ganna¹, Andrea Ganna², Daniel P. Birnbaum¹, Laura D. Gauthier¹, Harrison Brand², Harrison Brand¹, Matthew Solomonson², Matthew Solomonson¹, Nicholas A. Watts¹, Nicholas A. Watts², Daniel R. Rhodes⁴, Moriel Singer-Berk¹, Eleanor G. Seaby², Eleanor G. Seaby¹, Jack A. Kosmicki¹, Jack A. Kosmicki², Raymond K. Walters¹, Raymond K. Walters², Katherine Tashman², Katherine Tashman¹, Yossi Farjoun¹, Eric Banks¹, Timothy Poterba², Timothy Poterba¹, Arcturus Wang², Arcturus Wang¹, Cotton Seed², Cotton Seed¹, Nicola Whiffin¹, Nicola Whiffin⁵, Jessica X. Chong⁶, Kaitlin E. Samocha⁷, Emma Pierce-Hoffman¹, Zachary Zappala¹, Zachary Zappala⁸, Anne H. O’Donnell-Luria¹, Anne H. O’Donnell-Luria⁹, Anne H. O’Donnell-Luria², Eric Vallabh Minikel¹, Ben Weisburd¹, Monkol Lek¹, Monkol Lek¹⁰, James S. Ware⁵, James S. Ware¹, Christopher Vittal², Christopher Vittal¹, Irina M. Armean¹¹, Irina M. Armean², Irina M. Armean¹, Louis Bergelson¹, Kristian Cibulskis¹, Kristen M. Connolly¹, Miguel Covarrubias¹, Stacey Donnelly¹, Steven Ferriera¹, Stacey Gabriel¹, Jeff Gentry¹, Namrata Gupta¹, Thibault Jeandet¹, Diane Kaplan¹, Christopher Llanwarne¹, Ruchi Munshi¹, Sam Novod¹, Nikelle Petrillo¹, David Roazen¹, Valentin Ruano-Rubio¹, Andrea Saltzman¹, Molly Schleicher¹, Jose Soto¹, Kathleen Tibbetts¹, Charlotte Tolonen¹, Gordon Wade¹, Michael E. Talkowski², Michael E. Talkowski¹, Benjamin M. Neale², Benjamin M. Neale¹, Mark J. Daly¹, Daniel G. MacArthur², Daniel G. MacArthur¹ - Show less +92 more•Institutions (11)

Broad Institute¹, Harvard University², University of Helsinki³, Queen Mary University of London⁴, National Institutes of Health⁵, University of Washington⁶, Wellcome Trust Sanger Institute⁷, Vertex Pharmaceuticals⁸, Boston Children's Hospital⁹, Yale University¹⁰, European Bioinformatics Institute¹¹

30 Jan 2019-bioRxiv

TL;DR: Using an improved human mutation rate model, human protein-coding genes are classified along a spectrum representing tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve gene discovery power for both common and rare diseases.

...read moreread less

Abstract: Summary Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes critical for an organism’s function will be depleted for such variants in natural populations, while non-essential genes will tolerate their accumulation. However, predicted loss-of-function (pLoF) variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes. Here, we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence pLoF variants in this cohort after filtering for sequencing and annotation artifacts. Using an improved model of human mutation, we classify human protein-coding genes along a spectrum representing intolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve gene discovery power for both common and rare diseases.

...read moreread less

1,128 citations

Journal Article•DOI•

The diverse roles of DNA methylation in mammalian development and disease

[...]

Maxim V. C. Greenberg¹, Déborah Bourc'his¹•Institutions (1)

Curie Institute¹

09 Aug 2019-Nature Reviews Molecular Cell Biology

TL;DR: The mechanisms and functions of DNA methylation and demethylation in both mice and humans at CpG-rich promoters, gene bodies and transposable elements are discussed and the dynamic erasure and re-establishment in embryonic, germline and somatic cell development is highlighted.

...read moreread less

Abstract: DNA methylation is of paramount importance for mammalian embryonic development. DNA methylation has numerous functions: it is implicated in the repression of transposons and genes, but is also associated with actively transcribed gene bodies and, in some cases, with gene activation per se. In recent years, sensitive technologies have been developed that allow the interrogation of DNA methylation patterns from a small number of cells. The use of these technologies has greatly improved our knowledge of DNA methylation dynamics and heterogeneity in embryos and in specific tissues. Combined with genetic analyses, it is increasingly apparent that regulation of DNA methylation erasure and (re-)establishment varies considerably between different developmental stages. In this Review, we discuss the mechanisms and functions of DNA methylation and demethylation in both mice and humans at CpG-rich promoters, gene bodies and transposable elements. We highlight the dynamic erasure and re-establishment of DNA methylation in embryonic, germline and somatic cell development. Finally, we provide insights into DNA methylation gained from studying genetic diseases. DNA methylation is essential for mammalian embryogenesis owing to its repression of transposons and genes, but it is also associated with gene activation. The recent use of sensitive technologies has revealed that DNA methylation dynamics vary considerably between embryonic, germline and somatic cell development, with implications for genetic diseases and cancer.

...read moreread less

1,039 citations

Posted Content•DOI•

Generalizing RNA velocity to transient cell states through dynamical modeling

[...]

Volker Bergen¹, Marius Lange¹, Stefan Peidli¹, F. Alexander Wolf, Fabian J. Theis¹ - Show less +1 more•Institutions (1)

Technische Universität München¹

29 Oct 2019-bioRxiv

TL;DR: ScVelo enables disentangling heterogeneous subpopulation kinetics with unprecedented resolution in hippocampal dentate gyrus neurogenesis and pancreatic endocrinogenesis and is anticipate that scVelo will greatly facilitate the study of lineage decisions, gene regulation, and pathway activity identification.

...read moreread less

Abstract: The introduction of RNA velocity in single cells has opened up new ways of studying cellular differentiation. The originally proposed framework obtains velocities as the deviation of the observed ratio of spliced and unspliced mRNA from an inferred steady state. Errors in velocity estimates arise if the central assumptions of a common splicing rate and the observation of the full splicing dynamics with steady-state mRNA levels are violated. With scVelo (https://scvelo.org), we address these restrictions by solving the full transcriptional dynamics of splicing kinetics using a likelihood-based dynamical model. This generalizes RNA velocity to a wide variety of systems comprising transient cell states, which are common in development and in response to perturbations. We infer gene-specific rates of transcription, splicing and degradation, and recover the latent time of the underlying cellular processes. This latent time represents the cell’s internal clock and is based only on its transcriptional dynamics. Moreover, scVelo allows us to identify regimes of regulatory changes such as stages of cell fate commitment and, therein, systematically detects putative driver genes. We demonstrate that scVelo enables disentangling heterogeneous subpopulation kinetics with unprecedented resolution in hippocampal dentate gyrus neurogenesis and pancreatic endocrinogenesis. We anticipate that scVelo will greatly facilitate the study of lineage decisions, gene regulation, and pathway activity identification.

...read moreread less

712 citations

Journal Article•DOI•

Genetic compensation triggered by mutant mRNA degradation

[...]

Mohamed A. El-Brolosy¹, Zacharias Kontarakis¹, Andrea Rossi¹, Andrea Rossi², Carsten Kuenne¹, Stefan Günther¹, Nana Fukuda¹, Khrievono Kikhi¹, Giulia L. M. Boezio¹, Carter M. Takacs³, Carter M. Takacs⁴, Shih-Lei Lai⁵, Shih-Lei Lai¹, Ryuichi Fukuda¹, Claudia Gerri⁶, Claudia Gerri¹, Antonio J. Giraldez³, Didier Y.R. Stainier¹ - Show less +14 more•Institutions (6)

Max Planck Society¹, Leibniz Association², Yale University³, University of New Haven⁴, Academia Sinica⁵, Francis Crick Institute⁶

03 Apr 2019-Nature

TL;DR: Transcriptional adaptation, a genetic compensation process by which organisms respond to mutations by upregulating related genes, is triggered by mRNA decay and involves a sequence-dependent mechanism.

...read moreread less

Abstract: Genetic robustness, or the ability of an organism to maintain fitness in the presence of harmful mutations, can be achieved via protein feedback loops. Previous work has suggested that organisms may also respond to mutations by transcriptional adaptation, a process by which related gene(s) are upregulated independently of protein feedback loops. However, the prevalence of transcriptional adaptation and its underlying molecular mechanisms are unknown. Here, by analysing several models of transcriptional adaptation in zebrafish and mouse, we uncover a requirement for mutant mRNA degradation. Alleles that fail to transcribe the mutated gene do not exhibit transcriptional adaptation, and these alleles give rise to more severe phenotypes than alleles displaying mutant mRNA decay. Transcriptome analysis in alleles displaying mutant mRNA decay reveals the upregulation of a substantial proportion of the genes that exhibit sequence similarity with the mutated gene's mRNA, suggesting a sequence-dependent mechanism. These findings have implications for our understanding of disease-causing mutations, and will help in the design of mutant alleles with minimal transcriptional adaptation-derived compensation.

...read moreread less

679 citations

Journal Article•DOI•

The multiple mechanisms that regulate p53 activity and cell fate

[...]

Antonina Hafner¹, Martha L. Bulyk², Ashwini Jambhekar¹, Galit Lahav¹•Institutions (2)

Harvard University¹, Brigham and Women's Hospital²

01 Apr 2019-Nature Reviews Molecular Cell Biology

TL;DR: This Review discusses how the interaction of p53 with DNA and chromatin affects gene expression, and how p53 post-translational modifications, its temporal expression dynamics and its interactions with chromatin regulators and transcription factors influence cell fate.

...read moreread less

Abstract: The tumour suppressor p53 has a central role in the response to cellular stress. Activated p53 transcriptionally regulates hundreds of genes that are involved in multiple biological processes, including in DNA damage repair, cell cycle arrest, apoptosis and senescence. In the context of DNA damage, p53 is thought to be a decision-making transcription factor that selectively activates genes as part of specific gene expression programmes to determine cellular outcomes. In this Review, we discuss the multiple molecular mechanisms of p53 regulation and how they modulate the induction of apoptosis or cell cycle arrest following DNA damage. Specifically, we discuss how the interaction of p53 with DNA and chromatin affects gene expression, and how p53 post-translational modifications, its temporal expression dynamics and its interactions with chromatin regulators and transcription factors influence cell fate. These multiple layers of regulation enable p53 to execute cellular responses that are appropriate for specific cellular states and environmental conditions.

...read moreread less

611 citations

Journal Article•DOI•

Interferon-Stimulated Genes: What Do They All Do?

[...]

John W. Schoggins¹•Institutions (1)

University of Texas Southwestern Medical Center¹

29 Sep 2019-Annual Review of Virology

TL;DR: This review is the antiviral activities of the IFN/ISG system, which includes general paradigms of ISG function, supported by specific examples in the literature, as well as methodologies to identify and characterizeISG function.

...read moreread less

Abstract: In the absence of an intact interferon (IFN) response, mammals may be susceptible to lethal viral infection. IFNs are secreted cytokines that activate a signal transduction cascade leading to the induction of hundreds of interferon-stimulated genes (ISGs). Remarkably, approximately 10% of the genes in the human genome have the potential to be regulated by IFNs. What do all of these genes do? It is a complex question without a simple answer. From decades of research, we know that many of the protein products encoded by these ISGs work alone or in concert to achieve one or more cellular outcomes, including antiviral defense, antiproliferative activities, and stimulation of adaptive immunity. The focus of this review is the antiviral activities of the IFN/ISG system. This includes general paradigms of ISG function, supported by specific examples in the literature, as well as methodologies to identify and characterize ISG function.

...read moreread less

502 citations

Journal Article•DOI•

RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types.

[...]

Gianni Monaco¹, Gianni Monaco², Gianni Monaco³, Bernett Lee², Weili Xu², Seri Mustafah², You Yi Hwang², Christophe Carre⁴, Nicolas Burdin⁴, Lucian Visan⁴, Michele Ceccarelli⁵, Michael Poidinger², Alfred Zippelius¹, João Pedro de Magalhães³, Anis Larbi - Show less +11 more•Institutions (5)

University of Basel¹, Agency for Science, Technology and Research², University of Liverpool³, Sanofi Pasteur⁴, University of Sannio⁵

05 Feb 2019-Cell Reports

TL;DR: This work characterized 29 immune cell types within the peripheral blood mononuclear cell (PBMC) fraction of healthy donors using RNA-seq (RNA sequencing) and flow cytometry to identify sets of genes that are specific, are co-expressed, and have housekeeping roles across the 29 cell types.

...read moreread less

497 citations

Journal Article•DOI•

OMIM.org: leveraging knowledge across phenotype-gene relationships.

[...]

Joanna S. Amberger¹, Carol A. Bocchini¹, Alan F. Scott¹, Ada Hamosh¹•Institutions (1)

Johns Hopkins University School of Medicine¹

08 Jan 2019-Nucleic Acids Research

TL;DR: Mendelian Inheritance in Man provides interactive access to the knowledge repository, including genomic coordinate searches of the gene map, views of genetic heterogeneity of phenotypes in Phenotypic Series, and side-by-side comparisons of clinical synopses.

...read moreread less

Abstract: For over 50 years Mendelian Inheritance in Man has chronicled the collective knowledge of the field of medical genetics. It initially cataloged the known X-linked, autosomal recessive and autosomal dominant inherited disorders, but grew to be the primary repository of curated information on both genes and genetic phenotypes and the relationships between them. Each phenotype and gene is given a separate entry assigned a stable, unique identifier. The entries contain structured summaries of new and important information based on expert review of the biomedical literature. OMIM.org provides interactive access to the knowledge repository, including genomic coordinate searches of the gene map, views of genetic heterogeneity of phenotypes in Phenotypic Series, and side-by-side comparisons of clinical synopses. OMIM.org also supports computational queries via a robust API. All entries have extensive targeted links to other genomic resources and additional references. Updates to OMIM can be found on the update list or followed through the MIMmatch service. Updated user guides and tutorials are available on the website. As of September 2018, OMIM had over 24,600 entries, and the OMIM Morbid Map Scorecard had 6,259 molecularized phenotypes connected to 3,961 genes.

...read moreread less

487 citations

Journal Article•DOI•

Pathogen-induced activation of disease-suppressive functions in the endophytic root microbiome

[...]

Víctor J. Carrión¹, Juan E. Pérez-Jaramillo², Viviane Cordovez¹, Vittorio Tracanna³, Mattias de Hollander, Daniel Ruiz-Buck, Lucas William Mendes⁴, Wilfred F. J. van IJcken⁵, Ruth Gomez-Exposito³, Somayah S. Elsayed¹, Prarthana Mohanraju³, Adini Q Arifah³, John van der Oost³, Joseph N. Paulson⁶, Rodrigo Mendes⁷, Gilles P. van Wezel¹, Marnix H. Medema³, Jos M. Raaijmakers¹ - Show less +14 more•Institutions (7)

Leiden University¹, University of Antioquia², Wageningen University and Research Centre³, University of São Paulo⁴, Erasmus University Rotterdam⁵, Genentech⁶, Empresa Brasileira de Pesquisa Agropecuária⁷

01 Nov 2019-Science

TL;DR: The results highlight that endophytic root microbiomes harbor a wealth of as yet unknown functional traits that, in concert, can protect the plant inside out.

...read moreread less

Abstract: Microorganisms living inside plants can promote plant growth and health, but their genomic and functional diversity remain largely elusive. Here, metagenomics and network inference show that fungal infection of plant roots enriched for Chitinophagaceae and Flavobacteriaceae in the root endosphere and for chitinase genes and various unknown biosynthetic gene clusters encoding the production of nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs). After strain-level genome reconstruction, a consortium of Chitinophaga and Flavobacterium was designed that consistently suppressed fungal root disease. Site-directed mutagenesis then revealed that a previously unidentified NRPS-PKS gene cluster from Flavobacterium was essential for disease suppression by the endophytic consortium. Our results highlight that endophytic root microbiomes harbor a wealth of as yet unknown functional traits that, in concert, can protect the plant inside out.

...read moreread less

482 citations

Journal Article•DOI•

Prediction of functional microRNA targets by integrative modeling of microRNA binding and target expression data

[...]

Weijun Liu¹, Xiaowei Wang¹•Institutions (1)

Washington University in St. Louis¹

22 Jan 2019-Genome Biology

TL;DR: A large-scale RNA sequencing study is performed to experimentally identify genes that are downregulated by 25 miRNAs and an improved computational model for genome-wide miRNA target prediction is developed and validated.

...read moreread less

Abstract: We perform a large-scale RNA sequencing study to experimentally identify genes that are downregulated by 25 miRNAs. This RNA-seq dataset is combined with public miRNA target binding data to systematically identify miRNA targeting features that are characteristic of both miRNA binding and target downregulation. By integrating these common features in a machine learning framework, we develop and validate an improved computational model for genome-wide miRNA target prediction. All prediction data can be accessed at miRDB ( http://mirdb.org ).

...read moreread less

Journal Article•DOI•

Cistrome Data Browser: expanded datasets and new tools for gene regulatory analysis.

[...]

Rongbin Zheng¹, Changxin Wan¹, Shenglin Mei¹, Qian Qin¹, Qiu Wu¹, Hanfei Sun¹, Chen-Hao Chen², Myles Brown², Xiaoyan Zhang¹, Clifford A. Meyer², X. Shirley Liu¹, X. Shirley Liu² - Show less +8 more•Institutions (2)

Tongji University¹, Harvard University²

08 Jan 2019-Nucleic Acids Research

TL;DR: The Cistrome DB has a new Toolkit module with several features that allow users to better utilize the large-scale ChIP-seq, DNase-seq and ATAC-seq data, and the new tools will greatly benefit the biomedical research community.

...read moreread less

Abstract: The Cistrome Data Browser (DB) is a resource of human and mouse cis-regulatory information derived from ChIP-seq, DNase-seq and ATAC-seq chromatin profiling assays, which map the genome-wide locations of transcription factor binding sites, histone post-translational modifications and regions of chromatin accessible to endonuclease activity. Currently, the Cistrome DB contains approximately 47,000 human and mouse samples with about 24,000 newly collected datasets compared to the previous release two years ago. Furthermore, the Cistrome DB has a new Toolkit module with several features that allow users to better utilize the large-scale ChIP-seq, DNase-seq, and ATAC-seq data. First, users can query the factors which are likely to regulate a specific gene of interest. Second, the Cistrome DB Toolkit facilitates searches for factor binding, histone modifications, and chromatin accessibility in any given genomic interval shorter than 2Mb. Third, the Toolkit can determine the most similar ChIP-seq, DNase-seq, and ATAC-seq samples in terms of genomic interval overlaps with user-provided genomic interval sets. The Cistrome DB is a user-friendly, up-to-date, and well maintained resource, and the new tools will greatly benefit the biomedical research community. The database is freely available at http://cistrome.org/db, and the Toolkit is at http://dbtoolkit.cistrome.org.

...read moreread less

Journal Article•DOI•

Gene duplication and evolution in recurring polyploidization-diploidization cycles in plants.

[...]

Xin Qiao¹, Qionghou Li¹, Hao Yin¹, Kaijie Qi¹, Leiting Li¹, Runze Wang¹, Shaoling Zhang¹, Andrew H. Paterson² - Show less +4 more•Institutions (2)

Nanjing Agricultural University¹, Plant Genome Mapping Laboratory²

21 Feb 2019-Genome Biology

TL;DR: A comprehensive landscape of different modes of gene duplication across the plant kingdom is identified by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.

...read moreread less

Abstract: The sharp increase of plant genome and transcriptome data provide valuable resources to investigate evolutionary consequences of gene duplication in a range of taxa, and unravel common principles underlying duplicate gene retention. We survey 141 sequenced plant genomes to elucidate consequences of gene and genome duplication, processes central to the evolution of biodiversity. We develop a pipeline named DupGen_finder to identify different modes of gene duplication in plants. Genes derived from whole-genome, tandem, proximal, transposed, or dispersed duplication differ in abundance, selection pressure, expression divergence, and gene conversion rate among genomes. The number of WGD-derived duplicate genes decreases exponentially with increasing age of duplication events—transposed duplication- and dispersed duplication-derived genes declined in parallel. In contrast, the frequency of tandem and proximal duplications showed no significant decrease over time, providing a continuous supply of variants available for adaptation to continuously changing environments. Moreover, tandem and proximal duplicates experienced stronger selective pressure than genes formed by other modes and evolved toward biased functional roles involved in plant self-defense. The rate of gene conversion among WGD-derived gene pairs declined over time, peaking shortly after polyploidization. To provide a platform for accessing duplicated gene pairs in different plants, we constructed the Plant Duplicate Gene Database. We identify a comprehensive landscape of different modes of gene duplication across the plant kingdom by comparing 141 genomes, which provides a solid foundation for further investigation of the dynamic evolution of duplicate genes.

...read moreread less

Journal Article•DOI•

Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of Autism

[...]

F. Kyle Satterstrom¹, Jack A. Kosmicki¹, Jiebiao Wang², Michael S. Breen³ +150 more•Institutions (45)

13 Apr 2019-Social Science Research Network

TL;DR: Using an enhanced Bayesian framework to integrate de novo and case-control rare variation, 102 risk genes are identified at a false discovery rate of ≤ 0.1, consistent with multiple paths to an excitatory/inhibitory imbalance underlying ASD.

...read moreread less

Abstract: We present the largest exome sequencing study of autism spectrum disorder (ASD) to date (n=35,584 total samples, 11,986 with ASD). Using an enhanced Bayesian framework to integrate de novo and case-control rare variation, we identify 102 risk genes at a false discovery rate ≤ 0.1. Of these genes, 49 show higher frequencies of disruptive de novo variants in individuals ascertained for severe neurodevelopmental delay, while 53 show higher frequencies in individuals ascertained for ASD; comparing ASD cases with mutations in these groups reveals phenotypic differences. Expressed early in brain development, most of the risk genes have roles in regulation of gene expression or neuronal communication (i.e., mutations effect neurodevelopmental and neurophysiological changes), and 13 fall within loci recurrently hit by copy number variants. In human cortex single-cell gene expression data, expression of risk genes is enriched in both excitatory and inhibitory neuronal lineages, consistent with multiple paths to an excitatory/inhibitory imbalance underlying ASD.

...read moreread less

Journal Article•DOI•

METTL3 facilitates tumor progression via an m6A-IGF2BP2-dependent mechanism in colorectal carcinoma

[...]

Ting Li¹, Pei Shan Hu¹, Zhixiang Zuo¹, Jin Fei Lin¹, Xingyang Li¹, Qi Nian Wu¹, Zhan Hong Chen¹, Zhao Lei Zeng¹, Feng Wang¹, Jian Zheng¹, Demeng Chen¹, Bo Li¹, Tie Bang Kang¹, Dan Xie¹, Dongxin Lin¹, Dongxin Lin², Huai-Qiang Ju¹, Rui-Hua Xu¹ - Show less +14 more•Institutions (2)

Sun Yat-sen University¹, Peking Union Medical College²

24 Jun 2019-Molecular Cancer

TL;DR: This study revealed that METTL3, acting as an oncogene, maintained SOX2 expression through an m6A-IGF2BP2-dependent mechanism in CRC cells, and indicated a potential biomarker panel for prognostic prediction in CRC.

...read moreread less

Abstract: Colorectal carcinoma (CRC) is one of the most common malignant tumors, and its main cause of death is tumor metastasis. RNA N6-methyladenosine (m6A) is an emerging regulatory mechanism for gene expression and methyltransferase-like 3 (METTL3) participates in tumor progression in several cancer types. However, its role in CRC remains unexplored. Western blot, quantitative real-time PCR (RT-qPCR) and immunohistochemical (IHC) were used to detect METTL3 expression in cell lines and patient tissues. Methylated RNA immunoprecipitation sequencing (MeRIP-seq) and transcriptomic RNA sequencing (RNA-seq) were used to screen the target genes of METTL3. The biological functions of METTL3 were investigated in vitro and in vivo. RNA pull-down and RNA immunoprecipitation assays were conducted to explore the specific binding of target genes. RNA stability assay was used to detect the half-lives of the downstream genes of METTL3. Using TCGA database, higher METTL3 expression was found in CRC metastatic tissues and was associated with a poor prognosis. MeRIP-seq revealed that SRY (sex determining region Y)-box 2 (SOX2) was the downstream gene of METTL3. METTL3 knockdown in CRC cells drastically inhibited cell self-renewal, stem cell frequency and migration in vitro and suppressed CRC tumorigenesis and metastasis in both cell-based models and PDX models. Mechanistically, methylated SOX2 transcripts, specifically the coding sequence (CDS) regions, were subsequently recognized by the specific m6A “reader”, insulin-like growth factor 2 mRNA binding protein 2 (IGF2BP2), to prevent SOX2 mRNA degradation. Further, SOX2 expression positively correlated with METTL3 and IGF2BP2 in CRC tissues. The combined IHC panel, including “writer”, “reader”, and “target”, exhibited a better prognostic value for CRC patients than any of these components individually. Overall, our study revealed that METTL3, acting as an oncogene, maintained SOX2 expression through an m6A-IGF2BP2-dependent mechanism in CRC cells, and indicated a potential biomarker panel for prognostic prediction in CRC.

...read moreread less

Journal Article•DOI•

Brain cell type-specific enhancer-promoter interactome maps and disease-risk association.

[...]

Alexi Nott¹, Inge R. Holtman², Inge R. Holtman¹, Nicole G. Coufal³, Nicole G. Coufal¹, Johannes C. M. Schlachetzki¹, Miao Yu⁴, Rong Hu⁴, Claudia Z. Han¹, Monique Pena³, Jiayang Xiao³, Yin Wu³, Zahara Keulen³, Martina P. Pasillas¹, Carolyn O’Connor³, Christian K. Nickl¹, Simon T. Schafer³, Zeyang Shen¹, Robert A. Rissman⁵, Robert A. Rissman¹, James B. Brewer¹, David Gosselin¹, David Gosselin⁶, David D. Gonda¹, Michael J. Levy¹, Michael G. Rosenfeld¹, Graham McVicker³, Fred H. Gage², Bing Ren³, Bing Ren¹, Christopher K. Glass¹ - Show less +27 more•Institutions (6)

University of California, San Diego¹, University Medical Center Groningen², Salk Institute for Biological Studies³, Ludwig Institute for Cancer Research⁴, Veterans Health Administration⁵, Laval University⁶

29 Nov 2019-Science

TL;DR: The list of genes likely to be influenced by noncoding variants in AD is revised and expanded and the probable cell types in which they function are suggested to help better understand common genetic variation associated with brain diseases.

...read moreread less

Abstract: Noncoding genetic variation is a major driver of phenotypic diversity, but functional interpretation is challenging. To better understand common genetic variation associated with brain diseases, we defined noncoding regulatory regions for major cell types of the human brain. Whereas psychiatric disorders were primarily associated with variants in transcriptional enhancers and promoters in neurons, sporadic Alzheimer's disease (AD) variants were largely confined to microglia enhancers. Interactome maps connecting disease-risk variants in cell-type-specific enhancers to promoters revealed an extended microglia gene network in AD. Deletion of a microglia-specific enhancer harboring AD-risk variants ablated BIN1 expression in microglia, but not in neurons or astrocytes. These findings revise and expand the list of genes likely to be influenced by noncoding variants in AD and suggest the probable cell types in which they function.

...read moreread less

Journal Article•DOI•

Spatial transcriptome profiling by MERFISH reveals subcellular RNA compartmentalization and cell cycle-dependent gene expression.

[...]

Chenglong Xia¹, Jean Fan¹, Jean Fan², George Emanuel², George Emanuel¹, Junjie Hao¹, Junjie Hao², Xiaowei Zhuang¹ - Show less +4 more•Institutions (2)

Howard Hughes Medical Institute¹, Harvard University²

24 Sep 2019-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The ability to perform spatially resolved, genome-wide RNA profiling with high detection efficiency and accuracy by MERFISH could help address a wide array of questions ranging from the regulation of gene expression in cells to the development of cell fate and organization in tissues.

...read moreread less

Abstract: The expression profiles and spatial distributions of RNAs regulate many cellular functions. Image-based transcriptomic approaches provide powerful means to measure both expression and spatial information of RNAs in individual cells within their native environment. Among these approaches, multiplexed error-robust fluorescence in situ hybridization (MERFISH) has achieved spatially resolved RNA quantification at transcriptome scale by massively multiplexing single-molecule FISH measurements. Here, we increased the gene throughput of MERFISH and demonstrated simultaneous measurements of RNA transcripts from ∼10,000 genes in individual cells with ∼80% detection efficiency and ∼4% misidentification rate. We combined MERFISH with cellular structure imaging to determine subcellular compartmentalization of RNAs. We validated this approach by showing enrichment of secretome transcripts at the endoplasmic reticulum, and further revealed enrichment of long noncoding RNAs, RNAs with retained introns, and a subgroup of protein-coding mRNAs in the cell nucleus. Leveraging spatially resolved RNA profiling, we developed an approach to determine RNA velocity in situ using the balance of nuclear versus cytoplasmic RNA counts. We applied this approach to infer pseudotime ordering of cells and identified cells at different cell-cycle states, revealing ∼1,600 genes with putative cell cycle-dependent expression and a gradual transcription profile change as cells progress through cell-cycle stages. Our analysis further revealed cell cycle-dependent and cell cycle-independent spatial heterogeneity of transcriptionally distinct cells. We envision that the ability to perform spatially resolved, genome-wide RNA profiling with high detection efficiency and accuracy by MERFISH could help address a wide array of questions ranging from the regulation of gene expression in cells to the development of cell fate and organization in tissues.

...read moreread less

Journal Article•DOI•

Transcriptome-wide off-target RNA editing induced by CRISPR-guided DNA base editors

[...]

Julian Grünewald, Ronghao Zhou¹, Sara P. Garcia¹, Sowmya Iyer¹, Caleb A. Lareau¹, Martin J. Aryee, J. Keith Joung - Show less +3 more•Institutions (1)

Harvard University¹

17 Apr 2019-Nature

TL;DR: It is shown that a CBE with rat APOBEC1 can cause extensive transcriptome-wide deamination of RNA cytosines in human cells, inducing tens of thousands of C-to-U edits and the need to more fully define and characterize the RNA off-target effects of deaminase enzymes in base editor platforms is suggested.

...read moreread less

Abstract: CRISPR-Cas base-editor technology enables targeted nucleotide alterations, and is being increasingly used for research and potential therapeutic applications1,2. The most widely used cytosine base editors (CBEs) induce deamination of DNA cytosines using the rat APOBEC1 enzyme, which is targeted by a linked Cas protein-guide RNA complex3,4. Previous studies of the specificity of CBEs have identified off-target DNA edits in mammalian cells5,6. Here we show that a CBE with rat APOBEC1 can cause extensive transcriptome-wide deamination of RNA cytosines in human cells, inducing tens of thousands of C-to-U edits with frequencies ranging from 0.07% to 100% in 38-58% of expressed genes. CBE-induced RNA edits occur in both protein-coding and non-protein-coding sequences and generate missense, nonsense, splice site, and 5' and 3' untranslated region mutations. We engineered two CBE variants bearing mutations in rat APOBEC1 that substantially decreased the number of RNA edits (by more than 390-fold and more than 3,800-fold) in human cells. These variants also showed more precise on-target DNA editing than the wild-type CBE and, for most guide RNAs tested, no substantial reduction in editing efficiency. Finally, we show that an adenine base editor7 can also induce transcriptome-wide RNA edits. These results have implications for the use of base editors in both research and clinical settings, illustrate the feasibility of engineering improved variants with reduced RNA editing activities, and suggest the need to more fully define and characterize the RNA off-target effects of deaminase enzymes in base editor platforms.

...read moreread less

Journal Article•DOI•

A Genome-wide Framework for Mapping Gene Regulation via Cellular Genetic Screens.

[...]

Molly Gasperini¹, Andrew J. Hill¹, José L. McFaline-Figueroa¹, Beth Martin¹, Seungsoo Kim¹, Melissa D. Zhang¹, Dana Jackson¹, Anh Leith¹, Jacob Schreiber¹, William Stafford Noble¹, Cole Trapnell¹, Nadav Ahituv², Jay Shendure³, Jay Shendure¹ - Show less +10 more•Institutions (3)

University of Washington¹, University of California, San Francisco², Howard Hughes Medical Institute³

10 Jan 2019-Cell

TL;DR: A multiplex, expression quantitative trait locus (eQTL)-inspired framework for mapping enhancer-gene pairs by introducing random combinations of CRISPR/Cas9-mediated perturbations to each of many cells, followed by single-cell RNA sequencing (RNA-seq).

...read moreread less

Journal Article•DOI•

ChEA3: transcription factor enrichment analysis by orthogonal omics integration.

[...]

Alexandra B Keenan¹, Denis Torre¹, Alexander Lachmann¹, Ariel K Leong¹, Megan L. Wojciechowicz¹, Vivian Utti¹, Kathleen M. Jagodnik¹, Eryk Kropiwnicki¹, Zichen Wang¹, Avi Ma'ayan¹ - Show less +6 more•Institutions (1)

Icahn School of Medicine at Mount Sinai¹

02 Jul 2019-Nucleic Acids Research

TL;DR: The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF–gene co-expression from RNA-seq studies, TF–target associations from ChIP-seq experiments, and TF-gree co-occurrence computed from crowd-submitted gene lists, which illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor.

...read moreread less

Abstract: Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.

...read moreread less

Journal Article•DOI•

Broad-spectrum resistance to bacterial blight in rice using genome editing

[...]

Ricardo Oliva¹, Chonghui Ji², Genelou Atienza-Grande¹, Genelou Atienza-Grande³, Jose C. Huguet-Tapia⁴, Alvaro L. Pérez-Quintero⁵, Alvaro L. Pérez-Quintero⁶, Ting Li⁷, Joon-Seob Eom⁸, Chenhao Li², Hanna Nguyen¹, Bo Liu², Florence Auguy⁶, Coline Sciallano⁶, Van Thi Luu⁸, Gerbert Sylvestre Dossa⁹, Sébastien Cunnac⁶, Sarah M. Schmidt⁸, Inez H. Slamet-Loedin¹, Casiana Vera Cruz¹, Boris Szurek⁶, Wolf B. Frommer¹⁰, Wolf B. Frommer⁸, Frank F. White⁴, Bing Yang², Bing Yang¹¹ - Show less +22 more•Institutions (11)

International Rice Research Institute¹, University of Missouri², University of the Philippines Los Baños³, University of Florida⁴, Colorado State University⁵, University of Montpellier⁶, Iowa State University⁷, Max Planck Society⁸, Frankfurt University of Applied Sciences⁹, Nagoya University¹⁰, Donald Danforth Plant Science Center¹¹

28 Oct 2019-Nature Biotechnology

TL;DR: Paddy trials showed that genome-edited SWEET promoters endow rice lines with robust, broad-spectrum resistance to all Xanthomonas bacterial blight strains tested.

...read moreread less

Abstract: Bacterial blight of rice is an important disease in Asia and Africa. The pathogen, Xanthomonas oryzae pv. oryzae (Xoo), secretes one or more of six known transcription-activator-like effectors (TALes) that bind specific promoter sequences and induce, at minimum, one of the three host sucrose transporter genes SWEET11, SWEET13 and SWEET14, the expression of which is required for disease susceptibility. We used CRISPR-Cas9-mediated genome editing to introduce mutations in all three SWEET gene promoters. Editing was further informed by sequence analyses of TALe genes in 63 Xoo strains, which revealed multiple TALe variants for SWEET13 alleles. Mutations were also created in SWEET14, which is also targeted by two TALes from an African Xoo lineage. A total of five promoter mutations were simultaneously introduced into the rice line Kitaake and the elite mega varieties IR64 and Ciherang-Sub1. Paddy trials showed that genome-edited SWEET promoters endow rice lines with robust, broad-spectrum resistance.

...read moreread less

Journal Article•DOI•

Organization and regulation of gene transcription.

[...]

Patrick Cramer¹•Institutions (1)

Max Planck Society¹

28 Aug 2019-Nature

TL;DR: Structural and microscopy studies of gene transcription underpin a model in which phosphorylation controls the shuttling of RNA polymerase II between promoter and gene-body condensates to regulate transcription initiation and elongation.

...read moreread less

Abstract: The regulated transcription of genes determines cell identity and function. Recent structural studies have elucidated mechanisms that govern the regulation of transcription by RNA polymerases during the initiation and elongation phases. Microscopy studies have revealed that transcription involves the condensation of factors in the cell nucleus. A model is emerging for the transcription of protein-coding genes in which distinct transient condensates form at gene promoters and in gene bodies to concentrate the factors required for transcription initiation and elongation, respectively. The transcribing enzyme RNA polymerase II may shuttle between these condensates in a phosphorylation-dependent manner. Molecular principles are being defined that rationalize transcriptional organization and regulation, and that will guide future investigations. Structural and microscopy studies of gene transcription underpin a model in which phosphorylation controls the shuttling of RNA polymerase II between promoter and gene-body condensates to regulate transcription initiation and elongation.

...read moreread less

Journal Article•DOI•

The tomato pan-genome uncovers new genes and a rare allele regulating fruit flavor.

[...]

Lei Gao¹, Itay Gonda¹, Itay Gonda², Honghe Sun¹, Qiyue Ma¹, Kan Bao¹, Denise M. Tieman³, Elizabeth A. Burzynski-Chang⁴, Tara Fish⁵, Kaitlin A. Stromberg¹, Gavin L. Sacks⁴, Theodore W. Thannhauser⁵, Majid R. Foolad⁶, María José Díez⁷, José Blanca⁷, Joaquín Cañizares⁷, Yimin Xu¹, Esther van der Knaap⁸, Sanwen Huang, Harry J. Klee³, James J. Giovannoni¹, James J. Giovannoni⁵, Zhangjun Fei¹, Zhangjun Fei⁵ - Show less +20 more•Institutions (8)

Boyce Thompson Institute for Plant Research¹, Agricultural Research Organization, Volcani Center², University of Florida³, Cornell University⁴, United States Department of Agriculture⁵, Pennsylvania State University⁶, Polytechnic University of Valencia⁷, University of Georgia⁸

13 May 2019-Nature Genetics

TL;DR: A tomato pan-genome constructed using genome sequences of 725 phylogenetically and geographically representative accessions captures 4,873 genes absent from the reference genome and identifies a rare allele of TomLoxC regulating fruit flavor.

...read moreread less

Abstract: Modern tomatoes have narrow genetic diversity limiting their improvement potential. We present a tomato pan-genome constructed using genome sequences of 725 phylogenetically and geographically representative accessions, revealing 4,873 genes absent from the reference genome. Presence/absence variation analyses reveal substantial gene loss and intense negative selection of genes and promoters during tomato domestication and improvement. Lost or negatively selected genes are enriched for important traits, especially disease resistance. We identify a rare allele in the TomLoxC promoter selected against during domestication. Quantitative trait locus mapping and analysis of transgenic plants reveal a role for TomLoxC in apocarotenoid production, which contributes to desirable tomato flavor. In orange-stage fruit, accessions harboring both the rare and common TomLoxC alleles (heterozygotes) have higher TomLoxC expression than those homozygous for either and are resurgent in modern tomatoes. The tomato pan-genome adds depth and completeness to the reference genome, and is useful for future biological discovery and breeding.

...read moreread less

Journal Article•DOI•

Illuminating G-Protein-Coupling Selectivity of GPCRs.

[...]

Asuka Inoue¹, Asuka Inoue², Francesco Raimondi³, Francois Marie Ngako Kadji², Gurdeep Singh³, Takayuki Kishi², Akiharu Uwamizu², Yuki Ono², Yuji Shinjo², Satoru Ishida², Nadia Arang⁴, Kouki Kawakami², J. Silvio Gutkind⁴, Junken Aoki², Robert B. Russell³ - Show less +11 more•Institutions (4)

Japan Agency for Medical Research and Development¹, Tohoku University², Heidelberg University³, University of California, San Diego⁴

13 Jun 2019-Cell

TL;DR: This work systematically quantified ligand-induced interactions between 148 GPCRs and all 11 unique Gα subunit C termini, and identified sequence-based coupling specificity features, inside and outside the transmembrane domain, which were used to develop a coupling predictor that outperforms previous methods.

...read moreread less

Journal Article•DOI•

Integrated Analysis of TP53 Gene and Pathway Alterations in The Cancer Genome Atlas.

[...]

Lawrence A. Donehower¹, Thierry Soussi², Anil Korkut³, Yuexin Liu³, Andre Schultz³, Maria F. Cardenas¹, Xubin Li³, Özgün Babur⁴, Teng-Kuei Hsu¹, Olivier Lichtarge¹, John N. Weinstein³, Rehan Akbani³, David A. Wheeler¹ - Show less +9 more•Institutions (4)

Baylor College of Medicine¹, Karolinska Institutet², University of Texas MD Anderson Cancer Center³, Oregon Health & Science University⁴

30 Jul 2019-Cell Reports

TL;DR: Tumors with TP53 mutations differ from their non-mutated counterparts in RNA, miRNA, and protein expression patterns, with mutant TP53 tumors displaying enhanced expression of cell cycle progression genes and proteins.

...read moreread less

Journal Article•DOI•

Visualizing DNA folding and RNA in embryos at single-cell resolution

[...]

Leslie J. Mateo¹, Sedona E. Murphy¹, Antonina Hafner¹, Isaac S. Cinquini¹, Carly A. Walker¹, Alistair N. Boettiger¹ - Show less +2 more•Institutions (1)

Stanford University¹

18 Mar 2019-Nature

TL;DR: In this paper, optical reconstruction of chromatin architecture (ORCA) is used to trace the DNA path in single cells with nanoscale accuracy and genomic resolution reaching two kilobases.

...read moreread less

Abstract: The establishment of cell types during development requires precise interactions between genes and distal regulatory sequences. We have a limited understanding of how these interactions look in three dimensions, vary across cell types in complex tissue, and relate to transcription. Here we describe optical reconstruction of chromatin architecture (ORCA), a method that can trace the DNA path in single cells with nanoscale accuracy and genomic resolution reaching two kilobases. We used ORCA to study a Hox gene cluster in cryosectioned Drosophila embryos and labelled around 30 RNA species in parallel. We identified cell-type-specific physical borders between active and Polycomb-repressed DNA, and unexpected Polycomb-independent borders. Deletion of Polycomb-independent borders led to ectopic enhancer-promoter contacts, aberrant gene expression, and developmental defects. Together, these results illustrate an approach for high-resolution, single-cell DNA domain analysis in vivo, identify domain structures that change with cell identity, and show that border elements contribute to the formation of physical domains in Drosophila.

...read moreread less

Journal Article•DOI•

High-throughput single-cell ChIP-seq identifies heterogeneity of chromatin states in breast cancer.

[...]

Kevin Grosselin, Adeline Durand¹, Justine Marsolier¹, Adeline Poitou, Elisabetta Marangoni¹, Fariba Nemati¹, Ahmed Dahmani¹, Sonia Lameiras¹, Fabien Reyal¹, Olivia Frenoy², Yannick Pousse, Marcel Reichen³, Adam Woolfe, Colin Brenan, Andrew D. Griffiths⁴, Céline Vallot¹, Annabelle Gérard - Show less +13 more•Institutions (4)

Curie Institute¹, Pasteur Institute², Bayer³, Centre national de la recherche scientifique⁴

31 May 2019-Nature Genetics

TL;DR: A single-cell chromatin immunoprecipitation followed by sequencing approach paves the way to study the role of chromatin heterogeneity, not just in cancer but in other diseases and healthy systems, notably during cellular differentiation and development.

...read moreread less

Abstract: Modulation of chromatin structure via histone modification is a major epigenetic mechanism and regulator of gene expression. However, the contribution of chromatin features to tumor heterogeneity and evolution remains unknown. Here we describe a high-throughput droplet microfluidics platform to profile chromatin landscapes of thousands of cells at single-cell resolution. Using patient-derived xenograft models of acquired resistance to chemotherapy and targeted therapy in breast cancer, we found that a subset of cells within untreated drug-sensitive tumors share a common chromatin signature with resistant cells, undetectable using bulk approaches. These cells, and cells from the resistant tumors, have lost chromatin marks-H3K27me3, which is associated with stable transcriptional repression-for genes known to promote resistance to treatment. This single-cell chromatin immunoprecipitation followed by sequencing approach paves the way to study the role of chromatin heterogeneity, not just in cancer but in other diseases and healthy systems, notably during cellular differentiation and development.

...read moreread less

Journal Article•DOI•

Total synthesis of Escherichia coli with a recoded genome

[...]

Julius Fredens¹, Kaihang Wang¹, Kaihang Wang², Daniel de la Torre¹, Louise F. H. Funke¹, Wesley E. Robertson¹, Yonka Christova¹, Tiongsun Chia¹, Wolfgang H. Schmied¹, Daniel L. Dunkelmann¹, Václav Beránek¹, Chayasith Uttamapinant¹, Andres Gonzalez Llamazares¹, Thomas S. Elliott¹, Jason W. Chin¹ - Show less +11 more•Institutions (2)

Laboratory of Molecular Biology¹, California Institute of Technology²

01 May 2019-Nature

TL;DR: The number of codons used to encode the canonical amino acids can be reduced, through the genome-wide substitution of target codons by defined synonyms, through a high-fidelity convergent total synthesis.

...read moreread less

Abstract: Nature uses 64 codons to encode the synthesis of proteins from the genome, and chooses 1 sense codon-out of up to 6 synonyms-to encode each amino acid. Synonymous codon choice has diverse and important roles, and many synonymous substitutions are detrimental. Here we demonstrate that the number of codons used to encode the canonical amino acids can be reduced, through the genome-wide substitution of target codons by defined synonyms. We create a variant of Escherichia coli with a four-megabase synthetic genome through a high-fidelity convergent total synthesis. Our synthetic genome implements a defined recoding and refactoring scheme-with simple corrections at just seven positions-to replace every known occurrence of two sense codons and a stop codon in the genome. Thus, we recode 18,214 codons to create an organism with a 61-codon genome; this organism uses 59 codons to encode the 20 amino acids, and enables the deletion of a previously essential transfer RNA.

...read moreread less

Journal Article•DOI•

Off-target RNA mutation induced by DNA base editing and its elimination by mutagenesis

[...]

Changyang Zhou¹, Yidi Sun², Yidi Sun¹, Rui Yan³, Yajing Liu⁴, Yajing Liu¹, Erwei Zuo¹, Chan Gu³, Linxiao Han¹, Yu Wei¹, Xinde Hu¹, Rong Zeng¹, Rong Zeng⁴, Yixue Li³, Yixue Li⁴, Yixue Li⁵, Haibo Zhou¹, Fan Guo³, Hui Yang¹ - Show less +15 more•Institutions (5)

Chinese Academy of Sciences¹, CAS-MPG Partner Institute for Computational Biology², Sichuan University³, ShanghaiTech University⁴, Shanghai Jiao Tong University⁵

10 Jun 2019-Nature

TL;DR: In this paper, the deaminases that are integral to commonly used DNA base editors often bind to RNA, and the authors quantitatively evaluated RNA single nucleotide variations (SNVs) that were induced by CBEs or ABEs.

...read moreread less

Abstract: Recently developed DNA base editing methods enable the direct generation of desired point mutations in genomic DNA without generating any double-strand breaks1-3, but the issue of off-target edits has limited the application of these methods. Although several previous studies have evaluated off-target mutations in genomic DNA4-8, it is now clear that the deaminases that are integral to commonly used DNA base editors often bind to RNA9-13. For example, the cytosine deaminase APOBEC1-which is used in cytosine base editors (CBEs)-targets both DNA and RNA12, and the adenine deaminase TadA-which is used in adenine base editors (ABEs)-induces site-specific inosine formation on RNA9,11. However, any potential RNA mutations caused by DNA base editors have not been evaluated. Adeno-associated viruses are the most common delivery system for gene therapies that involve DNA editing; these viruses can sustain long-term gene expression in vivo, so the extent of potential RNA mutations induced by DNA base editors is of great concern14-16. Here we quantitatively evaluated RNA single nucleotide variations (SNVs) that were induced by CBEs or ABEs. Both the cytosine base editor BE3 and the adenine base editor ABE7.10 generated tens of thousands of off-target RNA SNVs. Subsequently, by engineering deaminases, we found that three CBE variants and one ABE variant showed a reduction in off-target RNA SNVs to the baseline while maintaining efficient DNA on-target activity. This study reveals a previously overlooked aspect of off-target effects in DNA editing and also demonstrates that such effects can be eliminated by engineering deaminases.

...read moreread less

Journal Article•DOI•

A reference genome for pea provides insight into legume genome evolution

[...]

Jonathan Kreplak¹, Mohammed-Amin Madoui², Petr Cápal, Petr Novák³, Karine Labadie², Grégoire Aubert¹, Philipp E. Bayer⁴, Krishna K. Gali⁵, Robert A. Syme⁶, Dorrie Main⁷, Anthony Klein¹, Aurélie Bérard², Iva Vrbová³, Cyril Fournier¹, Leo d’Agata², Caroline Belser², Wahiba Berrabah², Helena Toegelová, Zbyněk Milec, Jan Vrána, HueyTyng Lee⁸, HueyTyng Lee⁴, Ayité Kougbeadjo¹, Morgane Terezol¹, Cécile Huneau⁹, Chala J. Turo⁶, Nacer Mohellibi², Pavel Neumann³, Matthieu Falque¹⁰, Karine Gallardo¹, Rebecca J. McGee¹¹, Bunyamin Tar’an⁵, Abdelhafid Bendahmane¹⁰, Jean-Marc Aury², Jacqueline Batley⁴, Marie-Christine Le Paslier², Noel Ellis¹², Thomas D. Warkentin⁵, Clarice J. Coyne¹¹, Jérôme Salse⁹, David Edwards⁴, Judith Lichtenzveig⁴, Jiří Macas³, Jaroslav Doležel, Patrick Wincker², Judith Burstin¹ - Show less +42 more•Institutions (12)

Institut national de la recherche agronomique¹, Université Paris-Saclay², Academy of Sciences of the Czech Republic³, University of Western Australia⁴, University of Saskatchewan⁵, Curtin University⁶, Washington State University⁷, University of Giessen⁸, University of Auvergne⁹, University of Paris-Sud¹⁰, Agricultural Research Service¹¹, University of Auckland¹²

01 Sep 2019-Nature Genetics

TL;DR: The first annotated chromosome-level reference genome assembly for pea, Gregor Mendel’s original genetic model, provides insights into legume genome evolution and the molecular basis of agricultural traits forpea improvement.

...read moreread less

Abstract: We report the first annotated chromosome-level reference genome assembly for pea, Gregor Mendel’s original genetic model. Phylogenetics and paleogenomics show genomic rearrangements across legumes and suggest a major role for repetitive elements in pea genome evolution. Compared to other sequenced Leguminosae genomes, the pea genome shows intense gene dynamics, most likely associated with genome size expansion when the Fabeae diverged from its sister tribes. During Pisum evolution, translocation and transposition differentially occurred across lineages. This reference sequence will accelerate our understanding of the molecular basis of agronomically important traits and support crop improvement.

...read moreread less

Collapse