Showing papers on "Gene published in 2014"

PDF

Open Access

Journal Article•DOI•

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

[...]

Yang Liao¹, Gordon K. Smyth¹, Wei Shi¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

01 Apr 2014-Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Abstract: MOTIVATION: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. RESULTS: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. AVAILABILITY AND IMPLEMENTATION: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

...read moreread less

14,103 citations

Journal Article•DOI•

Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells

[...]

Ophir Shalem¹, Ophir Shalem², Neville E. Sanjana¹, Neville E. Sanjana², Ella Hartenian¹, Xi-Shun Shi¹, David A. Scott², David A. Scott¹, Tarjei S. Mikkelsen¹, Dirk Heckl³, Benjamin L. Ebert³, David E. Root¹, John G. Doench¹, Feng Zhang², Feng Zhang¹ - Show less +11 more•Institutions (3)

Broad Institute¹, McGovern Institute for Brain Research², Brigham and Women's Hospital³

03 Jan 2014-Science

TL;DR: This work shows that lentiviral delivery of a genome-scale CRISPR-Cas9 knockout (GeCKO) library targeting 18,080 genes with 64,751 unique guide sequences enables both negative and positive selection screening in human cells, and observes a high level of consistency between independent guide RNAs targeting the same gene and a high rate of hit confirmation.

...read moreread less

Abstract: The simplicity of programming the CRISPR (clustered regularly interspaced short palindromic repeats)–associated nuclease Cas9 to modify specific genomic loci suggests a new way to interrogate gene function on a genome-wide scale. We show that lentiviral delivery of a genome-scale CRISPR-Cas9 knockout (GeCKO) library targeting 18,080 genes with 64,751 unique guide sequences enables both negative and positive selection screening in human cells. First, we used the GeCKO library to identify genes essential for cell viability in cancer and pluripotent stem cells. Next, in a melanoma model, we screened for genes whose loss is involved in resistance to vemurafenib, a therapeutic RAF inhibitor. Our highest-ranking candidates include previously validated genes NF1 and MED12 , as well as novel hits NF2 , CUL3 , TADA2B , and TADA1. We observe a high level of consistency between independent guide RNAs targeting the same gene and a high rate of hit confirmation, demonstrating the promise of genome-scale screening with Cas9.

...read moreread less

4,147 citations

Journal Article•DOI•

Comprehensive molecular profiling of lung adenocarcinoma: The cancer genome atlas research network

[...]

Eric A. Collisson¹, Joshua D. Campbell², Angela N. Brooks³, Angela N. Brooks² +315 more•Institutions (41)

01 Jan 2014-Nature

TL;DR: In this paper, the authors report molecular profiling of 230 resected lung adnocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses.

...read moreread less

Abstract: Adenocarcinoma of the lung is the leading cause of cancer death worldwide. Here we report molecular profiling of 230 resected lung adenocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses. High rates of somatic mutation were seen (mean 8.9 mutations per megabase). Eighteen genes were statistically significantly mutated, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification. EGFR mutations were more frequent in female patients, whereas mutations in RBM10 were more common in males. Aberrations in NF1, MET, ERBB2 and RIT1 occurred in 13% of cases and were enriched in samples otherwise lacking an activated oncogene, suggesting a driver role for these events in certain tumours. DNA and mRNA sequence from the same tumour highlighted splicing alterations driven by somatic genomic changes, including exon 14 skipping in MET mRNA in 4% of cases. MAPK and PI(3)K pathway activity, when measured at the protein level, was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation. These data establish a foundation for classification and further investigations of lung adenocarcinoma molecular pathogenesis.

...read moreread less

4,104 citations

Journal Article•DOI•

An RNA-Sequencing Transcriptome and Splicing Database of Glia, Neurons, and Vascular Cells of the Cerebral Cortex

[...]

Ye Zhang¹, Kenian Chen², Steven A. Sloan¹, Mariko L. Bennett¹, Anja R. Scholze¹, Sean O'Keeffe³, Hemali Phatnani³, Paolo Guarnieri⁴, Christine Caneda¹, Nadine Ruderisch⁵, Shuyun Deng², Shane A. Liddelow¹, Chaolin Zhang³, Richard Daneman⁵, Tom Maniatis³, Ben A. Barres¹, Jian Qian Wu² - Show less +13 more•Institutions (5)

Stanford University¹, University of Texas at Austin², Columbia University Medical Center³, Columbia University⁴, University of California, San Francisco⁵

03 Sep 2014-The Journal of Neuroscience

TL;DR: The authors' data provide clues as to how neurons and astrocytes differ in their ability to dynamically regulate glycolytic flux and lactate generation attributable to unique splicing of PKM2, the gene encoding the glycoleytic enzyme pyruvate kinase.

...read moreread less

Abstract: The major cell classes of the brain differ in their developmental processes, metabolism, signaling, and function To better understand the functions and interactions of the cell types that comprise these classes, we acutely purified representative populations of neurons, astrocytes, oligodendrocyte precursor cells, newly formed oligodendrocytes, myelinating oligodendrocytes, microglia, endothelial cells, and pericytes from mouse cerebral cortex We generated a transcriptome database for these eight cell types by RNA sequencing and used a sensitive algorithm to detect alternative splicing events in each cell type Bioinformatic analyses identified thousands of new cell type-enriched genes and splicing isoforms that will provide novel markers for cell identification, tools for genetic manipulation, and insights into the biology of the brain For example, our data provide clues as to how neurons and astrocytes differ in their ability to dynamically regulate glycolytic flux and lactate generation attributable to unique splicing of PKM2, the gene encoding the glycolytic enzyme pyruvate kinase This dataset will provide a powerful new resource for understanding the development and function of the brain To ensure the widespread distribution of these datasets, we have created a user-friendly website (http://webstanfordedu/group/barres_lab/brain_rnaseqhtml) that provides a platform for analyzing and comparing transciption and alternative splicing profiles for various cell classes in the brain

...read moreread less

3,891 citations

Comprehensive molecular profiling of lung adenocarcinoma

[...]

Eric S. Lander

01 Jul 2014

TL;DR: High rates of somatic mutation were seen, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification, and MAPK and PI(3)K pathway activity was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation.

...read moreread less

Abstract: Adenocarcinoma of the lung is the leading cause of cancer death worldwide. Here we report molecular profiling of 230 resected lung adenocarcinomas using messenger RNA, microRNA and DNA sequencing integrated with copy number, methylation and proteomic analyses. High rates of somatic mutation were seen(mean 8.9 mutations per megabase). Eighteen genes were statistically significantly mutated, including RIT1 activating mutations and newly described loss-of-function MGA mutations which are mutually exclusive with focal MYC amplification. EGFR mutations were more frequent in female patients, whereas mutations in RBM10 were more common in males. Aberrations in NF1, MET, ERBB2 and RIT1 occurred in 13% of cases and were enriched in samples otherwise lacking an activated oncogene, suggesting a driver role for these events in certain tumours. DNA and mRNA sequence from the same tumour highlighted splicing alterations driven by somatic genomic changes, including exon 14 skipping in MET mRNA in 4% of cases. MAPK and PI(3)K pathway activity, when measured at the protein level, was explained by known mutations in only a fraction of cases, suggesting additional, unexplained mechanisms of pathway activation. These data establish a foundation for classification and further investigations of lung adenocarcinoma molecular pathogenesis.

...read moreread less

2,847 citations

Journal Article•DOI•

Discovery and saturation analysis of cancer genes across 21 tumour types

[...]

Michael S. Lawrence¹, Petar Stojanov², Craig H. Mermel², James T. Robinson¹, Levi A. Garraway², Todd R. Golub³, Matthew Meyerson², Stacey Gabriel¹, Eric S. Lander⁴, Gad Getz² - Show less +6 more•Institutions (4)

Broad Institute¹, Harvard University², Howard Hughes Medical Institute³, Massachusetts Institute of Technology⁴

23 Jan 2014-Nature

TL;DR: It is found that large-scale genomic analysis can identify nearly all known cancer genes in these cancer types and 33 genes that were not previously known to be significantly mutated in cancer, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis.

...read moreread less

Abstract: Although a few cancer genes are mutated in a high proportion of tumours of a given type (.20%), most are mutated at intermediate frequencies (2–20%). To explore the feasibility of creating a comprehensive catalogue of cancer genes, we analysed somatic point mutations in exome sequences from 4,742 human cancers and their matched normal-tissue samples across 21 cancer types. We found that large-scale genomic analysis can identify nearly all known cancer genes in these tumour types. Our analysis also identified 33 genes that were not previously known to be significantly mutated in cancer, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis. Down-sampling analysis indicates that larger sample sizes will reveal many more genes mutated at clinically important frequencies. We estimate that near-saturation may be achieved with 600– 5,000 samples per tumour type, depending on background mutation frequency. The results may help to guide the next stage of cancer genomics. Comprehensive knowledge of the genes underlying human cancers is a critical foundation for cancer diagnostics, therapeutics, clinical-trial design and selection of rational combination therapies. It is now possible to use genomic analysis to identify cancer genes in an unbiased fashion, based on the presence of somatic mutations at a rate significantly higher than the expected background level. Systematic studies have revealed many new cancer genes, as well as new classes of cancer genes 1,2 . They have also made clear that, although some cancer genes are mutated at high frequencies, most cancer genes in most patients occur at intermediate frequencies (2–20%) or lower. Accordingly, a complete catalogue of mutations in this frequency class will be essential for recognizing dysregulated pathways and optimal targets for therapeutic intervention. However, recent work suggests major gaps in our knowledge of cancer genes of intermediate frequency. For example, a study of 183 lung adenocarcinomas 3 found that 15% of patients lacked even a single mutation affecting any of the 10 known hallmarks of cancer, and 38% had 3 or fewer such mutations. In this paper, we analysed somatic point mutations (substitutions and small insertion and deletions) in nearly 5,000 human cancers and their matched normal-tissue samples (‘tumour–normal pairs’) across 21 tumour types. The questions that we examine here are: first, whether large-scale genomic analysis across tumour types can reliably identify all known cancer genes; second, whether it will reveal many new candidate cancer genes; and third, how far we are from having a complete catalogue of cancer genes (at least those of intermediate frequency). We used rigorous statistical methods to enumerate candidate cancer genes and then carefully inspected each gene to identify those with strong biological connections to cancer and mutational patterns consistent with the expected function. The analysis reveals nearly all known cancer genes and revealed 33 novel candidates, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis. Importantly, the data show that the

...read moreread less

2,565 citations

Journal Article•DOI•

Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics

[...]

Linn Fagerberg¹, Björn M. Hallström¹, Per Oksvold¹, Caroline Kampf², Dijana Djureinovic², Jacob Odeberg¹, Masato Habuka¹, Simin Tahmasebpoor², Angelika Danielsson², Karolina Edlund², Anna Asplund², Evelina Sjöstedt², Emma Lundberg¹, Cristina Al-Khalili Szigyarto¹, Marie Skogs¹, Jenny Ottosson Takanen¹, Holger Berling¹, Hanna Tegel¹, Jan Mulder³, Peter Nilsson¹, Jochen M. Schwenk¹, Cecilia Lindskog², Frida Danielsson¹, Adil Mardinoglu⁴, Åsa Sivertsson¹, Kalle von Feilitzen¹, Mattias Forsberg¹, Martin Zwahlen¹, IngMarie Olsson², Sanjay Navani, Mikael Huss¹, Jens Nielsen⁴, Jens Nielsen¹, Fredrik Pontén², Mathias Uhlén¹ - Show less +31 more•Institutions (4)

Royal Institute of Technology¹, Uppsala University², Science for Life Laboratory³, Chalmers University of Technology⁴

01 Feb 2014-Molecular & Cellular Proteomics

TL;DR: A quantitative transcriptomics analysis (RNA-Seq) is used to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues.

...read moreread less

2,512 citations

Journal Article•DOI•

Genetic Screens in Human Cells Using the CRISPR-Cas9 System

[...]

Timothy C. Wang, Jenny J. Wei¹, David M. Sabatini, Eric S. Lander², Eric S. Lander³, Eric S. Lander¹ - Show less +2 more•Institutions (3)

Massachusetts Institute of Technology¹, Harvard University², Broad Institute³

03 Jan 2014-Science

TL;DR: In this paper, a pooled, loss-of-function genetic screening approach suitable for both positive and negative selection that uses a genome-scale lentiviral single-guide RNA (sgRNA) library was described.

...read moreread less

Abstract: The bacterial clustered regularly interspaced short palindromic repeats (CRISPR)–Cas9 system for genome editing has greatly expanded the toolbox for mammalian genetics, enabling the rapid generation of isogenic cell lines and mice with modified alleles. Here, we describe a pooled, loss-of-function genetic screening approach suitable for both positive and negative selection that uses a genome-scale lentiviral single-guide RNA (sgRNA) library. sgRNA expression cassettes were stably integrated into the genome, which enabled a complex mutant pool to be tracked by massively parallel sequencing. We used a library containing 73,000 sgRNAs to generate knockout collections and performed screens in two human cell lines. A screen for resistance to the nucleotide analog 6-thioguanine identified all expected members of the DNA mismatch repair pathway, whereas another for the DNA topoisomerase II ( TOP2A ) poison etoposide identified TOP2A , as expected, and also cyclin-dependent kinase 6, CDK6. A negative selection screen for essential genes identified numerous gene sets corresponding to fundamental processes. Last, we show that sgRNA efficiency is associated with specific sequence motifs, enabling the prediction of more effective sgRNAs. Collectively, these results establish Cas9/sgRNA screens as a powerful tool for systematic genetic analysis in mammalian cells.

...read moreread less

2,487 citations

Journal Article•DOI•

Non-viral vectors for gene-based therapy

[...]

Hao Yin¹, Rosemary Lynn Kanasty¹, Ahmed A. Eltoukhy¹, Arturo J. Vegas¹, J. Robert Dorkin¹, Daniel G. Anderson¹ - Show less +2 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Aug 2014-Nature Reviews Genetics

TL;DR: The biological barriers to gene delivery in vivo are introduced and recent advances in material sciences, nanotechnology and nucleic acid chemistry that have yielded promising non-viral delivery systems are discussed, some of which are currently undergoing testing in clinical trials.

...read moreread less

Abstract: Gene-based therapy is the intentional modulation of gene expression in specific cells to treat pathological conditions This modulation is accomplished by introducing exogenous nucleic acids such as DNA, mRNA, small interfering RNA (siRNA), microRNA (miRNA) or antisense oligonucleotides Given the large size and the negative charge of these macromolecules, their delivery is typically mediated by carriers or vectors In this Review, we introduce the biological barriers to gene delivery in vivo and discuss recent advances in material sciences, nanotechnology and nucleic acid chemistry that have yielded promising non-viral delivery systems, some of which are currently undergoing testing in clinical trials The diversity of these systems highlights the recent progress of gene-based therapy using non-viral approaches

...read moreread less

2,460 citations

Journal Article•DOI•

Synaptic, transcriptional and chromatin genes disrupted in autism

[...]

Silvia De Rubeis¹, Xin-Xin He², Arthur P. Goldberg¹, Christopher S. Poultney¹, Kaitlin E. Samocha³, A. Ercument Cicek², Yan Kou¹, Li Liu², Menachem Fromer¹, Menachem Fromer³, R. Susan Walker⁴, Tarjinder Singh⁵, Lambertus Klei⁶, Jack A. Kosmicki³, Shih-Chen Fu¹, Branko Aleksic⁷, Monica Biscaldi⁸, Patrick Bolton⁹, Jessica M. Brownfeld¹, Jinlu Cai¹, Nicholas G. Campbell¹⁰, Angel Carracedo¹¹, Angel Carracedo¹², Maria H. Chahrour³, Andreas G. Chiocchetti, Hilary Coon¹³, Emily L. Crawford¹⁰, Lucy Crooks⁵, Sarah Curran⁹, Geraldine Dawson¹⁴, Eftichia Duketis, Bridget A. Fernandez¹⁵, Louise Gallagher¹⁶, Evan T. Geller¹⁷, Stephen J. Guter¹⁸, R. Sean Hill¹⁹, R. Sean Hill³, Iuliana Ionita-Laza²⁰, Patricia Jiménez González, Helena Kilpinen, Sabine M. Klauck²¹, Alexander Kolevzon¹, Irene Lee²², Jing Lei², Terho Lehtimäki, Chiao-Feng Lin¹⁷, Avi Ma'ayan¹, Christian R. Marshall⁴, Alison L. McInnes²³, Benjamin M. Neale²⁴, Michael John Owen²⁵, Norio Ozaki⁷, Mara Parellada²⁶, Jeremy R. Parr²⁷, Shaun Purcell¹, Kaija Puura, Deepthi Rajagopalan⁴, Karola Rehnström⁵, Abraham Reichenberg¹, Aniko Sabo²⁸, Michael Sachse, Stephen Sanders²⁹, Chad M. Schafer², Martin Schulte-Rüther³⁰, David Skuse²², David Skuse³¹, Christine Stevens²⁴, Peter Szatmari³², Kristiina Tammimies⁴, Otto Valladares¹⁷, Annette Voran³³, Li-San Wang¹⁷, Lauren A. Weiss²⁹, A. Jeremy Willsey²⁹, Timothy W. Yu³, Timothy W. Yu¹⁹, Ryan K. C. Yuen⁴, Edwin H. Cook¹⁸, Christine M. Freitag, Michael Gill¹⁶, Christina M. Hultman³⁴, Thomas Lehner³⁵, Aarno Palotie³⁶, Aarno Palotie³, Aarno Palotie²⁴, Gerard D. Schellenberg¹⁷, Pamela Sklar¹, Matthew W. State²⁹, James S. Sutcliffe¹⁰, Christopher A. Walsh³, Christopher A. Walsh¹⁹, Stephen W. Scherer⁴, Michael E. Zwick³⁷, Jeffrey C. Barrett⁵, David J. Cutler³⁷, Kathryn Roeder², Bernie Devlin⁶, Mark J. Daly³, Mark J. Daly²⁴, Joseph D. Buxbaum¹ - Show less +96 more•Institutions (37)

Icahn School of Medicine at Mount Sinai¹, Carnegie Mellon University², Harvard University³, University of Toronto⁴, Wellcome Trust Sanger Institute⁵, University of Pittsburgh⁶, Nagoya University⁷, University of Freiburg⁸, King's College London⁹, Vanderbilt University¹⁰, University of Santiago de Compostela¹¹, King Abdulaziz University¹², University of Utah¹³, Duke University¹⁴, Memorial University of Newfoundland¹⁵, Trinity College, Dublin¹⁶, University of Pennsylvania¹⁷, University of Illinois at Chicago¹⁸, Boston Children's Hospital¹⁹, Columbia University²⁰, German Cancer Research Center²¹, University College London²², Kaiser Permanente²³, Broad Institute²⁴, Cardiff University²⁵, Complutense University of Madrid²⁶, Newcastle University²⁷, Baylor College of Medicine²⁸, University of California, San Francisco²⁹, RWTH Aachen University³⁰, National Health Service³¹, McMaster University³², Saarland University³³, Karolinska Institutet³⁴, National Institutes of Health³⁵, University of Helsinki³⁶, Emory University³⁷

13 Nov 2014-Nature

TL;DR: Using exome sequencing, it is shown that analysis of rare coding variation in 3,871 autism cases and 9,937 ancestry-matched or parental controls implicates 22 autosomal genes at a false discovery rate of < 0.05, plus a set of 107 genes strongly enriched for those likely to affect risk (FDR < 0.30).

...read moreread less

Abstract: The genetic architecture of autism spectrum disorder involves the interplay of common and rare variants and their impact on hundreds of genes. Using exome sequencing, here we show that analysis of rare coding variation in 3,871 autism cases and 9,937 ancestry-matched or parental controls implicates 22 autosomal genes at a false discovery rate (FDR) < 0.05, plus a set of 107 autosomal genes strongly enriched for those likely to affect risk (FDR < 0.30). These 107 genes, which show unusual evolutionary constraint against mutations, incur de novo loss-of-function mutations in over 5% of autistic subjects. Many of the genes implicated encode proteins for synaptic formation, transcriptional regulation and chromatin-remodelling pathways. These include voltage-gated ion channels regulating the propagation of action potentials, pacemaking and excitability-transcription coupling, as well as histone-modifying enzymes and chromatin remodellers-most prominently those that mediate post-translational lysine methylation/demethylation modifications of histones.

...read moreread less

2,228 citations

Journal Article•DOI•

Defining the role of common variation in the genomic and biological architecture of adult human height

[...]

Andrew R. Wood¹, Tõnu Esko², Jian Yang³, Sailaja Vedantam⁴ +441 more•Institutions (132)

01 Nov 2014-Nature Genetics

TL;DR: This article identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height, and all common variants together captured 60% of heritability.

...read moreread less

Abstract: Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.

...read moreread less

Journal Article•DOI•

A promoter-level mammalian expression atlas

[...]

Alistair R. R. Forrest, Hideya Kawaji, Michael Rehli¹, J Kenneth Baillie² +277 more•Institutions (63)

27 Mar 2014-Nature

TL;DR: For example, the authors mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body.

...read moreread less

Abstract: Regulated transcription controls the diversity, developmental pathways and spatial organization of the hundreds of cell types that make up a mammal Using single-molecule cDNA sequencing, we mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body We find that few genes are truly 'housekeeping', whereas many mammalian promoters are composite entities composed of several closely separated TSSs, with independent cell-type-specific expression profiles TSSs specific to different cell types evolve at different rates, whereas promoters of broadly expressed genes are the most conserved Promoter-based expression analysis reveals key transcription factors defining cell states and links them to binding-site motifs The functions of identified novel transcripts can be predicted by coexpression and sample ontology enrichment analyses The functional annotation of the mammalian genome 5 (FANTOM5) project provides comprehensive expression profiles and functional annotation of mammalian cell-type-specific transcriptomes with wide applications in biomedical research

...read moreread less

Journal Article•DOI•

A circadian gene expression atlas in mammals: Implications for biology and medicine

[...]

Ray Zhang¹, Nicholas F. Lahens¹, Heather I. Ballance¹, Michael E. Hughes², John B. Hogenesch¹ - Show less +1 more•Institutions (2)

University of Pennsylvania¹, University of Missouri–St. Louis²

11 Nov 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: High-resolution multiorgan expression data is generated showing that nearly half of all genes in the mouse genome oscillate with circadian rhythm somewhere in the body, and the majority of best-selling drugs and World Health Organization essential medicines directly target the products of rhythmic genes.

...read moreread less

Abstract: To characterize the role of the circadian clock in mouse physiology and behavior, we used RNA-seq and DNA arrays to quantify the transcriptomes of 12 mouse organs over time. We found 43% of all protein coding genes showed circadian rhythms in transcription somewhere in the body, largely in an organ-specific manner. In most organs, we noticed the expression of many oscillating genes peaked during transcriptional “rush hours” preceding dawn and dusk. Looking at the genomic landscape of rhythmic genes, we saw that they clustered together, were longer, and had more spliceforms than nonoscillating genes. Systems-level analysis revealed intricate rhythmic orchestration of gene pathways throughout the body. We also found oscillations in the expression of more than 1,000 known and novel noncoding RNAs (ncRNAs). Supporting their potential role in mediating clock function, ncRNAs conserved between mouse and human showed rhythmic expression in similar proportions as protein coding genes. Importantly, we also found that the majority of best-selling drugs and World Health Organization essential medicines directly target the products of rhythmic genes. Many of these drugs have short half-lives and may benefit from timed dosage. In sum, this study highlights critical, systemic, and surprising roles of the mammalian circadian clock and provides a blueprint for advancement in chronotherapy.

...read moreread less

Journal Article•DOI•

An integrated catalog of reference genes in the human gut microbiome

[...]

Junhua Li¹, Huijue Jia, Xianghang Cai, Huanzi Zhong, Qiang Feng², Shinichi Sunagawa, Manimozhiyan Arumugam², Jens Roat Kultima, Edi Prifti³, Trine Nielsen², Agnieszka S. Juncker⁴, Chaysavanh Manichanh, Bing Chen, Wenwei Zhang, Florence Levenez³, Juan Wang, Xun Xu, Liang Xiao, Suisha Liang, Dongya Zhang, Zhaoxi Zhang, Weineng Chen, Hailong Zhao, Jumana Y. Al-Aama⁵, Sherif Edris⁵, Huanming Yang, Jian Wang, Torben Hansen², Henrik Nielsen⁴, Søren Brunak⁴, Karsten Kristiansen², Francisco Guarner, Oluf Pedersen², Joël Doré³, S. Dusko Ehrlich³, Peer Bork, Jun Wang⁶ - Show less +33 more•Institutions (6)

South China University of Technology¹, University of Copenhagen², Institut national de la recherche agronomique³, Technical University of Denmark⁴, King Abdulaziz University⁵, Macau University of Science and Technology⁶

01 Aug 2014-Nature Biotechnology

TL;DR: The integrated gene catalog (IGC) is established comprising 9,879,896 genes, which includes close-to-complete sets of genes for most gut microbes, which are also of considerably higher quality than in previous catalogs.

...read moreread less

Abstract: Many analyses of the human gut microbiome depend on a catalog of reference genes. Existing catalogs for the human gut microbiome are based on samples from single cohorts or on reference genomes or protein sequences, which limits coverage of global microbiome diversity. Here we combined 249 newly sequenced samples of the Metagenomics of the Human Intestinal Tract (MetaHit) project with 1,018 previously sequenced samples to create a cohort from three continents that is at least threefold larger than cohorts used for previous gene catalogs. From this we established the integrated gene catalog (IGC) comprising 9,879,896 genes. The catalog includes close-to-complete sets of genes for most gut microbes, which are also of considerably higher quality than in previous catalogs. Analyses of a group of samples from Chinese and Danish individuals using the catalog revealed country-specific gut microbial signatures. This expanded catalog should facilitate quantitative characterization of metagenomic, metatranscriptomic and metaproteomic data from the gut microbiome to understand its variation across populations in human health and disease.

...read moreread less

Journal Article•DOI•

A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome

[...]

Klaus F. X. Mayer, Jane Rogers, Jaroslav Doležel¹, Curtis J. Pozniak², Kellye Eversole, Catherine Feuillet³, Bikram S. Gill⁴, Bernd Friebe⁴, Adam J. Lukaszewski⁵, Pierre Sourdille⁶, Takashi R. Endo⁷, M. Kubaláková¹, Jarmila Číhalíková¹, Zdeňka Dubská¹, Jan Vrána¹, Romana Šperková¹, Hana Šimková¹, Melanie Febrer⁸, Leah Clissold, Kirsten McLay, Kuldeep Singh⁹, Parveen Chhuneja⁹, Nagendra K. Singh¹⁰, Jitendra P. Khurana¹¹, Eduard Akhunov⁴, Frédéric Choulet⁶, Adriana Alberti, Valérie Barbe, Patrick Wincker, Hiroyuki Kanamori¹², Fuminori Kobayashi¹², Takeshi Itoh¹², Takashi Matsumoto¹², Hiroaki Sakai¹², Tsuyoshi Tanaka¹², Jianzhong Wu¹², Yasunari Ogihara¹³, Hirokazu Handa¹², P. Ron Maclachlan², Andrew G. Sharpe¹⁴, Darrin Klassen¹⁴, David Edwards, Jacqueline Batley, Odd-Arne Olsen, Simen Rød Sandve¹⁵, Sigbjørn Lien¹⁵, Burkhard Steuernagel¹⁶, Brande B. H. Wulff¹⁶, Mario Caccamo, Sarah Ayling, Ricardo H. Ramirez-Gonzalez, Bernardo J. Clavijo, Jonathan M. Wright, Matthias Pfeifer, Manuel Spannagl, Mihaela Martis, Martin Mascher¹⁷, Jarrod Chapman¹⁸, Jesse Poland⁴, Uwe Scholz¹⁷, Kerrie Barry¹⁸, Robbie Waugh¹⁹, Daniel S. Rokhsar¹⁸, Gary J. Muehlbauer, Nils Stein¹⁷, Heidrun Gundlach, Matthias Zytnicki²⁰, Véronique Jamilloux²⁰, Hadi Quesneville²⁰, Thomas Wicker²¹, Primetta Faccioli, Moreno Colaiacovo, Antonio Michele Stanca, Hikmet Budak²², Luigi Cattivelli, Natasha Glover⁶, Lise Pingault⁶, Etienne Paux⁶, Sapna Sharma, Rudi Appels²³, Matthew I. Bellgard²³, Brett Chapman²³, Thomas Nussbaumer, Kai Christian Bader, Hélène Rimbert, Shichen Wang⁴, Ron Knox, Andrzej Kilian, Michael Alaux²⁰, Françoise Alfama²⁰, Loïc Couderc²⁰, Nicolas Guilhot⁶, Claire Viseux²⁰, Mikaël Loaec²⁰, Beat Keller²¹, Sébastien Praud - Show less +92 more•Institutions (23)

18 Jul 2014-Science

TL;DR: Insight into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

...read moreread less

Abstract: An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

...read moreread less

Journal Article•DOI•

Rational design of highly active sgRNAs for CRISPR-Cas9–mediated gene inactivation

[...]

John G. Doench¹, Ella Hartenian¹, Daniel B. Graham¹, Zuzana Tothova², Mudra Hegde¹, Ian Smith¹, Meagan E Sullender¹, Benjamin L. Ebert², Ramnik J. Xavier³, David E. Root¹ - Show less +6 more•Institutions (3)

Broad Institute¹, Brigham and Women's Hospital², Harvard University³

01 Dec 2014-Nature Biotechnology

TL;DR: An online tool for the design of highly active sgRNAs for any gene of interest is provided, including a further optimization of the protospacer-adjacent motif (PAM) of Streptococcus pyogenes Cas9.

...read moreread less

Abstract: Components of the prokaryotic clustered, regularly interspaced, short palindromic repeats (CRISPR) loci have recently been repurposed for use in mammalian cells. The CRISPR-associated (Cas)9 can be programmed with a single guide RNA (sgRNA) to generate site-specific DNA breaks, but there are few known rules governing on-target efficacy of this system. We created a pool of sgRNAs, tiling across all possible target sites of a panel of six endogenous mouse and three endogenous human genes and quantitatively assessed their ability to produce null alleles of their target gene by antibody staining and flow cytometry. We discovered sequence features that improved activity, including a further optimization of the protospacer-adjacent motif (PAM) of Streptococcus pyogenes Cas9. The results from 1,841 sgRNAs were used to construct a predictive model of sgRNA activity to improve sgRNA design for gene editing and genetic screens. We provide an online tool for the design of highly active sgRNAs for any gene of interest.

...read moreread less

Journal Article•DOI•

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu⁵, Weisheng Wu², Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁷, Yin Shen¹, Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender⁴, M. A. Bender¹², Miaohua Zhang¹², Rachel Byron¹², Mark Groudine¹², Mark Groudine⁴, David McCleary¹, Long Pham¹, Zhen Ye¹, Samantha Kuan¹, Lee Edsall¹, Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹⁴, Manolis Kellis¹³, Cheryl A. Keller², Christapher S. Morrissey², Tejaswini Mishra², Deepti Jain², Nergiz Dogan², Robert S. Harris², Philip Cayting³, Trupti Kawli³, Alan P. Boyle³, Alan P. Boyle⁵, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang², Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao¹, Sheng Zhong¹, Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon²⁹, Rebecca F. Lowdon³⁰, Leslie B. Adams³⁰, Leslie B. Adams³¹, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁷, Ross C. Hardison², Michael A. Beer²⁰, Bing Ren¹ - Show less +142 more•Institutions (31)

University of California, San Diego¹, Pennsylvania State University², Stanford University³, University of Washington⁴, University of Michigan⁵, New College of Florida⁶, Florida State University⁷, Cold Spring Harbor Laboratory⁸, California Institute of Technology⁹, University of Vienna¹⁰, Emory University¹¹, Fred Hutchinson Cancer Research Center¹², Massachusetts Institute of Technology¹³, Broad Institute¹⁴, University of California, Irvine¹⁵, University of California, Santa Cruz¹⁶, University of California, San Francisco¹⁷, Yale University¹⁸, University of Florida¹⁹, Johns Hopkins University²⁰, University College London²¹, University of Oxford²², Cornell University²³, Memorial Sloan Kettering Cancer Center²⁴, Harvard University²⁵, University of Iowa²⁶, Yeshiva University²⁷, University of Pennsylvania²⁸, Washington University in St. Louis²⁹, National Institutes of Health³⁰, University of North Carolina at Chapel Hill³¹

20 Nov 2014-Nature

TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases

...read moreread less

Journal Article•DOI•

Expanded identification and characterization of mammalian circular RNAs

[...]

Junjie U. Guo¹, Junjie U. Guo², Vikram Agarwal, Huili Guo, David P. Bartel², David P. Bartel¹ - Show less +2 more•Institutions (2)

Howard Hughes Medical Institute¹, Massachusetts Institute of Technology²

29 Jul 2014-Genome Biology

TL;DR: A computational pipeline to identifycircRNAs and quantify their relative abundance from RNA-seq data is developed, providing a new framework for future investigation of this intriguing topological isoform while raising doubts regarding a biological function of most circRNAs.

...read moreread less

Abstract: Background: The recent reports of two circular RNAs (circRNAs) with strong potential to act as microRNA (miRNA) sponges suggest that circRNAs might play important roles in regulating gene expression. However, the global properties of circRNAs are not well understood. Results: We developed a computational pipeline to identify circRNAs and quantify their relative abundance from RNA-seq data. Applying this pipeline to a large set of non-poly(A)-selected RNA-seq data from the ENCODE project, we annotated 7,112 human circRNAs that were estimated to comprise at least 10% of the transcripts accumulating from their loci. Most circRNAs are expressed in only a few cell types and at low abundance, but they are no more cell-type-specific than are mRNAs with similar overall expression levels. Although most circRNAs overlap protein-coding sequences, ribosome profiling provides no evidence for their translation. We also annotated 635 mouse circRNAs, and although 20% of them are orthologous to human circRNAs, the sequence conservation of these circRNA orthologs is no higher than that of their neighboring linear exons. The previously proposed miR-7 sponge, CDR1as, is one of only two circRNAs with more miRNA sites than expected by chance, with the next best miRNA-sponge candidate deriving from a gene encoding a primate-specific zinc-finger protein, ZNF91. Conclusions: Our results provide a new framework for future investigation of this intriguing topological isoform while raising doubts regarding a biological function of most circRNAs.

...read moreread less

Journal Article•DOI•

Dynamic regulation of genome-wide pre-mRNA splicing and stress tolerance by the Sm-like protein LSm5 in Arabidopsis.

[...]

Peng Cui¹, Shoudong Zhang¹, Feng Ding¹, Shahjahan Ali¹, Liming Xiong¹ - Show less +1 more•Institutions (1)

King Abdullah University of Science and Technology¹

07 Jan 2014-Genome Biology

TL;DR: It is concluded that SAD1 dynamically controls splicing efficiency and splice-site recognition in Arabidopsis, and it is proposed that this may contribute to S AD1-mediated stress tolerance through the metabolism of transcripts expressed from stress-responsive genes.

...read moreread less

Abstract: Sm-like proteins are highly conserved proteins that form the core of the U6 ribonucleoprotein and function in several mRNA metabolism processes, including pre-mRNA splicing. Despite their wide occurrence in all eukaryotes, little is known about the roles of Sm-like proteins in the regulation of splicing. Here, through comprehensive transcriptome analyses, we demonstrate that depletion of the Arabidopsis supersensitive to abscisic acid and drought 1 gene (SAD1), which encodes Sm-like protein 5 (LSm5), promotes an inaccurate selection of splice sites that leads to a genome-wide increase in alternative splicing. In contrast, overexpression of SAD1 strengthens the precision of splice-site recognition and globally inhibits alternative splicing. Further, SAD1 modulates the splicing of stress-responsive genes, particularly under salt-stress conditions. Finally, we find that overexpression of SAD1 in Arabidopsis improves salt tolerance in transgenic plants, which correlates with an increase in splicing accuracy and efficiency for stress-responsive genes. We conclude that SAD1 dynamically controls splicing efficiency and splice-site recognition in Arabidopsis, and propose that this may contribute to SAD1-mediated stress tolerance through the metabolism of transcripts expressed from stress-responsive genes. Our study not only provides novel insights into the function of Sm-like proteins in splicing, but also uncovers new means to improve splicing efficiency and to enhance stress tolerance in a higher eukaryote.

...read moreread less

Journal Article•DOI•

Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells.

[...]

Qiaolin Deng¹, Daniel Ramsköld², Daniel Ramsköld¹, Björn Reinius¹, Björn Reinius², Rickard Sandberg², Rickard Sandberg¹ - Show less +3 more•Institutions (2)

Ludwig Institute for Cancer Research¹, Karolinska Institutet²

10 Jan 2014-Science

TL;DR: It is concluded that independent and stochastic allelic transcription generates abundant random monoallelic expression in the mammalian cell.

...read moreread less

Abstract: Expression from both alleles is generally observed in analyses of diploid cell populations, but studies addressing allelic expression patterns genome-wide in single cells are lacking. Here, we present global analyses of allelic expression across individual cells of mouse preimplantation embryos of mixed background (CAST/EiJ × C57BL/6J). We discovered abundant (12 to 24%) monoallelic expression of autosomal genes and that expression of the two alleles occurs independently. The monoallelic expression appeared random and dynamic because there was considerable variation among closely related embryonic cells. Similar patterns of monoallelic expression were observed in mature cells. Our allelic expression analysis also demonstrates the de novo inactivation of the paternal X chromosome. We conclude that independent and stochastic allelic transcription generates abundant random monoallelic expression in the mammalian cell.

...read moreread less

Journal Article•DOI•

The rise of regulatory RNA.

[...]

Kevin V. Morris¹, John S. Mattick²•Institutions (2)

University of New South Wales¹, Garvan Institute of Medical Research²

01 Jun 2014-Nature Reviews Genetics

TL;DR: A central role for RNA in human evolution and ontogeny is suggested and the emergence of the previously unsuspected world of regulatory RNA from a historical perspective is reviewed.

...read moreread less

Abstract: Discoveries over the past decade portend a paradigm shift in molecular biology. Evidence suggests that RNA is not only functional as a messenger between DNA and protein but also involved in the regulation of genome organization and gene expression, which is increasingly elaborate in complex organisms. Regulatory RNA seems to operate at many levels; in particular, it plays an important part in the epigenetic processes that control differentiation and development. These discoveries suggest a central role for RNA in human evolution and ontogeny. Here, we review the emergence of the previously unsuspected world of regulatory RNA from a historical perspective.

...read moreread less

A comparative encyclopedia of DNA elements in the mouse genome - eScholarship

[...]

Miguel Ramalho-Santos, Yin Shen, Sheng Zhong, Licia Selleri, F Yue, Y Cheng, A Breschi, Jeff Vierstra, W Wu, T Ryba, Richard Sandstrom, Z Ma, C Davis, BD Pope - Show less +10 more

20 Nov 2014

TL;DR: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways.

...read moreread less

Abstract: © 2014 Macmillan Publishers Limited. All rights reserved.The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain

...read moreread less

Journal Article•DOI•

ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes

[...]

Sushim K. Gupta¹, Babu Roshan Padmanabhan¹, Seydina M. Diene¹, Rafael López-Rojas, Marie Kempf, Luce Landraud, Jean-Marc Rolain¹ - Show less +3 more•Institutions (1)

Aix-Marseille University¹

01 Jan 2014-Antimicrobial Agents and Chemotherapy

TL;DR: A concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR geneticeterminants is created.

...read moreread less

Abstract: ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation) is a new bioinformatic tool that was created to detect existing and putative new antibiotic resistance (AR) genes in bacterial genomes. ARG-ANNOT uses a local BLAST program in Bio-Edit software that allows the user to analyze sequences without a Web interface. All AR genetic determinants were collected from published works and online resources; nucleotide and protein sequences were retrieved from the NCBI GenBank database. After building a database that includes 1,689 antibiotic resistance genes, the software was tested in a blind manner using 100 random sequences selected from the database to verify that the sensitivity and specificity were at 100% even when partial sequences were queried. Notably, BLAST analysis results obtained using the rmtF gene sequence (a new aminoglycoside-modifying enzyme gene sequence that is not included in the database) as a query revealed that the tool was able to link this sequence to short sequences (17 to 40 bp) found in other genes of the rmt family with significant E values. Finally, the analysis of 178 Acinetobacter baumannii and 20 Staphylococcus aureus genomes allowed the detection of a significantly higher number of AR genes than the Resfinder gene analyzer and 11 point mutations in target genes known to be associated with AR. The average time for the analysis of a genome was 3.35 ± 0.13 min. We have created a concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR genetic determinants.

...read moreread less

Journal Article•DOI•

A reference genome for common bean and genome-wide analysis of dual domestications

[...]

Jeremy Schmutz¹, Phillip E. McClean², Sujan Mamidi², G Albert Wu¹, Steven B. Cannon³, Jane Grimwood, Jerry Jenkins, Shengqiang Shu¹, Qijian Song³, Carolina Chavarro⁴, Mirayda Torres-Torres⁴, Valérie Geffroy⁵, Samira Mafi Moghaddam², Dongying Gao⁴, Brian Abernathy⁴, Kerrie Barry¹, Matthew W. Blair⁶, Mark A. Brick⁷, Mansi Chovatia¹, Paul Gepts⁸, David Goodstein¹, Michael D. Gonzales⁴, Uffe Hellsten¹, David L. Hyten³, Gaofeng Jia³, James D. Kelly⁹, Dave Kudrna¹⁰, Rian Lee², Manon M.S. Richard¹¹, Phillip N. Miklas³, Juan M. Osorno², Josiane Rodrigues³, Vincent Thareau¹¹, Carlos A. Urrea¹², Mei Wang¹, Yeisoo Yu¹⁰, Ming Zhang¹, Rod A. Wing¹⁰, Perry B. Cregan³, Daniel S. Rokhsar¹, Scott A. Jackson⁴ - Show less +37 more•Institutions (12)

United States Department of Energy¹, North Dakota State University², United States Department of Agriculture³, University of Georgia⁴, Institut national de la recherche agronomique⁵, Tennessee State University⁶, Colorado State University⁷, University of California, Davis⁸, Michigan State University⁹, University of Arizona¹⁰, University of Paris-Sud¹¹, University of Nebraska–Lincoln¹²

01 Jul 2014-Nature Genetics

TL;DR: 2 independent domestications from genetic pools that diverged before human colonization are confirmed and a set of genes linked with increased leaf and seed size are identified and combined with quantitative trait locus data from Mesoamerican cultivars.

...read moreread less

Abstract: Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.

...read moreread less

Journal Article•DOI•

Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library

[...]

Hiroko Koike-Yusa¹, Yang Li¹, E-Pien Tan¹, Martin Del Castillo Velasco-Herrera¹, Kosuke Yusa¹ - Show less +1 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Mar 2014-Nature Biotechnology

TL;DR: The results demonstrate the potential for efficient loss-of-function screening using the CRISPR-Cas9 system and identify 27 known and 4 previously unknown genes implicated in these phenotypes.

...read moreread less

Abstract: Identification of genes influencing a phenotype of interest is frequently achieved through genetic screening by RNA interference (RNAi) or knockouts. However, RNAi may only achieve partial depletion of gene activity, and knockout-based screens are difficult in diploid mammalian cells. Here we took advantage of the efficiency and high throughput of genome editing based on type II, clustered, regularly interspaced, short palindromic repeats (CRISPR)-CRISPR-associated (Cas) systems to introduce genome-wide targeted mutations in mouse embryonic stem cells (ESCs). We designed 87,897 guide RNAs (gRNAs) targeting 19,150 mouse protein-coding genes and used a lentiviral vector to express these gRNAs in ESCs that constitutively express Cas9. Screening the resulting ESC mutant libraries for resistance to either Clostridium septicum alpha-toxin or 6-thioguanine identified 27 known and 4 previously unknown genes implicated in these phenotypes. Our results demonstrate the potential for efficient loss-of-function screening using the CRISPR-Cas9 system.

...read moreread less

Journal Article•DOI•

Enhanced homology-directed human genome engineering by controlled timing of CRISPR/Cas9 delivery.

[...]

Steven Lin¹, Brett T. Staahl¹, Ravi K Alla¹, Jennifer A. Doudna¹•Institutions (1)

University of California, Berkeley¹

15 Dec 2014-eLife

TL;DR: It is shown here that new genetic information can be introduced site-specifically and with high efficiency by homology-directed repair (HDR) of Cas9-induced site- specific double-strand DNA breaks using timed delivery ofCas9-guide RNA ribonucleoprotein (RNP) complexes.

...read moreread less

Abstract: The CRISPR/Cas9 system is a robust genome editing technology that works in human cells, animals and plants based on the RNA-programmed DNA cleaving activity of the Cas9 enzyme. Building on previous work (Jinek et al., 2013), we show here that new genetic information can be introduced site-specifically and with high efficiency by homology-directed repair (HDR) of Cas9-induced site-specific double-strand DNA breaks using timed delivery of Cas9-guide RNA ribonucleoprotein (RNP) complexes. Cas9 RNP-mediated HDR in HEK293T, human primary neonatal fibroblast and human embryonic stem cells was increased dramatically relative to experiments in unsynchronized cells, with rates of HDR up to 38% observed in HEK293T cells. Sequencing of on- and potential off-target sites showed that editing occurred with high fidelity, while cell mortality was minimized. This approach provides a simple and highly effective strategy for enhancing site-specific genome engineering in both transformed and primary human cells.

...read moreread less

Journal Article•DOI•

A framework for the interpretation of de novo mutation in human disease

[...]

Kaitlin E. Samocha¹, Elise B. Robinson¹, Stephen Sanders², Christine Stevens³, Aniko Sabo⁴, Lauren M. McGrath¹, Jack A. Kosmicki⁵, Karola Rehnström⁶, Swapan Mallick¹, Andrew Kirby¹, Dennis P. Wall⁵, Daniel G. MacArthur¹, Daniel G. MacArthur³, Stacey Gabriel³, Mark A. DePristo, Shaun Purcell¹, Shaun Purcell³, Shaun Purcell⁷, Aarno Palotie⁶, Eric Boerwinkle⁸, Joseph D. Buxbaum⁷, Edwin H. Cook⁹, Richard A. Gibbs⁴, Gerard D. Schellenberg¹⁰, James S. Sutcliffe¹¹, Bernie Devlin¹², Kathryn Roeder¹³, Benjamin M. Neale¹, Benjamin M. Neale³, Mark J. Daly³, Mark J. Daly¹ - Show less +27 more•Institutions (13)

Harvard University¹, Yale University², Broad Institute³, Baylor College of Medicine⁴, Beth Israel Deaconess Medical Center⁵, Wellcome Trust Sanger Institute⁶, Icahn School of Medicine at Mount Sinai⁷, University of Texas Health Science Center at Houston⁸, University of Illinois at Chicago⁹, University of Pennsylvania¹⁰, Vanderbilt University¹¹, University of Pittsburgh¹², Carnegie Mellon University¹³

01 Sep 2014-Nature Genetics

TL;DR: This model is used to identify ∼1,000 genes that are significantly lacking in functional coding variation in non-ASD samples and are enriched for de novo loss-of-function mutations identified in ASD cases, suggesting that the role of de noVO mutations in ASDs might reside in fundamental neurodevelopmental processes.

...read moreread less

Abstract: Mark Daly and colleagues present a statistical framework to evaluate the role of de novo mutations in human disease by calibrating a model of de novo mutation rates at the individual gene level. The mutation probabilities defined by their model and list of constrained genes can be used to help identify genetic variants that have a significant role in disease.

...read moreread less

Journal Article•DOI•

Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells

[...]

Xuebing Wu¹, David A. Scott², Andrea J. Kriz¹, Anthony C. Chiu¹, Patrick D. Hsu², Daniel B. Dadon¹, Albert W. Cheng¹, Alexandro E. Trevino², Silvana Konermann², Sidi Chen¹, Rudolf Jaenisch¹, Feng Zhang², Phillip A. Sharp¹ - Show less +9 more•Institutions (2)

Massachusetts Institute of Technology¹, McGovern Institute for Brain Research²

01 Jul 2014-Nature Biotechnology

TL;DR: A two-state model for Cas9 binding and cleavage is proposed, in which a seed match triggers binding but extensive pairing with target DNA is required for cleavage.

...read moreread less

Abstract: Bacterial type II CRISPR-Cas9 systems have been widely adapted for RNA-guided genome editing and transcription regulation in eukaryotic cells, yet their in vivo target specificity is poorly understood. Here we mapped genome-wide binding sites of a catalytically inactive Cas9 (dCas9) from Streptococcus pyogenes loaded with single guide RNAs (sgRNAs) in mouse embryonic stem cells (mESCs). Each of the four sgRNAs we tested targets dCas9 to between tens and thousands of genomic sites, frequently characterized by a 5-nucleotide seed region in the sgRNA and an NGG protospacer adjacent motif (PAM). Chromatin inaccessibility decreases dCas9 binding to other sites with matching seed sequences; thus 70% of off-target sites are associated with genes. Targeted sequencing of 295 dCas9 binding sites in mESCs transfected with catalytically active Cas9 identified only one site mutated above background levels. We propose a two-state model for Cas9 binding and cleavage, in which a seed match triggers binding but extensive pairing with target DNA is required for cleavage.

...read moreread less

Journal Article•DOI•

Highly multiplexed subcellular RNA sequencing in situ.

[...]

Je-Hyuk Lee¹, Evan R. Daugharthy¹, Jonathan Scheiman¹, Reza Kalhor¹, Joyce L. Yang¹, Thomas C. Ferrante¹, Richard C. Terry¹, Sauveur S. F. Jeanty¹, Chao Li¹, Ryoji Amamoto¹, Derek T. Peters¹, Brian M. Turczyk¹, Adam H. Marblestone¹, Samuel A. Inverso¹, Amy Bernard², Prashant Mali¹, Xavier Rios¹, John Aach¹, George M. Church¹ - Show less +15 more•Institutions (2)

Harvard University¹, Allen Institute for Brain Science²

21 Mar 2014-Science

TL;DR: FISSEQ is compatible with tissue sections and whole-mount embryos and reduces the limitations of optical resolution and noisy signals on single-molecule detection, and can be used to investigate cellular phenotype, gene regulation, and environment in situ.

...read moreread less

Abstract: Understanding the spatial organization of gene expression with single-nucleotide resolution requires localizing the sequences of expressed RNA transcripts within a cell in situ. Here, we describe fluorescent in situ RNA sequencing (FISSEQ), in which stably cross-linked complementary DNA (cDNA) amplicons are sequenced within a biological sample. Using 30-base reads from 8102 genes in situ, we examined RNA expression and localization in human primary fibroblasts with a simulated wound-healing assay. FISSEQ is compatible with tissue sections and whole-mount embryos and reduces the limitations of optical resolution and noisy signals on single-molecule detection. Our platform enables massively parallel detection of genetic elements, including gene transcripts and molecular barcodes, and can be used to investigate cellular phenotype, gene regulation, and environment in situ.

...read moreread less

Journal Article•DOI•

The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

[...]

Shengyi Liu¹, Yumei Liu, Xinhua Yang, Chaobo Tong¹, David Edwards², Isobel A. P. Parkin³, Meixia Zhao¹, Jianxin Ma⁴, Jingyin Yu¹, Shunmou Huang¹, Xiyin Wang⁵, Junyi Wang, Kun Lu⁶, Zhiyuan Fang, Ian Bancroft⁷, Tae-Jin Yang⁸, Qiong Hu¹, Xinfa Wang¹, Zhen Yue, Haojie Li, Linfeng Yang, Jian Wu, Qing Zhou, Wanxin Wang, Graham J.W. King⁹, J. Chris Pires¹⁰, Changxin Lu, Zhangyan Wu, Perumal Sampath⁸, Zhuo Wang, Hui Guo⁵, Shengkai Pan, Limei Yang, Jiumeng Min, Dong Zhang⁵, Dianchuan Jin, Wanshun Li, Harry Belcram¹¹, Jinxing Tu¹², Mei Guan¹³, Cunkou Qi, Dezhi Du, Jiana Li⁶, Liangcai Jiang, Jacqueline Batley¹⁴, Andrew G. Sharpe¹⁵, Beom Seok Park, Pradeep Ruperao², Feng Cheng, Nomar Espinosa Waminal⁸, Yin Huang, Caihua Dong¹, Li Wang, Jingping Li⁵, Zhiyong Hu¹, Mu Zhuang, Yi Huang¹, Junyan Huang¹, Jiaqin Shi¹, Desheng Mei¹, Jing Liu¹, Tae-Ho Lee⁵, Jinpeng Wang, Huizhe Jin⁵, Zaiyun Li¹², Xun Li¹³, Jiefu Zhang, Lu Xiao, Yongming Zhou¹², Zhongsong Liu¹³, Xuequn Liu¹⁶, Rui Qin¹⁶, Xu Tang⁵, Wenbin Liu, Yupeng Wang⁵, Yangyong Zhang, Jonghoon Lee⁸, Hyun Hee Kim¹⁷, Xun Xu, Xinming Liang, Wei Hua¹, Xiaowu Wang, Jun Wang¹⁸, Boulos Chalhoub¹¹, Andrew H. Paterson⁵ - Show less +81 more•Institutions (18)

Crops Research Institute¹, Australian Centre for Plant Functional Genomics², Agriculture and Agri-Food Canada³, Purdue University⁴, Plant Genome Mapping Laboratory⁵, Southwest University⁶, University of York⁷, Seoul National University⁸, Southern Cross University⁹, University of Missouri¹⁰, Centre national de la recherche scientifique¹¹, Huazhong Agricultural University¹², Hunan Agricultural University¹³, University of Queensland¹⁴, National Research Council¹⁵, Central University, India¹⁶, Sahmyook University¹⁷, King Abdulaziz University¹⁸

23 May 2014-Nature Communications

TL;DR: A draft genome sequence of Brassica oleracea is described, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks.

...read moreread less

Abstract: Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear Brassica is an ideal model to increase knowledge of polyploid evolution Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B oleracea This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus

...read moreread less

Collapse