Showing papers on "Genome published in 2014"

PDF

Open Access

Journal Article•DOI•

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

[...]

Yang Liao¹, Gordon K. Smyth¹, Wei Shi¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

01 Apr 2014-Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Abstract: MOTIVATION: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. RESULTS: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. AVAILABILITY AND IMPLEMENTATION: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

...read moreread less

14,103 citations

Journal Article•DOI•

A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping

[...]

Suhas S.P. Rao¹, Miriam H. Huntley¹, Neva C. Durand, Elena K. Stamenova, Ivan D. Bochkov¹, James T. Robinson¹, James T. Robinson², Adrian L. Sanborn¹, Ido Machol¹, Ido Machol³, Arina D. Omer³, Arina D. Omer¹, Eric S. Lander⁴, Eric S. Lander⁵, Eric S. Lander², Erez Lieberman Aiden - Show less +12 more•Institutions (5)

Baylor College of Medicine¹, Broad Institute², Rice University³, Harvard University⁴, Massachusetts Institute of Technology⁵

18 Dec 2014-Cell

TL;DR: In situ Hi-C is used to probe the 3D architecture of genomes, constructing haploid and diploid maps of nine cell types, identifying ∼10,000 loops that frequently link promoters and enhancers, correlate with gene activation, and show conservation across cell types and species.

...read moreread less

5,945 citations

Journal Article•DOI•

Development and applications of CRISPR-Cas9 for genome engineering.

[...]

Patrick D. Hsu¹, Patrick D. Hsu², Patrick D. Hsu³, Eric S. Lander¹, Feng Zhang¹, Feng Zhang³ - Show less +2 more•Institutions (3)

Broad Institute¹, Harvard University², McGovern Institute for Brain Research³

05 Jun 2014-Cell

TL;DR: In this paper, the authors describe the development and applications of Cas9 for a variety of research or translational applications while highlighting challenges as well as future directions, and highlight challenges and future directions.

...read moreread less

4,361 citations

Journal Article•DOI•

The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST)

[...]

Ross Overbeek¹, Robert Olson¹, Gordon D. Pusch¹, Gary J. Olsen¹, James J. Davis¹, Terry Disz¹, Robert Edwards², Svetlana Gerdes¹, Bruce Parrello¹, Maulik Shukla³, Veronika Vonstein¹, Alice R. Wattam³, Fangfang Xia¹, Rick Stevens¹ - Show less +10 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, San Diego State University², Virginia Tech³

01 Jan 2014-Nucleic Acids Research

TL;DR: The interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources are described.

...read moreread less

Abstract: In 2004, the SEED (http://pubseed.theseed.org/) was created to provide consistent and accurate genome annotations across thousands of genomes and as a platform for discovering and developing de novo annotations. The SEED is a constantly updated integration of genomic data with a genome database, web front end, API and server scripts. It is used by many scientists for predicting gene functions and discovering new pathways. In addition to being a powerful database for bioinformatics research, the SEED also houses subsystems (collections of functionally related protein families) and their derived FIGfams (protein families), which represent the core of the RAST annotation engine (http://rast.nmpdr.org/). When a new genome is submitted to RAST, genes are called and their annotations are made by comparison to the FIGfam collection. If the genome is made public, it is then housed within the SEED and its proteins populate the FIGfam collection. This annotation cycle has proven to be a robust and scalable solution to the problem of annotating the exponentially increasing number of genomes. To date, >12 000 users worldwide have annotated >60 000 distinct genomes using RAST. Here we describe the interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources.

...read moreread less

3,415 citations

Development and Applications of CRISPR-Cas9 for Genome Engineering

[...]

Patrick D. Hsu¹, Patrick D. Hsu², Patrick D. Hsu³, Eric S. Lander³, Feng Zhang¹, Feng Zhang³ - Show less +2 more•Institutions (3)

McGovern Institute for Brain Research¹, Harvard University², Broad Institute³

01 Jun 2014

TL;DR: The development and applications of Cas9 are described for a variety of research or translational applications while highlighting challenges as well as future directions.

...read moreread less

Abstract: Recent advances in genome engineering technologies based on the CRISPR-associated RNA-guided endonuclease Cas9 are enabling the systematic interrogation of mammalian genome function. Analogous to the search function in modern word processors, Cas9 can be guided to specific locations within complex genomes by a short RNA search string. Using this system, DNA sequences within the endogenous genome and their functional outputs are now easily edited or modulated in virtually any organism of choice. Cas9-mediated genetic perturbation is simple and scalable, empowering researchers to elucidate the functional organization of the genome at the systems level and establish causal linkages between genetic variations and biological phenotypes. In this Review, we describe the development and applications of Cas9 for a variety of research or translational applications while highlighting challenges as well as future directions. Derived from a remarkable microbial defense system, Cas9 is driving innovative applications from basic biology to biotechnology and medicine.

...read moreread less

3,270 citations

Journal Article•DOI•

CRISPR-Cas systems for editing, regulating and targeting genomes

[...]

Jeffry D. Sander¹, J. Keith Joung¹•Institutions (1)

Harvard University¹

01 Apr 2014-Nature Biotechnology

TL;DR: A modified version of the CRISPR-Cas9 system has been developed to recruit heterologous domains that can regulate endogenous gene expression or label specific genomic loci in living cells, which will undoubtedly transform biological research and spur the development of novel molecular therapeutics for human disease.

...read moreread less

Abstract: Targeted genome editing using engineered nucleases has rapidly gone from being a niche technology to a mainstream method used by many biological researchers. This widespread adoption has been largely fueled by the emergence of the clustered, regularly interspaced, short palindromic repeat (CRISPR) technology, an important new approach for generating RNA-guided nucleases, such as Cas9, with customizable specificities. Genome editing mediated by these nucleases has been used to rapidly, easily and efficiently modify endogenous genes in a wide variety of biomedically important cell types and in organisms that have traditionally been challenging to manipulate genetically. Furthermore, a modified version of the CRISPR-Cas9 system has been developed to recruit heterologous domains that can regulate endogenous gene expression or label specific genomic loci in living cells. Although the genome-wide specificities of CRISPR-Cas9 systems remain to be fully defined, the power of these systems to perform targeted, highly efficient alterations of genome sequence and gene expression will undoubtedly transform biological research and spur the development of novel molecular therapeutics for human disease.

...read moreread less

2,930 citations

Journal Article•DOI•

Data, information, knowledge and principle: back to metabolism in KEGG

[...]

Minoru Kanehisa¹, Susumu Goto¹, Yoko Sato¹, Masayuki Kawashima¹, Miho Furumichi¹, Mao Tanabe¹ - Show less +2 more•Institutions (1)

Kyoto University¹

01 Jan 2014-Nucleic Acids Research

TL;DR: The reaction modules, which represent chemical units of reactions, have been used to analyze design principles of metabolic networks and also to improve the definition of K numbers and associated annotations for translational bioinformatics.

...read moreread less

Abstract: In the hierarchy of data, information and knowledge, computational methods play a major role in the initial processing of data to extract information, but they alone become less effective to compile knowledge from information. The Kyoto Encyclopedia of Genes and Genomes (KEGG) resource (http://www.kegg.jp/ or http://www.genome.jp/kegg/) has been developed as a reference knowledge base to assist this latter process. In particular, the KEGG pathway maps are widely used for biological interpretation of genome sequences and other high-throughput data. The link from genomes to pathways is made through the KEGG Orthology system, a collection of manually defined ortholog groups identified by K numbers. To better automate this interpretation process the KEGG modules defined by Boolean expressions of K numbers have been expanded and improved. Once genes in a genome are annotated with K numbers, the KEGG modules can be computationally evaluated revealing metabolic capacities and other phenotypic features. The reaction modules, which represent chemical units of reactions, have been used to analyze design principles of metabolic networks and also to improve the definition of K numbers and associated annotations. For translational bioinformatics, the KEGG MEDICUS resource has been developed by integrating drug labels (package inserts) used in society.

...read moreread less

2,808 citations

Journal Article•DOI•

Discovery and saturation analysis of cancer genes across 21 tumour types

[...]

Michael S. Lawrence¹, Petar Stojanov², Craig H. Mermel², James T. Robinson¹, Levi A. Garraway², Todd R. Golub³, Matthew Meyerson², Stacey Gabriel¹, Eric S. Lander⁴, Gad Getz² - Show less +6 more•Institutions (4)

Broad Institute¹, Harvard University², Howard Hughes Medical Institute³, Massachusetts Institute of Technology⁴

23 Jan 2014-Nature

TL;DR: It is found that large-scale genomic analysis can identify nearly all known cancer genes in these cancer types and 33 genes that were not previously known to be significantly mutated in cancer, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis.

...read moreread less

Abstract: Although a few cancer genes are mutated in a high proportion of tumours of a given type (.20%), most are mutated at intermediate frequencies (2–20%). To explore the feasibility of creating a comprehensive catalogue of cancer genes, we analysed somatic point mutations in exome sequences from 4,742 human cancers and their matched normal-tissue samples across 21 cancer types. We found that large-scale genomic analysis can identify nearly all known cancer genes in these tumour types. Our analysis also identified 33 genes that were not previously known to be significantly mutated in cancer, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis. Down-sampling analysis indicates that larger sample sizes will reveal many more genes mutated at clinically important frequencies. We estimate that near-saturation may be achieved with 600– 5,000 samples per tumour type, depending on background mutation frequency. The results may help to guide the next stage of cancer genomics. Comprehensive knowledge of the genes underlying human cancers is a critical foundation for cancer diagnostics, therapeutics, clinical-trial design and selection of rational combination therapies. It is now possible to use genomic analysis to identify cancer genes in an unbiased fashion, based on the presence of somatic mutations at a rate significantly higher than the expected background level. Systematic studies have revealed many new cancer genes, as well as new classes of cancer genes 1,2 . They have also made clear that, although some cancer genes are mutated at high frequencies, most cancer genes in most patients occur at intermediate frequencies (2–20%) or lower. Accordingly, a complete catalogue of mutations in this frequency class will be essential for recognizing dysregulated pathways and optimal targets for therapeutic intervention. However, recent work suggests major gaps in our knowledge of cancer genes of intermediate frequency. For example, a study of 183 lung adenocarcinomas 3 found that 15% of patients lacked even a single mutation affecting any of the 10 known hallmarks of cancer, and 38% had 3 or fewer such mutations. In this paper, we analysed somatic point mutations (substitutions and small insertion and deletions) in nearly 5,000 human cancers and their matched normal-tissue samples (‘tumour–normal pairs’) across 21 tumour types. The questions that we examine here are: first, whether large-scale genomic analysis across tumour types can reliably identify all known cancer genes; second, whether it will reveal many new candidate cancer genes; and third, how far we are from having a complete catalogue of cancer genes (at least those of intermediate frequency). We used rigorous statistical methods to enumerate candidate cancer genes and then carefully inspected each gene to identify those with strong biological connections to cancer and mutational patterns consistent with the expected function. The analysis reveals nearly all known cancer genes and revealed 33 novel candidates, including genes related to proliferation, apoptosis, genome stability, chromatin regulation, immune evasion, RNA processing and protein homeostasis. Importantly, the data show that the

...read moreread less

2,565 citations

Journal Article•DOI•

Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics

[...]

Linn Fagerberg¹, Björn M. Hallström¹, Per Oksvold¹, Caroline Kampf², Dijana Djureinovic², Jacob Odeberg¹, Masato Habuka¹, Simin Tahmasebpoor², Angelika Danielsson², Karolina Edlund², Anna Asplund², Evelina Sjöstedt², Emma Lundberg¹, Cristina Al-Khalili Szigyarto¹, Marie Skogs¹, Jenny Ottosson Takanen¹, Holger Berling¹, Hanna Tegel¹, Jan Mulder³, Peter Nilsson¹, Jochen M. Schwenk¹, Cecilia Lindskog², Frida Danielsson¹, Adil Mardinoglu⁴, Åsa Sivertsson¹, Kalle von Feilitzen¹, Mattias Forsberg¹, Martin Zwahlen¹, IngMarie Olsson², Sanjay Navani, Mikael Huss¹, Jens Nielsen⁴, Jens Nielsen¹, Fredrik Pontén², Mathias Uhlén¹ - Show less +31 more•Institutions (4)

Royal Institute of Technology¹, Uppsala University², Science for Life Laboratory³, Chalmers University of Technology⁴

01 Feb 2014-Molecular & Cellular Proteomics

TL;DR: A quantitative transcriptomics analysis (RNA-Seq) is used to classify the tissue-specific expression of genes across a representative set of all major human organs and tissues and combined this analysis with antibody-based profiling of the same tissues.

...read moreread less

2,512 citations

Journal Article•DOI•

Genetic Screens in Human Cells Using the CRISPR-Cas9 System

[...]

Timothy C. Wang, Jenny J. Wei¹, David M. Sabatini, Eric S. Lander², Eric S. Lander³, Eric S. Lander¹ - Show less +2 more•Institutions (3)

Massachusetts Institute of Technology¹, Harvard University², Broad Institute³

03 Jan 2014-Science

TL;DR: In this paper, a pooled, loss-of-function genetic screening approach suitable for both positive and negative selection that uses a genome-scale lentiviral single-guide RNA (sgRNA) library was described.

...read moreread less

Abstract: The bacterial clustered regularly interspaced short palindromic repeats (CRISPR)–Cas9 system for genome editing has greatly expanded the toolbox for mammalian genetics, enabling the rapid generation of isogenic cell lines and mice with modified alleles. Here, we describe a pooled, loss-of-function genetic screening approach suitable for both positive and negative selection that uses a genome-scale lentiviral single-guide RNA (sgRNA) library. sgRNA expression cassettes were stably integrated into the genome, which enabled a complex mutant pool to be tracked by massively parallel sequencing. We used a library containing 73,000 sgRNAs to generate knockout collections and performed screens in two human cell lines. A screen for resistance to the nucleotide analog 6-thioguanine identified all expected members of the DNA mismatch repair pathway, whereas another for the DNA topoisomerase II ( TOP2A ) poison etoposide identified TOP2A , as expected, and also cyclin-dependent kinase 6, CDK6. A negative selection screen for essential genes identified numerous gene sets corresponding to fundamental processes. Last, we show that sgRNA efficiency is associated with specific sequence motifs, enabling the prediction of more effective sgRNAs. Collectively, these results establish Cas9/sgRNA screens as a powerful tool for systematic genetic analysis in mammalian cells.

...read moreread less

2,487 citations

Journal Article•DOI•

Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes

[...]

Mincheol Kim¹, Hyunseok Oh¹, Sang-Cheol Park¹, Jongsik Chun¹•Institutions (1)

Seoul National University¹

01 Feb 2014-International Journal of Systematic and Evolutionary Microbiology

TL;DR: The overall distribution of ANI values generated by pairwise comparison of 6787 genomes of prokaryotes belonging to 22 phyla was investigated, finding an apparent distinction in the overall ANI distribution between intra- and interspecies relationships at around 95-96% ANI.

...read moreread less

Abstract: Among available genome relatedness indices, average nucleotide identity (ANI) is one of the most robust measurements of genomic relatedness between strains, and has great potential in the taxonomy of bacteria and archaea as a substitute for the labour-intensive DNA–DNA hybridization (DDH) technique. An ANI threshold range (95–96 %) for species demarcation had previously been suggested based on comparative investigation between DDH and ANI values, albeit with rather limited datasets. Furthermore, its generality was not tested on all lineages of prokaryotes. Here, we investigated the overall distribution of ANI values generated by pairwise comparison of 6787 genomes of prokaryotes belonging to 22 phyla to see whether the suggested range can be applied to all species. There was an apparent distinction in the overall ANI distribution between intra- and interspecies relationships at around 95–96 % ANI. We went on to determine which level of 16S rRNA gene sequence similarity corresponds to the currently accepted ANI threshold for species demarcation using over one million comparisons. A twofold cross-validation statistical test revealed that 98.65 % 16S rRNA gene sequence similarity can be used as the threshold for differentiating two species, which is consistent with previous suggestions (98.2–99.0 %) derived from comparative studies between DDH and 16S rRNA gene sequence similarity. Our findings should be useful in accelerating the use of genomic sequence data in the taxonomy of bacteria and archaea.

...read moreread less

Journal Article•DOI•

A draft map of the human proteome

[...]

Min-Sik Kim¹, Sneha M. Pinto, Derese Getnet¹, Raja Sekhar Nirujogi, Srikanth S. Manda, Raghothama Chaerkady², Anil K. Madugundu, Dhanashree S. Kelkar, Ruth Isserlin³, Shobhit Jain³, Joji Kurian Thomas, Babylakshmi Muthusamy, Pamela Leal-Rojas¹, Pamela Leal-Rojas⁴, Praveen Kumar, Nandini A. Sahasrabuddhe, Lavanya Balakrishnan, Jayshree Advani, Bijesh George, Santosh Renuse, Lakshmi Dhevi N. Selvan, Arun H. Patil, Vishalakshi Nanjappa, Aneesha Radhakrishnan, Samarjeet Prasad¹, Tejaswini Subbannayya, Rajesh Raju, Manish Kumar, Sreelakshmi K. Sreenivasamurthy, Arivusudar Marimuthu, Gajanan Sathe, Sandip Chavan, Keshava K. Datta, Yashwanth Subbannayya, Apeksha Sahu, Soujanya D. Yelamanchi, Savita Jayaram, Pavithra Rajagopalan, Jyoti Sharma, Krishna R Murthy, Nazia Syed, Renu Goel, Aafaque Ahmad Khan, Sartaj Ahmad, Gourav Dey, Keshav Mudgal⁵, Aditi Chatterjee, Tai-Chung Huang¹, Jun Zhong¹, Xinyan Wu², Patrick G. Shaw¹, Donald Freed¹, Muhammad Saddiq Zahari¹, Kanchan K Mukherjee⁶, Subramanian Shankar⁷, Anita Mahadevan⁸, Henry H N Lam⁹, Chris J. Mitchell¹, Susarla K. Shankar⁸, Parthasarathy Satishchandra⁸, John T. Schroeder¹, Ravi Sirdeshmukh, Anirban Maitra¹, Steven D. Leach¹, Charles G. Drake¹, Marc K. Halushka¹, T. S. Keshava Prasad, Ralph H. Hruban¹, Candace L. Kerr¹, Candace L. Kerr¹⁰, Gary D. Bader³, Christine A. Iacobuzio-Donahue¹, Harsha Gowda, Akhilesh Pandey - Show less +70 more•Institutions (10)

Johns Hopkins University¹, Johns Hopkins University School of Medicine², University of Toronto³, University of La Frontera⁴, Imperial College London⁵, Post Graduate Institute of Medical Education and Research⁶, Armed Forces Medical College⁷, National Institute of Mental Health and Neurosciences⁸, Hong Kong University of Science and Technology⁹, University of Maryland, Baltimore¹⁰

29 May 2014-Nature

TL;DR: A draft map of the human proteome is presented using high-resolution Fourier-transform mass spectrometry to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-c coding RNAs and upstream open reading frames.

...read moreread less

Abstract: The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.

...read moreread less

Journal Article•DOI•

Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases

[...]

Sangsu Bae¹, Jeongbin Park¹, Jin-Soo Kim¹•Institutions (1)

Seoul National University¹

15 May 2014-Bioinformatics

TL;DR: A novel algorithm termed Cas-OFFinder that searches for potential off-target sites in a given genome or user-defined sequences and allows variations in protospacer-adjacent motif sequences recognized by Cas9, the essential protein component in RGENs.

...read moreread less

Abstract: Summary: The Type II clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system is an adaptive immune response in prokaryotes, protecting host cells against invading phages or plasmids by cleaving these foreign DNA species in a targeted manner. CRISPR/Cas-derived RNA-guided engineered nucleases (RGENs) enable genome editing in cultured cells, animals and plants, but are limited by off-target mutations. Here, we present a novel algorithm termed Cas-OFFinder that searches for potential off-target sites in a given genome or user-defined sequences. Unlike other algorithms currently available for identification of RGEN off-target sites, Cas-OFFinder is not limited by the number of mismatches and allows variations in protospacer-adjacent motif sequences recognized by Cas9, the essential protein component in RGENs. Cas-OFFinder is available as a command-line program or accessible via our website. Availability and implementation: Cas-OFFinder free access at http://www.rgenome.net/cas-offinder. Contact: rk.ca.uns@uaseab or rk.ca.uns@10miksj

...read moreread less

Journal Article•DOI•

A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome

[...]

Klaus F. X. Mayer, Jane Rogers, Jaroslav Doležel¹, Curtis J. Pozniak², Kellye Eversole, Catherine Feuillet³, Bikram S. Gill⁴, Bernd Friebe⁴, Adam J. Lukaszewski⁵, Pierre Sourdille⁶, Takashi R. Endo⁷, M. Kubaláková¹, Jarmila Číhalíková¹, Zdeňka Dubská¹, Jan Vrána¹, Romana Šperková¹, Hana Šimková¹, Melanie Febrer⁸, Leah Clissold, Kirsten McLay, Kuldeep Singh⁹, Parveen Chhuneja⁹, Nagendra K. Singh¹⁰, Jitendra P. Khurana¹¹, Eduard Akhunov⁴, Frédéric Choulet⁶, Adriana Alberti, Valérie Barbe, Patrick Wincker, Hiroyuki Kanamori¹², Fuminori Kobayashi¹², Takeshi Itoh¹², Takashi Matsumoto¹², Hiroaki Sakai¹², Tsuyoshi Tanaka¹², Jianzhong Wu¹², Yasunari Ogihara¹³, Hirokazu Handa¹², P. Ron Maclachlan², Andrew G. Sharpe¹⁴, Darrin Klassen¹⁴, David Edwards, Jacqueline Batley, Odd-Arne Olsen, Simen Rød Sandve¹⁵, Sigbjørn Lien¹⁵, Burkhard Steuernagel¹⁶, Brande B. H. Wulff¹⁶, Mario Caccamo, Sarah Ayling, Ricardo H. Ramirez-Gonzalez, Bernardo J. Clavijo, Jonathan M. Wright, Matthias Pfeifer, Manuel Spannagl, Mihaela Martis, Martin Mascher¹⁷, Jarrod Chapman¹⁸, Jesse Poland⁴, Uwe Scholz¹⁷, Kerrie Barry¹⁸, Robbie Waugh¹⁹, Daniel S. Rokhsar¹⁸, Gary J. Muehlbauer, Nils Stein¹⁷, Heidrun Gundlach, Matthias Zytnicki²⁰, Véronique Jamilloux²⁰, Hadi Quesneville²⁰, Thomas Wicker²¹, Primetta Faccioli, Moreno Colaiacovo, Antonio Michele Stanca, Hikmet Budak²², Luigi Cattivelli, Natasha Glover⁶, Lise Pingault⁶, Etienne Paux⁶, Sapna Sharma, Rudi Appels²³, Matthew I. Bellgard²³, Brett Chapman²³, Thomas Nussbaumer, Kai Christian Bader, Hélène Rimbert, Shichen Wang⁴, Ron Knox, Andrzej Kilian, Michael Alaux²⁰, Françoise Alfama²⁰, Loïc Couderc²⁰, Nicolas Guilhot⁶, Claire Viseux²⁰, Mikaël Loaec²⁰, Beat Keller²¹, Sébastien Praud - Show less +92 more•Institutions (23)

18 Jul 2014-Science

TL;DR: Insight into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

...read moreread less

Abstract: An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.

...read moreread less

Journal Article•DOI•

A comparative encyclopedia of DNA elements in the mouse genome

[...]

Feng Yue¹, Feng Yue², Yong Cheng³, Alessandra Breschi, Jeff Vierstra⁴, Weisheng Wu¹, Weisheng Wu⁵, Tyrone Ryba⁶, Tyrone Ryba⁷, Richard Sandstrom⁴, Zhihai Ma³, Carrie A. Davis⁸, Benjamin D. Pope⁷, Yin Shen², Dmitri D. Pervouchine, Sarah Djebali, Robert E. Thurman⁴, Rajinder Kaul⁴, Eric Rynes⁴, Anthony Kirilusha⁹, Georgi K. Marinov⁹, Brian A. Williams⁹, Diane Trout⁹, Henry Amrhein⁹, Katherine I. Fisher-Aylor⁹, Igor Antoshechkin⁹, Gilberto DeSalvo⁹, Lei Hoon See⁸, Meagan Fastuca⁸, Jorg Drenkow⁸, Chris Zaleski⁸, Alexander Dobin⁸, Pablo Prieto, Julien Lagarde, Giovanni Bussotti, Andrea Tanzer¹⁰, Olgert Denas¹¹, Kanwei Li¹¹, M. A. Bender⁴, M. A. Bender¹², Miaohua Zhang¹², Rachel Byron¹², Mark Groudine⁴, Mark Groudine¹², David McCleary², Long Pham², Zhen Ye², Samantha Kuan², Lee Edsall², Yi-Chieh Wu¹³, Matthew D. Rasmussen¹³, Mukul S. Bansal¹³, Manolis Kellis¹⁴, Manolis Kellis¹³, Cheryl A. Keller¹, Christapher S. Morrissey¹, Tejaswini Mishra¹, Deepti Jain¹, Nergiz Dogan¹, Robert S. Harris¹, Philip Cayting³, Trupti Kawli³, Alan P. Boyle⁵, Alan P. Boyle³, Ghia Euskirchen³, Anshul Kundaje³, Shin Lin³, Yiing Lin³, Camden Jansen¹⁵, Venkat S. Malladi³, Melissa S. Cline¹⁶, Drew T. Erickson³, Vanessa M. Kirkup¹⁶, Katrina Learned¹⁶, Cricket A. Sloan³, Kate R. Rosenbloom¹⁶, Beatriz Lacerda de Sousa¹⁷, Kathryn Beal, Miguel Pignatelli, Paul Flicek, Jin Lian¹⁸, Tamer Kahveci¹⁹, Dongwon Lee²⁰, W. James Kent¹⁶, Miguel Santos¹⁷, Javier Herrero²¹, Cedric Notredame, Audra K. Johnson⁴, Shinny Vong⁴, Kristen Lee⁴, Daniel Bates⁴, Fidencio Neri⁴, Morgan Diegel⁴, Theresa K. Canfield⁴, Peter J. Sabo⁴, Matthew S. Wilken⁴, Thomas A. Reh⁴, Erika Giste⁴, Anthony Shafer⁴, Tanya Kutyavin⁴, Eric Haugen⁴, Douglas Dunn⁴, Alex Reynolds⁴, Shane Neph⁴, Richard Humbert⁴, R. Scott Hansen⁴, Marella F. T. R. de Bruijn²², Licia Selleri²³, Alexander Y. Rudensky²⁴, Steven Z. Josefowicz²⁴, Robert M. Samstein²⁴, Evan E. Eichler⁴, Stuart H. Orkin²⁵, Dana N. Levasseur²⁶, Thalia Papayannopoulou⁴, Kai Hsin Chang⁴, Arthur I. Skoultchi²⁷, Srikanta Gosh²⁷, Christine M. Disteche⁴, Piper M. Treuting⁴, Yanli Wang¹, Mitchell J. Weiss, Gerd A. Blobel²⁸, Xiaoyi Cao², Sheng Zhong², Ting Wang²⁹, Peter J. Good³⁰, Rebecca F. Lowdon³⁰, Rebecca F. Lowdon²⁹, Leslie B. Adams³⁰, Leslie B. Adams³¹, Xiao Qiao Zhou³⁰, Michael J. Pazin³⁰, Elise A. Feingold³⁰, Barbara J. Wold⁹, James Taylor¹¹, Ali Mortazavi¹⁵, Sherman M. Weissman¹⁸, John A. Stamatoyannopoulos⁴, Michael Snyder³, Roderic Guigó, Thomas R. Gingeras⁸, David M. Gilbert⁷, Ross C. Hardison¹, Michael A. Beer²⁰, Bing Ren² - Show less +142 more•Institutions (31)

Pennsylvania State University¹, University of California, San Diego², Stanford University³, University of Washington⁴, University of Michigan⁵, New College of Florida⁶, Florida State University⁷, Cold Spring Harbor Laboratory⁸, California Institute of Technology⁹, University of Vienna¹⁰, Emory University¹¹, Fred Hutchinson Cancer Research Center¹², Massachusetts Institute of Technology¹³, Broad Institute¹⁴, University of California, Irvine¹⁵, University of California, Santa Cruz¹⁶, University of California, San Francisco¹⁷, Yale University¹⁸, University of Florida¹⁹, Johns Hopkins University²⁰, University College London²¹, University of Oxford²², Cornell University²³, Memorial Sloan Kettering Cancer Center²⁴, Harvard University²⁵, University of Iowa²⁶, Yeshiva University²⁷, University of Pennsylvania²⁸, Washington University in St. Louis²⁹, National Institutes of Health³⁰, University of North Carolina at Chapel Hill³¹

20 Nov 2014-Nature

TL;DR: The mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types as mentioned in this paper.

...read moreread less

Abstract: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways To gain greater insights into both shared and species-specific transcriptional and cellular regulatory programs in the mouse, the Mouse ENCODE Consortium has mapped transcription, DNase I hypersensitivity, transcription factor binding, chromatin modifications and replication domains throughout the mouse genome in diverse cell and tissue types By comparing with the human genome, we not only confirm substantial conservation in the newly annotated potential functional sequences, but also find a large degree of divergence of sequences involved in transcriptional regulation, chromatin state and higher order chromatin organization Our results illuminate the wide range of evolutionary forces acting on genes and their regulatory regions, and provide a general resource for research into mammalian biology and mechanisms of human diseases

...read moreread less

A comparative encyclopedia of DNA elements in the mouse genome - eScholarship

[...]

Miguel Ramalho-Santos, Yin Shen, Sheng Zhong, Licia Selleri, F Yue, Y Cheng, A Breschi, Jeff Vierstra, W Wu, T Ryba, Richard Sandstrom, Z Ma, C Davis, BD Pope - Show less +10 more

20 Nov 2014

TL;DR: The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways.

...read moreread less

Abstract: © 2014 Macmillan Publishers Limited. All rights reserved.The laboratory mouse shares the majority of its protein-coding genes with humans, making it the premier model organism in biomedical research, yet the two mammals differ in significant ways. To gain

...read moreread less

Journal Article•DOI•

ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes

[...]

Sushim K. Gupta¹, Babu Roshan Padmanabhan¹, Seydina M. Diene¹, Rafael López-Rojas, Marie Kempf, Luce Landraud, Jean-Marc Rolain¹ - Show less +3 more•Institutions (1)

Aix-Marseille University¹

01 Jan 2014-Antimicrobial Agents and Chemotherapy

TL;DR: A concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR geneticeterminants is created.

...read moreread less

Abstract: ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation) is a new bioinformatic tool that was created to detect existing and putative new antibiotic resistance (AR) genes in bacterial genomes. ARG-ANNOT uses a local BLAST program in Bio-Edit software that allows the user to analyze sequences without a Web interface. All AR genetic determinants were collected from published works and online resources; nucleotide and protein sequences were retrieved from the NCBI GenBank database. After building a database that includes 1,689 antibiotic resistance genes, the software was tested in a blind manner using 100 random sequences selected from the database to verify that the sensitivity and specificity were at 100% even when partial sequences were queried. Notably, BLAST analysis results obtained using the rmtF gene sequence (a new aminoglycoside-modifying enzyme gene sequence that is not included in the database) as a query revealed that the tool was able to link this sequence to short sequences (17 to 40 bp) found in other genes of the rmt family with significant E values. Finally, the analysis of 178 Acinetobacter baumannii and 20 Staphylococcus aureus genomes allowed the detection of a significantly higher number of AR genes than the Resfinder gene analyzer and 11 point mutations in target genes known to be associated with AR. The average time for the analysis of a genome was 3.35 ± 0.13 min. We have created a concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR genetic determinants.

...read moreread less

Journal Article•DOI•

A reference genome for common bean and genome-wide analysis of dual domestications

[...]

Jeremy Schmutz¹, Phillip E. McClean², Sujan Mamidi², G Albert Wu¹, Steven B. Cannon³, Jane Grimwood, Jerry Jenkins, Shengqiang Shu¹, Qijian Song³, Carolina Chavarro⁴, Mirayda Torres-Torres⁴, Valérie Geffroy⁵, Samira Mafi Moghaddam², Dongying Gao⁴, Brian Abernathy⁴, Kerrie Barry¹, Matthew W. Blair⁶, Mark A. Brick⁷, Mansi Chovatia¹, Paul Gepts⁸, David Goodstein¹, Michael D. Gonzales⁴, Uffe Hellsten¹, David L. Hyten³, Gaofeng Jia³, James D. Kelly⁹, Dave Kudrna¹⁰, Rian Lee², Manon M.S. Richard¹¹, Phillip N. Miklas³, Juan M. Osorno², Josiane Rodrigues³, Vincent Thareau¹¹, Carlos A. Urrea¹², Mei Wang¹, Yeisoo Yu¹⁰, Ming Zhang¹, Rod A. Wing¹⁰, Perry B. Cregan³, Daniel S. Rokhsar¹, Scott A. Jackson⁴ - Show less +37 more•Institutions (12)

United States Department of Energy¹, North Dakota State University², United States Department of Agriculture³, University of Georgia⁴, Institut national de la recherche agronomique⁵, Tennessee State University⁶, Colorado State University⁷, University of California, Davis⁸, Michigan State University⁹, University of Arizona¹⁰, University of Paris-Sud¹¹, University of Nebraska–Lincoln¹²

01 Jul 2014-Nature Genetics

TL;DR: 2 independent domestications from genetic pools that diverged before human colonization are confirmed and a set of genes linked with increased leaf and seed size are identified and combined with quantitative trait locus data from Mesoamerican cultivars.

...read moreread less

Abstract: Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.

...read moreread less

Journal Article•DOI•

Dimeric CRISPR RNA-guided FokI nucleases for highly specific genome editing

[...]

Shengdar Q. Tsai, Nicolas Wyvekens¹, Cyd Khayter¹, Jennifer A Foden¹, Vishal Thapar¹, Deepak Reyon, Mathew J. Goodwin¹, Martin J. Aryee¹, J. Keith Joung - Show less +5 more•Institutions (1)

Harvard University¹

01 Jun 2014-Nature Biotechnology

TL;DR: D dimeric RNA-guided FokI nucleases (RFNs) are described that can recognize extended sequences and edit endogenous genes with high efficiencies in human cells and are likely to be useful in applications that require highly precise genome editing.

...read moreread less

Abstract: Monomeric CRISPR-Cas9 nucleases are widely used for targeted genome editing but can induce unwanted off-target mutations with high frequencies. Here we describe dimeric RNA-guided FokI Nucleases (RFNs) that recognize extended sequences and can edit endogenous genes with high efficiencies in human cells. The cleavage activity of an RFN depends strictly on the binding of two guide RNAs (gRNAs) to DNA with a defined spacing and orientation and therefore show improved specificities relative to wild-type Cas9 monomers. Importantly, direct comparisons show that RFNs guided by a single gRNA generally induce lower levels of unwanted mutations than matched monomeric Cas9 nickases. In addition, we describe a simple method for expressing multiple gRNAs bearing any 5′ end nucleotide, which gives dimeric RFNs a broad targeting range. RFNs combine the ease of RNA-based targeting with the specificity enhancement inherent to dimerization and are likely to be useful in applications that require highly precise genome editing.

...read moreread less

Journal Article•DOI•

Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library

[...]

Hiroko Koike-Yusa¹, Yang Li¹, E-Pien Tan¹, Martin Del Castillo Velasco-Herrera¹, Kosuke Yusa¹ - Show less +1 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Mar 2014-Nature Biotechnology

TL;DR: The results demonstrate the potential for efficient loss-of-function screening using the CRISPR-Cas9 system and identify 27 known and 4 previously unknown genes implicated in these phenotypes.

...read moreread less

Abstract: Identification of genes influencing a phenotype of interest is frequently achieved through genetic screening by RNA interference (RNAi) or knockouts. However, RNAi may only achieve partial depletion of gene activity, and knockout-based screens are difficult in diploid mammalian cells. Here we took advantage of the efficiency and high throughput of genome editing based on type II, clustered, regularly interspaced, short palindromic repeats (CRISPR)-CRISPR-associated (Cas) systems to introduce genome-wide targeted mutations in mouse embryonic stem cells (ESCs). We designed 87,897 guide RNAs (gRNAs) targeting 19,150 mouse protein-coding genes and used a lentiviral vector to express these gRNAs in ESCs that constitutively express Cas9. Screening the resulting ESC mutant libraries for resistance to either Clostridium septicum alpha-toxin or 6-thioguanine identified 27 known and 4 previously unknown genes implicated in these phenotypes. Our results demonstrate the potential for efficient loss-of-function screening using the CRISPR-Cas9 system.

...read moreread less

Journal Article•DOI•

Optimized CRISPR/Cas tools for efficient germline and somatic genome engineering in Drosophila.

[...]

Fillip Port¹, Hui-Min Chen², Tzumin Lee², Simon L. Bullock¹•Institutions (2)

Laboratory of Molecular Biology¹, Janelia Farm Research Campus²

22 Jul 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: A toolbox for high-efficiency genome engineering of Drosophila melanogaster consisting of transgenic Cas9 lines and versatile guide RNA (gRNA) expression plasmids is reported, which will facilitate the rapid evaluation of mutant phenotypes of specific genes and the precise modification of the genome with single-nucleotide precision.

...read moreread less

Abstract: The type II clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) system has emerged recently as a powerful method to manipulate the genomes of various organisms. Here, we report a toolbox for high-efficiency genome engineering of Drosophila melanogaster consisting of transgenic Cas9 lines and versatile guide RNA (gRNA) expression plasmids. Systematic evaluation reveals Cas9 lines with ubiquitous or germ-line–restricted patterns of activity. We also demonstrate differential activity of the same gRNA expressed from different U6 snRNA promoters, with the previously untested U6:3 promoter giving the most potent effect. An appropriate combination of Cas9 and gRNA allows targeting of essential and nonessential genes with transmission rates ranging from 25–100%. We also demonstrate that our optimized CRISPR/Cas tools can be used for offset nicking-based mutagenesis. Furthermore, in combination with oligonucleotide or long double-stranded donor templates, our reagents allow precise genome editing by homology-directed repair with rates that make selection markers unnecessary. Last, we demonstrate a novel application of CRISPR/Cas-mediated technology in revealing loss-of-function phenotypes in somatic cells following efficient biallelic targeting by Cas9 expressed in a ubiquitous or tissue-restricted manner. Our CRISPR/Cas tools will facilitate the rapid evaluation of mutant phenotypes of specific genes and the precise modification of the genome with single-nucleotide precision. Our results also pave the way for high-throughput genetic screening with CRISPR/Cas.

...read moreread less

Journal Article•DOI•

Insect Mitochondrial Genomics: Implications for Evolution and Phylogeny

[...]

Stephen L. Cameron¹•Institutions (1)

Queensland University of Technology¹

07 Jan 2014-Annual Review of Entomology

TL;DR: Insects are model systems for studying aberrant mt genomes, including truncated tRNAs and multichromosomal genomes, and greater integration of nuclear and mt genomic studies is necessary to further the understanding of insect genomic evolution.

...read moreread less

Abstract: The mitochondrial (mt) genome is, to date, the most extensively studied genomic system in insects, outnumbering nuclear genomes tenfold and representing all orders versus very few. Phylogenomic analysis methods have been tested extensively, identifying compositional bias and rate variation, both within and between lineages, as the principal issues confronting accurate analyses. Major studies at both inter- and intraordinal levels have contributed to our understanding of phylogenetic relationships within many groups. Genome rearrangements are an additional data type for defining relationships, with rearrangement synapomorphies identified across multiple orders and at many different taxonomic levels. Hymenoptera and Psocodea have greatly elevated rates of rearrangement offering both opportunities and pitfalls for identifying rearrangement synapomorphies in each group. Finally, insects are model systems for studying aberrant mt genomes, including truncated tRNAs and multichromosomal genomes. Greater integration of nuclear and mt genomic studies is necessary to further our understanding of insect genomic evolution.

...read moreread less

Journal Article•DOI•

The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

[...]

Shengyi Liu¹, Yumei Liu, Xinhua Yang, Chaobo Tong¹, David Edwards², Isobel A. P. Parkin³, Meixia Zhao¹, Jianxin Ma⁴, Jingyin Yu¹, Shunmou Huang¹, Xiyin Wang⁵, Junyi Wang, Kun Lu⁶, Zhiyuan Fang, Ian Bancroft⁷, Tae-Jin Yang⁸, Qiong Hu¹, Xinfa Wang¹, Zhen Yue, Haojie Li, Linfeng Yang, Jian Wu, Qing Zhou, Wanxin Wang, Graham J.W. King⁹, J. Chris Pires¹⁰, Changxin Lu, Zhangyan Wu, Perumal Sampath⁸, Zhuo Wang, Hui Guo⁵, Shengkai Pan, Limei Yang, Jiumeng Min, Dong Zhang⁵, Dianchuan Jin, Wanshun Li, Harry Belcram¹¹, Jinxing Tu¹², Mei Guan¹³, Cunkou Qi, Dezhi Du, Jiana Li⁶, Liangcai Jiang, Jacqueline Batley¹⁴, Andrew G. Sharpe¹⁵, Beom Seok Park, Pradeep Ruperao², Feng Cheng, Nomar Espinosa Waminal⁸, Yin Huang, Caihua Dong¹, Li Wang, Jingping Li⁵, Zhiyong Hu¹, Mu Zhuang, Yi Huang¹, Junyan Huang¹, Jiaqin Shi¹, Desheng Mei¹, Jing Liu¹, Tae-Ho Lee⁵, Jinpeng Wang, Huizhe Jin⁵, Zaiyun Li¹², Xun Li¹³, Jiefu Zhang, Lu Xiao, Yongming Zhou¹², Zhongsong Liu¹³, Xuequn Liu¹⁶, Rui Qin¹⁶, Xu Tang⁵, Wenbin Liu, Yupeng Wang⁵, Yangyong Zhang, Jonghoon Lee⁸, Hyun Hee Kim¹⁷, Xun Xu, Xinming Liang, Wei Hua¹, Xiaowu Wang, Jun Wang¹⁸, Boulos Chalhoub¹¹, Andrew H. Paterson⁵ - Show less +81 more•Institutions (18)

Crops Research Institute¹, Australian Centre for Plant Functional Genomics², Agriculture and Agri-Food Canada³, Purdue University⁴, Plant Genome Mapping Laboratory⁵, Southwest University⁶, University of York⁷, Seoul National University⁸, Southern Cross University⁹, University of Missouri¹⁰, Centre national de la recherche scientifique¹¹, Huazhong Agricultural University¹², Hunan Agricultural University¹³, University of Queensland¹⁴, National Research Council¹⁵, Central University, India¹⁶, Sahmyook University¹⁷, King Abdulaziz University¹⁸

23 May 2014-Nature Communications

TL;DR: A draft genome sequence of Brassica oleracea is described, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks.

...read moreread less

Abstract: Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear Brassica is an ideal model to increase knowledge of polyploid evolution Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B oleracea This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus

...read moreread less

Journal Article•DOI•

Genome editing with Cas9 in adult mice corrects a disease mutation and phenotype

[...]

Hao Yin¹, Wen Xue¹, Sidi Chen¹, Roman L. Bogorad¹, Eric Benedetti², Markus Grompe², Victor Koteliansky³, Phillip A. Sharp¹, Tyler Jacks¹, Daniel G. Anderson¹ - Show less +6 more•Institutions (3)

Massachusetts Institute of Technology¹, Oregon Health & Science University², Skolkovo Institute of Science and Technology³

01 Jun 2014-Nature Biotechnology

TL;DR: In this article, the authors demonstrate CRISPR-Cas9-mediated correction of a Fah mutation in hepatocytes in a mouse model of the human disease hereditary tyrosinemia.

...read moreread less

Abstract: We demonstrate CRISPR-Cas9-mediated correction of a Fah mutation in hepatocytes in a mouse model of the human disease hereditary tyrosinemia. Delivery of components of the CRISPR-Cas9 system by hydrodynamic injection resulted in initial expression of the wild-type Fah protein in ∼1/250 liver cells. Expansion of Fah-positive hepatocytes rescued the body weight loss phenotype. Our study indicates that CRISPR-Cas9-mediated genome editing is possible in adult animals and has potential for correction of human genetic diseases.

...read moreread less

Journal Article•DOI•

Comparative genomics reveals insights into avian genome evolution and adaptation.

[...]

Guojie Zhang¹, Guojie Zhang², Cai Li², Qiye Li², Bo Li², Denis M. Larkin³, Chul Hee Lee⁴, Jay F. Storz⁵, Agostinho Antunes⁶, Matthew J. Greenwold⁷, Robert W. Meredith⁸, Anders Ödeen⁹, Jie Cui¹⁰, Qi Zhou¹¹, Luohao Xu², Hailin Pan², Zongji Wang¹², Lijun Jin², Pei Zhang², Haofu Hu², Wei Yang², Jiang Hu², Jin Xiao², Zhikai Yang², Yang Liu², Qiaolin Xie², Hao Yu², Jinmin Lian², Ping Wen², Fang Zhang², Hui Li², Yongli Zeng², Zijun Xiong², Shiping Liu¹², Long Zhou², Zhiyong Huang², Na An², Jie Wang¹³, Qiumei Zheng², Yingqi Xiong², Guangbiao Wang², Bo Wang², Jingjing Wang², Yu Fan¹⁴, Rute R. da Fonseca¹, Alonzo Alfaro-Núñez¹, Mikkel Schubert¹, Ludovic Orlando¹, Tobias Mourier¹, Jason T. Howard¹⁵, Ganeshkumar Ganapathy¹⁵, Andreas R. Pfenning¹⁵, Osceola Whitney¹⁵, Miriam V. Rivas¹⁵, Erina Hara¹⁵, Julia Smith¹⁵, Marta Farré³, Jitendra Narayan¹⁶, Gancho T. Slavov¹⁶, Michael N Romanov¹⁷, Rui Borges⁶, João Paulo Machado⁶, Imran Khan⁶, Mark S. Springer¹⁸, John Gatesy¹⁸, Federico G. Hoffmann¹⁹, Juan C. Opazo²⁰, Olle Håstad²¹, Roger H. Sawyer⁷, Heebal Kim⁴, Kyu-Won Kim⁴, Hyeon Jeong Kim⁴, Seoae Cho⁴, Ning Li²², Yinhua Huang²², Michael William Bruford²³, Xiangjiang Zhan¹³, Andrew Dixon, Mads F. Bertelsen²⁴, Elizabeth P. Derryberry²⁵, Wesley C. Warren²⁶, Richard K. Wilson²⁶, Shengbin Li²⁷, David A. Ray¹⁹, Richard E. Green²⁸, Stephen J. O'Brien²⁹, Darren K. Griffin¹⁷, Warren E. Johnson³⁰, David Haussler²⁸, Oliver A. Ryder, Eske Willerslev¹, Gary R. Graves³¹, Per Alström²¹, Jon Fjeldså³², David P. Mindell³³, Scott V. Edwards³⁴, Edward L. Braun³⁵, Carsten Rahbek³², David W. Burt³⁶, Peter Houde³⁷, Yong Zhang², Huanming Yang³⁸, Jian Wang², Erich D. Jarvis¹⁵, M. Thomas P. Gilbert³⁹, M. Thomas P. Gilbert¹, Jun Wang - Show less +103 more•Institutions (39)

University of Copenhagen¹, Beijing Genomics Institute², Royal Veterinary College³, Seoul National University⁴, University of Nebraska–Lincoln⁵, University of Porto⁶, University of South Carolina⁷, Montclair State University⁸, Uppsala University⁹, National University of Singapore¹⁰, University of California, Berkeley¹¹, South China University of Technology¹², Chinese Academy of Sciences¹³, Kunming Institute of Zoology¹⁴, Howard Hughes Medical Institute¹⁵, Aberystwyth University¹⁶, University of Kent¹⁷, University of California, Riverside¹⁸, Mississippi State University¹⁹, Austral University of Chile²⁰, Swedish University of Agricultural Sciences²¹, China Agricultural University²², Cardiff University²³, Copenhagen Zoo²⁴, Louisiana State University²⁵, Washington University in St. Louis²⁶, Xi'an Jiaotong University²⁷, University of California, Santa Cruz²⁸, Nova Southeastern University Oceanographic Center²⁹, Smithsonian Conservation Biology Institute³⁰, National Museum of Natural History³¹, Natural History Museum³², University of California, San Francisco³³, Harvard University³⁴, University of Florida³⁵, University of Edinburgh³⁶, New Mexico State University³⁷, Macau University of Science and Technology³⁸, Curtin University³⁹

12 Dec 2014-Science

TL;DR: This work explored bird macroevolution using full genomes from 48 avian species representing all major extant clades to reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

Abstract: Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

...read moreread less

Journal Article•DOI•

Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls

[...]

Justin M. Zook¹, Brad Chapman², Jason Wang, David Mittelman³, Oliver Hofmann², Winston Hide², Marc L. Salit¹ - Show less +3 more•Institutions (3)

National Institute of Standards and Technology¹, Harvard University², Virginia Bioinformatics Institute³

01 Mar 2014-Nature Biotechnology

TL;DR: Methods to make high-confidence, single-nucleotide polymorphism (SNP), indel and homozygous reference genotype calls for NA12878, the pilot genome for the Genome in a Bottle Consortium are presented.

...read moreread less

Abstract: Clinical adoption of human genome sequencing requires methods that output genotypes with known accuracy at millions or billions of positions across a genome. Because of substantial discordance among calls made by existing sequencing methods and algorithms, there is a need for a highly accurate set of genotypes across a genome that can be used as a benchmark. Here we present methods to make high-confidence, single-nucleotide polymorphism (SNP), indel and homozygous reference genotype calls for NA12878, the pilot genome for the Genome in a Bottle Consortium. We minimize bias toward any method by integrating and arbitrating between 14 data sets from five sequencing technologies, seven read mappers and three variant callers. We identify regions for which no confident genotype call could be made, and classify them into different categories based on reasons for uncertainty. Our genotype calls are publicly available on the Genome Comparison and Analytic Testing website to enable real-time benchmarking of any method.

...read moreread less

Journal Article•DOI•

Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects.

[...]

Bin Shen¹, Wensheng Zhang², Jun Zhang¹, Jiankui Zhou¹, Jianying Wang¹, Li Chen¹, Lu Wang³, Alex Hodgkins², Vivek Iyer², Xingxu Huang¹, William C. Skarnes² - Show less +7 more•Institutions (3)

Nanjing University¹, Wellcome Trust Sanger Institute², Beijing Institute of Genomics³

01 Apr 2014-Nature Methods

TL;DR: This work has shown that co-microinjection of mouse embryos with Cas9 mRNA and single guide RNAs induces on-target and off-target mutations that are transmissible to offspring, but Cas9 nickase can be used to efficiently mutate genes without detectable damage at known off- target sites.

...read moreread less

Abstract: Bacterial RNA-directed Cas9 endonuclease is a versatile tool for site-specific genome modification in eukaryotes. Co-microinjection of mouse embryos with Cas9 mRNA and single guide RNAs induces on-target and off-target mutations that are transmissible to offspring. However, Cas9 nickase can be used to efficiently mutate genes without detectable damage at known off-target sites. This method is applicable for genome editing of any model organism and minimizes confounding problems of off-target mutations.

...read moreread less

Journal Article•DOI•

The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates

[...]

Camille Berthelot¹, Frédéric Brunet², Domitille Chalopin², Amélie Juanchich³, Maria Bernard³, Benjamin Noel, Pascal Bento, Corinne Da Silva, Karine Labadie, Adriana Alberti, Jean-Marc Aury, Alexandra Louis², Patrice Dehais³, Philippe Bardou³, Jérôme Montfort³, Christophe Klopp³, Cédric Cabau³, Christine Gaspin³, Gary H. Thorgaard⁴, Mekki Boussaha³, Edwige Quillet³, René Guyomard³, Delphine Galiana², Julien Bobe³, Jean-Nicolas Volff², Carine Genet³, Patrick Wincker⁵, Olivier Jaillon⁵, Hugues Roest Crollius², Yann Guiguen³ - Show less +26 more•Institutions (5)

European Bioinformatics Institute¹, École Normale Supérieure², Institut national de la recherche agronomique³, Washington State University⁴, Centre national de la recherche scientifique⁵

22 Apr 2014-Nature Communications

TL;DR: It is shown that after 100 million years of evolution the two ancestral subgenomes have remained extremely collinear, despite the loss of half of the duplicated protein-coding genes, mostly through pseudogenization.

...read moreread less

Abstract: Vertebrate evolution has been shaped by several rounds of whole-genome duplications (WGDs) that are often suggested to be associated with adaptive radiations and evolutionary innovations. Due to an additional round of WGD, the rainbow trout genome offers a unique opportunity to investigate the early evolutionary fate of a duplicated vertebrate genome. Here we show that after 100 million years of evolution the two ancestral subgenomes have remained extremely collinear, despite the loss of half of the duplicated protein-coding genes, mostly through pseudogenization. In striking contrast is the fate of miRNA genes that have almost all been retained as duplicated copies. The slow and stepwise rediploidization process characterized here challenges the current hypothesis that WGD is followed by massive and rapid genomic reorganizations and gene deletions.

...read moreread less

Journal Article•DOI•

Control of Cell Identity Genes Occurs in Insulated Neighborhoods in Mammalian Chromosomes

[...]

Jill M. Dowen¹, Zi Peng Fan¹, Denes Hnisz¹, Gang Ren², Gang Ren³, Brian J. Abraham¹, Lyndon Nuoxi Zhang¹, Abraham S. Weintraub¹, Jurian Schuijers¹, Tong Ihn Lee¹, Keji Zhao², Richard A. Young¹ - Show less +8 more•Institutions (3)

Massachusetts Institute of Technology¹, National Institutes of Health², Northwest A&F University³

09 Oct 2014-Cell

TL;DR: Using ESC cohesin ChIA-PET data to identify the local chromosomal structures at both active and repressed genes across the genome produces a map of enhancer-promoter interactions and reveals that super-enhancer-driven genes generally occur within chromosome structures that are formed by the looping of two interacting CTCF sites co-occupied by cohesIn.

...read moreread less

Journal Article•DOI•

Genome sequence of the cultivated cotton Gossypium arboreum

[...]

Fuguang Li, Guangyi Fan, Kunbo Wang, Fengming Sun, Youlu Yuan, Guoli Song, Qin Li¹, Zhiying Ma², Cairui Lu, Changsong Zou, Wenbin Chen, Xinming Liang, Haihong Shang, Weiqing Liu, Chengcheng Shi, Guanghui Xiao¹, Caiyun Gou, Wuwei Ye, Xun Xu, Xueyan Zhang, Hengling Wei, Zhifang Li, Guiyin Zhang², Junyi Wang, Kun Liu, Russell J. Kohel³, Richard G. Percy³, John Z. Yu³, Yu-Xian Zhu¹, Jun Wang, Shuxun Yu - Show less +27 more•Institutions (3)

Peking University¹, Agricultural University of Hebei², United States Department of Agriculture³

01 Jun 2014-Nature Genetics

TL;DR: Comparative transcriptome studies showed the key role of the nucleotide binding site (NBS)-encoding gene family in resistance to Verticillium dahliae and the involvement of ethylene in the development of cotton fiber cells.

...read moreread less

Abstract: Yu-Xian Zhu, Jun Wang, Shuxun Yu and colleagues report sequencing and assembly of the genome of cultivated cotton, Gossypium arboreum. Comparison with the Gossypium raimondii genome sequence provides insights into genome evolution and speciation, and identifies two shared whole-genome duplication events occurring before the speciation event around 2–13 million years ago.

...read moreread less

Collapse