scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Improved vectors and genome-wide libraries for CRISPR screening.

01 Aug 2014-Nature Methods (NIH Public Access)-Vol. 11, Iss: 8, pp 783-784
TL;DR: In this paper, Zhang et al. used a Genome-scale CRISPR Knock-Out (GeCKO) library to identify loss-of-function mutations in a melanoma model.
Abstract: Genome-wide, targeted loss-of-function pooled screens using the CRISPR (clustered regularly interspaced short palindrome repeats)–associated nuclease Cas9 in human and mouse cells provide an alternative screening system to RNA interference (RNAi) and have been used to reveal new mechanisms in diverse biological models1-4. Previously, we used a Genome-scale CRISPR Knock-Out (GeCKO) library to identify loss-of-function mutations conferring vemurafenib resistance in a melanoma model1. However, initial lentiviral delivery systems for CRISPR screening had low viral titer or required a cell line already expressing Cas9, limiting the range of biological systems amenable to screening. Here, we sought to improve both the lentiviral packaging and choice of guide sequences in our original GeCKO library1, where a pooled library of synthesized oligonucleotides was cloned into a lentiviral backbone containing both the Streptococcus pyogenes Cas9 nuclease and the single guide RNA (sgRNA) scaffold. To create a new vector capable of producing higher-titer virus (lentiCRISPRv2), we made several modifications, including removal of one of the nuclear localization signals (NLS), human codon-optimization of the remaining NLS and P2A bicistronic linker sequences, and repositioning of the U6-driven sgRNA cassette (Fig. 1a). These changes resulted in a ~10-fold increase in functional viral titer over lentiCRISPRv11 (Fig. 1b). Figure 1 New lentiviral CRISPR designs produce viruses with higher functional titer. To further increase viral titer, we also cloned a two-vector system, in which Cas9 (lentiCas9-Blast) and sgRNA (lentiGuide-Puro) are delivered using separate viral vectors with distinct antibiotic selection markers (Fig. 1a). LentiGuide-Puro has a ~100-fold increase in functional viral titer over the original lentiCRISPRv1 (Fig. 1b). Both single and dual-vector systems mediate efficient knock-out of a genomically-integrated copy of EGFP in human cells (Supplementary Fig. 1). Whereas the dual vector system enables generation of Cas9-expressing cell lines which can be subsequently used for screens using lentiGuide-Puro, the single vector lentiCRISPRv2 may be better suited for in vivo or primary cell screening applications. In addition to the vector improvements, we designed and synthesized new human and mouse GeCKOv2 sgRNA libraries (Supplementary Methods) with several improvements (Table 1): First, for both human and mouse libraries, to target all genes with a uniform number of sgRNAs, we selected 6 sgRNAs per gene distributed over 3-4 constitutively expressed exons. Second, to further minimize off-target genome modification, we improved the calculation of off-target scores based on specificity analysis5. Third, to inactivate microRNAs (miRNAs) which play a key role in transcriptional regulation, we added sgRNAs to direct mutations to the pre-miRNA hairpin structure6. Finally, we targeted ~1000 additional genes not included in the original GeCKO library. Table 1 Comparison of new GeCKO v2 human and mouse sgRNA libraries with existing CRISPR libraries. Both libraries, mouse and human, are divided into 2 sub-libraries — containing 3 sgRNAs targeting each gene in the genome, as well as 1000 non-targeting control sgRNAs. Screens can be performed by combining both sub-libraries, yielding 6 sgRNAs per gene, for higher coverage. Alternatively, individual sub-libraries can be used in situations where cell numbers are limiting (eg. primary cells, in vivo screens). The human and mouse libraries have been cloned into lentiCRISPRv2 and into lentiGuide-Puro and deep sequenced to ensure uniform representation (Supplementary Fig. 2, 3). These new lentiCRISPR vectors and human and mouse libraries further improve the GeCKO reagents for diverse screening applications. Reagents are available to the academic community through Addgene and associated protocols, support forums, and computational tools are available via the Zhang lab website (www.genome-engineering.org).
Citations
More filters
Journal ArticleDOI
TL;DR: Recently devised sgRNA design rules are used to create human and mouse genome-wide libraries, perform positive and negative selection screens and observe that the use of these rules produced improved results, and a metric to predict off-target sites is developed.
Abstract: CRISPR-Cas9-based genetic screens are a powerful new tool in biology. By simply altering the sequence of the single-guide RNA (sgRNA), one can reprogram Cas9 to target different sites in the genome with relative ease, but the on-target activity and off-target effects of individual sgRNAs can vary widely. Here, we use recently devised sgRNA design rules to create human and mouse genome-wide libraries, perform positive and negative selection screens and observe that the use of these rules produced improved results. Additionally, we profile the off-target activity of thousands of sgRNAs and develop a metric to predict off-target sites. We incorporate these findings from large-scale, empirical data to improve our computational design rules and create optimized sgRNA libraries that maximize on-target activity and minimize off-target effects to enable more effective and efficient genetic screens and genome engineering.

2,866 citations

Journal ArticleDOI
09 Oct 2014-Cell
TL;DR: In vivo as well as ex vivo genome editing using adeno-associated virus, lentivirus, or particle-mediated delivery of guide RNA in neurons, immune cells, and endothelial cells is demonstrated, suggesting that Cas9 mice empower a wide range of biological and disease modeling applications.

1,476 citations

01 Sep 2014
TL;DR: In this paper, a Cre-dependent Cas9 knockin mouse was used to study the dynamics of KRAS, p53, and LKB1, the top three significantly mutated genes in lung adenocarcinoma.
Abstract: CRISPR-Cas9 is a versatile genome editing technology for studying the functions of genetic elements. To broadly enable the application of Cas9 in vivo, we established a Cre-dependent Cas9 knockin mouse. We demonstrated in vivo as well as ex vivo genome editing using adeno-associated virus (AAV)-, lentivirus-, or particle-mediated delivery of guide RNA in neurons, immune cells, and endothelial cells. Using these mice, we simultaneously modeled the dynamics of KRAS, p53, and LKB1, the top three significantly mutated genes in lung adenocarcinoma. Delivery of a single AAV vector in the lung generated loss-of-function mutations in p53 and Lkb1, as well as homology-directed repair-mediated KrasG12D mutations, leading to macroscopic tumors of adenocarcinoma pathology. Together, these results suggest that Cas9 mice empower a wide range of biological and disease modeling applications.

1,216 citations

Journal ArticleDOI
05 Jul 2017-Nature
TL;DR: Vulnerability to ferroptic cell death induced by inhibition of a lipid peroxidase pathway as a feature of therapy-resistant cancer cells across diverse mesenchymal cell-state contexts is identified.
Abstract: Cancer cells can assume different biological states, which can affect their resistance to therapies. A mesenchymal phenotype has been associated with drug resistance but the mechanism behind this state is not well understood. Stuart Schreiber and colleagues now show that tumour cells with a mesenchymal phenotype are selectively sensitive to inhibition of GPX4, an enzyme that alters lipid metabolism. GPX4 dissipates lipid peroxides and therefore prevents the iron-mediated reactions which induce ferroptotic cell death. These findings offer new perspectives on targeting cancers that have undergone a transition to a mesenchymal state to evade other therapeutic agents.

1,008 citations

Journal ArticleDOI
TL;DR: A review of the latest applications of CRISPR-Cas9 in mammalian functional genomics screens is presented in this article, which covers related genome-scale applications of Cas9 for either gene knockout or transcriptional modulation.
Abstract: CRISPR–Cas9 has been adopted as a powerful genome-editing technology in various species. By generating libraries of thousands of guide RNAs — which direct the Cas9 nuclease to chosen genomic loci — high-throughput genetic perturbations are now possible. This Review discusses the latest applications of CRISPR–Cas9 in mammalian functional genomics screens. It covers related genome-scale applications of Cas9 for either gene knockout or transcriptional modulation, and provides comparisons with complementary RNA interference (RNAi)-based approaches.

980 citations

References
More filters
Journal ArticleDOI
TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Abstract: Bowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

20,335 citations

Journal ArticleDOI
TL;DR: The command-line tool cutadapt is developed, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features.
Abstract: When small RNA is sequenced on current sequencing machines, the resulting reads are usually longer than the RNA and therefore contain parts of the 3' adapter. That adapter must be found and removed error-tolerantly from each read before read mapping. Previous solutions are either hard to use or do not offer required features, in particular support for color space data. As an easy to use alternative, we developed the command-line tool cutadapt, which supports 454, Illumina and SOLiD (color space) data, offers two adapter trimming algorithms, and has other useful features. Cutadapt, including its MIT-licensed source code, is available for download at http://code.google.com/p/cutadapt/

20,255 citations

Journal ArticleDOI
TL;DR: The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer.
Abstract: Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: ude.dmu.sc@eloc Supplementary information: Supplementary data are available at Bioinformatics online.

11,473 citations

Journal ArticleDOI
TL;DR: This protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results, which takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.
Abstract: Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and ∼1 h of hands-on time.

10,913 citations

Journal ArticleDOI
TL;DR: An update of the miRBase database is described, including the collation and use of deep sequencing data sets to assign levels of confidence to miR base entries, and a high confidence subset of miR Base entries are provided, based on the pattern of mapped reads.
Abstract: We describe an update of the miRBase database (http://www.mirbase.org/), the primary microRNA sequence repository. The latest miRBase release (v20, June 2013) contains 24 521 microRNA loci from 206 species, processed to produce 30 424 mature microRNA products. The rate of deposition of novel microRNAs and the number of researchers involved in their discovery continue to increase, driven largely by small RNA deep sequencing experiments. In the face of these increases, and a range of microRNA annotation methods and criteria, maintaining the quality of the microRNA sequence data set is a significant challenge. Here, we describe recent developments of the miRBase database to address this issue. In particular, we describe the collation and use of deep sequencing data sets to assign levels of confidence to miRBase entries. We now provide a high confidence subset of miRBase entries, based on the pattern of mapped reads. The high confidence microRNA data set is available alongside the complete microRNA collection at http://www.mirbase.org/. We also describe embedding microRNA-specific Wikipedia pages on the miRBase website to encourage the microRNA community to contribute and share textual and functional information.

4,705 citations