scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Structural basis for CRISPR RNA-guided DNA recognition by Cascade

TL;DR: The composition and low-resolution structure of Cascade is presented and it is shown how it recognizes double-stranded DNA targets in a sequence-specific manner and suggests that continuous invader DNA surveillance takes place without energy investment.
Abstract: The CRISPR (clustered regularly interspaced short palindromic repeats) immune system in prokaryotes uses small guide RNAs to neutralize invading viruses and plasmids. In Escherichia coli, immunity depends on a ribonucleoprotein complex called Cascade. Here we present the composition and low-resolution structure of Cascade and show how it recognizes double-stranded DNA (dsDNA) targets in a sequence-specific manner. Cascade is a 405-kDa complex comprising five functionally essential CRISPR-associated (Cas) proteins (CasA(1)B(2)C(6)D(1)E(1)) and a 61-nucleotide CRISPR RNA (crRNA) with 5'-hydroxyl and 2',3'-cyclic phosphate termini. The crRNA guides Cascade to dsDNA target sequences by forming base pairs with the complementary DNA strand while displacing the noncomplementary strand to form an R-loop. Cascade recognizes target DNA without consuming ATP, which suggests that continuous invader DNA surveillance takes place without energy investment. The structure of Cascade shows an unusual seahorse shape that undergoes conformational changes when it binds target DNA.

Summary (3 min read)

Introduction

  • The CRISPR (clustered regularly interspaced short palindromic repeats) immune system in prokaryotes uses small guide RNAs to neutralize invading viruses and plasmids.
  • Cascade is a 405-kDa complex comprising five functionally essential CRISPRassociated (Cas) proteins (CasA1B2C6D1E1) and a 61-nucleotide CRISPR RNA with 5′-hydroxyl and 2′,3′-cyclic phosphate termini.

RESULTS Core subcomplexes of Cascade

  • The E. coli K12 CRISPR-Cas system (Cse-subtype) consists of a gene cluster that includes cas3, the Cascade genes , cas1, cas2 and a downstream CRISPR locus20 (Fig. 1a).
  • To investigate the role of the individual Cascade subunits, the authors first tested whether each subunit was required for antiviral defense.
  • The authors systematically overproduced and purified different combinations of Cascade subunits and checked for the presence of mature crRNA.
  • Cascade, unlike CasBCDE and CasCDE, always copurified with large nucleic acid molecules (>300 nucleotides; Fig. 1c).
  • DNase I treatment before gel filtration eliminated the void peak without disrupting the discrete Cascade peak, again indicating the presence of Cascade-bound DNA (Fig. 1e).

Architecture of crRNA

  • The characteristics of the mature crRNA species have been accurately determined by subjecting mature crRNAs isolated from Cascade to denaturing RNA chromatography25,26 and electrospray ionization mass spectrometry (ESI-MS).
  • The authors used chromatography to show the purity and homogeneity of the crRNA in Cascade (Fig. 2a), and the retention time was consistent with an approximate length of 60 nucleotides.

Target DNA recognition

  • The observation that DNA copurified with Cascade prompted us to analyze the DNA-binding behavior of Cascade in detail (Fig. 3).
  • At high competitor concentrations a proportion of Cascade lacking CasA bound target DNA, as was evident from the faster migration rate of the CasBCDE–DNA complex (Fig. 3a,c and Supplementary Fig. 4).
  • This difference was more pronounced when the authors used dsDNA target probes (data not shown).
  • Only the complementary strand shifted when the authors used short dsDNA targets (corresponding to the protospacer), which suggests that the formation of base pairs between crRNA and the complementary strand displaced the noncomplementary strand (Fig. 4c).
  • Endonuclease P1 footprints showed that an 18-base region (G27–C44) of the nontargeted strand corresponding to about half of the protospacer was susceptible to cleavage after binding by Cascade (Fig. 4d).

G59

  • The borders of the Cascade-protected regions are indicated by arrows.
  • Subsequent RNase H treatment restored the original gel mobility of the plasmid, indicating that Watson-Crick base-pairing occurred between the crRNA and one strand of the plasmid DNA (Fig. 4g).
  • The specific binding of Cascade thus led to the formation of an R-loop (Fig. 4h): a structure in which the spacer part of the crRNA was base paired with one DNA strand of a duplex, leaving the displaced strand single stranded30.

Subunit stoichiometry

  • To understand the structural basis of the interaction between Cascade and target DNA, the authors determined the composition of the Cascade protein assembly using mass spectrometric analyses.
  • In agreement with the mass difference of 55,966 Da, the two major complexes of 405 and 349 kDa probably resemble the intact Cascade and a Cascade subcomplex lacking CasA.
  • Selection and activation of the 349-kDa Cascade subcomplex showed the loss of one CasD subunit and up to two CasC subunits.
  • The total molecular weight of Cascade increased by the mass of a single ssDNA probe, indicating the presence of one accessible crRNA (Supplementary Fig. 6c).

Structure of Cascade

  • The authors gained insight into the structural organization of Cascade by singleparticle EM (Fig. 6).
  • The observed change suggests that Cascade undergoes conformational changes when it binds target DNA.
  • Apart from the six better-resolved CasC subunits in these projections (Fig. 6p,q), the difference map showed one region compatible with two CasB copies (Fig. 6r), consistent with the CasB homodimer observed with MS (Fig. 5a).
  • The combined observations result in a structural model in which the positions of CasA, CasB and CasC can be assigned with confidence, contributing the curled-up tail, nose and torso of the seahorse, respectively (Fig. 6t).
  • Raw electron micrographs and an overview of the particle analysis method are given in Supplementary Figures 7 and 8, respectively.

DISCUSSION Composition of Cascade

  • Cascade is a ribonucleoprotein complex that has a key role in CRISPRbased defense against mobile genetic elements.
  • In addition to sequence-specific DNA recognition, Cascade also interacts nonspecifically with DNA.
  • The sequence-specific interaction gives rise to the formation of an R-loop30—a structure that is known to prime replication of bacterial plasmids41, bacterial chromosomes42 and mitochondrial DNA43.
  • Mass spectrometry showed the existence of a 107-kDa subcomplex containing single copies of CasC, CasD, CasE and crRNA (Fig. 5b).the authors.
  • The authors expect that the 61-nucleotide molecule is bound by a substantial part of Cascade.

METHODS

  • Methods and any associated references are available in the online version of the paper at http://www.nature.com/nsmb/.
  • Supplementary information is available on the Nature Structural & Molecular Biology website, also known as Note.

AcKnOWLEDgMEntS

  • The authors thank M.H. Lai for experimental contributions, and E. Schaible, P. Zwart and M. Bokhove for technical support and for assistance with post processing of SAXS data.
  • The authors thank The Netherlands Proteomics Center for financial support.

AUtHOR cOntRIBUtIOnS

  • Karginov, F.V. & Hannon, G.J. The CRISPR system: small RNA-guided defense in bacteria and archaea.
  • Thomas, M., White, R.L. & Davis, R.W. Hybridization of RNA to double-stranded DNA: formation of R-loops.

ONLINE METHODS

  • Cascade complexes were produced and purified as described20 using the expression plasmids listed in Supplementary Table 3.
  • Proteins were sprayed from borosilicate glass capillaries and analyzed on a LCT electrospray time-of-flight or modified quadrupole time-offlight instruments (both from Waters) adjusted for optimal performance in high mass detection48,49.
  • The reaction was stopped by the addition of 4.8 µl β-mercaptoethanol and 5.3 µl 0.5 M EDTA, followed by phenol-chloroform extraction and precipitation with ethanol.
  • The cleavage products were separated on a denaturing 10% polyacrylamide gel together with G+A-sequencing of oligonucleotides50 and visualized by autoradiography.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

University of Groningen
Structural basis for CRISPR RNA-guided DNA recognition by Cascade
Jore, Matthijs M.; Lundgren, Magnus; van Duijn, Esther; Bultema, Jelle B.; Westra, Edze R.;
Waghmare, Sakharam P.; Wiedenheft, Blake; Pul, Uemit; Wurm, Reinhild; Wagner, Rolf
Published in:
Nature Structural & Molecular Biology
DOI:
10.1038/nsmb.2019
IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from
it. Please check the document version below.
Document Version
Publisher's PDF, also known as Version of record
Publication date:
2011
Link to publication in University of Groningen/UMCG research database
Citation for published version (APA):
Jore, M. M., Lundgren, M., van Duijn, E., Bultema, J. B., Westra, E. R., Waghmare, S. P., Wiedenheft, B.,
Pul, U., Wurm, R., Wagner, R., Beijer, M. R., Barendregt, A., Zhou, K., Snijders, A. P. L., Dickman, M. J.,
Doudna, J. A., Boekema, E. J., Heck, A. J. R., van der Oost, J., ... Pul, Ü. (2011). Structural basis for
CRISPR RNA-guided DNA recognition by Cascade.
Nature Structural & Molecular Biology
,
18
(5), 529-
U141. https://doi.org/10.1038/nsmb.2019
Copyright
Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the
author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).
The publication may also be distributed here under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license.
More information can be found on the University of Groningen website: https://www.rug.nl/library/open-access/self-archiving-pure/taverne-
amendment.
Take-down policy
If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately
and investigate your claim.
Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the
number of authors shown on this cover page is limited to 10 maximum.
Download date: 10-08-2022

A r t i c l e s
nAture structurAl & moleculAr biology VOLUME 18 NUMBER 5 MAY 2011 529
The constant pressure of invading viruses and conjugative plasmids
has shaped the evolution of host defense systems in prokaryotes. The
widely distributed CRISPR immune system represents a recently
discovered adaptive and inheritable defense strategy
1–5
. The system
consists of repeats that are interspersed with unique sequences called
spacers, which are derived from viral and plasmid DNA
6–8
.
The Cas protein machinery is encoded by gene clusters that are located
close to the CRISPR locus
9
. Multiple types of cas gene sets have been rec-
ognized
10,11
that correlate with specific families of repeat sequences
12
.
The mechanism by which CRISPR and Cas induce immunity has been
divided into three stages. In the first stage—CRISPR adaptation—the
host encounters an invader and integrates a random fragment of foreign
DNA nondirectionally into the CRISPR locus as a new spacer, resulting
in resistance to foreign genetic elements carrying this sequence
13–15
. The
metal-dependent DNase Cas1 might be involved in generating small
DNA fragments that are used as precursors for CRISPR adaptation
16
.
The second stage—CRISPR expression—involves transcription and
translation of the cas genes and transcription of the CRISPR, yielding a
precursor CRISPR RNA (pre-crRNA). The pre-crRNA is cleaved in the
repeat regions
17–19
by specific Cas endoRNases: CasE from E. coli
20
, Cas6
from Pyrococcus furiosus
21
and Csy4 from Pseudomonas aeruginosa
22
.
CasE-generated mature crRNAs remain bound to a protein complex
called Cascade (CRISPR-associated complex for antiviral defense),
which consists of five Cas proteins of the Cse-type (CasABCDE; Fig. 1a).
Cas6-generated crRNAs end up in the Cmr complex (Cmr1–Cmr6) and
are further trimmed at the 3 end from around 67 to 39 or 45 nucleotides.
The guide RNA-loaded Cmr complex cleaves single-stranded target RNA
sequence-specifically
23
. In E. coli, the third stage—CRISPR interference—
requires not only Cascade loaded with anti-invader crRNA, but also the
participation of the predicted nuclease-helicase Cas3. Because crRNAs
that are complementary to either strand of the phage DNA provide resist-
ance, it has been proposed that Cascade is a crRNA-guided complex that
targets DNA rather than mRNA
20
. In vivo experiments in Staphylococcus
epidermidis involving a conjugative plasmid proved that the Csm-type
CRISPR-Cas system targets DNA
24
. In Streptococcus thermophilus,
bacteriophage resistance and plasmid loss result from site-specific
cleavage of viral and plasmid DNA within the spacer-targeted sequence,
and this process requires the csn1-like gene
15
.
Although viral or plasmid DNA seems to be the target molecule
in these three bacterial systems, there is no biochemical evidence
1
Laboratory of Microbiology, Department of Agrotechnology and Food Sciences, Wageningen University, Wageningen, The Netherlands.
2
Biomolecular Mass
Spectrometry and Proteomics Group, Bijvoet Center for Biomolecular Research and the Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht,
The Netherlands.
3
The Netherlands Proteomics Center, Utrecht, The Netherlands.
4
Department of Biophysical Chemistry, Groningen Biomolecular Sciences and
Biotechnology Institute, University of Groningen, Groningen, The Netherlands.
5
ChELSI Institute, Department of Chemical and Process Engineering, University
of Sheffield, Sheffield, UK.
6
Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, California, USA.
7
Department of Molecular and Cell
Biology, University of California, Berkeley, Berkeley, California.
8
Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.
9
Physikalische Biologie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany.
10
Present addresses: Department of Cell and Molecular Biology, Uppsala
University, Uppsala, Sweden (M.L.); MRC Clinical Sciences Centre, Imperial College London, London, UK (A.P.L.S.).
11
These authors contributed equally to this work.
Correspondence should be addressed to J.v.d.O. (john.vanderoost@wur.nl).
Received 6 May 2010; accepted 24 January 2011; published online 3 April 2011; doi:10.1038/nsmb.2019
Structural basis for CRISPR RNA-guided DNA
recognition by Cascade
Matthijs M Jore
1,11
, Magnus Lundgren
1,10,11
, Esther van Duijn
2,3,11
, Jelle B Bultema
4,11
, Edze R Westra
1
,
Sakharam P Waghmare
5
, Blake Wiedenheft
6–8
, Ümit Pul
9
, Reinhild Wurm
9
, Rolf Wagner
9
, Marieke R Beijer
1
,
Arjan Barendregt
2,3
, Kaihong Zhou
6–8
, Ambrosius P L Snijders
5,10
, Mark J Dickman
5
, Jennifer A Doudna
6–8
,
Egbert J Boekema
4
, Albert J R Heck
2,3
, John van der Oost
1
& Stan J J Brouns
1
The CRISPR (clustered regularly interspaced short palindromic repeats) immune system in prokaryotes uses small guide RNAs to 
neutralize invading viruses and plasmids. In Escherichia coli, immunity depends on a ribonucleoprotein complex called Cascade. 
Here we present the composition and low-resolution structure of Cascade and show how it recognizes double-stranded DNA 
(dsDNA) targets in a sequence-specific manner. Cascade is a 405-kDa complex comprising five functionally essential CRISPR-
associated (Cas) proteins (CasA
1
B
2
C
6
D
1
E
1
) and a 61-nucleotide CRISPR RNA (crRNA) with 5-hydroxyl and 2,3-cyclic phosphate 
termini. The crRNA guides Cascade to dsDNA target sequences by forming base pairs with the complementary DNA strand while 
displacing the noncomplementary strand to form an R-loop. Cascade recognizes target DNA without consuming ATP, which 
suggests that continuous invader DNA surveillance takes place without energy investment. The structure of Cascade shows an 
unusual seahorse shape that undergoes conformational changes when it binds target DNA.
© 2011 Nature America, Inc. All rights reserved.

530 VOLUME 18 NUMBER 5 MAY 2011 nAture structur Al & moleculAr biology
A r t i c l e s
to explain how Cas protein complexes recognize their target DNA
molecules. Here we show that the molecular basis of the specificity
of CRISPR immune system relies on the formation of R-loops: that is,
the formation of Watson-Crick base pairs between the crRNA-spacer
sequence and the complementary strand of double-stranded target
DNA. We also present a structural model of Cascade that provides
insight into the architecture of Cas protein complexes.
RESULTS
Core subcomplexes of Cascade
The E. coli K12 CRISPR-Cas system (Cse-subtype) consists of a gene
cluster that includes cas3, the Cascade genes (casABCDE), cas1, cas2
and a downstream CRISPR locus
20
(Fig. 1a). To investigate the role of
the individual Cascade subunits, we first tested whether each subunit
was required for antiviral defense. Viral plaque assays with Cas3 and
Cascade lacking one type of protein subunit showed that all protein
components of Cascade are indispensible for the virus-resistant pheno-
type of E. coli (Supplementary Fig. 1).
We systematically overproduced and purified different combinations
of Cascade subunits and checked for the presence of mature crRNA.
We found that CasA or CasAB could be omitted without affecting the
apparent stoichiometry of the remaining subunits or the mature crRNA
(Fig. 1b,c). Cascade, unlike CasBCDE and CasCDE, always copurified
with large nucleic acid molecules (>300 nucleotides; Fig. 1c). Removal
of the Cas proteins followed by nuclease treat-
ments showed that RNase A only hydrolyzed
the crRNA, whereas DNase I removed the long
nucleic acids, thereby identifying the copurified
nucleic acid as DNA (Fig. 1d). Size exclusion
chromatography of the three types of complex
revealed that the majority of CasBCDE and
CasCDE were present in a single form, whereas
Cascade showed a substantial void peak in
addition to a discrete peak at ~11 ml (Fig. 1e).
DNase I treatment before gel filtration elimi-
nated the void peak without disrupting the
discrete Cascade peak, again indicating the
presence of Cascade-bound DNA (Fig. 1e).
Architecture of crRNA
The characteristics of the mature crRNA species
have been accurately determined by subjecting
mature crRNAs isolated from Cascade to dena-
turing RNA chromatography
25,26
and electro-
spray ionization mass spectrometry (ESI-MS).
To simplify the analysis, we obtained a uniform crRNA preparation by
coexpressing Cascade with a designed CRISPR containing eight repeats
and seven identical spacers (denoted R44 CRISPR (Supplementary
Fig. 2)). This setup resulted in a Cascade preparation in which each
molecule was loaded with the same crRNA. We used chromatography
to show the purity and homogeneity of the crRNA in Cascade (Fig. 2a),
and the retention time was consistent with an approximate length of
60 nucleotides. The ESI-MS spectra indicated that the crRNA had a
molecular weight of 19,660.80 Da (Fig. 2b), which corresponds well to
an expected molecular weight of 19,660.82 Da for a 61-nucleotide crRNA
resulting from single CasE endoRNase cleavage events in each repeat.
ESI-MS/MS analysis of crRNA treated with RNase T1 and RNase A
resulted in a number of oligoribonucleotide digests that could be
assigned to the mature crRNA sequence (Supplementary Fig. 3)
and were consistent with a cleavage site 5 of the terminal base of the
hairpin
20
. The molecular weight analysis of the crRNA indicated a 5-
hydroxyl group and a 2,3-cyclic phosphate terminus. We confirmed
the latter by acid treatment of the crRNA, which showed a mass shift
of 18 Da, corresponding to the hydrolysis of the cyclic phosphate to a
2 or 3 phosphate (Fig. 2c). Mature crRNA is 61 nucleotides long and
contains the 32-nucleotide spacer sequence, flanked by repeat-derived
sequences on either end: 8 bases at the 5 terminus (5 handle) and 21
bases forming a hairpin with a tetranucleotide loop at the 3 terminus
(3 handle; Fig. 2d).
0 2 4 6 8 10
Elution volume (ml)
12 14
1
1 2 12
1
0
100
CasA
300
150
80
50
crRNA
CasC*
CasC
CasD
CasE
CasB*
M
a e
b c d
cas3 casA casB casC
Cascade
CRISPR
casD cas1 cas2casE
M
Cascade
CasBCDE
CasCDE
Cascade
Cascade
CasBCDE
+ RNase
A
+ DNase l
CasCDE
75
50
25
20
15
37
CasCDE
CasBCDE
Cascade
Cascade +
DNase l
1
0
Normalized absorbance (A
280
)
1
0
0
Figure 1 Core complexes of Cascade retain
crRNA. (a) Schematic diagram of the CRISPR-
Cas locus in E. coli K12 containing cas3 (ygcB),
casA (cse1, ygcL), casB (cse2, ygcK), casC (cse4,
ygcJ), casD (cas5e, ygcI), casE (cse3, ygcH),
cas1 (ygbT) and cas2 (ygbF)
9,10
. (b) Coomassie
blue–stained SDS-polyacrylamide gel showing
StrepTactin-purified Cascade, CasBCDE and
CasCDE. Protein marker sizes in kDa. Asterisk,
Strep-tagged subunits. (c) Ethidium bromide
stained denaturing PAA-gel showing nucleic acids
isolated from purified Cascade (sub)complexes.
RNA marker sizes in nucleotides. (d) RNase A
or DNase I treatment of Cascade-bound nucleic
acids. (e) Size exclusion elution profiles of
CasCDE, CasBCDE and Cascade before and after
DNase I treatment.
200
a c
d
b
Absorbance (mV)
0
100
–28
–27
–26
–25
–24
–23
–22
–21
–21
–20
–19
–18
–17
–16
–15
1,091.269
Relative intensity (%)
0
700 800 900 1,000 1,100 1,200 m/z
m/z
0 10
Retention time (min)
20
1,091
5 handle
5OH
2, 3P
3 handle
Spacer
1,091.2
1,092.3
crRNA
Acid-treated crRNA
1,093.5
1,093.4
1,092.2
Observed M
w
= 19,660.80 Da
Observed M
w
= 19,678.81 Da
Theoretical M
w
= 19,678.83 Da
Theoretical M
w
= 19,660.82 Da
crRNA
1,092 1,093 1,094 1,095 1,096
Figure 2 Architecture of crRNA. (a) Ion-pair reversed-phase HPLC purification of mature R44
crRNA at 75 °C. (b) Multiple-charged ESI-MS spectrum of the purified mature crRNA. (c) Enhanced
view of the −18 charged species before (top) and after (bottom) acid treatment indicating hydrolysis
of the 2,3-cyclic phosphate. (d) Diagram of mature crRNA derived from the R44 CRISPR.
© 2011 Nature America, Inc. All rights reserved.

nAture structurAl & moleculAr biology VOLUME 18 NUMBER 5 MAY 2011 531
A r t i c l e s
Target DNA recognition
The observation that DNA copurified with Cascade prompted us
to analyze the DNA-binding behavior of Cascade in detail (Fig. 3).
Electrophoretic mobility shift assays (EMSAs) showed that Cascade
could bind single-stranded (ss) DNA containing a sequence complemen-
tary to the spacer sequence of the crRNA (Fig. 3a,c and Supplementary
Fig. 4). Cascade also bound double-stranded target DNA (Fig. 3b,d)
without the need for, or enhancement by, additional cofactors such as
divalent metal ions or ATP (data not shown). The dissociation constants
(K
d
) of Cascade for single- and double-stranded target DNA were 8 and
790 nM, respectively. Cascade also bound weakly to nontarget DNA
(Fig. 3c,d). Competitor DNA had little effect on preformed Cascade–
target DNA complexes, indicating that the interaction between Cascade
and complementary DNA substrates is stable. At high competitor con-
centrations a proportion of Cascade lacking CasA bound target DNA,
as was evident from the faster migration rate of the CasBCDE–DNA
complex (Fig. 3a,c and Supplementary Fig. 4).
Cascade subcomplexes that lacked CasA (CasBCDE and CasCDE)
showed only sequence-specific binding to ssDNA and dsDNA targets,
and were not affected by the addition of competitor DNA (Fig. 3eh).
The addition of purified CasA to CasBCDE preparations restored
Cascade-like nonspecific DNA binding (Supplementary Fig. 5), but
neither CasA alone (Supplementary Fig. 5)
nor a combination of CasA and CasCDE (not
shown) showed nonspecific DNA binding.
Competition assays between Cascade and
CasBCDE showed that a ratio of 1:25 was
required to distribute the ssDNA target
equally between the types of complexes
(Fig. 4a). This difference was more pro-
nounced when we used dsDNA target probes
(data not shown). Furthermore, much less
dsDNA target was shifted by CasBCDE than
by Cascade under equivalent conditions
(Fig. 3d,f), indicating again that CasA
enhanced target DNA localization.
It seems that Cascade recognizes 32-
nucleotide target sequences (referred to
as protospacers) in dsDNA molecules by
formation of base pairs between the crRNA spacer sequence and the
complementary DNA strand. Analysis of long dsDNA targets (proto-
spacer with 27-base-pair (bp) flanks) showed that both strands shifted
(Fig. 4b), probably owing to base-paired flanking regions. Only the
complementary strand shifted when we used short dsDNA targets
(corresponding to the protospacer), which suggests that the formation
of base pairs between crRNA and the complementary strand displaced
the noncomplementary strand (Fig. 4c).
We demonstrated that the noncomplementary strand had been dis-
placed by performing enzymatic and chemical footprint analyses specific
for ssDNA
27
using Cascade loaded with targeting (R44) and nontarget-
ing (K12) crRNA. Endonuclease P1 footprints showed that an 18-base
region (G27–C44) of the nontargeted strand corresponding to about half
of the protospacer was susceptible to cleavage after binding by Cascade
(Fig. 4d). In line with this observation, six thymines and one adenine
in the same region of the nontargeted strand were also sensitive to per-
manganate modification and subsequent piperidine cleavage (Fig. 4e).
These results indicate that only the first half of the nontargeted strand
of the protospacer was exposed in the Cascade–DNA complex and the
second half of the protospacer was shielded. The targeted strand was
sensitive to P1 in a region that overlapped the protospacer adjacent
motif (PAM)
28
(Fig. 4d), which may be indicative of a distortion in
Targeting Cascade
and ssDNA
a
c
e
g
i j
h
f
d
b
Competitor
concentration
Cascade–
DNA complex
CasBCDE–
DNA complex
Unbound DNA
Competitor
concentration
Cascade–
DNA complex
CasBCDE–
DNA complex
Cascade
concentration
Cascade–
RNA complex
CasBCDE–
DNA complex
CasCDE–
DNA complex
CasCDE
Cascade
CasBCDE
Cascade
Cascade
Unbound DNA
Unbound DNA
Unbound RNA
Unbound DNA
Nontargeting
Cascade and ssDNA
Target ssDNA Nontarget ssDNA Target dsDNA Nontarget dsDNA
Target ssRNANontarget ssRNA
Target dsRNANontarget dsRNA
Targeting Cascade
and dsDNA
Nontargeting
Cascade and dsDNA
Figure 3 Target recognition by Cascade.
(a,b) Effect of the type of crRNA bound.
Cascade was loaded with either targeting
crRNA (R44 CRISPR, Supplementary Fig. 2)
or nontargeting crRNA (K12 CRISPR).
The binding of these two types of Cascade
complex to one type of probe is shown. DNA
probes are 86-nucleotide ssDNA or dsDNA
sequences containing the R44 protospacer
(32 nucleotides) flanked by 27 nucleotides
on either end. (ch) Effect of uniform crRNA-
loaded (sub)complexes (R44 CRISPR) on the
binding of single- or double-stranded target
and nontarget DNA. Nontarget DNA probes
contain a scrambled R44 protospacer sequence.
(i,j) Effect of uniform crRNA-loaded Cascade
(R44 CRISPR) on the binding of target and
nontarget ssRNA and dsRNA. (ah) Labeled
probe concentration 1 nM; DNA competitor
concentration 2,500, 500, 50, 5 and
0.5 ng µl
−1
(highest concentration not used for
CasCDE); protein concentration 200–300 nM
except in i,j where the Cascade concentrations
were 200, 100, 50, 25 and 12.5 nM.
© 2011 Nature America, Inc. All rights reserved.

532 VOLUME 18 NUMBER 5 MAY 2011 nAture structur Al & moleculAr biology
A r t i c l e s
DNA conformation resulting from Cascade-induced strand opening
further downstream. Permanganate assays also indicated that a single
base of the targeted strand of the protospacer (T33) was sensitive to
modification (Fig. 4e), suggesting that this base is unpaired in the
hybrid crRNA–DNA duplex. Exonuclease III footprints showed that
the sequences that flank the protospacer are double-stranded and that
Cascade protects a region of around 9 bases including the PAM on one
flank and around 14 bases on the non-PAM flank (Fig. 4f).
To show that crRNA and DNA form base pairs, we used a 3-kb plasmid
containing the target sequence. Specific binding of R44-Cascade to the
supercoiled plasmid resulted in a mobility shift of the plasmid (Fig. 4g).
The retardation of the plasmid was partially relieved by proteolytic
a
Target ssDNA
without competitor
CasBCDE conc.
Cascade conc.
Cascade
conc.
Cascade–
DNA complex
Cascade-R44
+
+
+
+
+
+
+
+
+
+
+
+
Seq
Seq
Seq
Nontarget strand
G59
T21
T43
T28
T73
T20
T28
C59
G59
A28
T33
T38
T34
T32
A28
T28
C44
C59
C59
G59
G31
A28
G27
Target plasmid
3.5 kb OC
Target
strand
Exo. lll
3
5
Nontarget
strand
Cascade
PAM
Spacer/protospacer
Exo. lll
5OH
5
2, 3P
3
10
20
30
###### ## ####### # # #
######
**** ** *
*
40
50
60
70
80
L
SC
3.0 kb
2.5 kb
M
+ ++++++ +
+++
++ +
+++ +
+ +
Nontarget strandTarget strand
Seq
Target strand
Seq
Nontarget strand
Seq
Target strand
Cascade–
DNA complex
Cascade
conc.
Cascade–
DNA complex
CasBCDE–
DNA complex
Unbound DNA
Unbound DNA
Unbound DNA
Nuclease P1
KMnO
4
Exonuclease III
Target ssDNA
with competitor
Long dsDNA
complementary
strand labeled
Short dsDNA
complementary
strand labeled
Long dsDNA
noncomplementary
strand labeled
Short dsDNA
noncomplementary
strand labeled
b c
d
g h
e f
Cascade-R44
Cascade-K12
Proteinase K
RNaseH
Cascade-K12
Figure 4 R-loop formation by Cascade. (a) Competition assay between R44-crRNA–loaded Cascade and CasBCDE for R44 ssDNA target. Total protein
concentration was 500 nM in each reaction, and the Cascade:CasBCDE ratio was 1:0, 100:1, 10:1, 1:1, 1:10, 1:100 and 0:1. DNA competitor
concentration was 1 µg µl
−1
. (b,c) Effect of labeling the complementary or noncomplementary strand of a target dsDNA with 27-bp flanks (b) or without
protospacer-flanking sequences (c). Cascade concentrations were 1,500, 300, 60 and 12.5 nM. (d,e) Mapping of ssDNA regions in the Cascade–target
DNA complex using nuclease P1 and KMnO
4
. Sensitive regions are indicated by dashed lines and the protospacer by a solid line according to the G+A
sequencing lanes of each strand. Cascade loaded with K12-derived crRNA was used as a control. (f) Exonuclease III mapping of accessible dsDNA
regions upstream and downstream of the Cascade–DNA complex. The borders of the Cascade-protected regions are indicated by arrows. (g) Detection of
the R-loop in a target plasmid. Agarose gel indicating the mobility of the different plasmid forms (SC, supercoiled; L, linear; OC, open circular) and the
mobility shifts caused by R44-Cascade and R44-crRNA binding. (h) Schematic diagram of the R-loop formed in crRNA-guided dsDNA recognition by
Cascade. Regions sensitive to nuclease P1 and KMnO
4
are indicated by hash and asterisk signs, respectively.
© 2011 Nature America, Inc. All rights reserved.

Citations
More filters
Journal ArticleDOI
19 May 2016-Nature
TL;DR: E engineered fusions of CRISPR/Cas9 and a cytidine deaminase enzyme that retain the ability to be programmed with a guide RNA, do not induce dsDNA breaks, and mediate the direct conversion of cytidine to uridine, thereby effecting a C→T (or G→A) substitution.
Abstract: Current genome-editing technologies introduce double-stranded (ds) DNA breaks at a target locus as the first step to gene correction. Although most genetic diseases arise from point mutations, current approaches to point mutation correction are inefficient and typically induce an abundance of random insertions and deletions (indels) at the target locus resulting from the cellular response to dsDNA breaks. Here we report the development of 'base editing', a new approach to genome editing that enables the direct, irreversible conversion of one target DNA base into another in a programmable manner, without requiring dsDNA backbone cleavage or a donor template. We engineered fusions of CRISPR/Cas9 and a cytidine deaminase enzyme that retain the ability to be programmed with a guide RNA, do not induce dsDNA breaks, and mediate the direct conversion of cytidine to uridine, thereby effecting a C→T (or G→A) substitution. The resulting 'base editors' convert cytidines within a window of approximately five nucleotides, and can efficiently correct a variety of point mutations relevant to human disease. In four transformed human and murine cell lines, second- and third-generation base editors that fuse uracil glycosylase inhibitor, and that use a Cas9 nickase targeting the non-edited strand, manipulate the cellular DNA repair response to favour desired base-editing outcomes, resulting in permanent correction of ~15-75% of total cellular DNA with minimal (typically ≤1%) indel formation. Base editing expands the scope and efficiency of genome editing of point mutations.

3,384 citations

Journal ArticleDOI
TL;DR: It is demonstrated that the Cas9–crRNA complex of the Streptococcus thermophilus CRISPR3/Cas system introduces in vitro a double-strand break at a specific site in DNA containing a sequence complementary to crRNA, paving the way for engineering of universal programmable RNA-guided DNA endonucleases.
Abstract: Clustered, regularly interspaced, short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems provide adaptive immunity against viruses and plasmids in bacteria and archaea. The silencing of invading nucleic acids is executed by ribonucleoprotein complexes preloaded with small, interfering CRISPR RNAs (crRNAs) that act as guides for targeting and degradation of foreign nucleic acid. Here, we demonstrate that the Cas9–crRNA complex of the Streptococcus thermophilus CRISPR3/Cas system introduces in vitro a double-strand break at a specific site in DNA containing a sequence complementary to crRNA. DNA cleavage is executed by Cas9, which uses two distinct active sites, RuvC and HNH, to generate site-specific nicks on opposite DNA strands. Results demonstrate that the Cas9–crRNA complex functions as an RNA-guided endonuclease with RNA-directed target sequence recognition and protein-mediated DNA cleavage. These findings pave the way for engineering of universal programmable RNA-guided DNA endonucleases.

2,435 citations


Cites background from "Structural basis for CRISPR RNA-gui..."

  • ...The mechanism proposed here for the cleavage of dsDNA by the Cas9–crRNA complex differs significantly from that for the type I-E (former “Ecoli”) system (7)....

    [...]

  • ...coli type I-E system crRNA andCas proteins assemble into a large ribonucleoprotein complex, Cascade, that facilitates target recognition by enhancing sequence-specific hybridization between the crRNA and complementary target sequences (7)....

    [...]

  • ...crRNAs combine with Cas proteins to form an effector complex which recognizes the target sequence in the invasive nucleic acid by base pairing to the complementary strand (7) and induces sequence-specific cleavage, thereby preventing proliferation and propagation of foreign genetic elements....

    [...]

Journal ArticleDOI
16 Feb 2012-Nature
TL;DR: Understanding how small RNAs are used to find and destroy foreign nucleic acids will provide new insights into the diverse mechanisms of RNA-controlled genetic silencing systems.
Abstract: Clustered regularly interspaced short palindromic repeat (CRISPR) are essential components of nucleic-acid-based adaptive immune systems that are widespread in bacteria and archaea. Similar to RNA interference (RNAi) pathways in eukaryotes, CRISPR-mediated immune systems rely on small RNAs for sequence-specific detection and silencing of foreign nucleic acids, including viruses and plasmids. However, the mechanism of RNA-based bacterial immunity is distinct from RNAi. Understanding how small RNAs are used to find and destroy foreign nucleic acids will provide new insights into the diverse mechanisms of RNA-controlled genetic silencing systems.

1,720 citations

Journal ArticleDOI
14 Mar 2014-Science
TL;DR: To compare the architectures and domain organization of diverse Cas9 proteins, the atomic structures of Cas9 from Streptococcus pyogenes and Actinomyces naeslundii and AnaCas9 were determined by x-ray crystallography and three-dimensional reconstructions of apo-SpyCas9, SpyCas9:RNA, and SpyCas 9:RNA:DNA were obtained by negative-stain single-particle electron microscopy.
Abstract: Type II CRISPR (clustered regularly interspaced short palindromic repeats)-Cas (CRISPR-associated) systems use an RNA-guided DNA endonuclease, Cas9, to generate double-strand breaks in invasive DNA during an adaptive bacterial immune response. Cas9 has been harnessed as a powerful tool for genome editing and gene regulation in many eukaryotic organisms. We report 2.6 and 2.2 angstrom resolution crystal structures of two major Cas9 enzyme subtypes, revealing the structural core shared by all Cas9 family members. The architectures of Cas9 enzymes define nucleic acid binding clefts, and single-particle electron microscopy reconstructions show that the two structural lobes harboring these clefts undergo guide RNA-induced reorientation to form a central channel where DNA substrates are bound. The observation that extensive structural rearrangements occur before target DNA duplex binding implicates guide RNA loading as a key step in Cas9 activation.

1,034 citations


Cites background from "Structural basis for CRISPR RNA-gui..."

  • ...7B), as observed for other CRISPRCas targeting complexes (36)....

    [...]

Journal ArticleDOI
14 Jan 2016-Cell
TL;DR: Recent advances in understanding the diverse mechanisms by which Cas proteins respond to foreign nucleic acids are discussed and how these systems have been harnessed for precision genome manipulation in a wide array of organisms.

878 citations


Cites background from "Structural basis for CRISPR RNA-gui..."

  • ...After a spacer is acquired from a new invader, the resulting crRNA assembles with Cas proteins to form Cascade, the interference complex capable of targeting PAM-adjacent DNA sequences matching the spacer sequence of the crRNA (Brouns et al., 2008; Jore et al., 2011; Lintner et al., 2011)....

    [...]

  • ...…that polymerizes along the crRNA and determines the crescent-shaped, semi-helical architecture seen in all structurally characterized Cascade complexes (Hochstrasser et al., 2014; Jackson et al., 2014; Jore et al., 2011; Mulepati et al., 2014;Wiedenheft et al., 2011a; Zhao et al., 2014)....

    [...]

  • ...Cas6 (Cas6e in Type I-E systems) remains bound to the 30 hairpin following CRISPR maturation, while Cas5 binds the 50 handle (Brouns et al., 2008; Jore et al., 2011)....

    [...]

  • ...The Cas7 subunits form the ‘‘backbone’’ that polymerizes along the crRNA and determines the crescent-shaped, semi-helical architecture seen in all structurally characterized Cascade complexes (Hochstrasser et al., 2014; Jackson et al., 2014; Jore et al., 2011; Mulepati et al., 2014;Wiedenheft et al., 2011a; Zhao et al., 2014)....

    [...]

  • ...After a spacer is acquired from a new invader, the resulting crRNA assembles with Cas proteins to form Cascade, the interference complex capable of targeting PAM-adjacent DNA sequences matching the spacer sequence of the crRNA (Brouns et al., 2008; Jore et al., 2011; Lint- ner et al., 2011)....

    [...]

References
More filters
Book ChapterDOI
TL;DR: The chapter presents techniques for producing discrete DNA fragments, end-labeling DNA, segregating end- labeled fragments, extracting DNA from gels, and the protocols for partially cleaving it at specific bases using the chemical reactions.
Abstract: Publisher Summary This chapter discusses the sequencing end-labeled DNA with base-specific chemical cleavages. In the chemical DNA sequencing method, one end-labels the DNA, partially cleaves it at each of the four bases in four reactions, orders the products by size on a slab gel, and then reads the sequence from an autoradiogram by noting which base-specific agent cleaved at each successive nucleotide along the strand. This technique sequences the DNA made in and purified from cells. No enzymatic copying in vitro is required, and either single- or double-stranded DNA can be sequenced. Most chemical schemes that cleave at one or two of the four bases involve three consecutive steps: modification of a base, removal of the modified base from its sugar, and DNA strand scission at that sugar. Base-specific chemical cleavage is only one step in sequencing DNA. The chapter presents techniques for producing discrete DNA fragments, end-labeling DNA, segregating end-labeled fragments, extracting DNA from gels, and the protocols for partially cleaving it at specific bases using the chemical reactions. The chapter also discusses the electrophoresis of the chemical cleavage products on long-distance sequencing gels and a guide for troubleshooting problems in sequencing patterns.

12,321 citations


"Structural basis for CRISPR RNA-gui..." refers methods in this paper

  • ...The cleavage products were separated on a denaturing 10% polyacrylamide gel together with G+A-sequencing of oligonucleotide...

    [...]

Journal ArticleDOI
23 Mar 2007-Science
TL;DR: It is found that, after viral challenge, bacteria integrated new spacers derived from phage genomic sequences, and CRISPR provided resistance against phages, and resistance specificity is determined by spacer-phage sequence similarity.
Abstract: Clustered regularly interspaced short palindromic repeats (CRISPR) are a distinctive feature of the genomes of most Bacteria and Archaea and are thought to be involved in resistance to bacteriophages. We found that, after viral challenge, bacteria integrated new spacers derived from phage genomic sequences. Removal or addition of particular spacers modified the phage-resistance phenotype of the cell. Thus, CRISPR, together with associated cas genes, provided resistance against phages, and resistance specificity is determined by spacer-phage sequence similarity.

5,045 citations


"Structural basis for CRISPR RNA-gui..." refers background in this paper

  • ...In the first stage—CRISPR adaptation—the host encounters an invader and integrates a random fragment of foreign DNA nondirectionally into the CRISPR locus as a new spacer, resulting in resistance to foreign genetic elements carrying this sequenc...

    [...]

Journal ArticleDOI
TL;DR: In this paper, a method is proposed for the determination of the optimum value of the regularization parameter (Lagrange multiplier) when applying indirect transform techniques in small-angle scattering data analysis.
Abstract: A method is proposed for the determination of the optimum value of the regularization parameter (Lagrange multiplier) when applying indirect transform techniques in small-angle scattering data analysis. The method is based on perceptual criteria of what is the best solution. A set of simple criteria is used to construct a total estimate describing the quality of the solution. Maximization of the total estimate is straightforward. Model computations show the effectiveness of the technique. The method is implemented in the program GNOM [Svergun, Semenyuk & Feigin (1988). Acta Cryst. A44, 244–250].

3,477 citations

Journal ArticleDOI
TL;DR: A program suite for one-dimensional small-angle scattering data processing running on IBM-compatible PCs under Windows 9x/NT/2000/XP is presented and PRIMUS enables model-independent singular value decomposition or linear fitting if the scattering from the components is known.
Abstract: A program suite for one-dimensional small-angle scattering data processing running on IBM-compatible PCs under Windows 9x/NT/2000/XP is presented. The main program, PRIMUS, has a menu-driven graphical user interface calling computational modules to perform data manipulation and analysis. Experimental data in binary OTOKO format can be reduced by calling the program SAPOKO, which includes statistical analysis of time frames, averaging and scaling. Tools to generate the angular axis and detector response files from diffraction patterns of calibration samples, as well as binary to ASCII transformation programs, are available. Several types of ASCII files can be directly imported into PRIMUS, in particular, sasCIF or ILL-type files are read without modification. PRIMUS provides basic data manipulation functions (averaging, background subtraction, merging of data measured in different angular ranges, extrapolation to zero sample concentration, etc.) and computes invariants from Guinier and Porod plots. Several external modules coupled with PRIMUS via pop-up menus enable the user to evaluate the characteristic functions by indirect Fourier transformation, to perform peak analysis for partially ordered systems and to find shape approximations in terms of three-parametric geometrical bodies. For the analysis of mixtures, PRIMUS enables model-independent singular value decomposition or linear fitting if the scattering from the components is known. An interface is also provided to the general non-linear fitting program MIXTURE, which is designed for quantitative analysis of multicomponent systems represented by simple geometrical bodies, taking shape and size polydispersity as well as interparticle interference effects into account.

2,871 citations

Journal ArticleDOI
15 Aug 2008-Science
TL;DR: The results demonstrate that the formation of mature guide RNAs by the CRISPR RNA endonuclease subunit of Cascade is a mechanistic requirement for antiviral defense.
Abstract: Prokaryotes acquire virus resistance by integrating short fragments of viral nucleic acid into clusters of regularly interspaced short palindromic repeats (CRISPRs). Here we show how virus-derived sequences contained in CRISPRs are used by CRISPR-associated (Cas) proteins from the host to mediate an antiviral response that counteracts infection. After transcription of the CRISPR, a complex of Cas proteins termed Cascade cleaves a CRISPR RNA precursor in each repeat and retains the cleavage products containing the virus-derived sequence. Assisted by the helicase Cas3, these mature CRISPR RNAs then serve as small guide RNAs that enable Cascade to interfere with virus proliferation. Our results demonstrate that the formation of mature guide RNAs by the CRISPR RNA endonuclease subunit of Cascade is a mechanistic requirement for antiviral defense.

2,262 citations


Additional excerpts

  • ...c CasBCDE-DNA (1,009, 43,694, 3) d CasBCDE (773, 26,665, 1)...

    [...]

Frequently Asked Questions (16)
Q1. What are the contributions in "Structural basis for crispr rna-guided dna recognition by cascade" ?

Jore et al. this paper proposed a structural basis for CRISPR RNA-guided DNA recognition by Cascade. 

Hydrolysis of the cyclic phosphate terminus was performed by incubating the HPLC-purified crRNA in a final concentration of 0.1 M HCl at 4 °C for 1 h.ESI-MS analysis of crRNA. 

The backbone of the complex consists of six CasC (Cse4) subunits, which belong to COG1857 and are present in several Cas subtypes11. 

The crRNA is remarkably stable when bound by Cascade or the CasBCDE and CasCDE subcomplexes, indicating that it is tightly associated with the CasCDE core of the protein complex and well shielded from cellular RNase activities. 

The ATP-independence of this scanning process is in line with the absence of homology between the Cascade subunits and classical ATP-consuming helicases40. 

E. coli crRNAs are unmodified, unlike, for example, plant siRNAs and miRNAs or vertebrate piwiinteracting RNAs, which are methylated at the 2′-hydroxyl group of the 3′-terminal ribonucleotide to prevent uridylation and associated destabilization36,37. 

Eukaryotic small interfering RNA (siRNA) and microRNA (miRNA), by contrast, need to be 5′-phosphorylated in order to bind to Argonaute and serve as a guide for the RNAinduced silencing complex (RISC)35. 

As Cascade was loaded with only R44 crRNA, the authors could determine the number of bound crRNA molecules by adding a complementary ssDNA probe. 

Exonuclease III footprints showed that the sequences that flank the protospacer are double-stranded and that Cascade protects a region of around 9 bases including the PAM on one flank and around 14 bases on the non-PAM flank (Fig. 4f). 

Only the complementary strand shifted when the authors used short dsDNA targets (corresponding to the protospacer), which suggests that the formation of base pairs between crRNA and the complementary strand displaced the noncomplementary strand (Fig. 4c). 

The authors demonstrated that the noncomplementary strand had been displaced by performing enzymatic and chemical footprint analyses specific for ssDNA27 using Cascade loaded with targeting (R44) and nontargeting (K12) crRNA. 

D.H., Selengut, J., Mongodin, E.F. & Nelson, K.E. A guild of 45 CRISPRassociated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. 

Viral plaque assays with Cas3 and Cascade lacking one type of protein subunit showed that all protein components of Cascade are indispensible for the virus-resistant phenotype of E. coli (Supplementary Fig. 1). 

Size exclusion chromatography of the three types of complex revealed that the majority of CasBCDE and CasCDE were present in a single form, whereas Cascade showed a substantial void peak in addition to a discrete peak at ~11 ml (Fig. 1e). 

Analysis of long dsDNA targets (protospacer with 27-base-pair (bp) flanks) showed that both strands shifted (Fig. 4b), probably owing to base-paired flanking regions. 

These initial pre-crRNA endonuclease cleavage products are the mature form of the crRNA in E. coli, in contrast to crRNAs from P. furiosus, S. solfataricus and S. epidermidis, which are trimmed at the 3′ end19,23,38.