Development of chassis-independent recombinase-assisted genome engineering (CRAGE) enables the integration of plasmids encoding biosynthetic gene clusters into the chromosomes of diverse bacteria to optimize production of natural products in non-native strains.
Abstract:
It is generally believed that exchange of secondary metabolite biosynthetic gene clusters (BGCs) among closely related bacteria is an important driver of BGC evolution and diversification. Applying this idea may help researchers efficiently connect many BGCs to their products and characterize the products' roles in various environments. However, existing genetic tools support only a small fraction of these efforts. Here, we present the development of chassis-independent recombinase-assisted genome engineering (CRAGE), which enables single-step integration of large, complex BGC constructs directly into the chromosomes of diverse bacteria with high accuracy and efficiency. To demonstrate the efficacy of CRAGE, we expressed three known and six previously identified but experimentally elusive non-ribosomal peptide synthetase (NRPS) and NRPS-polyketide synthase (PKS) hybrid BGCs from Photorhabdus luminescens in 25 diverse γ-Proteobacteria species. Successful activation of six BGCs identified 22 products for which diversity and yield were greater when the BGCs were expressed in strains closely related to the native strain than when they were expressed in either native or more distantly related strains. Activation of these BGCs demonstrates the feasibility of exploiting their underlying catalytic activity and plasticity, and provides evidence that systematic approaches based on CRAGE will be useful for discovering and identifying previously uncharacterized metabolites.
neering (CRAGE) applicable to diverse bacterial species across
m
ultiple phyla (Fig. 1a). CRAGE is based on recombinase-assisted
genome engineering (RAGE) technology
34,35
. Use of RAGE has
allowed single-step integration of pathways comprising 48 kb
directly into the Escherichia coli chromosome without compromis-
ing integration efficiency. After simple antibiotic counter-selection,
t
he integration yield reached 100%. CRAGE extends RAGE by
enabling researchers to domesticate previously undomesticated
microbes, and substantially increases the chance of successful BGC
expression and discovery of previously uncharacterized secondary
metabolites.
Results
Developing a design principle for CRAGE. In RAGE, a landing
pad (LP) containing two mutually exclusive lox sites is first inte-
grated into a chromosome. The DNA constructs flanked by these
l
ox sites are then integrated into the LP, catalysed by Cre recombi-
nase. We modified RAGE
34,35
in three major ways to establish the
CRAGE design principle (Fig. 1a). First, we used transposon sys-
tems
36,37
(mariner system for Proteobacteria and Tn5 system for
DonorRecipient
Chromosome
Chromosome
pW17
T7RP
loxPlox5171
IRIR
KmRCre
Transposase
Chromosome
Chromosome
loxPlox5171
pW34
AprR
Chromosome
Chromosome
Chromosome
b
Step 1: Landing pad integration
Step 2: BGC integration
IPTG
luxCDABE
IPTG
Ctrl
T7RP
IRIR
KmRCre
Transposase
T7RP
loxPlox5171
IRIR
KmRCre
BGC
T7RP
IRIR
KmRCre
AprR
BGC
T7RP
AprR
BGC
a
XP01 P. luminescens subsp. laumondii TT01
XP02 P. luminescens subsp. luminescens
XP03 P. temperata subsp. khanii
XP04 X.nematophila corrig
XP05 X.doucetiae
EB01 D. zeae
EB02 D. solani
EB03 D. dadantii subsp. dadantii
EB04 D. dadantii subsp. dieffenbachiae
EB06 P. carotovorum subsp. odoriferum
EB10 S. odorifera
EB12 P. agglomerans strain Eh1087
EB13 E. pyrifoliae
EB15 E. oleae
EB16 Y. ruckeri
EB17 Y. bercovieri
EB18 Y. mollaretii
EB19 Y. aldovae
AM02 A. encheleia
AM03 A. salmonicida subsp. salmonicida
AM04 A. piscicola
AM05 A. salmonicida subsp. pectinolytica
PM01 P. simiae WCS417r
PM02 P. fluorescens Q8r1-96
PM03 P. putida KT2440
25 selected γ-Proteobacteria
Intensity (a.u.)
10
3
10
5
10
4
Fig. 1 |Chromosomal integration of BGCs mediated through CRAGE.a, Schematic for CRAGE, a genome engineering technology that allows complex
biological systems to be implemented in a broad range of microbial strains. We primarily focused on formulating the design principle combining all existing
technologies that have proven to work in a wide range of organisms. Step 1: A pW17 plasmid containing a mariner transposon and transposase was
generated. The transposon contained a Cre recombinase gene and a kanamycin-resistant gene (KmR) flanked by two mutually exclusive lox sites (loxP and
lox5171). In addition, a T7-RNA polymerase (T7RP) gene under the control of a lacUV5 regulon was incorporated into the transposon. The pW17 plasmid
was conjugated from donor E. coli strain BW29427 into the panel of recipient bacterial strains (Supplementary Tables 1 and 2), and the transposon was
integrated into their chromosomes. Step 2: A different plasmid, pW34 (R6Kr ori) or pW5 (BAC-based), encoding a BGC under control of the T7 promoter
and an apramycin-resistant gene (AprR) flanked by the two mutually exclusive lox sites, was conjugated into the recipient strain containing the LP. BGC
integration into the chromosome of these chassis strains was mediated through Cre recombinase activity. b, For 25 chassis strains containing only an LP
(control) and luxCDABE, luminescence activity was induced with four different IPTG concentrations (0, 0.01, 0.1 and 1 mM) and measured. All data were
generated from biological triplicates. The standard deviations were generally less than 10%. The colours used for b are coordinated to represent strains
classified in the same phylogenetic branches. a.u., arbitrary units.
Actinobacteria) to insert the LP into recipient bacteria chromo-
somes. Because transposon systems are commonly used in engi-
neering of diverse species ranging from prokaryotes to eukaryotes,
t
hey are suitable for the first step of domestication. Although the
transposon is randomly integrated into the chromosome of the
recipient strain, we can screen and select the transformants with the
LP integrated into the location minimally affecting the host strain’s
physiology.
Second, we used a Lac-T7 expression system
38
to control the
expression of BGCs. This system is orthogonal to native transcrip-
tion; genes under the control of a T7 promoter are not transcribed
un
less a T7 RNA polymerase (T7RNAP) is present. Using this
system, we can minimize the expression of BGCs whose prod-
ucts may be toxic to E. coli while we are assembling the BGC con-
structs. Additionally, although codon usage and ribosome binding
si
tes may need to be redesigned for each chassis strain to obtain
more optimal results, this design principle in general allows a high
degree of flexibility, so that any single construct in any CRAGE
strain can be expressed without re-cloning as long as the T7RNAP
is expressed under the control of promoters that function in the
recipient strains.
Several studies suggest that genomic integration location can
also affect the expression of integrated genes
34,39–41
. Therefore, inves-
tigating different integration locations is another viable approach
f
or exploring the effect of different expression levels on BGC activ-
ity. However, our previous study, as well as others, suggested that
t
he effect of different integration locations on enzyme and path-
way activity was generally small (at most less than about seven- to
eig
htfold, usually two- to threefold for enzyme activity and less than
two fold for pathway activity)
34,39–41
. In contrast, the Lac-T7 system
allows us to explore a much wider range of pathway activity (10-
to 1,000-fold) than we could if we explored according to integra-
tion location. Additionally, the approach of investigating different
in
tegration locations would complicate our workflow and make our
approach less attractive. Therefore, we specifically chose to use the
Lac-T7 system to explore the effect of different expression ranges
on BGC activity.
Third, we incorporated the origin of transfer (oriT) into both
the LP and BGCs carrying plasmids to use conjugation as a pri-
mary transformation method. Conjugation systems have been
u
sed for transformation of a wide spectrum of bacterial species
including those in the Proteobacteria, Actinobacteria, Firmicutes,
Bacteroidetes and Cyanobacteria phyla
42–44
. Although the present
study focuses on γ-Proteobacteria, our preliminary results demon-
strate that CRAGE can engineer bacteria across multiple phyla (for
exa
mple, α-Proteobacteria, β-Proteobacteria and Actinobacteria)
(Supplementary Figs. 1–3).
Selecting model biosynthetic gene clusters. Members of the
genus Photorhabdus, as well as the related genus Xenorhabdus, are
endosymbionts of soil-borne nematodes and are known to have
potent bioactivity toward a wide range of insects and insect larvae;
some of these entomopathogenic complexes are used as biological
in
secticides in agriculture
45,46
. These bacteria also produce numer-
ous secondary metabolites to inhibit the growth of competing
micr
obes within their hosts
47,48
. Genomes of ~50 Photorhabdus and
Xenorhabdus species have been sequenced and are accessible in
public databases. Computational analyses suggest that each of these
genomes contains ~20 to 50 putative BGCs
11,12
, and that many BGCs
divergently evolved among species within these two genera
49,50
. The
combination of extensive genomic resources and rich metabolic
potential makes these species ideal test cases for a purpose-engi-
neered multi-chassis strategy for BGC characterization.
W
e selected a model set of 10 NRPS and NRPS–PKS hybrid BGCs
from Photorhabdus luminescens subsp. laumondii TTO1 (Table 1,
Fig. 2 and Supplementary Fig. 4)
18,51
. Except for BGC6, these BGCs
had been previously cloned and heterologously expressed in E. coli.
However, only two of those nine BGCs had been successfully acti-
vated, making this study an ideal benchmark to test the efficacy
o
f the multi-chassis approach mediated by CRAGE. The selected
panel of pathways included four BGCs as controls that had been
previously studied successfully by single-chassis approaches (BGCs
4 and 9)
18,52–56
or by promoter replacement approaches in a native
strain (BGCs 1 and 6)
50,57,58
, as well as six putative BGCs that were
not functional using conventional chassis approaches (BGCs 2, 3, 5,
7, 8 and 10)
18
.
Preparing phylogenetically diverse chassis strain panels using
CRAGE. We selected 31 species of γ-Proteobacteria representing 10
different genera (Fig. 1, Supplementary Fig. 4 and Supplementary
Tables 1 and 2). The panel consisted of several Xenorhabdus and
Photorhabdus species (XPs) and many other species that are evolu-
tionarily slightly distant from XPs (other Enterobacteria (EB) and
b
acteria in the Aeromonas (AM) and Pseudomonas (PM) genera).
The panel also allowed us to systematically investigate correlations
between evolutionary relatedness and physiological compatibili-
ties of different chassis:BGC combinations. Selection criteria also
in
cluded the availability of complete or draft genome sequences
and classification as biosafety level 1. Furthermore, all the selected
strains had EntD- and/or Sfp-type phosphopantetheinyl transfer-
ases (PPTases), required for modifying NRPS and PKS proteins to
co
nvert their inactive apo forms into the enzymatically functional
holo forms
59
.
The LP on a transposon was first randomly integrated into the
chromosome of each strain and the integration site was determined
TL;DR: In this paper, the authors present a strategic blueprint to substantially improve our ability to discover and develop new antibiotics, and propose both short-term and long-term solutions to overcome the most urgent limitations in the various sectors of research and funding.
TL;DR: This review highlights recent advances in both bottom-up and top-down approaches to engineering non-model bacteria and microbiomes to promote beneficial plant-microbe interactions, as well as advances in strategies to evaluate these interactions.
TL;DR: The rapid accumulation of genomic information has revealed that the metabolic capacity of virtually all organisms is vastly underappreciated as discussed by the authors, and genome mining technologies are accelerating metabolite discovery in all organisms, including protists, plants and animals.
TL;DR: This review explores the many ecological and biotechnical facets of PGP research and attempts are currently underway to establish exclusive coupling of plant-bacteria interactions in the field, which will be crucial to optimise efficacy and establish biocontainment of engineered PGPR.
TL;DR: The asynchronous pipeline scheme provides other substantial advantages, including high flexibility, favorable processing speeds, choice of both all-in-memory and disk-bound processing, easy adaptation to different data formats, simpler software development and maintenance, and the ability to distribute processing tasks on multi-CPU computers and computer networks.
TL;DR: This review focuses on the architectures of bacterial chemical communication networks; how chemical information is integrated, processed, and transduced to control gene expression; how intra- and interspecies cell-cell communication is accomplished; and the intriguing possibility of prokaryote-eukaryote cross-communication.
TL;DR: A new generation of a popular open-source data processing toolbox, MZmine 2 is introduced, suitable for processing large batches of data and has been applied to both targeted and non-targeted metabolomic analyses.
Q1. What is the main reason for the evolution of BGCs?
It is generally believed that exchange of secondary metabolite biosynthetic gene clusters (BGCs) among closely related bacteria is an important driver of BGC evolution and diversification.
Q2. Why was the LB agar plate used to screen for BGCs?
Because successful integration of a BGC makes the recipient strains also sensitive to kanamycin, the authors counter-selected using an LB agar plate containing the appropriate concentration of kanamycin to screen for successful BGC integrants.
Q3. What are the abundant metabolites from BGC4 and BGC9?
Because Glidobactin A (2) and GameXPeptide A (11) are the most abundantly produced metabolites from BGC4 and BGC9, respectively, 2 and 11 were chosen as standards for quantification.
Q4. How many plasmids were cloned into S. cerevisi?
Approximately 50 ng of the plasmid DNA were mixed with the rest of the BGC fragments in 1 to 2 molar ratios and transformed into S. cerevisiae CEN.
Q5. How was the LB medium used to inoculate the recipient strains?
All recipient bacteria were inoculated in LB medium containing 10 μg ml−1 kanamycin and were grown at 28 °C in the incubation shaker at 200 r.p.m. until the late log phase.
Q6. What was the purpose of the cloning of plu2670?
Given that the function of plu2670 was recently characterized as kolossin A–C synthase58 by promoter replacement in the native strains, the authors decided not to pursue cloning of this BGC further.
Q7. What can be done to identify the unique features associated with each BGC?
The multi-chassis approach can further provide another criterion to filter out false positives and facilitate the untargeted analysis to identify unique features associated with each BGC.