Transcriptional regulation by nuclease-deficient CRISPR/Cas is a popular and valuable tool for routine control of gene expression. CRISPR interference in bacteria can be reliably achieved with high efficiencies. Yet, options for CRISPR activation (CRISPRa) remained limited in flexibility and activity because they relied on σ70 promoters. Here we report a eukaryote-like bacterial CRISPRa system based on σ54-dependent promoters, which supports long distance, and hence multi-input regulation with high dynamic ranges. Our CRISPRa device can activate σ54-dependent promoters with biotechnology relevance in non-model bacteria. It also supports orthogonal gene regulation on multiple levels. Combining our CRISPRa with dxCas9 further expands flexibility in DNA targeting, and boosts dynamic ranges into regimes that enable construction of cascaded CRISPRa circuits. Application-wise, we construct a reusable scanning platform for readily optimizing metabolic pathways without library reconstructions. This eukaryote-like CRISPRa system is therefore a powerful and versatile synthetic biology tool for diverse research and industrial applications.
In recent years, reprogrammable CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-mediated genetic regulation has emerged as a powerful tool for synthetic biology research. Transcriptional regulations through CRISPR interference (CRISPRi) and activation (CRISPRa) were widely used to modulate gene expression1,2,3,4,5,6,7,8,9,10. Bacterial CRISPRi is a matured technology with plenty of examples but for bacterial CRISPRa, few have been demonstrated2,10. The established mechanisms have RNA polymerase-recruiting domains linked to a dCas9–sgRNA complex, and rely on direct interactions between the complex and a σ70 promoter. These designs suffered from low dynamic ranges of activation output and could not support multi-input activation because the operator must lie near the core promoter regions, and the σ70 promoter per se must be weak enough for activation to be observed10.
In contrast, σ54-dependent promoters were much overlooked potential targets for CRISPRa. A σ54 factor “locks” a σ54-dependent promoter in a stable closed complex that tightly blocks transcription initiation11,12. A σ54-activator binds to an upstream activating sequence (UAS) and performs long-distance regulation through a local DNA loop, and catalyzes ATP hydrolysis to unlock the complex for transcription initiation. This mechanism is similar to transcriptional activation involving the eukaryotic RNA polymerase II in several ways and is known as “eukaryote-like” regulation in bacteria13,14,15,16,17,18. Hence, σ54-activators are often termed “bacterial Enhancer Binding Proteins” (bEBPs)18,19.
Here, we construct and characterize a CRISPRa system based on a eukaryote-like activation mechanism in bacteria. This CRISPRa device shows strong activity, superior dynamic range (ON–OFF dynamic range > 1000 and > 70-fold between cognate and mismatch sgRNA) and remarkable tolerance to a wide range of UAS locations. The device can work in both Escherichia coli and non-model bacteria, and can activate multiple high value wild-type (WT) σ54-dependent promoters. Use of dxCas9 enables non-canonical PAM targeting and further improves the dynamic ranges of the system to practical application-required levels. Finally, we set up an efficient profile scanning platform for metabolic pathways, in which the sgRNA library and thus its encoded transcriptional activation profiles do not need to be rebuilt and can be applied to different pathways. Those profiles, when applied to multi-gene expression, show good stability and durability. This CRISPRa device not only enriches the genetic regulation toolbox, but also supports real-life application scenarios by its high performance.
Design of eukaryote-like CRISPR activation in bacteria
To build a eukaryote-like programmable activation system, our first step was to identify a functional σ54-activation domain. It has been proven that the bEBP PspF from E. coli has a highly modular structure—a truncated PspF remained as a functional activation domain in a three-hybrid system in vitro20,21. A similar strategy was adopted to bridge the sgRNA and the activator—an RNA-binding peptide λN22plus was fused to the PspF activation domain (PspFΔHTH) at the C-terminus to create PspFΔHTH::λN22plus. We designed the sgRNA scaffold to incorporate two BoxB RNA aptamers, which recruit λN22plus peptide fused proteins, at the Tetra-loop and the stem loop 2 (Fig. 1b)22,23. To optimize sgRNA scaffold, we also mutated the U-A pair to G-C at “+5”23. We then set up a test platform using the σ54-dependent PpspA promoter. In total, 3 bp downstream of the native UAS were mutated to an NGG PAM, which could be targeted by a dCas9–sgRNA complex. A sfGFP reporter was placed downstream to monitor the transcriptional output (Fig. 1a). We verified that this promoter could still be activated by its cognate activator PspF in E. coli MC1061ΔpspF (Fig. 1e). These components, when supplemented by dCas9 and transformed into E. coli, successfully generated a transcriptional activation response similar to that of wild-type PspF, demonstrating the proof-of-concept of our CRISPRa system on σ54-dependent promoters (Fig. 1e). We then generated another UAS-gRNA set by replacing the UAS of PpspA by two copies of UAS (G6) from the PhrpL promoter and modified the sgRNA correspondingly. A combinatorial activation test showed that the σ54-dependent promoters could only be activated when the UAS sequence matched the sequence of the spacer region in the sgRNA (Fig. 1e), proving that our CRISPRa system critically relied on sgRNA-UAS specificity and had little cross-talk issues.
Given that many CRISPRa systems in eukaryotes were built by fusing the activation domain to dCas9 directly3,5, we asked if the same could be applied to the PspF activation domain. The HTH domain truncated PspF was fused to dCas9 and CRISPRa was attempted, but was unsuccessful (Supplementary Fig. 1c). This result was within expectation given that PspF must assemble as a hexamer to function, and dCas9 probably interfered with the assembly process as it was relatively bulky24,25.
To demonstrate the necessity of dCas9, sgRNA, PspFΔHTH::λN22plus (or simply, activator) and the σ54 factor (RpoN) in our CRISPRa system, we constructed reporter circuits with or without the dCas9 generator, the sgRNA generator and the activator generator circuits on three different plasmids. These plasmids were combinatorially mixed with empty vector controls and co-transformed into cells. A strong response could only be achieved when all three components were present, with a dynamic range of 78-fold between the cognate and the mismatched sgRNA (Fig. 1c). Therefore, our CRISPRa activity can be controlled and tuned on three dimensions. This CRISPRa device was non-functional in a ΔrpoN strain and thus is σ54-dependent (Supplementary Fig. 1e). Regarding the requirement of hexamer assembly of the activator, we further asked if only one UAS would suffice to stabilize hexamer assembly, by providing only two instead of four RNA aptamers that act as local docking sites for activators. Our results show that a single UAS remains functional in its activity (Supplementary Fig. 1a).
One of our goals was to create a CRISPRa system that does not depend on the host strain, so the CRISPRa that successfully worked in MC1061ΔpspF was transformed into and tested in a general cloning strain E. coli TOP10. The dynamic range was similar to that in MC1061ΔpspF but the maximum activity was halved (Supplementary Fig. 2). We reasoned that it might be due to potential background expression of the endogenous PspA protein controlled by PpspA promoter on the genome, which is known to allosterically inhibit PspF20,26. This hypothesis was indirectly supported by the observation that expression of PspFΔHTH::λN22plus (without sgRNA for CRISPRa) led to a higher activation efficiency on wild-type PpspA in TOP10 than in MC1061ΔpspF (Supplementary Fig. 1f). As the engineered PspFΔHTH::λN22plus has no DNA binding domain to target PpspA on the genome, it was likely that the endogenous PspA expression was activated by heterohexamers comprising PspFΔHTH::λN22plus and PspF and increased levels of PspA in turn impaired the efficiency of the activator on our engineered promoter in TOP10. It is worthwhile to mention that the lower output level in TOP10 may also be caused by other unknown mechanisms.
Nonetheless, our CRISPRa system still retained significant activity for general purposes. The rest of this study, except for those in Fig. 3d (see below), were done in the strain MC1061ΔpspF as it provided a higher maximum output so small changes in activities could be readily detected. In addition, to simplify our system for further optimizations, we put the activator under the control of a constitutive Anderson promoter BBa_J2310627, which provided the best output function compared to three other promoters of different strengths tested (Supplementary Fig. 1b).
Optimization of target sites and sgRNA based dynamic ranges
Successful activation of the σ54-dependent promoter critically depends on its local DNA loop structure, which establishes a two-dimensional spatial relationship between the UAS and the transcription start site. This means that the spatial direction of the engineered activator relative to the axis of the DNA helix may affect the efficiency of our CRISPRa system (Fig. 2a). To understand the relationship between UAS positions and the spatial directions of the engineered PspF hexamer, we designed a library of PpspA where the synthetic UAS was progressively shifted upstream from its original location (defined as “0”), and measured the resulting efficacies of CRISPRa. CRISPRa remained functional within a shift window of 40 bp. Most interestingly, the activities alternated between crests and troughs spaced at 5 bp apart. This oscillation exactly matches the fact that the DNA helix completes a turn every 10 bp (Fig. 2a), and strongly suggested a phase-dependency between the UAS and the core promoter. We thus sought further evidences and augmented the library to zoom into the 0–10 bp region corresponding to half a wave length, and indeed observed a sinusoidal waveform. This might provide insights on the underlying molecular mechanism of our CRISPRa system.
In our CRISPRa output characterization, the metric dynamic range could be defined in different ways depending on what constitutes the OFF state. Our CRISPRa system achieved > 1000-fold dynamic range for all component induced versus non-induced (Supplementary Fig. 3a), > 70-fold for cognate versus mismatch sgRNA (Fig. 1c), and > 30-fold in the scenario where dCas9 and sgRNA were treated as a single inducible unit (Fig. 1d). These values surpassed the highest records to date10. However, these values are not practically relevant when CRISPRa needs to be applied to multi-gene regulation and transcriptional cascades, where dCas9 and the activator should be constantly available at a fixed level and the sgRNA circuits need to be pre-installed. Under this scenario, dynamic ranges are strictly defined by circuit outputs between sgRNA induced and sgRNA uninduced states. Our results indicated that leaky sgRNA transcription owing to a leaky inducible promoter activation explained the significant basal activation observed in the inducible device (Fig. 1c, d). To optimize the CRISPRa system and improve its dynamic ranges (defined by induced versus non-induced sgRNA), we first investigated its input–output relationship, and discovered that low induction levels for sgRNA already led to saturated CRISPRa responses (Supplementary Fig. 4). Therefore, we reasoned that a decrease in the range of sgRNA input, either by reducing sgRNA induction efficiencies or by reducing sgRNA’s affinities to dCas9, would decrease the leakiness of CRISPRa, whereas excessive sgRNA would maintain its maximum output. To this end, we explored three separate strategies in optimizing sgRNA-dependent dynamic ranges. First, we inserted various known terminators as buffer sequence (named as “buffer terminators”) between Plux2 and sgRNA to interfere the function of the promoter and the output ranges of sgRNA transcription. Some of them reduced sgRNA leakiness and significantly improved the dynamic ranges to >10-fold (Fig. 2b). Second, we employed a wild-type U-A pair at the “+5” site and introduced no, one or two nucleotide mismatches into the original sgRNA scaffold, and hypothesized a reduction in the resulting dCas9–sgRNA affinity. sgRNA with two mismatching nucleotides could increase the dynamic range significantly to ~ 10-fold (Fig. 2c). Third, we moved the RNA aptamers onto different positions of the sgRNA scaffold, but discovered that our initial design was both necessary and sufficient (Fig. 2d). We further applied our first and second strategies to different spacers and the “mismatch in sgRNA strategy” was generally more versatile (Supplementary Fig. 5).
CRISPRa is functional on different σ54-dependent promoters
Our success in swapping the wild-type UAS in mutant PpspA with an orthogonal UAS suggested that our CRISPRa system could be repurposed to activate any σ54-dependent promoters, provided that a PAM could be found near the native UAS. This would be immensely useful for studying σ54-dependent promoters, because unlike PspF, many σ54-activators from bacteria are response regulators for environmental stimuli28. Thus, activation of their cognate promoters could be hardly achieved without changing the culture condition, which inevitably introduces additional factors in an experiment.
To demonstrate the function of our CRISPRa system on σ54-dependent promoters, we identified and cloned several promoters with research relevance: The wild-type PpspA promoter was involved in phage shock response. PhrpL from Pseudomonas syringae regulates Type III Secretion System-mediated pathogenicity in plants29, and promoters PnifH and PnifJ control a nitrogen fixation pathway in Klebsiella oxytoca that responds to nitrogen and oxygen availabilities30. For these promoters, we intended to maintain sequence integrity, so we opted for PAM sites near the wild-type UAS instead of inserting new ones. All promoters could be successfully activated by our CRISPRa system to drive sfGFP expression (Fig. 3a). We further asked if our CRISPRa system could activate endogenous σ54-dependent promoters on E. coli genome, rather than just episomes. We targeted pspA in MC1061ΔpspF for activation and employed RT-qPCR to measure the relative transcription activities from PpspA and PnorVW, and results showed that the former was activated in a specific manner (Supplementary Fig. 6).
We also demonstrated that the CRISPRa systems with engineered PspFΔHTH::λN22plus built for PnifH and PnifJ were functional when transferred to the original host species K. oxytoca (Fig. 3b), proving that our CRISPRa system could be heterologously expressed to activate σ54-dependent promoters in other species. This should be helpful for bypassing their natural genetic regulatory networks31.
It is worth mentioning that the relative activation efficiencies on wild-type σ54-dependent promoters by the CRISPRa system versus efficiencies mediated by their wild-type cognate activators has no fixed value even when the transcription levels of the activator genes were similar (Supplementary Fig. 7). This might be affected by the differences in targeting affinities between the CRISPRa complexes and their wild-type cognate activators and suboptimal UAS sequences owing to limitations in the availability of NGG PAM positions on the wild-type promoters.
As an example of utilizing this CRISPRa as a tool for σ54-dependent promoter related research, we employed our CRISPRa system to research potential modularity of the DNA components within a σ54-dependent promoter. σ54-dependent promoters are diverse in structures, with different UAS (in strength and numbers), loop regions, and core sequences32,33. Based on consensus sequences, we standardized, in length, three DNA loop regions and three core sequences from different σ54-dependent promoters. We then connected them combinatorially, with identical linker DNA in between, and placed them downstream of a synthetic UAS (Fig. 3c). Our results showed that except for the PflhDp2 core sequence, DNA loop regions and core sequences are modular and could be shuffled to yield functional hybrid promoters (Supplementary Fig. 8). In addition, the relative ranks of dynamic ranges conferred by one core sequence is consistent when paired with different loop regions. This is also true when the same loop region is pair with different core sequences. So the PhrpL loop region and the PpspA core sequence, which were consistently the strongest when paired with different elements, could combine to give a hybrid promoter with the highest dynamic range (Fig. 3c).
Alternative σ54-activator mining by protein engineering
To minimize the influence of the host strain on our CRISPRa system owing to potential PspFΔHTH::λN22plus/PspF heterohexamers formation, we searched for other σ54-activators, which might be orthogonal to the PspF-based activator. This however posed a challenge, because many bEBPs contain a sensory module domain at their N-termini, which must be removed to avoid interference from their regulatory pathways in the host28.
One such example is the bEBP NorR from E. coli, involved in NO sensing34,35. NorR with its sensory module deleted (NorRΔGAF) strongly activates its cognate promoter PnorV in absence of NO36,37. We thus replaced the HTH domain in NorRΔGAF by λN22plus as we did for PspF and tested its activation function, but no significant activity was observed. Previous researches indicate that for two bEBPs, HrpS, and HrpR, the HTH domain may be required for assembly into the HrpRS hexamer29. We conjectured that the same applied to NorR, and so generated two constructs with most of the NorRΔGAF fused directly to the λN22plus peptide, which successfully yielded activities in CRISPRa (Fig. 3d). Subsequently, we tested three other bEBPs using the same approach, and identified a new functional activator based on WtsA, which regulates expression of pathogenic genes in Erwinia stewartii38 (Fig. 3d, Supplementary Fig. 10). Unlike NorRΔGAF::λN22plus, WtsA::λN22plus did not impose significant metabolic burden onto E. coli (Supplementary Fig. 9).
Orthogonality based on gRNA and UAS
Orthogonality is a prerequisite for multiplex gene regulation39,40. Our CRISPRa comprises multiple components and thus offered opportunities to engineer orthogonality on different levels. We first capitalized on the specificity between UAS and gRNA (Fig. 1e) to create a library of orthogonal UAS-gRNA pairs. We designed a library of synthetic PpspA and generated unique UAS by sequence randomization. Based on the functional window for UAS locations (Fig. 2a), we designed two synthetic enhancers with PAM at −108 to −110 (LEB) and at −128 to −130 (LEA) respectively. The 10 sgRNA generators with the 5 CRISPRa circuits carrying various UAS were co-transformed and tested for activation. The results confirmed that the synthetic PpspA could only be activated by its cognate sgRNA (Fig. 4a). This high orthogonality will allow construction of multiplex and layered CRISPRa with little cross-talks. We then measured orthogonality between two RNA aptamer–adaptor pairs, namely, BoxB-λN22plus against MS2-MCP, in triggering CRISPRa using the same design. Again, significant orthogonality was observed between the aptamer–adaptor pairs (Fig. 4b, Supplementary Fig. 11).
The locations of UAS can provide an additional layer of orthogonality. PnifJ and PnifH from the Klebsiella genera are divergently transcribed and share a common intergenic region with three UAS for their activator NifA. UAS2 and UAS3 belong to PnifJ, and UASnifH belongs to PnifH41. We thus hypothesized that our CRISPRa complex would find new and similarly orthogonal UAS. Nine PAMs that spanned across the intergenic region of PnifJ and PnifH were tested in E. coli (Fig. 4c). Spacer H1 strongly activated PnifH but not PnifJ, and spacer H7 and H9 activated PnifJ but not PnifH. The location of these spacers matched well with the locations of their corresponding native UAS and proved that our eukaryote-like CRISPRa system could benefit from orthogonality owing to distinct UAS locations.
PAM expansion and dynamic range optimization by dxCas9
Although our CRISPRa system allowed us to target wild-type σ54-dependent promoters, it demanded the presence of an NGG PAM next to the UAS, which might not be always satisfied. A recent work reported xCas9 as a mutated version of spdCas9 with flexibilities on PAM choices42. We thus mutated our dCas9 to dxCas9 based on xCas93.7 and asked if it would allow our CRISPRa to target UAS with non-canonical PAM.
The dCas9 or dxCas9-mediated CRISPRa could activate PpspA with the canonical NGG PAM to similar degrees. Contrary to reported, however, in two cases the dxCas9 could not activate promoters with NNG PAM (Supplementary Fig. 12a, b). We then moved on to test NGT PAM, and CRISPRa using dxCas9 gave remarkable activation (Fig. 5a, Supplementary Fig. 12c). Our goal was to check if dxCas9 could expand available PAM sites on wild-type σ54-dependent promoters. Therefore, we treated two TGT sequences on the wild-type PpspA as PAM, and attempted CRISPRa on those two UAS using dCas9 or dxCas9. The result confirmed that our CRISPRa using TGT PAM and dxCas9 was functional (Fig. 5b), and hence CRISPRa with dxCas9 allowed more flexible targeting of UAS on different wild-type σ54-dependent promoters.
In the above experiments, surprisingly, CRISPRa through dxCas9 showed much lower sensitivities to sgRNA leakiness. Their sgRNA-dependent dynamic ranges were improved by an order of magnitude (Fig. 5c). We wondered if this was related to our engineered sgRNA, which contained BoxB aptamers and thus set up a CRISPRi experiment using the dxCas9. We observed a reduction in repression efficiencies mirroring that of activation (Supplementary Fig. 13a–c). It is noteworthy that dxCas9 also worked poorly with sgRNA that contained two mismatches in the scaffold (Supplementary Fig. 14). This suggested that dxCas9 has lower affinities towards its sgRNA or the target DNA and the change in performance probably did not concern our engineered sgRNA scaffold.
Serendipitously, the lowered activation background by dxCas9-based CRISPRa enabled complex circuit designs, which were not attainable by its dCas9 counterpart. This was showcased in a cascaded regulation. dxCas9 could be used to build a two-layer circuit, but when dCas9 was used, background sgRNA expression was sufficiently amplified by the second layer to yield a constant ON state (Fig. 5d). Furthermore, a positive feedback loop, which is self-amplifying in nature and has little tolerance toward input leakiness, could be built from CRISPRa through dxCas9.
This positive feedback loop was derived from the single-layer circuit, but with an additional activator generator driven by the same σ54 promoter as that of the reporter. Therefore, sgRNA induction should activate not only the reporter but also the extra activator, which in turn would enhance the CRISPRa system per se. As a result, the positive feedback loop using dxCas9 rendered the activation response more digital-like and increased sgRNA-dependent dynamic range from ~ 50 to 200-fold (Fig. 5e). The same magnitude of fold change was not achieved when dCas9 was used. dxCas9 could suppress the undesired basal activation due to sgRNA leakiness and enhance the dynamic range of the positive feedback loop (Supplementary Fig. 13d).
CRISPRa for multi-gene expression profile screening
CRISPR activation and repression is often considered as a useful tool to control metabolic pathway. Many have demonstrated their work through tuning production of violacein, a purple pigment with antitrypanosomal and antitumoral activities4,43,44. Yet, most focused on reproducing known effects of flux redirection in this pathway to give different terminal products, by tweaking expression levels of the five enzymes involved. In an industrial setting, pathways of interest are often inadequately characterized with little information for flux optimization. By designing a CRISPRa-based optimization strategy, we could exhaust a metabolic design space, instead of testing flux redirection by trial-and-error.
To demonstrate this, genes encoding the five enzymes (vioA—vioE) were individually placed downstream of our synthetic PpspA promoters with orthogonal UAS. A total of five corresponding sgRNAs were need for CRISPRa, and three out of the five sgRNAs were combinatorially connected to constitutive promoters of three different strengths, with the remaining two connected to a strong constitutive promoter. The sgRNA generators were then mixed and assembled together in a one-step Golden Gate reaction to yield a sgRNA generator library. The library was thus a collection of diverse transcription patterns of sgRNAs, which would project different sets of activation patterns on the five genes (Fig. 6a). The sgRNA generator library with other CRISPRa components and the PpspA driven violacein-producing genes were then co-transformed into E. coli.
We observed that our transformants from the library had colonies with different intensities of purple color, which directly reflected the individual differences in violacein production (Fig. 6b). We then classified colonies into three bins of violacein production, and for each bin, retrieved their individual sgRNA transcription profiles by sequencing the sgRNA generator cassettes from the five colonies (Fig. 6b). We ended up analyzing four profiles for each bin (see Online Methods). Among the three genes experiencing different activation patterns, the promoter strengths for sgRNA activating vioA correlated most with violacein production (Fig. 6d), suggesting its role as the limiting factor in the production pathway. It should be noted that the overall strength of the violacein pathway was heavily influence by the level of the dxCas9 and the activator (Supplementary Fig. 15). These two factors thus served as additional gain tuning knobs for optimization of the entire pathway.
Next, a new library was created by mixing sgRNA generators for CRISPRa or CRISPRi, and co-transformed to project more diverse profiles onto the pathway during the profile-scanning process. This addition of CRISPRi introduced possibilities of suppressing the basal expressions of the pathway, and so generated larger differences in violacein production under the same culture condition (Fig. 6c, Supplementary Fig. 16). The ability for CRISPRa and CRISPRi to co-regulate is an advantage of engineering activator onto sgRNA scaffold instead of dxCas9. In addition, the strength of CRISPRa can be tuned independently without affecting that of CRISPRi, by modulating the activator level like tuning the gain knob. This experiment showed potential for intermediate metabolites and pathway redirection scanning in future applications. It is worth noting that, our sgRNA generators already encode all designed transcriptional activation profiles, and therefore it is reusable and optimization of a new pathway would not involve any new library construction—the only step necessary would be the connection of different genes to PpspA with different UAS.
To further demonstrate the application value of this system, we design a “rainbow” device to test if the same multi-gRNA generator library is reusable on a different multi-gene circuit. The library with mixed sgRNA generators for CRISPRa or CRISPRi above was co-transformed with a circuit including three fluorescent protein genes (the rainbow circuit). The fluorescent protein genes were controlled by the same promoters used to drive vioA, vioD, vioC in Fig. 6. Therefore, each gene may be activated to different extents: (1) No activation (the sgRNA designed for CRISPRi on violacein genes was effectively a mismatch sgRNA), (2) moderate activation, and (3) strong activation (Fig. 7a). As expected, strains isolated from the resulting co-transformed library had a variety of fluorescent protein expression profiles, which would equate a rainbow color on a population level if the strains were pooled. Sequencing results from the strains revealed good correspondences between fluorescent protein expression profiles and the sgRNA generator units from the multi-gRNA generator (Fig. 7c).
One advantage of our trans-regulated multi-gene control system is its ability to scale the global expression level of all target genes proportionally by tuning the activator induction level—under different CRISPRa efficiencies, absolute expression levels of all fluorescent proteins were altered but the relative expression profiles remained unchanged (Fig. 7b, c).
We also investigated our system’s durability, which is important for its application in an industry setting. We serially transferred the culture every 6 hr by diluting the grown culture into fresh medium and measured fluorescence after 6 hr of growth. The result showed that all unique rainbow color profiles from individual strains were stable across the four transfers (Fig. 7d). In addition, there was no detectable metabolic burdens conferred by our CRISPRa-based expression profile scanning device, as evinced by the A600 data of this experiment (Supplementary Fig. 17).
In this work, we designed and constructed a CRISPR activation system based on a eukaryote-like enhancer in bacteria. Owing to the unique “unlock” mechanism of AAA + ATPase transcriptional activation domain on σ54, these promoters inherently possess large dynamic ranges that CRISPRa could harness. Therefore, the CRISPRa output dynamic range was over 1000-fold (dCas9/activator/sgRNA tuning), and was over 70-fold for targeting sgRNA versus non-target sgRNA, which exceeded previous prokaryotic CRISPRa devices. When employed for multi-channel gene regulation, our CRISPRa system utilizing dxCas9 achieved ~ 50-fold output increase when only sgRNA was varied (Fig. 5e), and permitted cascade of genes where no other CRISPRa to date has demonstrated.
σ54-dependent promoters play important roles in a variety of high-value physiological functions in bacteria, such as nitrogen assimilation and fixation41,45, pathogenicity29,46, host colonization47, motility46,47, biofilm formation47, environmental bioremediation48, and stress responses45. In principle, our CRISPRa system can activate any σ54-dependent promoters. Four typical, high value wild-type σ54-dependent promoters (PpspA/PhrpL/PnifH/PnifJ) from three species have been successfully activated in E. coli. PnifH and PnifJ are two key promoters controlling nitrogen fixation in K. oxytoca and usually require anoxic conditions to be activated31. Our CRISPRa system in K. oxytoca could activate these two promoters under aerobic condition, which proved that our device can bypass host regulation mechanisms on natural σ54-dependent regulation system in non-model bacteria. Such context-independent regulation control provides a powerful means to study σ54-mediated gene regulatory networks in situ, which has been difficult to achieve by traditional molecular biology methods.
Similar to activation mechanisms in eukaryotes, our CRISPRa involves long distance regulation and a flexible DNA loop structure. Therefore, unlike previous CRISPRa systems based on the σ70 factor, this CRISPRa tolerates a wide range of UAS locations (at least 40 bp). It not only accommodates two orthogonal synthetic UAS for input integration on a single promoter, but also provides large flexibility for PAM selection on wild-type σ54-dependent promoters. Introduction of dxCas9 into our system further enhanced this flexibility by permitting use of non-canonical PAM.
A CRISPRa-based metabolic pathway screening tool has been built to optimize a five-gene pathway of the antitumor chemical violacein in a single step. The scanning and sequencing results demonstrated that the outcomes of violacein production could be explained by transcriptional activation patterns projected onto the pathway. Most importantly, our tool is reusable—the same sgRNA generator library could assay different metabolic pathways if their enzymes are driven by our PpspA-derived promoters. A “rainbow” device base on above design was achieved, showing the excellent stability and durability of the multi-gene expression profile control. This showed that our design stores multi-gene expression profile diversities on a universally applicable library, which can be mass produced conveniently. Our tool should be of high value to commercial and industrial applications.
There may be some limitations of this system. Our engineered activator PspFΔHTH::λN22plus was compatible with the σ54-depedent promoters in both E. coli and K. oxytoca but this compatibility might be challenged in distantly-related micro-organisms. In such scenarios, it might be necessary to introduce the heterologous σ54 factor from E. coli or K. oxytoca, or test other σ54 activators we developed in this study (engineered PspF, NorR, WtsA). In addition, when activating a wild-type σ54 promoter, the available PAM sites may limit the maximum activation efficiency of CRISPRa. Nevertheless, whether the activation efficiency is sufficient depends on the specific engineering objectives, which likely vary from one application to another.
Our work also revealed several directions for further study: The fact that spacer H9 for PnifJ was functional in E. coli (Fig. 3a) but less functional in K. oxytoca (Fig. 3b) hinted at fundamental differences in promoter activation processes in the two species. They may involve differences in DNA bending angles, or other unknown factors, which warranted additional studies. In addition, WtsA from E. stewartii, being an orthogonal activator for CRISPRa in E. coli, might give higher activation efficiencies if engineered by directed evolution or other rational design. For dynamic range optimization, saturation mutagenesis on the core sequences of σ54-dependent promoters may improve their dynamic ranges and output levels. We noted that the expression ratio between PspF ΔHTH::λN22plus and dCas9 may affect the formation of activator hexamer through competition, and this ratio may be further optimized in the future. These further optimizations should realize the full potential of this tool for industrial biotechnology and synthetic biology research.
Strains and growth conditions
Experiments on activator engineering, heterologous activator scanning, and functional tests in different hosts. The experiments in Fig. 3d, Supplementary Figs. 1f and 2 were performed in the E. coli strain TOP10 as indicated. The experiments in Supplementary Fig. 1e were performed in the E. coli strain JW3169-1ΔrpoN730::kan from the Keio collection49. Experiments for PnifJ and PnifH activation in K. oxytoca was performed in the K. oxytoca strain M5a1. All other assays were performed in E. coli MC1061ΔpspF. E. coli MC1061ΔpspF was generated via P1 phage transduction using BW25113ΔpspF739::kan from the Keio collection as the donor strain49.
For growth of E. coli, cells were cultured in Lennox’s Lysogeny Broth (LB-Lennox) medium (10 g L−1 peptone (EMD Millipore), 5 g L−1 yeast extract (EMD Millipore), 5 g L−1 NaCl (Fisher Scientific)) with appropriate antibiotics. Antibiotics were applied to liquid cultures for assays at final concentrations of: 50 µg mL−1 ampicillin (Sigma-Aldrich), 25 µg mL−1 kanamycin (Sigma-Aldrich), and 12.5 µg mL−1 chloramphenicol (Sigma-Aldrich). For circuit construction and selection on agar plates, antibiotics were applied at final concentrations of: 100 µg mL−1 ampicillin, 50 µg mL−1 kanamycin, and 25 µg mL−1 chloramphenicol. For experiments on metabolic pathway profile scanning (Fig. 6, Supplementary Figs. 15 and 16), the agar plates contained 50 µg mL−1 ampicillin and 25 µg mL−1 kanamycin.
For circuit characterization in E. coli, a single colony of co-transformant carrying test circuits was picked from an agar plate and suspended in 30 µL of LB medium with appropriate antibiotics. In total, 5 µL of the cell suspension was then inoculated into 195 µL of liquid LB medium with antibiotics in a transparent and flat-bottom 96-well plate (CytoOne). To avoid abnormal growth at the edges of a plate, outermost wells were not used (filled by 200 µL of medium). The plates were cultured at 37 °C, 1000 rpm on a plate shaker (MB100-4A) for 18 hr overnight. The next day, 2 µL of overnight cultures were added into 198 µL of LB medium with appropriate antibiotics and inducers (N-(3-Oxohexanoyl)-L-homoserine lactone, AHL (Sigma-Aldrich), aTc (CAYMAN Chemical), rhamnose (Alfa Aesar), arabinose (Acros Organics)) on a flat and clear bottom 96-well plate with black walls (Greiner Bio-one). The dilution factor was 100-fold. The cultures were then grown for 6 hr at 37 °C, 1000 rpm and then fixed for flow cytometry analysis. All inducers for liquid culture in this research were prepared as 40 × stock solutions. Inducers were diluted to 1 × final concentrations by adding 5 µL of stock solutions into a final volume of 200 µL of assay volume. There was an exception in metabolic pathway profile scanning (Fig. 6): rhamnose was prepared as a 200 × solution and 200 µL of the stock solution was added to 40 mL of molten LB agar to reach 1 × final concentration. The aTc was prepared as a 160 × solution and 250 µL of the stock solution was added to 40 mL of molten LB agar to reach 1 × final concentration. All experiments were done in triplicates. Three colonies were picked and cultured separately on different 96-well plates for induction assays.
K. oxytoca M5a1 carries natural ampicillin resistance. 100 µg mL−1 ampicillin was always added to the agar plate during the co-transformation step, and 50 µg mL−1 ampicillin was used for overnight culture. For plasmid maintenance, 50 µg mL−1 kanamycin and 25 µg mL−1 chloramphenicol were used during the final culturing step (6.5 hr) before fixation (without ampicillin).
All reagents (i.e., antibiotics and inducers) were purchased in powder and were prepared by dissolving in double distilled water or ethanol followed by filtration through 0.22 µM syringe filters (Millipore).
Plasmid circuit construction
Standard molecular biology protocols were used in this study for circuit constructions. All plasmids and their maps and sequences have been listed in Supplementary Data 1. The typical composition of the plasmids used in this study is illustrated in Supplementary Fig. 18. Key primers used in this study are listed in Supplementary Data 1. Different dCas9 generators, activator generators and reporter circuits were combined on the BioBrick vector pSB4A350 for assays in E. coli. All sgRNA generators were hosted on a separate and compatible vector p15AC. Variants of each component were chosen according to the specific needs and details of a particular experiment. Qiaspin Miniprep Kit (Qiagen) and Wizard SV Gel and PCR Clean-Up System (Promega) were used in this study for DNA purification.
The dCas9 generator circuit (Supplementary Data 1) was constructed from pdCas9-bacteria (pdCas9-bacteria was a gift from Stanley Qi, Addgene plasmid # 442491). Two synonymous mutations at R63 (CGT > CGC) and R447 (CGA > CGT) were introduced into the dCas9 coding DNA sequence (CDS). One synonymous mutation at S2 (TCT > TCA) was introduced into the TetR CDS, and a base substitution (+29 site of PtetA, C > G) was introduced between the PtetA transcription start site and the ribosome-binding site (RBS) for dcas9. The terminator for the dcas9 operon was changed to a synthetic transcriptional terminator L3S3P00, and another synthetic terminator L3S3P22 was used for termination of tetR. The BioBrick RFC10 prefix and suffix flanked the dcas9 and tetR operons so the entire cluster conformed to BioBrick standards. The dxCas9 generator has the same structure with the dxCas9 CDS mutated by overlap PCR (ProFlex PCR system) (primers provided in Supplementary Data 1). Non-synonymous mutations encoding E1219V, M694I, E543D, E480K, S409I, R324L, and A262T from xCas9 version 3.742 were introduced into the dCas9 CDS.
The CDS of PspFΔHTH (1-296 of PspF) was amplified from the genome of E. coli TOP10. Through PCR cloning, a BioBrick RBS B0032 was added upstream and the CDS of λN22plus peptide (MNARTRRRERRAEKQAQWKAAN), along with a synthetic terminator L3S3P22 were added downstream. The resulting unit was assembled with either a PrhaB promoter or an Anderson promoter through BioBrick RFC10 assembly51. Structures and sequences of the Anderson promoters and the standardized PrhaB are available in Supplementary Data 1. The three versions of NorR CDS were amplified from the genome of E. coli TOP10 by PCR (primers provided in Supplementary Data 1). The RBS B0032, the λN22plus CDS and the synthetic terminator L3S3P22 were added by PCR cloning. The NifA2 CDS was amplified from the genome of K. oxytoca by PCR (primers provided in Supplementary Data 1), RBS B0032 and terminator L3S3P22 were added by the primer directly. WtsA::GGGS::λN22plus and ZraR::GGGS::λN22plus, both driven by the promoter BBa_J2310652 and the RBS B0032, terminated by the L3S3P22 terminator, were synthesized by GeneArt with codons optimized for E. coli. The engineered HrpRS generator was built from a CRISPRi circuit used in Supplementary Fig. 13. The λN22plus CDS, the RBS B0032, and the terminator L3S3P22 were added or modified by PCR cloning. PspFΔHTH::MCP was constructed by Gibson Assembly, in which the MCP domain was amplified from plasmid pHAGE-EFS-MCP-3XBFPnls which was a gift from Thoru Pederson (Addgene plasmid # 7538423). A synonymous mutation N86 (AAT > AAC) was introduced into the CDS of the MCP domain.
The reporter circuits have different promoters. They were all connected to an identical downstream circuit that contains an insulator RiboJ, the RBS B0030, a sfGFP CDS with the SsrA degradation tag (ASV). The wild-type PpspA promoter was synthesized by annealing of oligonucleotides (j5 protocol53). The synthetic PpspA with different UAS were constructed by PCR-based modifications on the UAS regions. The wild-type PhrpL was from our previous work54 and the mutated PhrpL with a modified DNA loop, tested in the PspFΔHTH::dCas9 and the CRISPRi assay, was constructed by reverse PCR. Promoters PnifH and PnifJ were amplified from the genome of K. oxytoca M5a1 by PCR (primers in Supplementary Data 1). All buffer terminators and hybrid promoters were constructed by annealing oligonucleotides or PCR.
The CRISPRi circuit includes a dCas9 generator, a HrpRS generator (RA32T in Supplementary Data 1) and a reporter circuit. RBS B0030 and B0032 drive the translation of hrpR and hrpS, respectively, in a bicistronic operon that is terminated by the terminator B0015. A mutated PhrpL was used in the reporter circuit. Its core region was mutated (−2, T > A; −5, G > A; −14, C > T; −21, T > C) to improve its maximum output. Furthermore, the − 65 & − 66 sites (AA > CC) and the − 90 & − 91 sites (TA > CC) of this promoter were mutated to create NGG PAM within the DNA loop region. The 16 bp fragment between the − 24 box and the IHF region (− 42 to − 63), and the 19 bp fragment from − 70 to − 88 were replaced by random sequences (Supplementary Data 1) for orthogonal CRISPRi targeting.
For constructions of sgRNA generators, the vector p15AC was amplified from pdCas9-bacteria and was flanked by two BbsI sites, which would create sticky ends for EcoRI and SpeI, respectively. The sgRNA scaffolds with BoxB or MS2 aptamers were synthesized by annealing oligonucleotides. Each generator contains a Plux2 or PBAD promoter and a L3S2P21 terminator. Whenever necessary, the spacer sequence was modified by PCR cloning. All buffer terminators were inserted 5’ of sgRNA spacer by PCR cloning. For modularity test of buffer terminators, the B0062-R terminator was synthesized by annealing oligonucleotides and assembled with the insulator RiboJ, which was then inserted into sgRNA generator. Modifications of the sgRNA scaffold, including introduction of mismatches, were performed by PCR cloning.
For heterologous expression of CRISPRa in K. oxytoca M5a1, the activator generator was driven by the promoter BBa_J23106, as PrhaB is non-functional in K. oxytoca. Circuits on pSB4A3 in E. coli were migrated into pSB4K555 owing to the endogenous ampicillin resistance of K. oxytoca M5a1. No vector change was necessary for sgRNA generators.
All constructs in this study were sequenced by Sanger sequencing service provided by Source Bioscience or MRC PPU DNA sequencing and services.
Metabolic pathway profile scanning
Libraries of tandem sgRNA generators for transcriptional activation profile scanning were constructed through the following steps: First, basic sgRNA generator units with three components were built, each includes an upstream terminator, an Anderson promoter and the sgRNA sequence. The upstream terminator, with BbsI restriction sites that would create unique overhangs (Supplementary Data 1), and Anderson promoters with different strength were made by annealing synthesized oligonucleotides. They were then ligated upstream to an amplified and digested sgRNA sequence on the vector pSB1K356 (sgRNA fragments were digested by XbaI and PstI). Second, sgRNA generator units that contain an identical spacer but different promoters (BBa_J23100 /BBa_J23108 /BBa_J23114)27 were mixed in equimolar ratios to give one set of sgRNA generator units. Each set corresponds to a single target gene in which expression would be tuned. Five sets of sgRNA generator units were then mixed and assembled in a one-step Golden Gate Assembly system into pSB4A3. It should be noted in Fig. 6, two out of the five sets had no variation in promoter choice and only used the promoter BBa_J23100. Third, the entire Golden Gate Assembly reaction was transformed into E. coli TOP10 and cultured on an agar plate with ampicillin for over 20 hr. All colonies on plate (with > 250 colonies, which should cover 27 possible combinations) were scraped and washed down from the plate and subjected to plasmid extraction, which yielded a crude library. Fourth, the crude library was cleaned up by size selection—plasmids were digested by EcoRI and PstI, and then resolved on an agarose gel by gel electrophoresis. Digested fragments, with sizes that correspond to the desired assembly product, were excised and self-ligated, which was then transformed into E. coli TOP10 again. The final library was obtained by scrapping all transformants followed by plasmid extraction. The positive control sgRNA generator circuit was made by following the same steps with the five kinds of sgRNA generator units that only used the promoter BBa_J23100.
The above steps were repeated in generating the CRISPRa + i profile scanning library with the following exception: When building a sgRNA generator unit set, sgRNA generators that contain standard sgRNA scaffold plus spacers that target CDS of vioA/vioD/vioC were added to the mixture of CRISPRa sgRNA generators.
To construct the violacein production pathway for optimization on metabolic engineering, the CDS of vioA, vioB, vioE, vioD were amplified from pΔ_lvl2_vioABDE57 and the CDS of vioC gene was amplified from pKMV-VioC (Addgene plasmid #6534958). The RBS B0030 and five different terminators were added to each gene through PCR cloning (L3S2P21 for vioA, L3S2P11 for vioB, L3S3P22 for vioE, L3S3P21 for vioD, and L3S2P55 for vioC). These translational units for VioA/B/D/E/C were placed downstream of PpspA-LEA1B1, PpspA-LEA2B2, PpspA-LEA3B3, PpspA-LEA4B4, and PpspA-LEA5B5, respectively. They were then assembled with a dxCas9 generator and the PspFΔHTH::λN22 generator on the vector pSEVA221 (GenBank: JX560327).
Circuits including CRISPRa device, the sgRNA library and the violacein production pathway were co-transformed into E. coli MC1061ΔpspF, which were then spread on LB-Lennox agar plates with ampicillin, kanamycin, aTc and rhamnose. Three kinds of plates with different inducer concentrations were made and used for culturing co-transformants: (1) 5 µM rhamnose + 0.63 ng mL−1 aTc, (2) 1 µM rhamnose + 0.63 ng mL−1 aTc, and (3) 0 µM rhamnose + 0.63 ng mL−1 aTc. All plates were incubated at 37 °C overnight. Pictures of the agar plates with colonies were then taken by a digital SONY NEX-5R camera under normal lighting conditions in a laboratory after incubation durations of 14, 16, 18, and 20 hr. For post-acquisition processing, brightness and contrast of photos were adjusted in Photoshop CS3. Each photo was then scaled using a five pence coin that was taken in the same shot so that colony sizes across different plates would be comparable. All scaled photos were cropped to areas of interest. Colonies to be picked and sequenced were selected from an area of interest at 16 (CRISPRa) or 20 (CRISPRa + i) hr post-incubation and their positions were recorded. They were then picked and inoculated after the last photograph time point has passed.
For the expression profile test on the rainbow device, the above CRISPRa + i library was co-transformed with the rainbow circuit with the same steps. The agar plate did not include inducers. Colonies were picked and suspended in 30 µL of LB medium. For each strain, 5 µL of cell suspension was added into 195 µL LB medium with ampicillin and kanamycin for overnight (18 hr) liquid culture in a 96-well plate, which then was diluted 100-fold into fresh medium with antibiotics and inducers and grown under 37 °C, 1000 rpm for 6 hr for data collection. Part of the cultured strains (LY1–LY9) were serially transferred every 6 hr (2 µL of grown culture into 198 µL of fresh medium) with antibiotics and inducers up to 24 hr. The plate was read every 6 hr after each transfer by a plate reader (BMG FLUOstar fluorometry).
Gene expression assays
For all the experiments except rainbow device, only one standardized reporter sfGFP was used to characterize promoter outputs in this work. All fluorescence data (except the rainbow device and the experiment of Supplementary Fig. 1f) was measured in an Attune NxT Flow Cytometer (equipped with Attune NxT Autosampler). For fixation, 2 µL of cultured bacteria was added to 198 µL 1 × phosphate-buffered saline (K813-500ML, VWR) in a 96-well U-bottom plate (Thermo Fisher Scientific) with 1 mg mL−1 kanamycin and the samples were stored in 4 °C for at least 1 hr before data acquisition.
Fixed samples were read by the Flow Cytometer, equipped with 488 nm excitation laser and 530/30 nm emission filter for green fluorescence measurements. For each sample reading, the cytometer was instructed to wait for 4 s before recording single cell fluorescence, and at least 10,000 gated events were recorded using a flow rate of 25 µL min−1. Flow cytometry data were analyzed by FlowJo (v7.6.1) and appropriate forward and side scatter gates were applied to all samples for each experiment. The geometric means of fluorescence were taken as population fluorescence. For the UAS shift test in Fig. 2a, median values were taken as population fluorescence to avoid impact of extreme values. For rainbow device, we used a plate reader (BMG FLUOstar fluorometry) to collect fluorescence and A600 data, and calculate fluorescence/A600.
For the measurement of expression profiles of the rainbow strains and the experiment of Supplementary Fig. 1f, data were acquired from a plate reader equipped with a 405–10 nm excitation laser and a 460 nm emission filter for blue fluorescence measurements (Gain 1372), a 485 nm excitation laser and a 520 nm emission filter for green fluorescence measurements (Gain 1263 for the rainbow strains; Gain 700 for the experiment of Supplementary Fig. 1f), and a 584 nm excitation laser and a 620–10 nm emission filter for red fluorescence measurements (Gain 2240).
Absorbance (A 600) assays
Cell densities, measured by absorbance (A600), were used as a proxy of cellular burden. A600 values were measured using a plate reader (BMG FLUOstar fluorometry). To set up blanks for measurements, 200 µL of liquid LB medium was added to a blank well without inoculation, in which the volumes to be occupied by inducers were filled with water. This step would be done at the same time when cells were diluted to give assay cultures. Therefore, effects of evaporation and inducer dilution should apply equally to both the blanks and the cell cultures. The A600 values were processed and analyzed in Microsoft Excel 2013.
The strains were cultured in 200 µL Lennox’s LB medium with appropriate antibiotics under 37 °C 1000 rpm overnight (16–18 hr). They were then diluted 100-fold into 1 mL fresh medium with appropriate antibiotics and inducers in a 96-well deep-well plate (Starlab). The diluted culture was incubated under 37 °C 1000 rpm for 6 hr. Then the total RNA was extracted by the Aurum™ Total RNA Mini Kit (Bio-Rad). The RQ1 RNase-Free DNase (Promega) was employed to digest the residual genomic DNA, and the iScript™ Reverse Transcription Supermix (Bio-Rad) was used to generate cDNA in 10 µL reaction systems through reverse transcription. The iTaq™ Universal SYBR® Green Supermix was used for qPCR in 10 µL reaction systems with 60 °C as the annealing temperature, and 30 s were set as the extension time. The housekeeping gene rho was used as the reference gene. All the genes in RT-qPCR experiments were isolated from the genome of E. coli and were cloned on a high copy-number plasmid pSB1K3 as the standard sample for standard curve generation. The qPCR reactions were run on a LightCycler 480 qPCR system (Roche). The relative expression levels of target genes were calculated using the LightCycler 480 software and Excel 2013 by the Pfaffl method (normalized by the reference gene rho). The primers for gene isolation and qPCR are provided in Supplementary Data 1.
Calculation of fluorescence intensities
The RT-qPCR experiment includes 6 biological repeats. All attempts at replication were successful. All geometric mean of fluorescence of a population were corrected by background fluorescence of cells with suitable empty plasmids grown in identical conditions but without inducers. Corrected fluorescence values were then averaged in GraphPad Prism 7.05 (except rainbow device) to give the medians or means with standard deviations (SD), which were used for plotting graphs. There were two exceptions: the data for heat maps (Fig. 4a, b) and the rainbow device (Fig. 7): For heat maps, means were calculated in Microsoft Excel 2013 and then imported into GraphPad Prism for plotting the heat maps. Two-tailed t tests were performed using GraphPad Prism 7.05. For the rainbow device and the experiment of Supplementary Fig. 1f, the mean of fluorescence values and OD values were corrected by a blank negative control (with corresponding volumes of medium and with water replacing inducers). The fluorescence/A600 for each well was then calculated. The fluorescence/A600 value was corrected by that of a fluorescence-free strain. All the calculation was done in Microsoft Excel 2013. Error calculation follows the propagation of error.
Calculation of cell densities (absorbance A 600)
All A600 values were corrected by blank absorbance of liquid LB medium without cells. Corrected A600 values were then averaged in GraphPad Prism 7.05 to give means and standard deviations (SD), which were used for plotting graphs.
Calculation of dynamic ranges
For all dynamic range calculations, the means and standard deviations (SD) of geometric mean fluorescence from all samples were first calculated by the built-in formulae AVERAGE and STDEV in Microsoft Excel 2013. The calculated mean was then corrected by the mean of background fluorescence of cells. Error propagation was calculated by the following formula:
SDcor, standard deviation of corrected mean; SD (x), SD of the three geometric mean fluorescence values from particular sample x; SD (nc), SD of that from negative control sample.
The dynamic range DR was calculated by the corrected mean value of fluorescence:
Fluo (on), the corrected means of three geometric mean fluorescence values from samples under on state; Fluo (off), the corrected means of three geometric mean fluorescence values from samples under off state; DR, dynamic range.
The standard deviation of DR (SD (DR)) was calculated by the uncertainty propagation formula:
The final mean (DR) and standard deviation (SD (DR)) values were imported into GraphPad Prism 7.05 for plotting graphs.
Sequencing analysis of metabolic activation profile scanning
For CRISPRa-based metabolic activation profile scanning, and for each bin of purple color intensity, five colonies were identified and marked after 16 hr of growth on agar plates. The same was applied to CRISPRa+i based profile scanning, but colonies were identified and marked after 20 hr of growth. For each scan, those previously marked colonies were picked and each was inoculated into 10 mL of LB medium with 100 µg mL−1 ampicillin in a 30 mL universal container (Starlab). After overnight culture at 37 °C, 160 rpm, plasmids were extracted from each culture and were sequenced by Sanger sequencing (Source BioScience). The sequencing results were aligned to known sequences of sgRNA generators by the software SnapGene 3.2.1 to recover the identities of the promoters of the sgRNA generator, or the identity of the CRISPRi sgRNA. Results that failed to give proper reads or indicated mixed clones were excluded from downstream analysis. For fair comparison, each bin used the same number of colonies for analysis.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The previously constructed plasmids pdCas9-bacteria (Addgene #44249), pHAGE-EFS-MCP-3XBFPnls (Addgene #75384), and pKMV-VioC (Addgene #65349) used in this study are available from Addgene. Key plasmids constructed in this study are available from Addgene at https://www.addgene.org/Baojun_Wang/. All other data and materials from this work are available from the corresponding author upon reasonable request. The source data underlying Figs. 1c–e, 2a–d, 3a–d, 4a–c, 5a–e, 6b, c, and 7a–d, and Supplementary Figs 1a–f, 2a, b, 3a, b, 4a, b, 5a–f, 6a, b, 7a–c, 8a, b, 9a, b, 10a, b, 11a, b, 12a–c, 13a–d, 14–17 are provided as a Source Data file.
Qi, L. S. et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173–1183 (2013).
Bikard, D. et al. Programmable repression and activation of bacterial gene expression using an engineered CRISPR-Cas system. Nucleic Acids Res. 41, 7429–7437 (2013).
Gilbert, L. A. et al. CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442–451 (2013).
Zalatan, J. G. et al. Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds. Cell 160, 339–350 (2015).
Chavez, A. et al. Highly efficient Cas9-mediated transcriptional programming. Nat. Methods 12, 326–328 (2015).
Tak, Y. E. et al. Inducible and multiplex gene regulation using CRISPR–Cpf1-based transcription factors. Nat. Methods 14, 1163–1166 (2017).
Zhang, X. et al. Multiplex gene regulation by CRISPR-ddCpf1. Cell Discov. 3, 17018 (2017).
Yeo, N. C. et al. An enhanced CRISPR repressor for targeted mammalian gene regulation. Nat. Methods 15, 611–616 (2018).
McCutcheon, S. R., Chiu, K. L., Lewis, D. D. & Tan, C. CRISPR-Cas expands dynamic range of gene expression from T7RNAP promoters. Biotechnol. J. 13, 1700167 (2018).
Dong, C., Fontana, J., Patel, A., Carothers, J. M. & Zalatan, J. G. Synthetic CRISPR-Cas gene activators for transcriptional reprogramming in bacteria. Nat. Commun. 9, 2489 (2018).
Rappas, M., Bose, D. & Zhang, X. D. Bacterial enhancer-binding proteins: unlocking σ54-dependent gene transcription. Curr. Opin. Struct. Biol. 17, 110–116 (2007).
Yang, Y. et al. Structures of the RNA polymerase-σ54 reveal new and conserved regulatory strategies. Science 349, 882–885 (2015).
Sassedwight, S. & Gralla, J. D. Role of eukaryotic-type functional domains found in the prokaryotic enhancer receptor factor σ54. Cell 62, 945–954 (1990).
Cases, I., Ussery, D. W. & De Lorenzo, V. The σ54 regulon (sigmulon) of Pseudomonas putida. Environ. Microbiol. 5, 1281–1293 (2003).
Kroos, L. Eukaryotic-like signaling and gene regulation in a prokaryote that undergoes multicellular development. Proc. Natl. Acad. Sci. USA 102, 2681–2682 (2005).
Wigneshweraraj, S. et al. Modus operandi of the bacterial RNA polymerase containing the σ54 promoter-specificity factor. Mol. Microbiol. 68, 538–546 (2008).
Jones, S. Sigma 54 minds the gap. Nat. Rev. Microbiol. 7, 3–3 (2009).
Joly, N., Zhang, N., Buck, M., Zhang, X. D. Coupling AAA protein function to regulated gene expression. BBA Mol. Cell Res. 1823, 108–116 (2012).
Zhang, N., Darbari, V. C., Glyde, R., Zhang, X. D. & Buck, M. The bacterial enhancer-dependent RNA polymerase. Biochem. J. 473, 3741–3753 (2016).
Elderkin, S., Jones, S., Schumacher, J., Studholme, D. & Buck, M. Mechanism of action of the Escherichia coli phage shock protein PspA in repression of the AAA family transcription factor PspF. J. Mol. Biol. 320, 23–37 (2002).
Zhou, Y. et al. Engineering bacterial transcription regulation to create a synthetic in vitro two-hybrid system for protein interaction assays. J. Am. Chem. Soc. 136, 14031–14038 (2014).
Austin, R. J., Xia, T. B., Ren, J. S., Takahashi, T. T. & Roberts, R. W. Designed arginine-rich RNA-binding peptides with picomolar affinity. J. Am. Chem. Soc. 124, 10966–10967 (2002).
Ma, H. H. et al. Multiplexed labeling of genomic loci with dCas9 and engineered sgRNAs using CRISPRainbow. Nat. Biotechnol. 34, 528–530 (2016).
Schumacher, J., Zhang, X., Jones, S., Bordes, P. & Buck, M. ATP-dependent transcriptional activation by bacterial PspF AAA+protein. J. Mol. Biol. 338, 863–875 (2004).
Joly, N., Zhang, N. & Buck, M. ATPase site architecture is required for self-assembly and remodeling activity of a hexameric AAA+ transcriptional activator. Mol. Cell 47, 484–490 (2012).
Dworkin, J., Jovanovic, G. & Model, P. The PspA protein of Escherichia coli is a negative regulator of σ54-dependent transcription. J. Bacteriol. 182, 311–319 (2000).
Anderson promoter collection. Available: http://parts.igem.org/Promoters/Catalog/Anderson
Studholme, D. J. & Dixon, R. Domain architectures of σ54-dependent transcriptional activators. J. Bacteriol. 185, 1757–1767 (2003).
Jovanovic, M. et al. Regulation of the co-evolved HrpR and HrpS AAA+ proteins required for Pseudomonas syringae pathogenicity. Nat. Commun. 2, 177 (2011).
Wang, X. et al. Using synthetic biology to distinguish and overcome regulatory and functional barriers related to nitrogen fixation. PLoS ONE 8, e68677 (2013).
Reyes Ramirez, F., Little, R. & Dixon, R. Mutant forms of the Azotobacter vinelandii transcriptional activator NifA resistant to inhibition by the NifL regulatory protein. J. Bacteriol. 184, 6777–6785 (2002).
Dworkin, J., Ninfa, A. J. & Model, P. A protein-induced DNA bend increases the specificity of a prokaryotic enhancer-binding protein. Gene Dev. 12, 894–900 (1998).
Huo, Y. X. et al. IHF-binding sites inhibit DNA loop formation and transcription initiation. Nucleic Acids Res. 37, 3878–3886 (2009).
Tucker, N. P., D’Autréaux, B., Studholme, D. J., Spiro, S. & Dixon, R. DNA binding activity of the Escherichia coli nitric oxide sensor NorR suggests a conserved target sequence in diverse proteobacteria. J. Bacteriol. 186, 6656–6660 (2004).
D’Autréaux, B., Tucker, N. P., Dixon, R. & Spiro, S. A non-haem iron centre in the transcription factor NorR senses nitric oxide. Nature 437, 769–772 (2005).
Tucker, N. P., Ghosh, T., Bush, M., Zhang, X. D. & Dixon, R. Essential roles of three enhancer sites in σ54-dependent transcription by the nitric oxide sensing regulatory protein NorR. Nucleic Acids Res. 38, 1182–1194 (2010).
Bush, M. et al. The structural basis for enhancer-dependent assembly and activation of the AAA transcriptional activator NorR. Mol. Microbiol. 95, 17–30 (2015).
Frederick, R. D., Majerczak, D. R. & Coplin, D. L. Erwinia stewartii Wtsa, a positive regulator of pathogenicity gene expression, is similar to Pseudomonas syringae pv. phaseolicola HrpS. Mol. Microbiol. 9, 477–485 (1993).
Wang, B., Kitney, R. I., Joly, N. & Buck, M. Engineering modular and orthogonal genetic logic gates for robust digital-like synthetic biology. Nat. Commun. 2, 508 (2011).
Brödel, A. K., Jaramillo, A. & Isalan, M. Engineering orthogonal dual transcription factors for multi-input synthetic promoters. Nat. Commun. 7, 13858 (2016).
Charlton, W., Cannon, W. & Buck, M. The Klebsiella pneumoniae nifJ promoter: analysis of promoter elements regulating activation by the Nif A promoter. Mol. Microbiol. 7, 1007–1021 (1993).
Hu, J. H. et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature 556, 57–63 (2018).
Zhang, X. & Enomoto, K. Characterization of a gene cluster and its putative promoter region for violacein biosynthesis in Pseudoalteromonas sp. 520P1. Appl. Microbiol. Biotech. 90, 1963–1971 (2011).
Cress, B. F. et al. Rapid generation of CRISPR/dCas9-regulated, orthogonally repressible hybrid T7-lac promoters for modular, tuneable control of metabolic pathway fluxes in Escherichia coli. Nucleic Acids Res. 44, 4472–4485 (2016).
Reitzer, L. & Schneider, B. L. Metabolic context and possible physiological themes of σ54-dependent genes in Escherichia coli. Microbiol. Mol. Biol. Rev. 65, 422–444 (2001).
Fisher, M. A. et al. Borrelia burgdorferi σ54 is required for mammalian infection and vector transmission but not for tick colonization. Proc. Natl. Acad. Sci. USA 102, 5162–5167 (2005).
Wolfe, A. J., Millikan, D. S., Campbell, J. M. & Visick, K. L. Vibrio fischeri σ54 controls motility, biofilm formation, luminescence, and colonization. Appl. Environ. Microb. 70, 2520–2524 (2004).
Leang, C. et al. Genome-wide analysis of the RpoN regulon in Geobacter sulfurreducens. BMC Genomics 10, 331 (2009).
Baba, T. et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 2006.0008 (2006).
Registry of standard biological parts—Part:pSB4A3. Available: http://parts.igem.org/Part:pSB4A3.
Knight, T. Idempotent vector design for standard assembly of Biobricks. MIT Synthetic Biology Working Group Technical Reports, (2003), Available: http://hdl.handle.net/1721.1/21168.
Wang, B., Barahona, M. & Buck, M. Amplification of small molecule-inducible gene expression via tuning of intracellular receptor densities. Nucleic Acids Res. 43, 1955–1964 (2015).
Hillson, N. J., Rosengarten, R. D. & Keasling, J. D. j5 DNA assembly design automation software. ACS Synth. Biol. 1, 14–21 (2012).
Wang, B. J., Barahona, M. & Buck, M. Engineering modular and tunable genetic amplifiers for scaling transcriptional signals in cascaded gene networks. Nucleic Acids Res. 42, 9484–9492 (2014).
Shetty, R. P., Endy, D. & Knight, T. F. Jr. Engineering BioBrick vectors from BioBrick parts. J. Biol. Eng. 2, 5 (2008).
Registry of standard biological parts—Part:pSB1K3. Available: http://parts.igem.org/Part:pSB1K3.
Andreou, A. I. & Nakayama, N. Mobius assembly: a versatile golden-gate framework towards universal DNA assembly. PLoS ONE 13, e0189892 (2018).
Guo, Y. et al. YeastFab: the design and construction of standard biological parts for metabolic engineering in Saccharomyces cerevisiae. Nucleic Acids Res. 43, e88 (2015).
Chen, Y.-J. et al. Characterization of 582 natural and synthetic terminators and quantification of their design constraints. Nat. Methods 10, 659–664 (2013).
Zhao, K., Liu, M. Z. & Burgess, R. R. Promoter and regulon analysis of nitrogen assimilation factor, σ54, reveal alternative strategy for E. coli MG1655 flagellar biosynthesis. Nucleic Acids Res. 38, 1273–1283 (2010).
We thank Trevor Y. H. Ho for his help in preparing the manuscript and valuable suggestions. We thank Dr Jorg Schumacher (Imperial College London) for supplying the Klebsiella oxytoca M5a1 strain, Andreas Andreou (University of Edinburgh) for providing plasmid pΔ_lvl2_vioABDE and Dr Attila Molnar for critical reviewing of the manuscript. This work was supported by the UK Biotechnology and Biological Sciences Research Council grant [BB/N007212/1], UK Research and Innovation [MR/S018875/1], Leverhulme Trust grant [RPG-2015-445], and Wellcome Trust Seed Award in Science [202078/Z/16/Z]. X.W. was supported by scholarships from the China Scholarship Council and the Scottish Universities Life Sciences Alliance.
The authors declare no competing interests.
Peer review information: Nature Communications thanks Alfonso Jaramillo and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Liu, Y., Wan, X. & Wang, B. Engineered CRISPRa enables programmable eukaryote-like gene activation in bacteria. Nat Commun 10, 3693 (2019). https://doi.org/10.1038/s41467-019-11479-0
Biotechnology and Applied Biochemistry (2020)
Synthetic Biology and Metabolic Engineering Employing Escherichia coli for C2–C6 Bioalcohol Production
Frontiers in Bioengineering and Biotechnology (2020)
Effective CRISPRa-mediated control of gene expression in bacteria must overcome strict target site requirements
Nature Communications (2020)
Microbial Cell Factories (2020)