Optimising expression of the large dynamic range FRET pair mNeonGreen and superfolder mTurquoise2ox for use in the Escherichia coli cytoplasm

Mertens, Laureen M. Y.; den Blaauwen, Tanneke

doi:10.1038/s41598-022-22918-2

Download PDF

Article
Open access
Published: 26 October 2022

Optimising expression of the large dynamic range FRET pair mNeonGreen and superfolder mTurquoise2^ox for use in the Escherichia coli cytoplasm

Laureen M. Y. Mertens¹ &
Tanneke den Blaauwen¹

Scientific Reports volume 12, Article number: 17977 (2022) Cite this article

2277 Accesses
1 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The fluorescent proteins superfolder mTurquoise2^ox (sfTq2^ox) and mNeonGreen function excellently in mammalian cells, but are not well expressed in E. coli when forming the N-terminus of constructs. Expression was increased by decreasing structures at the start of their coding sequences in the mRNA. Unfortunately, the expression of mNeonGreen started from methionine at position ten as optimisation introduced an alternative RBS in the GFP N-terminus of mNeonGreen. The original start-codon was not deleted, which caused the expression of isomers starting at the original start-codon and at the start-codon at the beginning of the GFP N-terminus. By omitting the GFP N-terminus of mNeonGreen and optimising the structure of its mRNA, the expression of a mixture of isomers was avoided, and up to ~ 50-fold higher expression rates were achieved for mNeonGreen. Both fluorescent proteins can now be expressed at readily detectable levels in E. coli and can be used for various purposes. One explored application is the detection of in-cell protein interactions by FRET. mNeonGreen and sfTq2^ox form a FRET pair with a larger dynamic range than commonly used donor–acceptor pairs, allowing for an excellent signal-to-noise ratio, which supports the detection of conformational changes that affect the distance between the interacting proteins.

FAST, a method based on split-GFP for the detection in solution of proteins synthesized in cell-free expression systems

Article Open access 05 April 2024

Design of a palette of SNAP-tag mimics of fluorescent proteins and their use as cell reporters

Article Open access 13 June 2023

Chemically stable fluorescent proteins for advanced microscopy

Article Open access 07 November 2022

Introduction

Fluorescent proteins genetically fused to proteins of interest are widely used in molecular biology to study protein localisation, transport, interactions, dynamics and many other facets of life. Fluorescent proteins are used across all domains of life and in cells of organisms as diverse as bacteria¹, plants², and humans³. Most fluorescent proteins, however, are developed for use in a mammalian context and are in our experience not well expressed in Escherichia coli specifically, or bacteria in general. Expanding of available fluorescent proteins (FPs) for use in bacteria allows for more complex experiments, like detecting the concurrent expression of multiple fusions, or for more complex techniques such as FRET, high-resolution microscopy or FRAP -all current methods used by microbiologists working on complex and relevant scientific questions^4,5,6,7,8,9.

Our group has previously developed the donor superfolder turquoise 2 ox (sfTq2^ox), and the acceptor mNeonGreen (mNG) FRET pair for the periplasm¹⁰. It has a superior dynamic range to previously used FRET pairs for E. coli¹⁰. This FRET pair was also a promising avenue for developing a more sensitive FRET assay for the cytoplasm than the thus far used cytoplasmic donor mKusabira-Orange (mKO) and acceptor mCherry (mCh) FRET pair¹¹. The experimentally determined dynamic range of mKO-mCh was 1.5–31%, whereas the dynamic range of sfTq2^ox-mNG in the cytoplasmic was 1.5–66%^10,11. A larger dynamic range allows for a better distinction between FRET and no-FRET and, therefore, a better signal-to-noise ratio and more sensitive measurements. The chromophore of the fluorescent proteins is in the centre of their beta-barrel structure. Consequently, the distance between the donor and acceptor chromophore will always be more than approximately 5 nm. The Förster distance R₀ is the distance between the chromophores that yields 50% FRET. The sfTq2^ox-mNG pair has a larger R₀ than the mKO-mCh pair resulting in a better dynamic range.

However, when repurposing the FRET pair sfTq2^ox and mNG for the cytoplasm of E. coli, we encountered deficient expression of N-terminal cytoplasmic fusions to sfTq2^ox. So far, in E. coli, sfTq2^ox was only employed in the periplasmic space or C-terminal fusions. Consequently, the coding sequence for sfTq2^ox was never the start of an open reading frame (ORF). It was always preceded by either a signal sequence for its targeting to the Sec-machinery (in the case of N-terminal fusions) or by another protein (C-terminal fusions, including the mNG-sfTq2^ox cytoplasmic tandem). sfTq2^ox was developed from mTq2¹⁰, a distant derivate of enhanced GFP (EGFP)¹², a GFP variant optimised for expression in human cells³. As the expression of sfTq2^ox for periplasmic purposes was up to par, it was not codon-optimized for E. coli. We hypothesise and prove that the initiation of translation is the bottleneck in sfTq2^ox expression, and therefore we could enhance expression by optimising specifically the first 20 codons of the protein.

The expression of fluorescent protein fusions is restricted to a small window of appropriate amounts. Too low expression does not result in detectable signals; too high expression of fluorescent protein(fusion)s can be detrimental to cells. There is an additional requirement for a FRET assay: the produced amounts of the two interaction partners should be close to a 1:1 ratio. To fulfil this latter requirement, the expression of mNG had to be increased as well. For our studies of bacterial cell cycle-related events, the fluorescent fusion proteins are expressed from low copy-number plasmids under the control of an IPTG inducible downregulated pTRC99A promoter (P_trcdown) or by their native promoter (region)^10,11,13. Using the P_trcdown expression system, we optimised the expression of the sfTq2^ox-mNG FRET pair. During the optimisation of the expression of these proteins, we confirmed that GFP-based terminal fragments added to many fluorescent proteins might provide alternative ribosome binding sites (RBSs) and lead to the expression of isoforms. A Standard Operating Procedure (SOP) for optimising fluorescent protein expression in bacteria has been formulated, (Supplementary Fig. S1).

The broader purpose of this manuscript is to create awareness of a relatively easy solution to common problems amongst scientists working with fluorescent proteins in bacteria and possibly archaea. Observing that the expression of a fluorescent protein does not result in detectable fluorescence is now often a dead end for scientists. A problem which we demonstrate in this work can be easily overcome, resulting in dramatic increases in expression levels.

Results

Increasing sfTq2^ox for N-terminal, cytoplasmic expression

The low expression level of open reading frames (ORFs) starting with sfTq2^ox was first observed for the fusion sfTq2^ox-FtsZ (Fig. 1A, B). Several successful N-terminal fusions to the bacterial cell division protein FtsZ that forms a ring structure at midcell have been made before to other fluorescent proteins such as mCherry¹¹, SYFP2¹¹ and mNG (unpublished). However, sfTq2^ox-FtsZ fusions did not result in apparent midcell localisation, nor did the detected fluorescence level increase above the autofluorescence of cells transformed with a void plasmid (Fig. 1A, C). This was also the case for unfused, cytoplasmic sfTq2^ox (Fig. 1A–C). Since expression of C-terminal sfTq2^ox fusions in E. coli was higher and previous work has shown that synonymous mutations in the N-terminal region of GFP have a serious impact on its expression levels¹⁴, the codons in the N-terminus of sfTq2^ox were exchanged to improve its expression level. sfTq2^ox is a far derivative of human-optimized GFP (EGFP), and the amino acids at its N-terminus still match those of EGFP completely^12,15,16,17. The first 20 codons of sfTq2^ox were replaced by synonymous codons used in gfp-mut2, a GFP variant well expressed in E. coli (sequences are aligned in Fig. 1E)¹. The version of sfTq2^ox with the first 20 codons replaced by those of gfp-mut2 is termed sfTq2^oxopt.

The expression levels of unfused sfTq2^oxopt and sfTq2^oxopt-FtsZ were compared with their sfTq2^ox equivalents by quantification of fluorescence microscopy images (Fig. 1A, C) and western blotting (Fig. 1D) of cells grown exponentially in rich medium. The sfTq^oxopt variants were expressed much better than the sfTq2^ox variants (Fig. 1C). The enhanced brightness image of Fig. 1A shows that the cells expressing sfTq2^ox exhibit similar levels of fluorescence as the cells with the void plasmid and that sfTq2^ox-FtsZ expressing cells show some midcell localisation, albeit scarcely distinguishable from the cell’s autofluorescence. The increase in expression of constructs commencing with sfTq2^oxopt is sufficient to detect fluorescence and to visualise localisation in the cells. Cells still have a regular morphology and grow as wild type, indicating that the expression levels are below a toxic threshold.

The hypothesis that the start of translation was the limiting factor for expressing constructs commencing with sfTq2^ox, was further examined by introducing sfTq2^oxopt in a C-terminal fusion to the outer membrane protein OmpA (Fig. 1B). Since the translation start and amino acids of OmpA-sfTq2^ox and OmpA-sfTq2^oxopt are the same, their expression level should not be different despite the changes made in the sfTq2^oxopt. Indeed, the images in the lowest panel of Fig. 1A and their quantification in Fig. 1C show that the amount of fluorescence signal for those fusions remained similar. This is further confirmed by the bands with similar intensity in the western blot in Fig. 1D.

The mutations introduced in sfTq2^ox to create sfTq2^oxopt all but one increased the amount of A/T-nucleotides and reduced the amount of G/C-nucleotides. This likely lowers secondary structures in this part of the mRNA and facilitates the binding of ribosomes and translation initiation. To verify whether the mutations impact the secondary structure, the 5′ UTR sequence until the fiftieth codon were entered in the RBS Calculator tool provided by SalisLab.net¹⁸. Results (Supplementary Table S1) show that secondary structures surrounding the start codon in the unbound mRNA (ΔG_mRNA) are less stable for sfTq2^oxopt than for sfTq2^ox. Also, the transition from unbound mRNA to ribosome-bound mRNA (ΔG_total) requires less energy input for sfTq2^oxopt than sfTq2^ox.

In conclusion, the expression of N-terminal sfTq2^ox fusion has been improved to a detectable level by introducing alternative codons derived from gfp-mut2. This increase in expression occurs only when sfTq2^oxopt forms the N-terminus of the expression fusion, not when it is present as a C-terminal fusion. This result agrees with the strong correlation of a lower GC-content near the start codon with increased expression of GFP¹⁴. General recommendations for optimising the expression of EGFP derived fluorescent proteins in bacteria can be found in Supplementary Fig. S1.

Optimising the N-terminus of mNeonGreen for better expression

mNG was developed from lanYFP, found in Branchiostoma lanceolatum¹⁹. During the development of mNG, short termini of EGFP were added to the termini of lanYFP (Fig. 1A, C). This is a common intervention, also found in (amongst others) mCh, its derivatives and other members of the mFruits family of fluorescent proteins²⁰. For the comprehensibility of the present study, the different versions of mNG discussed are named mNG-version1-6 (abbreviated as mNG-v1-6).

As with sfTq2^ox, C-terminal fusions of mNG (mNG-v1) were well expressed^10,13. Constructs with mNG (the FRET partner of sfTq2^ox) at the N-terminus were initially expressed at undetectable levels in E. coli. This was overcome by omitting the GFP-derived N-terminus of mNG, resulting in mNG-v2¹³. mNG-v2 was also used in the cytoplasmic tandem to prove that the dynamic range of mNG-sfTq2^ox was 1.5–66%^10,11. However, to match the expression level of constructs commencing with sfTq2^oxopt, a requirement for FRET measurements, this intervention was not sufficient. Therefore, the expression of mNG had to be further enhanced.

mNG-v1 contains the commonly used N-terminal (E)GFP terminus MVSKGEEDN. The mNG-v1 expression has also been increased by exchanging codons for V2 to E6 with synonymous codons with less C and G nucleotides, resulting in mNG-v3 (Fig. 2A, C show sequences)²¹. mNG-v3 was developed for use in promotor-reporter assays. mNG-v2 was previously used for localisation studies and as the N-terminus of an mNG-sfTq2^ox tandem for FRET. mNG-v3 and mNG-v2 were expressed at similar levels (Fig. 2B and Supplementary Fig. S3). mNG-v2 was used as the starting point for optimisation, as it has already performed well in N-terminal fusions for localisation and tandem-FRET purposes. Codons A11 to D20 in mNG-v2 were optimised by replacing C and G nucleotides with A or T nucleotides in synonymous codons. The resulting mNG-v4 was expressed at a higher level than mNG-v2 and mNG-v3. The signal of a mid-cell localising fusion was also better (Supplementary Fig. S4). However, when used as a FRET pair with sfTq2^oxopt, the expression of mNG-v4 was not sufficiently improved to match the expression level of sfTq2^oxopt.

Therefore, we re-introduced the N-terminal GFP sequence to mNG-v4, but now encoded by the codons used in gfp-mut2 and AT-optimised one of the two spacer codons (N9, see Fig. 2A, C). The resulting mNG-v5 showed a much higher fluorescence per µm³ cell (Fig. 2B and Supplementary Fig. S3) than mNG-v4. mNG-v5 was then used as a FRET acceptor for sfTq2^oxopt. The expression of mNG-v5 was now too high for FRET with sfTq2^oxopt. To reduce the expression of mNG-v5 and obtain a ratio of approximately 1:1 with sfTq^oxopt, the optimised GFP N-terminus was combined with non-optimised codons A11-D29, resulting in mNG-v6 (Fig. 2A, C). This optimisation resulted in a reduced expression level similar to that of mNG-v4 (Fig. 2B and Supplementary Fig. S3), and mNG-v6 is therefore unsuitable for FRET with sfTq2^oxopt.

Before further steps were taken to optimise mNG expression levels suitable for FRET with sfTq2^oxopt, it was established that the differences in expression were not caused by differences in mRNA concentration. A western blot confirmed that the increase in fluorescence detected for mNG-v5 was due to more proteins (Fig. 2D). Additionally, a qPCR was performed on whole RNA-extract from cultures expressing mNG-v2, mNG-v5 and mNG-v6. Differences among the normalized threshold cycles (Fig. 2E) are not significant. The detected threshold sequences for mNG were normalised with established primers²² amplifying part of idnT (encoding a putative L-idonate and D-gluconate transporter). This confirms that regulation of expression occurs at the level of translation.

When re-analysing the best-expressed mNG so far (mNG-v5), it was noticed that the gfp-mut2 sequence coding for the GFP peptide contained a putative Shine-Dalgarno (SD) sequence (underlined in Fig. 2A). This was confirmed in silico by the RBS Calculator¹⁸, predicting an ORF starting at Met10 with a higher predicted translation rate than the ORF commencing at Met1 (Supplementary Table S1 for the predicted translation rates). The original mNG-v1 and mNG-v3 also contain a putative SD sequence upstream of Met10, but their predicted translation rates were far less efficient (Supplementary Table S1). The additional putative SD-sequence in the GFP N-terminus of mNG-v5 could possibly create an alternative, more efficient start of translation.

Alternative ribosome binding site pivotal to increase in expression of mNeonGreen

We speculated that mNG-v5 contains an alternative RBS and that translation started at Met10 rather than at Met1. To prove this, mNG-v5 mutants Met1K (AUG > AAG) and Met10I (AUG > ATA) were constructed, restricting translation to start at either Met10 or Met1, respectively. The fluorescence of strains expressing these mutants was compared with strains expressing mNG-v5. Figure 3A shows that mNG-v5 M1K is still expressed, albeit not as much as mNG-v5. Cells expressing mNG-v5 M10I did not fluoresce above the background of cells with a void plasmid. mNG-v5 is, to a large extend, expressed from the alternative RBS and the ORF starting at Met10. It confirms the prediction of the RBS Calculator that Met10 is an efficient start of translation (Supplementary Table S1).

Notably, mNG-v6 also contains the alternative RBS but is not expressed as well as mNG-v5 (Fig. 2B, C). This means that in addition to a stronger RBS, AT-optimization of codons A11-D29 is also required to achieve the high expression levels of mNG-v5. We, therefore, continued our search for the desired expression level of mNG by replacing the original RBS of mNG-v4 with a short part (G5-D8) or a long part (V2-N9) of mNG-v5. Creating RBS^short-mNG-v4 and RBS^long-mNG-v4 (sequences in Fig. 3B). RBS^short misses part of the putative RBS sequences and could therefore regulate mNG expression to obtain a good ratio with sfTq2^oxopt. The RBS^short-mNG-v4 construct resulted indeed in a lowered expression compared to mNG-v5 and is a potential candidate for FRET with sfTq2^oxopt. RBS^long-mNG-v4 resulted in more than double the measured fluorescence per µm³ cell as for mNG-v5 and is therefore not valid for the purpose of a balanced FRET pair. However, it might have a purpose in experiments that require high expression levels, such as reporter fusions. We have drafted a Standard Operating Procedure (Supplementary Fig. S1) for optimising the expression of fluorescent proteins with a GFP-derived N-terminus, like mNG.

mNG-RBS^short-v4 and sfTq2^oxopt form a balanced FRET pair

The last step of this project was to confirm that the expression ratio between RBS^short-mNG-v4 and sfTq2^oxopt does allow for FRET measurements. A ~ 1:1 ratio of mNG:sfTq2^ox is required to reliably detect FRET, meaning an mNG variant was needed that expressed ~ 3 times fewer proteins than the highly expressed mNG-v5. The previous section showed that RBS^short-mNG-v4 approximately meets that requirement (Fig. 3A), and it was used in a FRET experiment. PBP2 and RodA have previously been used in successful FRET experiments fused to the mKO-mCh FRET pair yielding an acceptor FRET efficiency (Ef_A, where the increase in fluorescence of the acceptor fluorescent protein is the measure of FRET) of about 13%²³. The protein fusions sfTq2^oxopt-PBP2 and RBS^short-mNG-v4-RodA were constructed to test the new FRET pair. Positive control is the tandem RBSshort-mNG-v4-sfTq2ox and negative control is the non-interacting RBS^short-mNG-v4-RodA and sfTq2^oxopt-GlpT were used (schematically shown in Fig. 4A, and Supplementary Table S2). As expected, the Ef_A for the RodA-PBP2 pair was increased to 20% due to the larger R₀ of the sfTq2-mNG FRET pair compared to the mKO-mCh FRET pair (Fig. 4B and Supplementary Table S3). The FRET efficiency based on the donor (using the decrease in donor fluorescent protein fluorescence as a measure for FRET) was 25%: indicating that the sfTq2 to mNG ratio was 0.8, which was sufficiently close to the desired 1:1 ratio to function as FRET pair to study protein interactions.

In conclusion, a more efficiently expressed variant sfTq2^ox and a matching mNG have been constructed that can function to study the interaction between proteins in the cytoplasm of E. coli and possibly other bacterial species. In addition, an exceptionally well expressed mNG version was constructed (RBS^long-mNG-v4) that could be useful as promoter reporters or whenever higher expression rates are required.

Discussion

Fluorescent proteins have been applied across all domains of life. This is possible since the genetic code, converted from DNA to RNA, to proteins, as described by the central dogma²⁴ of molecular biology, is virtually universal to all known life forms. However, the types of machinery and, therefore, (regulatory) mechanisms of protein expression vary amongst the different domains of life. In eukaryotes -with some exceptions like internal ribosome entry sites (IRES²⁵)- translation is initiated by the binding of a ribosome to the CAP at the 5′ terminus of an mRNA and its subsequent scanning for the first start-codon, where translation starts²⁶. In bacteria, the ribosomal subunits assemble to form a functional ribosome by the complementarity of the anti-SD sequence on the rRNA to an SD sequence on the mRNA upstream of an ORF²⁶. These SD sequences can be located near the 5′ end of an mRNA but can be found along the entire length of mRNAs (allowing for the polycistronic nature of bacterial mRNAs). These differences have now been proven to be problematic in the heterologous expression of fluorescent proteins in bacteria.

Increasing the expression of the EGFP-derived sfTq2^ox in E. coli was relatively easy. Rather than replacing all codons in such a protein, this study confirms¹⁴ that replacing only the first twenty codons with synonymous codons from gfp-mut2 is sufficient to increase the expression of translational units starting with such fluorescent proteins. The codon replacement did not result in the most frequently used codons but decreased the use of C- and G-nucleotides and increased A- and T-nucleotides usage. As reported by Kudla and colleagues, such replacements in the vicinity of the start-codon correlate with an increase in translation initiation, likely through a decrease in strong mRNA structures near the start-codon¹⁴. The sequence of the first 20 codons in gfp-mut2 is a generally applicable sequence which can be used to increase the expression of GFP-derived fluorescent proteins in E. coli. Such mutations can easily be introduced with mutagenesis primers using various PCR techniques.

As most fluorescent proteins are initially developed for applications in eukaryotic species (particularly humans and mice), researchers often overlook bacterial-specific requirements or intricacies. The introduction of synonymous codons with an increase in AT-content in the GFP N-terminus added to many non-GFP-derived fluorescent proteins^19,20,27,28, like mNG, may result in the expression of unwanted isoforms. The introduced GFP N-terminus is MVSKGEEXN (where the X is either L, D or N) and the added GFP C-terminus GMDLYK. Specifically, the KGEEX coding nucleotides provide a putative RBS. The rudimental methionine at position ten, present in mNG¹⁹, RFP1.3-derivates including mCherry²⁰, mFruits^20,27 and many others²⁸, will not be an alternative start of translation in eukaryotes, but it can be in bacteria. Ribosome assembly in bacteria is not restricted to the 5′ end of mRNAs. These N-termini, therefore, introduce an additional RBS at an opportune distance from a rudimental methionine²⁹. This can lead to the expression of 9 amino acids shorter isoforms or even the translation of non-fused fluorescent proteins in C-terminal fusions.

Coinciding with our observation of the expression of mNG from a rudimental methionine at the end of a GFP-derived N-terminus, it was also described for mCherry in a preprint by Fages-Lartaud and colleagues³⁰. These scientists used an E. coli optimised version of mCh, which contained a similar RBS upstream of Met10. Also, coinciding with the preparations of this manuscript, researchers optimised mNG expression for Saccharomyces cerevisiae by replacing codons to match the most used codons in S. cerevisiae (ymNG_WT) and subsequently by decreasing structures in the mRNA coding for the mNG N-terminus (ymNG_RM)³¹. They also introduced ymNG_RM in E. coli and observed an increase in expression. However, the codon replacements made to decrease structures in the 5′ end of the mRNA are likely introducing an alternative RBS so translation can start from Met10, rather than an increased expression of full-length mNG. We verified this in silico using the RBS predictor¹⁸ and found that, indeed Met10 is a more likely start of translation than Met1 (Supplementary Table S1).

The expression of a 9-amino acid shorter isoform is not problematic by itself: proteins are still fluorescent, and the GFP N-terminus is not likely to have any effects on other behaviours of the protein. However, there are two arguments why one would want to avoid this. The first being that expressing a mixture of two isoforms—and not knowing which ratio is inept and can be easily avoided by omitting the GFP N-terminus altogether. The second being that in C-terminal fusions, one can theoretically express a relatively large amount of unfused fluorescent protein (starting at Met10 of the fluorescent protein) and not only the expression of the intended fusions protein. This leads to a large amount of background fluorescence. This second problem was not addressed in this study, but researchers should be aware of this risk.

To avoid the expression of isoforms and uncertainty over the exact start of translation, we propose to abolish and/or adapt these GFP-derived peptide linkers at the N-termini of fluorescent proteins when working in bacteria. In N-terminal fusions (FP-protein), the GFP-derived sequence has no function and could be abolished without any consequences.

Another concern we wish to address is that it is often (almost) undecipherable from publications whether fluorescent proteins have been codon-optimized and to what extent. The amino acid sequence and, therefore the resulting protein may be the same, but as this study and the work of Fages-Lartaud and colleagues³⁰ shows: it does matter and should be reported more clearly. sfTq2^ox and sfTq2^oxopt have the same amino acid sequence and can therefore consider to be the same protein: however, their expression in E. coli differs hugely. In the case of mNG-v5 versus mNG-v1, the introduction of synonymous codons leads to the expression of two different proteins, emphasising that the current naming system for proteins not allowing for differentiating between those two is too limited. In addition, the recent finding that synonymous mutations often impact fitness as much as non-synonymous mutations in yeast implicates that codon usage has a more significant impact than has been considered so far³².

A complicating factor in obtaining a good ratio between mNG and sfTq2^ox for FRET is the different effects of fixation on sfTq2^oxopt and mNG. It was previously shown that sfTq2^ox continues maturing to its fluorescent state after fixation¹⁰. Cells expressing mNG, however lose approximately 60% of their fluorescence per µm³ cell upon fixation and only reach about 70% of the fluorescence per µm³ cell volume after overnight maturation (Supplementary Fig. S8). This means that 30% of mNG molecules do not recover to a fluorescent state after fixation. A 1:1 ratio of the fluorescent proteins, in this case, does therefore not mean that there is a 1:1 ratio of proteins that fluoresce. The combination of protein fusions starting with RBS^short-mNG-v4 and sfTq2^oxopt, however is sufficiently balanced to be suitable for cytoplasmic FRET.

The improvements in the expression of sfTq2^ox and mNG discussed in this manuscript may have broader applications than solely those two fluorescent proteins expressed in specifically E. coli. When expression of a fluorescent protein in a bacterium does not result in detectable fluorescence, it may be worth attempting to increase its expression. In Supplementary Fig. S1, the reader can find a flowchart describing steps to take, resources and some potential pitfalls when optimizing expression of fluorescent protein expression for bacterial species, based on this work. The SOP is centred around what worked for E. coli but it is constructed in such a manner that it can also be used as a starting point for improving the expression of FPs in other bacterial species. Since translation by the kingdoms of bacteria and archaea is extremely diverse³³, we cannot guarantee its general applicability. Even in the well-studied bacterium E. coli, precise predictions about translation rates are difficult—as evidenced by the deviations between the predicted translation rates (Supplementary Table S1) and the measured fluorescence (Figs. 1, 2, 3). Amongst different bacterial species, more factors have to be taken into account (e.g. some species express mainly leaderless mRNAs, some almost none)³³. The anti-SD sequence is quite strongly conserved, whereas enormous diversity amongst bacterial species SDs exists, which implies that our findings for the alternative RBS found in the first nine codons of mNG (and other non-GFP derived FPs) is relevant for some, but not all bacterial species.

Methods

Strains and growth conditions

This study used two E. coli K-12 strains (described in Table 1). LMC500 was used for the expression of FP(-fusion)s, for both establishing expression levels and the FRET assay. For the generation and selection of cloning products, DH5α was utilised.

Table 1 Strains used in this study.

Full size table

Strains were grown in either minimal GB1 medium (1 L contains: 4 g glucose (Roth, Karlsruhe, Germany), 4.83 g K₂HPO₄ (VWR), 2.95 g KH₂PO₄ (Fisher Chemical), 1.05 g (NH₄)₂SO₄ (Sigma-Aldrich), 0.10 g MgSO₄·7H₂O (Roth), 0.28 mg FeSO₄·7H₂O (Sigma-Aldrich), 7.1 mg Ca(NO₃)₂·4H₂O (Sigma-Aldrich), 4 mg thiamine (Sigma) and 50 mg lysine (Sigma)) at 28 °C for steady-state growth required for FRET experiments, or in rich medium (TY, 1 L contains: 10 g tryptone (Duchefa), 5 g yeast extract (Fisher Bioreagents) and 5 g NaCl (Acros Organics)) at 37 °C for all other purposes. Liquid cultures were grown under constant agitation (200 rpm), whereas media were solidified for selective growth after transformations by adding 20 g agar (Sphaero Q) per litre medium. When needed for plasmid maintenance, media were supplemented with relevant antibiotics: 100 µg/L ampicillin (Roth) and 25 µg/L chloramphenicol (Sigma). The expression of proteins from plasmids was induced with 15 or 50 µM isopropyl ß-D-1-thiogalactopyranoside (IPTG, Duchefa).

Plasmids and plasmid construction

All plasmids and primers can be found in Supplementary Tables S4 and S5, respectively.

Site-directed mutagenesis by inverse PCR was performed on plasmids to introduce mutations in sfTq2^ox, and mNG and replace RBSs³⁴. To overcome primer-dimerization issues, mutagenesis PCR was performed in two stages. The nine first PCR cycles were performed at low annealing temperature, allowing the mutagenesis primers to anneal to their mismatched template. These were followed by 26 cycles where the annealing temperature was increased for primer binding to the fully matching templates generated in the first nine cycles. The PCR product was isolated and digested with DpnI (New England Biolabs) to degrade the template plasmid. Circularisation was realized upon transformation in DH5α cells as reported before³⁴.

Other plasmids were constructed by restriction-ligation cloning (Supplementary Table S4). When necessary for efficiency or introduction of alternative restriction sites, inserts were first amplified by PCR prior to digestion with restriction enzymes (all from New England Biolabs). Ligation was performed with T4 DNA ligase (New England Biolabs), and the product was selected by the transformation in DH5α cells.

Plasmid constructs were also generated with Gibson assembly when appropriate (Supplementary Table S4). PCR reactions generating fragments with matching overhangs were followed by DpnI digestion and band excision from agarose gels. Assembly was performed in homemade Gibson mix (all components from NEB), and successful products were subsequently selected by the transformation in DH5α.

All cloning products were checked by sequencing with the mix2seq kit (Eurofins) before subsequent use in experiments.

Comparative expression experiments

Experiments comparing FP expression levels were all performed in a rich medium. Overnight cultures were diluted 1:500 in 25 ml of fresh medium, grown to mid-exponential phase (OD₆₀₀ = 0.2–0.3), diluted to an OD₆₀₀ of 0.04 and subsequently induced with 15 µM IPTG. After two mass doublings, living cells were harvested for microscopy and western blot analysis. The remainder of the culture was fixed by adding 2.8% formaldehyde and 0.04% glutardialdehyde (final concentrations).

Living cells intended for microscopy were washed twice in Milli-Q water to remove the yellow background signal stemming from the TY medium. Living cells for western blot analysis were pelleted by centrifugation and frozen at -20 °C after the supernatant was removed. After adding fixatives to the cultures, cultures were fixed for 30 min, under constant agitation. Cultures were subsequently washed in PBS (pH = 7.2), a fraction of each sample was imaged immediately (termed ‘fixed’), and the remainder was stored in a dark place at room temperature for overnight fluorophore maturation and imaged the next day (termed ‘matured’).

Microscopy

Microscopy slides were prepared by trapping cell suspensions between a 1% agarose (Sphaero Q) slab on an object glass and a cover slip³⁵. For living cells, this agarose was added to Milli-Q water, and for fixed cells to PBS, pH = 7.2. Cells were imaged in a phase contrast and fluorescence microscopy channel with an Olympus BX-60 microscope (Tokyo, Japan), equipped with a Hamamatsu ORCAFlash-4.0LT CMOS camera (Hamamatsu, Naka-ku, Japan) and a UPlanApo 100×/N.A. 1.35 oil Iris Ph3 objective. The following filter cubes were used to imaging sfTq2^ox expression and mNG expression, respectively: Cyan-GFPv2 (cyan, ex436/20, dic455LP, em480/40) and EN-GFP (green, ex470/40, dic495LP, em525/50).

For two experiments, a different microscope was used. The maturation of mNG-v4 (Supplementary Fig. S8) and the FRET samples (Supplementary Fig. S6) were imaged with a Nikon Eclipse Ti microscope equipped with a C11440-22CU Hamamatsu ORCA camera, a CFI Plan Apochromat DM 100 × oil objective, an Intensilight HG 130 W lamp and the NIS elements software (version 4.20.01). The following filter cubes were used for the imaging of sfTq2^ox expression and mNG expression, respectively: CFP MXU96210/C153677 (ex426-446, dic455LP, em460-500) and GFP-B MBE44740 (ex460-500, dic505, em510-560).

Microscopy data were analysed with the ImageJ plugin ‘Coli-Inspector’³⁶. The fluorescence per µm³ reported as an indication of expression levels of fluorescent proteins is given by ‘Concentration Total’.

Western blot

Biomass concentrations in frozen cell pellets were equalized by adding appropriate volumes of Lämmli buffer (0.0625 M Tris–HCl (VWR) pH = 6.8, 2% SDS (Merck), 10% glycerol (Biosolve Chimie SARL,), 0.1 M dithiothreitol (Sigma-Aldrich) and 0.001% bromophenol blue (Fisher Scientific)). Samples were kept on ice, heated for 5 min at 99 °C and centrifuged at 20,000 rcf for 5 min before loading. SDS-PAGES were hand-casted (4% stacking gel, 12% separating gel). Proteins were separated by electrophoresis and transferred to a 0.2 µm nitrocellulose membrane (Bio-Rad). sfTq2^ox was detected with primary antibody αGFP (1:2000, Fisher Scientific), FtsZ with a polyclonal antibody generated in rabbit against E. coli FtsZ³⁵. Both rabbit-antibodies were subsequently detected with the secondary Peroxidase AffiniPure Donkey Anti-Rabbit IgG (H + L) (Jackson ImmunoResearch). mNG was detected with anti-mNG (Chromotek) and secondary goat anti-mouse IgG (H + L) HRP (Invitrogen). The conjugated horse radish peroxidase (HRP) of the secondary antibody was subsequently visualised with the Pierce™ ECL Western blotting substrate (Thermo Scientific) in a Licor Odyssey machine (700 nm channel for the Precision Plus Protein Dual Color Standards ladder (Bio-Rad), chemiluminescence channel for HRP activity).

qPCR

The amount of mNG encoding mRNAs was established with qPCR. Total RNA was extracted from exponentially growing cells expressing mNG-v2-sfTq2^ox, mNG-v5-sfTq2^ox and mNG-v6-sfTq2^ox. Cells were harvested by centrifugation, lysed by cryogenic pulverisation and RNA was isolated from the lysate by phenol–chloroform isoamyl alcohol extraction, followed by isopropanol precipitation^37,38. The concentration of extracted RNA was determined with a Nanodrop machine, and equal amounts were used as the template for cDNA synthesis with the First Strand cDNA Synthesis Kit (Thermo Scientific), implementing the random hexamer primers provided in the kit for amplification. The cDNA functioned subsequently as a template for a qPCR, with primers amplifying part of the mng sequence. For normalisation, established primers amplifying idnT were used²² in the DyNAmo HS SYBR Green qPCR Kit (Thermo Scientific). Three technical replicates per cDNA reaction and three technical replicates per qPCR resulted in nine qPCR reactions per sample. Normalisation was done by subtracting the mean threshold cycle (Ct) of the three replicate qPCRs per cDNA for idnT from the mean Ct for mng of the same cDNA template (ΔCt method). Values are negative as the threshold cycles for mng were reached earlier than those for idnT.

Predicting translation efficiencies and mRNA structures in silico

Translation efficiency calculations were performed with RBS Calculator (version 2.1) accessed through https://salislab.net/software/predict_rbs_calculator, with standard settings and host organism Escherichia coli str. K-12 substr. MG1655¹⁸. Inserted sequences used for calculations span from the start of transcription until the fiftieth codon (in the T50 for sfTq2^ox variants and D50 for mNG variants). The exact input sequences can be found following Supplementary Table S1 in the supplementary materials.

FRET experiment

FRET experiments were performed as previously described^11,39. In short: LMC500 cells transformed with two plasmids (Supplementary Table S2) were grown to a steady state in GB1 medium supplied with ampicillin and chloramphenicol (for plasmid maintenance) at 28 °C. Subsequently, the cultures were diluted to an OD_450nm of 0.05 and induced with 15 µM IPTG (50 µM IPTG for sfTq2^oxopt and mNG references) for two mass doublings and then fixed with 2.8% formaldehyde and 0.04% glutardialdehyde (final concentrations). Fixed cells were washed with PBS and kept at RT for overnight maturation of the fluorescent proteins. Emission spectra were measured with the Spectrofluorometer Quantamaster 2000-4 (Photon Technology International) and unmixed as previously published³⁹. For the purpose of this experiment, we used a known pair of interacting proteins: RodA and PBP2²³. RodA with GlpT was the negative control, since it was previously proven that those two proteins do not interact²³.

Data availability

The datasets generated and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Cormack, B. P., Valdivia, R. H. & Falkow, S. FACS-optimized mutants of the green fluorescent protein (GFP). Gene 173, 33–38 (1996).
Article CAS PubMed Google Scholar
Chiu, W. et al. Engineered GFP as a vital reporter in plants. Curr. Biol. 6, 325–330 (1996).
Article CAS PubMed Google Scholar
Tsien, R. Y. The green fluorescent protein. Annu. Rev. Biochem. 67, 509–544 (1998).
Article CAS PubMed Google Scholar
Mamou, G. et al. Peptidoglycan maturation controls outer membrane protein assembly. Nature 606, 953–959 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Söderström, B., Pittorino, M. J., Daley, D. O. & Duggin, I. G. Assembly dynamics of FtsZ and DamX during infection-related filamentation and division in uropathogenic E. coli. Nat. Commun. 13, 3648 (2022).
Article ADS PubMed PubMed Central Google Scholar
Martin, N. R. CrvA and CrvB form a curvature-inducing module sufficient to induce cell-shape complexity in Gram-negative bacteria. Nat. Microbiol. 6, 24 (2021).
Article Google Scholar
Gallay, C. et al. CcrZ is a pneumococcal spatiotemporal cell cycle regulator that interacts with FtsZ and controls DNA replication by modulating the activity of DnaA. Nat. Microbiol. 6, 1175–1187 (2021).
Article CAS PubMed PubMed Central Google Scholar
Szczepaniak, J. et al. The lipoprotein Pal stabilises the bacterial outer membrane during constriction by a mobilisation-and-capture mechanism. Nat. Commun. 11, 1305 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Navarro, P. P. et al. Cell wall synthesis and remodelling dynamics determine division site architecture and cell shape in Escherichia coli. Nat. Microbiol. 7, 1621–1634 (2022).
Article CAS PubMed PubMed Central Google Scholar
Meiresonne, N. Y. et al. Superfolder mTurquoise2 ^ox optimized for the bacterial periplasm allows high efficiency in vivo FRET of cell division antibiotic targets. Mol. Microbiol. 111, 1025–1038 (2019).
Article CAS PubMed PubMed Central Google Scholar
Alexeeva, S., Gadella, T. W. J., Verheul, J., Verhoeven, G. S. & Den Blaauwen, T. Direct interactions of early and late assembling division proteins in Escherichia coli cells resolved by FRET: bacterial division studied by spectral FRET. Mol. Microbiol. 77, 384–398 (2010).
Article CAS PubMed Google Scholar
Goedhart, J. et al. Bright cyan fluorescent protein variants identified by fluorescence lifetime screening. Nat. Methods 7, 137–139 (2010).
Article CAS PubMed Google Scholar
Meiresonne, N. Y., van der Ploeg, R., Hink, M. A. & den Blaauwen, T. Activity-related conformational changes in d,d-carboxypeptidases revealed by in vivo periplasmic förster resonance energy transfer assay in Escherichia coli. mBio 8, e01089-17 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kudla, G., Murray, A. W., Tollervey, D. & Plotkin, J. B. Coding-sequence determinants of gene expression in Escherichia coli. Science 324, 255–258 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Kremers, G.-J., Goedhart, J., van Munster, E. B. & Gadella, T. W. J. Cyan and yellow super fluorescent proteins with improved brightness, protein folding, and FRET förster radius. Biochemistry 45, 6570–6580 (2006).
Article CAS PubMed Google Scholar
Prasher, D. C., Eckenrode, V. K., Ward, W. W., Prendergast, F. G. & Cormier, M. J. Primary structure of the Aequorea victoria green-fluorescent protein. Gene 111, 229–233 (1992).
Article CAS PubMed Google Scholar
Nagai, T. et al. A variant of yellow fluorescent protein with fast and efficient maturation for cell-biological applications. Nat. Biotechnol. 20, 87–90 (2002).
Article CAS PubMed Google Scholar
Reis, A. C. & Salis, H. M. An automated model test system for systematic development and improvement of gene expression models. ACS Synth. Biol. 9, 3145–3156 (2020).
Article CAS PubMed Google Scholar
Shaner, N. C. et al. A bright monomeric green fluorescent protein derived from Branchiostoma lanceolatum. Nat. Methods 10, 407–409 (2013).
Article CAS PubMed PubMed Central Google Scholar
Shaner, N. C. et al. Improved monomeric red, orange and yellow fluorescent proteins derived from Discosoma sp. red fluorescent protein. Nat. Biotechnol. 22, 1567–1572 (2004).
Article CAS PubMed Google Scholar
Steenhuis, M. et al. Inhibition of autotransporter biogenesis by small molecules. Mol. Microbiol. 112, 81–98 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhou, K. et al. Novel reference genes for quantifying transcriptional responses of Escherichia coli to protein overexpression by quantitative PCR. BMC Mol. Biol. 12, 18 (2011).
Article CAS PubMed PubMed Central Google Scholar
Liu, X., Biboy, J., Consoli, E., Vollmer, W. & den Blaauwen, T. MreC and MreD balance the interaction between the elongasome proteins PBP2 and RodA. PLOS Genet. 16, e1009276 (2020).
Article CAS PubMed PubMed Central Google Scholar
Crick, F. H. C. On protein synthesis. Mol. Biol. 27 (1956).
Hellen, C. U. T. & Sarnow, P. Internal ribosome entry sites in eukaryotic mRNA molecules. Genes Dev. 15, 1593–1612 (2001).
Article CAS PubMed Google Scholar
Melnikov, S. et al. One core, two shells: bacterial and eukaryotic ribosomes. Nat. Struct. Mol. Biol. 19, 560–567 (2012).
Article CAS PubMed Google Scholar
Ai, H., Shaner, N. C., Cheng, Z., Tsien, R. Y. & Campbell, R. E. Exploration of new chromophore structures leads to the identification of improved blue fluorescent proteins. Biochemistry 46, 5904–5910 (2007).
Article CAS PubMed Google Scholar
Lin, M. Z. et al. Autofluorescent proteins with excitation in the optical window for intravital imaging in mammals. Chem. Biol. 16, 1169–1179 (2009).
Article CAS PubMed PubMed Central Google Scholar
Shultzaberger, R. K., Bucheimer, R. E., Rudd, K. E. & Schneider, T. D. Anatomy of Escherichia coli ribosome binding sites. J. Mol. Biol. 313, 215–228 (2001).
Article CAS PubMed Google Scholar
Fages-Lartaud, M., Tietze, L., Elie, F., Lale, R. & Hohmann-Marriott, M. F. mCherry contains a fluorescent protein isoform that interferes with its reporter function. Front. Bioeng. Biotechnol. 10, 892138 (2022).
Article PubMed PubMed Central Google Scholar
Hector, R. E., Mertens, J. A. & Nichols, N. N. Increased expression of the fluorescent reporter protein ymNeonGreen in Saccharomyces cerevisiae by reducing RNA secondary structure near the start codon. Biotechnol. Rep. 33, e00697 (2022).
Article CAS Google Scholar
Shen, X., Song, S., Li, C. & Zhang, J. Synonymous mutations in representative yeast genes are mostly strongly non-neutral. Nature 606, 725–731 (2022).
Article ADS CAS PubMed Google Scholar
Wen, J.-D., Kuo, S.-T. & Chou, H.-H.D. The diversity of Shine-Dalgarno sequences sheds light on the evolution of translation initiation. RNA Biol. 18, 1489–1500 (2021).
Article CAS PubMed Google Scholar
Liu, H. & Naismith, J. H. An efficient one-step site-directed deletion, insertion, single and multiple-site plasmid mutagenesis protocol. BMC Biotechnol. 8, 91 (2008).
Article PubMed PubMed Central Google Scholar
Koppelman, C.-M. et al. R174 of Escherichia coli FtsZ is involved in membrane interaction and protofilament bundling, and is essential for cell division: membrane interaction of FtsZ. Mol. Microbiol. 51, 645–657 (2003).
Article Google Scholar
Vischer, N. O. E. et al. Cell age dependent concentration of Escherichia coli divisome proteins analyzed with ImageJ and ObjectJ. Front. Microbiol. (2015).
Heera, R. et al. Efficient extraction of small and large RNAs in bacteria for excellent total RNA sequencing and comprehensive transcriptome analysis. BMC Res. Notes 8, 754 (2015).
Article PubMed PubMed Central Google Scholar
Rivas, R. et al. An effective, rapid and simple method for total RNA extraction from bacteria and yeast. J. Microbiol. Methods 47, 59–63 (2001).
Article CAS PubMed Google Scholar
Meiresonne, N., Consoli, E., Mertens, L. & den Blaauwen, T. (2019) Detection of in vivo protein interactions in all bacterial compartments by förster resonance energy transfer with the superfolder mTurquoise2 ox-mNeongreen FRET pair. Bio-Protocol. 9 (2019).
Cormarck, B. P., Valdivia, R. H. & Falkow, S. FACS-optimized mutants of the green fluorescent protein (GFP). Gene 173, 33–38 (1996).
Article Google Scholar
Taschner, P. E., Huls, P. G., Pas, E. & Woldringh, C. L. Division behavior and shape changes in isogenic ftsZ, ftsQ, ftsA, pbpB, and ftsE cell division mutants of Escherichia coli during temperature shift experiments. J. Bacteriol. 170, 1533–1540 (1988).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the following people for their contributions to this work: Timo Engelhardt and Idris Iritaş were excellent interns in our group and contributed to the experimental work. We would like to thank Yaozu Han and Alberto Agnolin for the extraction of RNA for qPCR. Nils Meiresonne for the construction of the plasmid pNM200 and pNM201 and his insightful comments on this project. Xiaolong Lui for the plasmid pXL30. Joen Luirink for providing a plasmid with mNG-v3 and his input during our discussions. This research was funded by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 721484 via the International Training Network Train2Target.

Author information

Authors and Affiliations

Bacterial Cell Biology and Physiology, Swammerdam Institute for Life Science, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
Laureen M. Y. Mertens & Tanneke den Blaauwen

Authors

Laureen M. Y. Mertens
View author publications
You can also search for this author in PubMed Google Scholar
Tanneke den Blaauwen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualisation, formal analysis, investigation, methodology, and manuscript preparation by L.M.Y.M. and T.d.B. Data curation and visualisation by L.M.Y.M. Funding acquisition and supervision by T.d.B.

Corresponding author

Correspondence to Tanneke den Blaauwen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mertens, L.M.Y., den Blaauwen, T. Optimising expression of the large dynamic range FRET pair mNeonGreen and superfolder mTurquoise2^ox for use in the Escherichia coli cytoplasm. Sci Rep 12, 17977 (2022). https://doi.org/10.1038/s41598-022-22918-2

Download citation

Received: 12 July 2022
Accepted: 20 October 2022
Published: 26 October 2022
DOI: https://doi.org/10.1038/s41598-022-22918-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.