Abstract
Resource competition can be the cause of unintended coupling between co-expressed genetic constructs. Here we report the quantification of the resource load imposed by different mammalian genetic components and identify construct designs with increased performance and reduced resource footprint. We use these to generate improved synthetic circuits and optimise the co-expression of transfected cassettes, shedding light on how this can be useful for bioproduction and biotherapeutic applications. This work provides the scientific community with a framework to consider resource demand when designing mammalian constructs to achieve robust and optimised gene expression.
Similar content being viewed by others
Introduction
Resource competition arises from the simultaneous use of cellular gene expression resources (i.e. polymerases, ribosomes, and other cellular factors) by different expression cassettes, leading to a non-intuitive indirect coupling of their output levels and behaviour1,2,3. In bacteria, the phenomenon has been extensively assessed and quantified4,5,6, and many experimental and computational strategies towards its mitigation have been proposed7,8,9,10,11,12,13. Among these, identification of genetic designs with lower impact on the host resources was shown to be feasible and desirable for optimising synthetic construct performance5,14.
Although robust heterologous expression is central to mammalian cell engineering and its applications, resource competition in eukaryotes has only recently started to be investigated. Evidence confirmed that gene expression resources are as limiting in mammalian cells as they are in bacteria and that unobvious coupling of genetic circuits can arise due to synthetic cassettes drawing from the same pool of resources15,16,17,18,19. However, accurate characterisation of the demand that different construct components place on cellular resources is still missing in mammalian cells, limiting our ability to rationally design genetic constructs with minimised resource footprint towards robust and predictable engineering of these organisms. Understanding resource competition and its impact on construct performance is particularly important when co-expression of different cassettes is desired, something that is often sought after when engineering mammalian systems, from the use of transfection controls in fundamental research to the expression of multi-gene constructs for bioproduction and biotherapeutic applications17,20,21.
Results
A framework to quantify the resource load of genetic parts
To advance resource-aware construct design in mammalian cells, we developed a framework for quantification of the load imposed by different designs on cellular resources.
To visualise and quantify intracellular resource usage, a test plasmid is co-transfected along with a fluorescence-based capacity monitor, a CMVp-controlled cassette for mKATE expression15. When co-expressed in the cell, the monitor and the test plasmid rely on the same pool of cellular resources for gene expression. As the test plasmid uses more resources, less are available for the expression of the capacity monitor, leading to a decrease in its expression levels. Since the monitor cassette lacks regulation, changes in its expression levels can be used as a proxy measure of the available cellular resources (Fig. 1a). We designed a modular test plasmid architecture to enable easy substitution of genetic parts of interest (Fig. S1). This was used to generate a library of synthetic constructs for constitutive and inducible expression that were individually co-transfected with the capacity monitor in HEK293T and CHO-K1 to identify genetic designs with maximised expression and minimised resource load (Fig. 1b).
We first investigated the sensitivity of the capacity monitor to resource loading by co-transfecting synthetic designs with diverse compositions (Fig. 1c). Specifically, we compared the levels of the capacity monitor in competition with an empty vector (pEmpty) as control of minimal resource demand (i), a constitutively expressed tetracycline transactivator (rtTA) alone (ii), or in combination with its cognate promoter (TETp) driving EGFP expression in the absence (iii) or presence (iv) of the small-molecule inducer doxycycline (dox)22. In both HEK293T and CHO-K1, capacity monitor expression decreased as the complexity of the competing test plasmid increased (Fig. 1c, Fig. S2). While co-transfection of the rtTA-expressing plasmid and the uninduced TET-ON-controlled EGFP resulted in a decrease in capacity monitor expression of the same order of magnitude, a further reduction is observed upon dox addition (Fig. 1c). This is consistent with the expression of two transcriptional units, as opposed to only one in the less complex designs. Reducing the dosage of transfected plasmids did not rescue resource competition (Figure S3a), and induction of a single-copy integrated gene led to resource loading on the capacity monitor (Figure S3b), indicating that resource competition is not limited to co-transfected systems.
We then investigated the role of different genetic elements in driving resource competition. We used our modular backbone to generate a library of EGFP-expressing synthetic designs bearing different promoters, Kozaks, and polyA configurations. We selected seven constitutive promoters (pJB42CAT5, PGKp, hACTBp, SV40p, UBp, CMVp, EF1ap) and one inducible promoter (TETp), three Kozak sequences with different translational efficiency, and six polyA sequences (SV40pA, SV40pA in reverse orientation—SV40pA_rv-, PGKpA, BGHpA, RBpA, HGHpA) (Supplementary data file 1), reasoning they would likely represent critical parameters that influence the resource demand that synthetic constructs place on cellular gene expression resources at the transcriptional and translational levels.
First, we tested the effect that promoter strength imposes on cellular resources. We individually co-transfected a library of test plasmids bearing one of eight promoters along with the capacity monitor. Although, as previously reported23, equal promoters often display different outputs when adopted in different cell lines, promoter strength negatively correlated with monitor expression levels in HEK293T and CHO-K1 (Fig. 2a).
This suggests that promoter strength is a crucial parameter in defining the resource footprint of a synthetic construct, with stronger promoters requiring more resources than weaker ones. A few promoters showed better performance than others (i.e. UBp in CHO-K1), where “performance” is defined as the maximisation of both the test plasmid and the capacity monitor expression (Fig. 2a). Similar to what observed for constitutive expression, when the TET-ON system was induced with increasing dox concentrations, increased test plasmid production and corresponding decreased capacity monitor expression levels were observed (Fig. 2b).
To show that our findings are not dependent on the capacity monitor design, we tested different capacity monitors in which the mKATE is controlled by different promoters, namely SV40p, UBp, and EF1ap (Fig. S4). When tested with the promoter library, the additional monitor designs displayed similar behaviour to the original CMV-mKATE capacity monitor, with increasing transcriptional rate from the test plasmid leading to decreased capacity monitor expression. In addition, promoter and cell line-specific effects were observed. These are discussed in Supplementary Note 2.
PolyAs play an essential role in the regulation of eukaryotic gene expression, as they mediate transcriptional termination but also impact mRNA stability, localisation and translation24,25. When six different polyAs (SV40pA, SV40pA_rv, PGKpA, BGHpA, RBpA, HGHpA) were assembled within the modular library, diverse impact on the test plasmid output was observed (Fig. 2c, Fig. S5). This is in line with recent work highlighting polyA-dependent tuning of gene expression in mammalian cells26,27. Although some polyA sequences consistently resulted in increased or suppressed test plasmid outputs, we observed variability in combination with different promoter sequences (Fig. 2c, Fig. S5)28. We also individuated specific polyAs which consistently resulted in suppressed expression levels of the competing capacity monitor (i.e. PGKpA, SV40pA_rv in HEK293T, and SV40pA, HGHpA in CHO-K1) and observed that polyA-based interference on the capacity monitor expression is more dramatic in combination with stronger promoters (especially in CHO-K1) (Fig. 2c). It is worth noting that the effect of equal promoter-polyA combinations on the expression of the competing monitor is cell line-dependent (Fig. S5). To investigate this further, we designed and cloned four additional capacity monitors bearing different polyAs (i.e. SV40pA_rv, HGHpA, PGKpA and RBpA). By conducting competition experiments between the new monitor designs and five shortlisted test plasmids bearing different polyA configurations, we identified distinct polyA-specific competition profiles in addition to the previously observed cell line-specific effects (Fig. S6, Supplementary Note 3).
Although both promoter and polyA selection impact the expression of the competing capacity monitor, whether they do this directly by recruiting transcriptional resources or indirectly by impacting downstream translational resources requires further investigation.
Kozak sequences are directly linked to translational resources as they represent the translation start sites in eukaryotes. Kozak motifs with different translational efficiency have been designed and reported29. We thus adopted three previously reported Kozak sequences30 and tested their behaviour when assembled downstream of different promoters (Fig. S7). We observed that Kz1 and Kz3 displayed similar output for most promoter combinations, whereas Kz2 led to lower test plasmid expression (Fig. S7). When comparing different promoter-Kozak combinations, most of the variation in the capacity monitor expression was explainable by the change in promoter, and significant changes in the monitor expression due to Kozak variation were primarily observed in combination with strong promoters (Fig. 2d, Fig. S7). This suggests that transcriptional resources might be more limiting than translational resources in the tested cell lines, and that the latter can accommodate a higher demand from synthetic constructs before being saturated. To investigate this further, we cloned three hairpins of increasing minimum free energy (MFE) (Fig. S8a). The hairpins impede translation by sequestering the Kozak sequence. While increasing MFE led to increased suppression of EGFP translation, we observed minimal impact on the capacity monitor expression (Fig S8a), in accordance with our previous data. Interestingly, when similar changes in EGFP expression were obtained by a change in promoter sequence, the capacity monitor expression was dramatically altered (Fig S8a, red asterisks). Finally, we conducted mRNA co-transfection experiments to disentangle the contribution of transcriptional and translational resources to the resource competition landscape. Specifically, we generated two test mRNAs bearing Kozak sequences with different translational efficiency (Kz1-EGFP and Kz2-EGFP) and one monitor mRNA (Kz1-mKATE). Upon individual co-transfection of each test mRNA with the monitor mRNA, we observed no alteration in the mKATE expression levels despite significant differences in EGFP output (Fig. S8b). Altogether, these results are in striking contrast with those previously reported in bacteria, where translational resources play a major role in shaping resource competition5,8,31,32,33,34.
Identification of designs with minimised footprint
Combining all the library designs can clarify the overall competition landscape and inform on the presence of more efficient design options (Fig. 3a, Fig. S9). Although we observed that greater output from the test plasmids was generally matched by decreased expression of the capacity monitor, some designs appeared more performant than others, enabling the identification of regions where co-expression of both cassettes can be optimised (Fig. 3a, shaded area). To assess how various genetic components affect a construct’s resource footprint, we conducted a pairwise comparison of constructs differing in one of the three design elements considered in this study (promoters, Kozaks, or polyA tails) (Fig. 3b). We measured the differences in EGFP and mKATE (ΔEGFP and ΔmKATE) between each pair of constructs bearing (i) different promoters, but identical Kozaks and polyAs; (ii) different Kozaks, but identical promoters and polyAs; and (iii) different polyAs, but identical promoters and Kozaks. These pairs were then grouped into quartiles based on their ΔEGFP, revealing how similar changes in EGFP expression have a different impact on mKATE depending on whether the modification is made to the promoter, Kozak or polyA sequences (Fig. 3b–d). In both cell lines, the promoter is the genetic component with the greatest impact on the capacity monitor expression. Kozak efficiency minorly affects a construct’s resource footprint, and only for high ΔEGFP values. This is consistent with our prior observations indicating that translational resources play a minor role in driving resource competition. The role of polyAs is cell-line dependent, and while in HEK293T, increasing ΔEGFP due to polyA variation leads to increased monitor expression, in CHO-K1, polyA-based interference displays the opposite behaviour. We noted that while changing each of the tested genetic parts can lead to the same gain in test plasmid expression, promoters do so by impacting the capacity monitor to a greater extent than Kozaks and polyAs (Fig. 3b–d).
Multi-gene constructs are often needed for the production of biotherapeutic proteins. Different configurations are adopted for the co-expression of two or more genes, such as multiple transcriptional unit arrangements or bi-cistronic modules that use internal ribosome entry sites (IRES) or 2A peptides35.
Starting from the observation that transcriptional resources are more easily saturated than translational resources in mammalian cells, we compared different design configurations for multi-gene expression to identify designs with lower resource footprints.
We assembled a small library of constructs for the co-expression of EGFP and EBFP either from two separate transcriptional units (TUs) or from one TU using 2A peptides or an IRES sequence (Fig. 3e, f). In both HEK293T and CHO-K1, a stepwise decrease in the monitor expression was observed upon competition with an increasing number of TUs, with the pEmpty retaining higher capacity than a construct expressing one TU and two TUs, with 100%, 48% and 26% of remaining capacity in HEK293T, respectively (Fig. 3e, f). When EBFP and EGFP are arranged in a bi-cistronic configuration by use of IRES and 2A peptides, capacity is rescued to levels comparable to or higher than the expression of a single EGFP-expressing TU (i.e. ≥48% in HEK293T), despite the latter producing only one fluorescent protein (Fig. 3e, f, Fig. S10).
This reinforces our findings, as reducing the number of promoters in the circuit frees up transcriptional resources and dramatically reduces the resource footprint of the multi-gene configuration. However, for both cell lines, although the use of the IRES2 allows higher capacity than 2A peptides, this comes at the cost of reduced output levels of the coding sequence in second position (Fig. 3e, f, S10). This is in line with previous work demonstrating that IRES-dependent translation is lower than CAP-dependent translation, thus selection of appropriate bi-cistronic configurations (IRES vs 2A peptides) for specific applications may not always line up with the alternative with the lowest resource footprint.
Changing gene syntax on the test plasmid also resulted in different resource uptake (Fig. S11). Placing two TUs on the same plasmid yielded higher monitor expression than identical TUs split between two separate plasmids. This was mirrored by the decreased output of one of the two TUs in the linked arrangement compared to the split one (Fig. S11). Interference between adjacent transcriptional units is a well-reported phenomenon36, but effects on a competing gene were never observed before.
Optimisation of a genetic switch and co-expression systems
Our results can prove highly relevant to optimise the co-expression of different cassettes. To prove this, we chose two examples demonstrating that (i) it is possible to decrease the resource load of a gene switch widely adopted for gene expression control and that (ii) best-performing designs hold when adopted on a protein of interest for bioproduction and gene therapy applications.
We started by re-considering the TET-ON doxycycline-responsive system adopted earlier in this study, given that this system is widely used in mammalian cell engineering for controlled gene expression (i.e. miRNAs, biotherapeutics, reporters for foundational studies). The system is based on two TUs, one expressing the rtTA transactivator, which, in the presence of dox, can bind and activate an inducible TETp promoter driving the expression of a gene of interest (i.e. an EGFP in this case). In our experiments, the TET-ON system imposed a considerable load on cellular resources (Fig. 1c). We speculate that this is due to the expression of two transcriptional units needed for its functioning.
We designed a new configuration where transcription of both the rtTA and the EGFP is initiated from a single TETp through an IRES or 2A peptide and terminated by different polyAs selected to yield different test plasmid outputs. We observed increased monitor expression levels in both the uninduced and induced conditions when using a single TU configuration compared to the two TUs (Fig. 4a, Fig. S12). Although the resource-efficient designs demonstrated lower EGFP expression in induced conditions, an improvement in fold change (defined as the ratio of EGFP output in the presence and absence of dox) was observed in HEK293T cells. Additionally, we noticed that designs incorporating the more efficient RBpA and HGHpA exhibited higher fold changes than designs carrying PGKpA (Fig. 4a, Fig. S12). Increasing dox and rtTA concentration did not improve EGFP expression for the resource-aware designs (Fig. S13a, b). However, we did observe that the percentage of EGFP-expressing cells was higher when additional rtTA was present for both IRES2 and P2A designs in both cell lines (Fig. S13c). These findings point to the need to find a balance between the desired output and resource footprint, tailored to the specific application.
Our redesigned TET-ON stands as a proof-of-concept that genetic circuits for gene expression control can be designed to impose a lower demand on cellular resources while at the same time displaying improved fold change. Future work could focus on developing circuits with higher maximum output if desired for the application.
Finally, we confirmed that our findings are independent of the type of gene in use. This is highly relevant as biotherapeutic proteins often require the co-expression of multiple cassettes (i.e. antibodies, VLPs, enzymatic complexes), which will inevitably compete for cellular resources. We chose EBNA1, a viral protein often adopted in bioprocessing and gene therapy to improve transient transfection37,38,39,40,41,42. Based on data in Fig. 3a, we designed eight EBNA1-expressing constructs and co-transfected them with the capacity monitor, working as a proxy for any biotherapeutic of interest. Although we observed no correlation between the EBNA1 and EGFP expression levels of the selected designs, this is in accordance with previous work outlining lack of modularity when the genetic context is changed43. However, the expression levels of the capacity monitor did show correlation when in competition with either EGFP or EBNA1 (Fig. S14a-b). This indicates that the capacity monitor output is a good predictor of the resource footprint of a specific design independently of the gene of use. Five of the eight shortlisted designs were selected to yield similar EBNA1 output while displaying different resource footprints (Fig. 4b, top panels). In accordance with our previous characterisation (Fig. 3a–d), designs carrying strong promoters resulted in a higher resource footprint than designs with weaker promoters, with 13% remaining capacity for TETp, 53% for CMVp, and 68-85% for UBp and hACTBp designs (Fig. 4b, top panels). This is in line with the key findings we inferred from Fig. 3a–c. In addition, four designs bearing CMVp-Kz1 and one of four polyA configurations were selected to yield different EBNA1 outputs while maintaining the same resource footprint (Fig. 4b, bottom panels). In line with previous findings (Fig. 2c, Fig. S5), the change of the polyA to more favourable configurations allowed for higher EBNA1 production without increasing the resource footprint.
Discussion
Robust and predictable heterologous gene expression is critical to foundational and applied biology. Limiting intracellular resources represent a bottleneck to predictable performance when two or more heterologous cassettes are co-expressed in mammalian cells. Here, we present a framework for quantifying the resource load of construct libraries for expression in mammalian cells. By screening a library of synthetic constructs bearing different promoters, Kozaks and polyAs, we show that the design of genetic circuits directly affects their resource footprint. We characterise widely adopted parts for multi-cistronic expression, such as IRES and 2A peptides, and verify that these parts are crucial in minimising the resource footprint of multi-gene constructs. We apply our findings to the optimisation of genetic circuit performance (i.e. the TET-ON system), and we demonstrate that our framework remains relevant when applied to other proteins of interest for bioproduction and biotherapy, such as EBNA1. Specifically, we carefully selected genetic parts within the library (i.e. promoters, Kozaks, polyAs) to achieve (i) designs with minimised resource load on similar EBNA1 expression, (ii) designs with increased EBNA1 expression on similar resource load, (iii) designs with increased EBNA1 expression and minimised resource load. Our applications demonstrate the broad applicability of our findings and make this work relevant to anyone working with co-expressed systems in mammalian cells.
Although previous work used a transient capacity monitor to visualise resource competition in mammalian cells15,16, a direct link between construct design and resource footprint was not demonstrated. Our work gives the scientific community insights into resource allocation in mammalian cells. While earlier work showed that resources are limiting in mammalian cells, contrasting evidence was provided on the role of transcriptional and translational resources15,16. By studying circuit components directly linked to transcription or translation, we show that promoters impact resources the most compared to Kozak sequences. This contrasts with what is reported for bacteria, where translational resources were recognised as the primary source of gene expression burden5. Our work also highlights the importance of polyAs in defining the resource landscape of synthetic constructs and points to the need for a deeper understanding of the contribution of different expression resources in different genetic backgrounds. Future work investigating these effects in a wider panel of cell lines could provide a comprehensive analysis of cell line-specific effects. While we report initial evidence that competition arises also between transient and integrated systems, further work will also be needed to expand on this and provide characterisation and analysis of how our findings translate to integrated systems. This will be of invaluable importance considering the wide adoption of stably integrated constructs in mammalian cell engineering and its applications.
Mitigation strategies have been proposed to address the shortcomings caused by resource competition in mammalian cells15,16,44,45,46,47. One such example is the use of incoherent feed-forward loops (iFFLs). Although iFFLs have been implemented to mitigate unwanted coupling between competing cassettes, they come at the cost of dramatic output suppression of the regulated gene. This limits the implementation of such network motif, as it is unsuitable in settings where output maximisation is sought. By implementing our framework, coupling effects can be mitigated without engineering complex circuits and avoiding suppression of the protein expression levels.
Differently from bacteria, mammalian cells possess compartments and complex regulatory machinery, thus creating a more complex resource allocation scenario than the simple transcription versus translation contribution. Approaches to assess and investigate competition for different types of resources, such as the host-aware framework demonstrated here, provide a way to manage this complexity to enable diverse applications in mammalian synthetic biology.
Ultimately, we provide direct evidence that different synthetic designs can be characterised based on the resource load they impose on mammalian cells. By focusing on genetic parts that make up any functional mammalian gene expression system (i.e. promoters, Kozaks and polyAs), we demonstrate that our framework can be used to optimise co-expression systems, whether targeted towards fundamental or applied biological research.
Methods
Bacterial culture and modular cloning
E. coli DH5a were used for routine cloning. A modular plasmid architecture was first cloned using the EMMA kit35 and was then used to generate the construct library. A complete list of plasmids constructed in this study can be found in Supplementary data file 1. Parts and sequences used in this paper can be found in Supplementary data file 2. Primers used in this study can be found in Supplementary data file 5.
Cell culture
HEK293T (ATCC-CRL-3216) were cultured in DMEM Glutamax (Thermo Fisher 31966047) supplemented with 10% FBS (Life Technologies, 16000-044). CHO-K1 were cultured in MEM α (Sigma Aldrich, M4526) supplemented with 10% FBS, 1% non-essential amino acids (Thermo Fisher, 11140050), and 1% L-glutamine (Thermo Fisher, 25030081). Both cell lines were cultured at 37 °C and 5% CO2.
DNA transfections
Transfections were performed in a 24-well plate format. Cells were seeded approximately 24 h before transfection in complete media. HEK293T and CHO-K1 were transfected using xTREME GENE HP DNA (Merck, 6366236001) and TransIT-X2 (Mirus, MIR 6003) as transfection reagents, respectively. The number of transfected plasmids was normalised by molar ratio (further information on the transfection protocol can be found in Supplementary data file 3). Cells were detached and prepared for flow cytometry analysis two days after transfection. For rtTA-based induction experiments, unless otherwise specified, 1 ng/µL of doxycycline hyclate (Sigma Aldrich, D9891-25G) was added to the media upon transfection.
RNA production and transfections
RNAs were transcribed, capped and polyadenylated in vitro using the HiScribe® T7 ARCA mRNA Kit (NEB, E2060S) following the manufacturer’s instructions. RNAs were then purified from the in vitro reaction using RNA Clean & Concentrator-25 (Zymo Research, R1017) as per the manufacturer’s instructions. The purified RNAs were then co-transfected in HEK293T using TransIT®-mRNA Transfection Kit (Mirus, MIR 2225). 0.25ug of test and monitor RNAs (i.e. 0.5ug in total) were transfected per well (further information on transfection protocol can be found in Supplementary data file 3). Cells were detached using Trypsin (Thermo Fisher, 15400054) and prepared for flow cytometry analysis two days after transfection.
Flow cytometry and data analysis
Transfected cells were detached, centrifuged, resuspended in DPBS (Thermo Fisher 14190250), and filtered to disrupt any cell clumps. HEK293T and CHO-K1 were analysed with the Attune NxT flow cytometer (Thermo Fisher). For each sample, 10000 viable cells were recorded. Transfected cells were selected using an OR gate approach with fuzzy logic (see Supplementary Note 1). All data points lower than Q1-1.5*IQR and higher than Q3 + 1.5*IQR- where IQR stands for the interquartile range—were considered outliers and excluded from further analysis. Mann–Whitney test was used to assess statistical significance. The code used for data analysis is freely available at https://github.com/sfurini/gating_ResourceAwareDesign.git. Capacity was calculated as a percentage of the total capacity monitor expression measured upon co-transfection of the pEmpty plasmid (i.e. in the absence of competition). Error is expressed as ± % and is calculated from standard deviation values.
Statistics and reproducibility
No statistical method was used to predetermine sample size. The Investigators were not blinded to allocation during experiments and outcome assessment. All data points lower than Q1-1.5*IQR and higher than Q3-1.5*IQR-where IQR stands for interquartile range-where considered outliers and excluded from further analysis. Number of repeats for each sample are reported in Supplementary data file 3. Exact P-values are reported in Supplementary data file 4.
Western Blot
Transfected HEK293T cells were detached, centrifuged, and washed in ice-cold DPBS. Cell pellets were lysed in RIPA buffer (Sigma) containing 1% v/v protease inhibitor cocktail (Sigma). The protein concentration of clarified lysates was determined using a BCA assay (Thermo Fisher). Lysates were denatured, and 20 μg were loaded into SDS PAGE gels. Proteins were transferred onto nitrocellulose membranes (Thermo Fisher) and probed for EBNA1 (Merck, clone 1H4, monoclonal, cat MABF2800, dilution 1:1000) and a loading control, Vinculin (Sigma, clone hVIN-1, monoclonal, cat V9264, dilution 1:5000). Secondary antibodies (Life Technologies, Goat anti-Rat IgG A18868, polyclonal, and Abcam Rabbit Anti-Mouse IgG H&L (Alkaline Phosphatase) ab6729, dilutions 1:10000) were detected using the Novex™ AP Chromogenic Substrate (Invitrogen). Image J48 was used for densitometry analysis. To obtain a normalisation factor specific to each lane, the Vinculin signal for each lane was normalised against the lane with the strongest signal. This was then multiplied by the EBNA1 signal of the same lane to get a normalised EBNA1 experimental signal. Finally, to control for inter-blot variability, normalised EBNA1 values were divided once more against the average normalised signal of each lane present within the blot (except the negative control).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
All source data for the data sets and figures presented in the manuscript are available at “Roberto, Di Blasi; Mara, Pisani; Tedeschi, Fabiana; Marbiah, Masue; Polizzi, Karen Marie; Furini, Simone; et al. (2023): Resource-aware construct design in mammalian cells. Figshare. Dataset. https://doi.org/10.6084/m9.figshare.22659211” and the provided Source Data file. Constructs are available upon request to the corresponding author. Plasmid maps are available in the source data. Source data are provided with this paper.
Code availability
The code used for data analysis is freely available at https://github.com/sfurini/gating_ResourceAwareDesign.git and https://doi.org/10.5281/zenodo.7956982.
References
Alice Boo, T. E. & Stan, G. B. Host-aware synthetic biology. Curr. Opin. Syst. Biol. 14, 66–72 (2019).
Borkowski, O., Ceroni, F., Stan, G. B. & Ellis, T. Overloaded and stressed: whole-cell considerations for bacterial synthetic biology. Curr. Opin. Microbiol 33, 123–130 (2016).
Tan, C., Marguet, P. & You, L. Emergent bistability by a growth-modulating positive feedback circuit. Nat. Chem. Biol. 5, 842–848 (2009).
Gyorgy, A. et al. Isocost lines describe the cellular economy of genetic circuits. Biophys. J. 109, 639–646 (2015).
Ceroni, F., Algar, R., Stan, G. B. & Ellis, T. Quantifying cellular capacity identifies gene expression designs with reduced burden. Nat. Methods 12, 415 (2015).
Ceroni, F. et al. Burden-driven feedback control of gene expression. Nat. Methods 15, 387 (2018).
Alice Grob, R. D. B. & Ceroni, Francesca Experimental tools to reduce the burden of bacterial synthetic biology. Curr. Opin. Syst. Biol. 28, 100393 (2021).
Gorochowski, T. E., Avcilar-Kucukgoze, I., Bovenberg, R. A., Roubos, J. A. & Ignatova, Z. A minimal model of ribosome allocation dynamics captures trade-offs in expression between endogenous and synthetic genes. Acs Synth. Biol. 5, 710–720 (2016).
Darlington, A. P. S., Kim, J., Jimenez, J. I. & Bates, D. G. Engineering translational resource allocation controllers: mechanistic models, design guidelines, and potential biological implementations. ACS Synth. Biol. 7, 2485–2496 (2018).
Carlson, E. D. et al. Engineered ribosomes with tethered subunits for expanding biological function. Nat. Commun. 10, 3920 (2019).
Rugbjerg, P., Sarup-Lytzen, K., Nagy, M. & Sommer, M. O. A. Synthetic addiction extends the productive life time of engineered Escherichia coli populations. Proc. Natl Acad. Sci. USA 115, 2347–2352 (2018).
Huang, H. H., Qian, Y. & Del Vecchio, D. A quasi-integral controller for adaptation of genetic modules to variable ribosome demand. Nat. Commun. 9, 5415 (2018).
Lastiri-Pancardo, G., Mercado-Hernandez, J. S., Kim, J., Jimenez, J. I. & Utrilla, J. A quantitative method for proteome reallocation using minimal regulatory interventions. Nat. Chem. Biol. 16, 1026–1033 (2020).
Pasini, M. et al. Using promoter libraries to reduce metabolic burden due to plasmid-encoded proteins in recombinant Escherichia coli. N. Biotechnol. 33, 78–90 (2016).
Frei, T. et al. Characterization and mitigation of gene expression burden in mammalian cells. Nat. Commun. 11, 4641 (2020).
Jones, R. D. et al. An endoribonuclease-based feedforward controller for decoupling resource-limited genetic modules in mammalian cells. Nat. Commun. 11, 5690 (2020).
Di Blasi, R., Marbiah, M. M., Siciliano, V., Polizzi, K. & Ceroni, F. A call for caution in analysing mammalian co-transfection experiments and implications of resource competition in data misinterpretation. Nat. Commun. 12, 2545 (2021).
Cella, F. et al. MIRELLA: a mathematical model explains the effect of microRNA-mediated synthetic genes regulation on intracellular resource allocation. Nucleic Acids Res. https://doi.org/10.1093/nar/gkad151 (2023).
Qin, C. et al. Precise programming of multigene expression stoichiometry in mammalian cells by a modular and programmable transcriptional system. Nat. Commun. 14, 1500 (2023).
Huliak, I., Sike, A., Zencir, S. & Boros, I. M. The objectivity of reporters: interference between physically unlinked promoters affects reporter gene expression in transient transfection experiments. DNA Cell Biol. 31, 1580–1584 (2012).
Farr, A. & Roman, A. A pitfall of using a second plasmid to determine transfection efficiency. Nucleic Acids Res. 20, 920 (1992).
Loew, R., Heinz, N., Hampf, M., Bujard, H. & Gossen, M. Improved Tet-responsive promoters with minimized background expression. BMC Biotechnol. 10, 81 (2010).
Qin, J. Y. et al. Systematic comparison of constitutive promoters and the doxycycline-inducible promoter. PLoS One 5, e10611 (2010).
Passmore, L. A. & Coller, J. Roles of mRNA poly(A) tails in regulation of eukaryotic gene expression. Nat. Rev. Mol. Cell Biol. https://doi.org/10.1038/s41580-021-00417-y (2021).
Lewis, J. D., Gunderson, S. I. & Mattaj, I. W. The influence of 5' and 3' end structures on pre-mRNA metabolism. J. Cell Sci. Suppl. 19, 13–19 (1995).
Wang, X. Y. et al. Enhanced transgene expression by optimization of poly A in transfected CHO cells. Front. Bioeng. Biotechnol. 10, 722722 (2022).
Fonseca, J. P. et al. A toolkit for rapid modular construction of biological circuits in mammalian cells. ACS Synth. Biol. 8, 2593–2606 (2019).
Lou, C., Stanton, B., Chen, Y. J., Munsky, B. & Voigt, C. A. Ribozyme-based insulator parts buffer synthetic circuits from genetic context. Nat. Biotechnol. 30, 1137–1142 (2012).
Noderer, W. L. et al. Quantitative analysis of mammalian translation initiation sites by FACS-seq. Mol. Syst. Biol. 10, 748 (2014).
Blanco, N. et al. Tailoring translational strength using Kozak sequence variants improves bispecific antibody assembly and reduces product-related impurities in CHO cells. Biotechnol. Bioeng. 117, 1946–1960 (2020).
Orelle, C. et al. Protein synthesis by ribosomes with tethered subunits. Nature 524, 119–124 (2015).
Sarvari, P., Ingram, D. & Stan, G. B. A modelling framework linking resource-based stochastic translation to the optimal Design of synthetic constructs. Biology (Basel) https://doi.org/10.3390/biology10010037 (2021).
Weisse, A. Y., Oyarzun, D. A., Danos, V. & Swain, P. S. Mechanistic links between cellular trade-offs, gene expression, and growth. Proc. Natl Acad. Sci. USA 112, E1038–E1047 (2015).
Nikolados, E. M., Weisse, A. Y., Ceroni, F. & Oyarzun, D. A. Growth defects and loss-of-function in synthetic gene circuits. ACS Synth. Biol. 8, 1231–1240 (2019).
Martella, A., Matjusaitis, M., Auxillos, J., Pollard, S. M. & Cai, Y. Z. EMMA: An extensible mammalian modular assembly toolkit for the rapid design and production of diverse expression vectors. ACS Synth. Biol. 6, 1380–1392 (2017).
Johnstone, C. P. & Galloway, A. K. E. Supercoiling-mediated feedback rapidly couples and tunes transcription. bioRxiv https://doi.org/10.1101/2022.04.20.488937 (2022).
Black, J. & Vos, J. M. Establishment of an oriP/EBNA1-based episomal vector transcribing human genomic beta-globin in cultured murine fibroblasts. Gene Ther. 9, 1447–1454 (2002).
Budge, J. D., Young, R. J. & Smales, C. M. Engineering of Chinese hamster ovary cells with NDPK-A to enhance DNA nuclear delivery combined with EBNA1 plasmid maintenance gives improved exogenous transient reporter, mAb and SARS-CoV-2 spike protein expression. Front. Bioeng. Biotechnol. 9, 679448 (2021).
Tom, R., Bisson, L. & Durocher, Y. Culture of HEK293-EBNA1 cells for production of recombinant proteins. CSH Protoc. https://doi.org/10.1101/pdb.prot4976 (2008).
Kiyosue, K. & Miwa, Y. Epstein-Barr virus-derived vector suitable for long-term expression in neurons. Heliyon 6, e03504 (2020).
Ren, C. et al. Establishment and applications of epstein-barr virus-based episomal vectors in human embryonic. Stem Cells 24, 1338–1347 (2006).
Nassor, F. et al. Long term gene expression in human induced pluripotent stem cells and cerebral organoids to model a neurodegenerative disease. Front. Cell Neurosci. 14, 14 (2020).
Shakiba, N., Jones, R. D., Weiss, R. & Del Vecchio, D. Context-aware synthetic biology by controller design: Engineering the mammalian cell. Cell Syst. 12, 561–592 (2021).
Du, Z. M. et al. Non-invasive UPR monitoring system and its applications in CHO production cultures. Biotechnol. Bioeng. 110, 2184–2194 (2013).
Roy, G. et al. Development of a fluorescent reporter system for monitoring ER stress in Chinese hamster ovary cells and its application for therapeutic protein production. Plos One 12, e0183694 (2017). ARTN.
Donaldson, J. S., Dale, M. P. & Rosser, S. J. Decoupling growth and protein production in CHO cells: a targeted approach. Front. Bioeng. Biotechnol. 9, 658325 (2021).
Kol, S. et al. Multiplex secretome engineering enhances recombinant protein production and purity. Nat. Commun. 11, 1908 (2020).
Rasband, W. S. ImageJ. U. S. National Institutes of Health. https://imagej.nih.gov/ij/ (2018).
Acknowledgements
The authors thank Tom Ellis and Cleo Kontoravdi for their support, thoughts and advice during the project. R.D.B. was supported by the Imperial College Chemical Engineering PhD scholarship. M.M.M. and K.P. were supported by the Biotechnology and Biological Sciences Research Council (grant BB/S006206/1). V.S. was supported by the ERC Starting Grant Synthetic-TrEX #852012.
Author information
Authors and Affiliations
Contributions
F.C., V.S. and R.D.B. designed the research; R.D.B., M.P., F.T. and M.M.M. performed experiments; R.D.B., S.F. and F.C. analysed the data; F.C., V.S., K.P. contributed funding; F.C., R.D.B. and S.F. wrote the paper; all authors read and edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Di Blasi, R., Pisani, M., Tedeschi, F. et al. Resource-aware construct design in mammalian cells. Nat Commun 14, 3576 (2023). https://doi.org/10.1038/s41467-023-39252-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-39252-4
This article is cited by
-
Development of multiplexed orthogonal base editor (MOBE) systems
Nature Biotechnology (2024)
-
Understanding resource competition to achieve predictable synthetic gene expression in eukaryotes
Nature Reviews Bioengineering (2024)
-
Competition and evolutionary selection among core regulatory motifs in gene expression control
Nature Communications (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.