Optimization of culture conditions for the expression of three different insoluble proteins in Escherichia coli

Gutiérrez-González, Matías; Farías, Camila; Tello, Samantha; Pérez-Etcheverry, Diana; Romero, Alfonso; Zúñiga, Roberto; Ribeiro, Carolina H.; Lorenzo-Ferreiro, Carmen; Molina, María Carmen

doi:10.1038/s41598-019-53200-7

Download PDF

Article
Open access
Published: 14 November 2019

Optimization of culture conditions for the expression of three different insoluble proteins in Escherichia coli

Matías Gutiérrez-González^1,2,
Camila Farías¹,
Samantha Tello¹,
Diana Pérez-Etcheverry³,
Alfonso Romero¹,
Roberto Zúñiga¹,
Carolina H. Ribeiro¹,
Carmen Lorenzo-Ferreiro³ &
…
María Carmen Molina¹

Scientific Reports volume 9, Article number: 16850 (2019) Cite this article

30k Accesses
46 Citations
Metrics details

Subjects

Abstract

Recombinant protein expression for structural and therapeutic applications requires the use of systems with high expression yields. Escherichia coli is considered the workhorse for this purpose, given its fast growth rate and feasible manipulation. However, bacterial inclusion body formation remains a challenge for further protein purification. We analyzed and optimized the expression conditions for three different proteins: an anti-MICA scFv, MICA, and p19 subunit of IL-23. We used a response surface methodology based on a three-level Box-Behnken design, which included three factors: post-induction temperature, post-induction time and IPTG concentration. Comparing this information with soluble protein data in a principal component analysis revealed that insoluble and soluble proteins have different optimal conditions for post-induction temperature, post-induction time, IPTG concentration and in amino acid sequence features. Finally, we optimized the refolding conditions of the least expressed protein, anti-MICA scFv, using a fast dilution protocol with different additives, obtaining soluble and active scFv for binding assays. These results allowed us to obtain higher yields of proteins expressed in inclusion bodies. Further studies using the system proposed in this study may lead to the identification of optimal environmental factors for a given protein sequence, favoring the acceleration of bioprocess development and structural studies.

Streptomyces umbrella toxin particles block hyphal growth of competing species

Article Open access 17 April 2024

Elucidation of genes enhancing natural product biosynthesis through co-evolution analysis

Article 12 April 2024

Diverse and abundant phages exploit conjugative plasmids

Article Open access 12 April 2024

Introduction

It is well established that systems of high recombinant protein expression levels are required for structural studies and therapeutic uses. Biological expression systems that are currently used include: prokaryotic, plant-based and eukaryotic expression systems, each with well-known advantages and disadvantages^1,2,3. Among prokaryotic expression systems, Escherichia coli remains the workhorse for several applications, given its fast growth, high densities achieved and feasible manipulation¹. However, not all proteins are efficiently produced in this system, as low solubility of the target protein and subsequent inclusion bodies (IB) formation may restrict its successful application³. Several strategies have been developed to overcome this undesirable limitation, which target environmental parameters, such as culture temperature or inducer concentration, as well as intrinsic protein variables, such as relative codon abundance or fusion to more soluble proteins⁴. However, there is no “one size fits all” strategy a priori to obtain an active, soluble protein and, as a consequence, empirical observations for each protein is needed, which can be both costly and time consuming.

In some situations, the accumulation of recombinant protein in IBs is unavoidable, and it represents a challenging condition when recombinant proteins are needed in a fast and reliable fashion. The technical procedures to obtain soluble and active proteins from IBs are labor intensive and require a combination of rational and empirical knowledge. In this sense, a valuable approach would be to increase the yield of recombinant protein in this state, as IBs can, in fact, protect the recombinant protein from proteolytic degradation and prevent the bacteria from recombinant protein toxicity. With several batches of correctly stored IBs, a researcher can explore some alternatives to obtain a final, soluble protein preparation⁵.

Bioprocess improvement can be achieved by changing one factor at a time (OFAT). However, although attractively simple, this is a limited methodology, given the complex nature of the determinants of protein expression, solubility and folding. In this scenario, OFAT is not the most efficient approach to obtain information on the operation space, as changing one input can have unexpected effects on the outcomes of other, unrelated, variables⁶. The Design of Experiments (DoE) methodology is a more appropriate approach, as it requires less resources and systematizes interaction discovery. Importantly, there are several DoE settings, each with its own advantages and disadvantages. Three-level Box-Behnken methods are a type of incomplete factorial designs, with slightly more efficiency than Central Composite Designs and much more effective than full factorial designs⁷. The application of this methodology results in less experiments aiming to obtain the coefficients for the estimated model.

MHC class I chain–related protein A (MICA) is a transmembrane protein expressed as a result of cellular stress. NKG2D receptor, present on the surface of natural killer and cytotoxic cells, can recognize MICA and trigger target cell lysis. However, tumor cells can escape this immunosurveillance mechanism by expressing a soluble form of MICA, which downregulates NKG2D expression on effector cells. Moreover, it has been observed that high serum levels of MICA are correlated with disease progression in a variety of human cancers⁸. This led us to develop a single chain variable antibody (scFv), isolated from a phage display library, directed against the recognition interface between MICA and NKG2D; by preventing MICA-mediated NKG2D downregulation, this scFv could potentially serve as therapy in MICA expressing cancers⁹. scFvs are composed of variable regions from heavy and light chains from immunoglobulins, and fused with a flexible linker. This protein format can be expressed in E. coli, and direct modification of its amino acidic sequence can be carried out for affinity maturation¹⁰.

IL-23 is a heterodimeric protein member of the IL-12 cytokine family, sharing with this last cytokine the p40 subunit^11,12. The p19 subunit, on the other hand, is unique to IL-23, and it is an interesting therapeutic target, as IL-23 has been linked to immune-related diseases, such as Crohn’s disease and psoriasis^{13,14,15,16,17,18}. Thus, expression of this protein at large scales is attractive for the development of new and effective treatments for these diseases.

Here, we present the analysis and optimization of the expression conditions for three different proteins, a anti-MICA scFv, MICA, and p19 subunit of IL-23, which are expressed as insoluble recombinant proteins in E. coli. We favored speed of analysis using a three-level Box-Behnken design, with post-induction temperature, post-induction time and IPTG (Isopropyl β-D-1-thiogalactopyranoside) concentration as factors, generating 15 experimental runs. The resulting models allowed us to obtain the optimum environmental variables for each protein, and to compare the behavior of these insoluble proteins with data from soluble proteins available in the literature. We further optimized protein refolding conditions, which resulted in the generation of soluble and active scFv for binding assays. We also performed a multivariate analysis of the sequence-derived features and optimal environmental variables for protein expression and compared soluble and insoluble proteins, which revealed important differences in terms of favorable environmental variables and amino acid sequence features.

Results

anti-MICA scFv, MICA and IL-23p19 are expressed as inclusion bodies in E. coli

A low yield of soluble proteins was obtained when MICA, anti-MICA scFv and IL-23p19 were expressed in E. coli, whereas proteins in inclusion bodies represented more than 90% of recombinant proteins (Fig. 1). Attempts in protocol optimization, including changes in culture temperature, IPTG concentration or induction time were unsuccessful to obtain soluble proteins. However, we could effectively purify these proteins from inclusion bodies (data not shown), and decided to optimize protein expression from this compartment. Optimization was first carried out by selecting the most appropriate operating space for each protein. For MICA and anti-MICA scFv, previous experiments showed that the operating space lies between 3–6 h (post-induction time) and 0.1–1 mM IPTG, which were used in the present designs. In the case of IL-23p19, a range of 3–5 h (post-induction time) and 0.2–1 mM IPTG was applied for protein expression.

Optimization of anti-MICA scFv, MICA and IL-23p19 production in inclusion bodies in E. coli using response surface methodology

We developed a model to determine optimal bacterial culture temperature, IPTG concentration and post-induction time using response surface methodology (RSM). This technique is useful when a response of interest, in this case protein concentration, is dependent on several independent factors¹⁹. The design matrix was generated using RcmdrPlugin.DoE in a R environment. Table 1 shows detailed information on the design matrix, including coded and actual variables for each run. For each protein, the expression values obtained were analyzed by RSM, which retrieved the following equations:

$$\begin{array}{ccc}Y[scFv(\mu g/mL)] & = & 0.179-0.022A+0.014B+0.012C-0.049{A}^{2}\\ & & -0.018{B}^{2}-0.039{C}^{2}\end{array}$$

(1)

$$Y[MICA(\mu g/mL)]=0.315+0.145A+0.087B-0.068C$$

(2)

$$\begin{array}{ccc}Y[IL23-p19(\mu g/mL)] & = & 2.5+1.125A-0.750B+0.025C-0.375AB\\ & & -0.25AC-0.175BC\,+\,0.762{A}^{2}-0.588{B}^{2}-0.188{C}^{2}\end{array}$$

(3)

where Y is the response variable (protein in µg/mL, in eluate), A refers to post-induction temperature, B refers to post-induction time, and C represents IPTG concentration.

Table 1 Box-Behnken design.

Full size table

The yield for each protein varied significantly, as displayed in Table 2, with IL-23p19 showing the greatest protein concentration. More importantly, every protein data point was fitted with different models and particular optimal conditions could be determined. The model obtained for scFv expression (r² = 0.7524, p = 0.036) shows that the most significant variables were post-induction temperature and IPTG concentration, both in their quadratic forms. In the case of MICA, a first order model (r² = 0.8262, p = 1.72 × 10⁻⁴) was selected, in which the three variables showed a high impact in the observed yields. Finally, in the case of IL-23p19, a complete second order model (r² = 0.9773, p = 1.36 × 10⁻³) indicated that the most significant variables were temperature and time, both in first order and quadratic forms. All fitted models are shown in Fig. 2. Based on the normal Q-Q plot shown in Fig. S1, we concluded that the residuals are normally distributed in the three models generated.

Table 2 Experimental runs for Box-Behnken design.

Full size table

Multivariate analysis of soluble and insoluble proteins

Since different optimal expression conditions for each protein tested could be detected using our model, we decided to test whether there was any relationship between protein solubility and favored environmental factors by performing a principal component analysis (PCA). To this end, a literature search was carried out in order to select those publications in which the post-induction temperature, post-induction time and IPTG concentration were regarded as the optimal variables for bacterial culture, and whose numerical values for protein concentration were provided. We were able to select 10 reports (Table 3), which described different conditions for protein optimization. Interestingly, insoluble proteins from our work clustered together when plotting the first two principal components (78.5% of the total variance) (Fig. 3a). Furthermore, insoluble proteins were correlated with higher temperatures and lower expression times (Fig. 3b).

Table 3 Literature search of proteins optimized by a DoE approach.

Full size table

Besides environmental variables, we retrieved the protein sequence reported in the literature and constructed FASTA sequences. With this information, we derived 33 protein features normally used to predict protein solubility in E. coli, using a Biopython script^20,21. We performed a second PCA with this information, observing again a separation of soluble and insoluble proteins into two different clusters (Fig. 3c). In this case, the protein NS31b, a protease domain from hepatitis C virus NS3 protein, was also clustered with insoluble proteins from our work²². In the PCA, components PC1 and PC2 explained 53,3% of the total variance, and the most important variables included amino acids content (K and E), and composite amino acids content K − R (KmR), D + E (DpE), K + R + D + E (PpN).

A correlation analysis between environmental variables and sequence-derived features revealed that there are a significant (p < 0.05), positive correlation between IPTG levels and cysteine, proline and the composite amino acid content K + R − D − E in the whole dataset (Fig. S2). No further significant correlations were found between these two types of variables.

Optimization of refolding conditions

Although the DoE strategy adopted by us revealed the optimal conditions for protein expression, we still faced the challenge of a limited expression of anti-MICA scFv, which we attributed to the in-column refolding protocol (see materials and methods). Therefore, we decided to use a fast-dilution protocol, which provides the possibility to assess several buffers and additives. Refolding conditions were optimized using small-scale refolding assays in 96-well plate format. The chosen screening conditions were based on the literature and restricted to most probable positive conditions²³. The success of refolding was tested by analyzing the soluble fraction of the protein by size-exclusion chromatography, as it retrieves information on the amount of protein and its aggregation state. As shown in Fig. 4a, the use of arginine and the redox pair GSH/GSSG (reduced gluthatione and oxidized gluthatione) resulted in the greatest increase in soluble protein concentration. Of note, all conditions with the redox pair showed a higher concentration of total protein, consistent with the presence of disulfide bonds in most scFvs²⁴. Importantly, these conditions allowed us to obtain 1.1 mg/mL of anti-MICA scFv. The conditions were repeated with MICA, and the functional activity of the protein pairs was analyzed by ELISA. Our results showed that scFv, refolded in NaCl, arginine and glycerol (with redox pairs), was able to specifically detect MICA (Fig. 4b).

Discussion

In this work, we employed a design of experimental methodology to optimize the production of recombinant anti-MICA scFv, MICA and IL-23p19 in E. coli. As expected, we found important yield differences at different environmental variables, with distinctive optimal conditions for each protein. In addition, for each protein studied, environmental variables had different effects on the generated models, reflecting some intrinsic protein properties that can affect protein biosynthesis and/or folding. Moreover, multivariate analysis showed that insoluble proteins were markedly different at the preferred environmental conditions and amino acid content from a set of soluble proteins described in the literature. Finally, we showed that a simple refolding experiment can be successfully applied to obtain the best conditions for full-scale protein refolding.

Obtaining high yields of soluble protein can be challenging, as the majority of proteins tend to aggregate as inclusion bodies²⁵. This can delay a successful expression experiment, given the rather complex steps needed to obtain native protein in this scenario. However, inclusion bodies have several advantages, such as high levels of protein expression and purity. Optimizing the expression of recombinant proteins in this compartment, as we show in the present work, is relatively uncommon, although it can be a very promising strategy. The three-level Box-Behnken design is relatively easy to run with standard laboratory equipment. After a three-day experiment, including protein expression, inclusion bodies purification and solubilization, purification and refolding, a researcher can have complete working space and obtain important insights to develop a more robust and productive expression system. The yields obtained for each protein varied significantly, as expected; however, more importantly, they varied between each run. These differences should be interpreted as room for improvement, and to the fact that obtaining protein in inclusion bodies can be fine-tuned by different environmental variables.

Although there are other factors that can be optimized, including OD at induction, pre-induction time, the use of different plasmid constructions, the use of different hosts and codon optimization, we favored the speed of our analysis using factors widely reported in the literature^{22,26,27,28,29}. Our main objective was to quickly find the most appropriate conditions for protein expression in order to perform further biological assays, and not to force the expression of the protein in the soluble compartment, which is not always successful and can significantly delay its production.

Multivariate analysis showed that soluble and insoluble proteins are distinct in their optimal environmental features and in their sequences. The expression of insoluble proteins correlated with higher temperatures and lower amount of time, probably reflecting the need to rapidly accumulate protein in inclusion bodies and avoid bacterial damage²⁹. In the case of sequence-derived features of recombinant proteins, those that accounted for most variance were represented by the amino acid content (lysine, glutamic acid) and composites (Lysine minus arginine, DpE: Aspartic acid plus glutamic acid. PpN: Lyisine plus arginine plus aspartic acid plus glutamic acid). We expected more contributions from predicted features, such as length and absolute charge, which are regarded as the most informative characteristics for predicting protein solubility²¹. Interestingly, the protease NS3 appeared inside the concentration ellipse of insoluble proteins. Proteases tend to accumulate in inclusion bodies due to their toxicity to the host, which could explain this finding³⁰. However, NS3 showed markedly different optimal environmental variables, suggesting that its optimal expression conditions cannot be predicted only from sequence features.

Our protocol for protein refolding was adapted from commercially available high-throughput methods. We favored a fast protocol, in which relevant conditions for protein refolding were assayed. In this respect, the most studied variables in the culture medium are NaCl concentration, since it can stabilize proteins by either hydration or exclusion of water³¹; the presence of arginine, which aids the refolding process, although its exact mechanism is still unclear³²; and glycerol, which enhances hydrophobic interactions by ordering the solvent around the protein³². The use of the redox pair GSH/GSSG was considered as the scFv structures tend to have disulfide bonds³³. We tested our protocol in scFv, since it is the most complex structure and showed the lowest expression yields between the three proteins studied. As expected, adding GSH/GSSG dramatically increased the yield of soluble protein, which was further enhanced by the addition of arginine. This experiment took only two days to complete, but offered invaluable information for further scaling the expression of scFv. More importantly, all conditions with additive/redox pair resulted in a similar ability to bind MICA, thus complementing our refolding chromatographic assay with a functional assay.

The use of statistical approaches for insoluble protein expression is a powerful strategy to improve the yield of recombinant protein. We successfully developed models to predict protein yields from inclusion bodies, which revealed that each protein has different requirements for the environmental variables tested. Moreover, the proteins tested in this work, which were expressed in inclusion bodies, showed a differed behavior in terms of sequence features and environmental variables for optimal expression, as compared with soluble proteins. Further protein processing, including refolding, was effectively applied to the most difficult-to-express protein in our set, using a simple approach with commonly used additives. The use of a redox pair is highlighted as a necessary strategy when disulfide bonds are suspected to be in the protein structure, as it dramatically improves the yield of soluble protein. Further work in insoluble proteins may unveil whether the behavior observed for this set of proteins is replicable, and potentially reveal optimal environmental factors for a given protein sequence, which will accelerate bioprocesses development and structural studies.

Materials and Methods

Design of experiments

A Box-Behnken design was generated in R³⁴, using RcmdrPlugin.DoE package³⁵. Three factors, which included temperature after induction, time after induction and IPTG concentration, with three levels each, were considered, accounting for 15 sets of experiments. This design was used to model the expression of anti-MICA scFv, MICA, and IL-23p19. Table 1 shows the coded and actual variables derived from the Box-Behnken design.

Protein production

Proteins were expressed in BL21(DE3) bacteria as a HisTag fusion proteins. MICA was expressed as a truncated form using the extracellular domains α1 and α2³⁶. Starting cultures were generated from pipette tip punctured glycerol stocks in 50 mL of 2xYT (BD Biosciences, USA) supplemented with corresponding antibiotics (ampicillin/kanamycin 50 µg/mL) (United States Biological). This bacterial culture was grown overnight at 37 °C under shaking (200 rpm). Next, 1 mL of starting culture was added to 200 mL of 2xYT/ampicillin/kanamycin and grown to mid-log phase (OD600 = 0.6). The bacterial suspension was then separated into 15 subcultures and protein expression was induced using IPTG. After this point, cultures were separated into their respective experimental runs. After protein expression, cell cultures were harvested by centrifugation, washed with ice-cold PBS, and resuspended in lysis buffer (25 mM Tris at pH 8.0, 100 mM NaCl, 5 mM imidazole, 1% Triton X-100, lysozyme plus protease inhibitors). This bacterial paste was stored at −80 °C until further processing. Bacterial lysis was carried out by sonication of thawed bacterial paste, on ice, with 8 cycles of 20 seconds and 40 resting seconds. Sonicated samples were centrifuged and the pellet (inclusion bodies) were harvested, washed once with washing buffer (50 mM Tris, 100 mM NaCl, 1% Triton X-100, 0.1% sodium deoxycholate (DOC), pH 8.0), and treated with denaturation buffer (50 mM Tris, 500 mM NaCl, 6 M guanidine hydrochloride, 5 mM imidazole, pH 8.0) overnight at 4 °C. In the case of IL-23p19, inclusion bodies were firstly incubated in washing buffer (50 mM Tris, 100 mM NaCl, 1% Triton X-100, pH 8.0) during 30 min at 37 °C and centrifuged. The resulting insoluble fraction was washed three times with the same buffer without Triton X-100 and treated with the anionic detergent N-Lauroylsarcosine (Thermo Fisher Scientific, USA) as denaturant agent (50 mM Tris, 500 mM NaCl, 1% N-Lauroylsarcosine, 5 mM imidazole, pH 8.0) for 1 h at 37 °C. Resuspended inclusion bodies were centrifuged; the supernatants were collected, filtrated through a 0.22 µm syringe filter unit (Advantec MFS, Japan), and loaded into a pre-equilibrated Ni-NTA matrix (Thermo Fisher Scientific, USA).

The purification process differed for each protein. For MICA, purification was carried out in denaturing conditions, washing with matrix denaturing washing buffer (50 mM Tris, 500 mM NaCl, 5 mM imidazole, 6 M guanidine hydrochloride, pH 8.0) and eluting with 6 column volumes of elution buffer (50 mM Tris, 500 mM NaCl, 300 mM imidazole, 6 M guanidine hydrochloride, pH 8.0). scFv was refolded in-column by serial exchange of matrix denaturing washing buffer to matrix washing buffer (50 mM Tris, 500 mM NaCl, 5 mM imidazole, pH 8.0). Refolded protein was eluted with 6 column volumes of elution buffer (50 mM Tris, 500 mM NaCl, 300 mM imidazole, pH 8.0). Finally, in the case of IL23-p19, purification was carried out in the presence of 0.2% N-Lauroylsarcosine. The concentration of detergent was brought to 0.2% by slow dilution of the sample and then applied to the matrix equilibrated in the same condition. Washing was performed with buffer containing 0.2% N-Lauroylsarcosine and 20 mM imidazole, followed by serial exchange of buffer until total dilution of N-Lauroylsarcosine. Elution was performed with 6 column volumes of elution buffer (50 mM Tris, 500 mM NaCl, 600 mM imidazole, pH 8.0).

Protein expression was assessed in bacterial cultures by SDS-PAGE. Each step of protein expression and purification was loaded into 10 % acrylamide gels. In the case of MICA and anti-MICA scFv, identity of the protein was confirmed by western blot, using a mouse anti-HisTag antibody, followed by anti-mouse IgG-HRP detection. Identity of IL23-p19 was confirmed by mass spectrometry using a MALDI-TOF/TOF 4800 Analyzer (Applied Biosystems, Framingham, USA).

Protein renaturation

Denatured MICA fractions were pooled and refolded by rapid dilution in 100 mL of renaturing buffer (50 mM Tris, 500 mM NaCl, 3 mM GSH, 0.3 mM GSSG, pH 7.4) in order to achieve a protein concentration <10 µg/mL. Diluted protein was mixed overnight at room temperature. Next, the mix was filtrated through a 0.22 µm syringe filter unit and concentrated using 10 kDa Centricon centrifugal filters (Merck Milipore, Germany) to a final volume <1 mL. Renaturation was verified by size-exclusion chromatography in an Äkta Purifier FPLC with a Superdex 5/150 GL column (GE Healthcare, USA).

Model generation

Data obtained from Box-Behnken design was fitted using rsm package³⁷. The expression level of each protein was assumed to be influenced by the following independent variables: post-induction temperature, post-induction time and IPTG concentration. Model quality was assessed by r-squared values, lack-of-fit tests and normal Q-Q plots. Contour plots (conditions versus yield) were generated for each variable.

Multivariate analysis

A literature search was carried out, in PubMed, in order to find optimized expression conditions for other proteins. The inclusion criteria were: (1) Protein was expressed in E. coli, (2) Protocols that were optimized through a DoE methodology including post-induction temperature, post-induction time and IPTG concentration, (3) Protein was purified and quantified and (4) Protein sequence was readily available in the work or through references. With this information, FASTA sequences were retrieved. Thirty-five features were predicted from protein sequences using a Biopython script²⁰. Briefly, these features included amino acid content, 7 composites K − R, D − E, K + R, D + E, K + R − D − E, K + R + D + E, F + W + Y, length, pI, hydropathy, absolute charge at pH 7, fold propensity, disorder, sequence entropy, and β-strand propensity. All these features where selected from previous literature as potential indicators for protein solubility²¹. Data were stored as csv file and loaded on R³⁴ for PCA analysis, using FactoMineR package³⁸.

Data availability

All of the data analysed during this study are included in this published article (and its Supplementary Information files).

References

Rosano, G. L. & Ceccarelli, E. A. Recombinant protein expression in Escherichia coli: advances and challenges. Front Microbiol 5, 172, https://doi.org/10.3389/fmicb.2014.00172 (2014).
Article PubMed PubMed Central Google Scholar
Ahmad, M., Hirz, M., Pichler, H. & Schwab, H. Protein expression in Pichia pastoris: recent achievements and perspectives for heterologous protein production. Appl Microbiol Biotechnol 98, 5301–5317, https://doi.org/10.1007/s00253-014-5732-5 (2014).
Article CAS PubMed PubMed Central Google Scholar
Almo, S. C. & Love, J. D. Better and faster: improvements and optimization for mammalian recombinant protein production. Curr Opin Struct Biol 26, 39–43, https://doi.org/10.1016/j.sbi.2014.03.006 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhu, S. et al. A simple and effective strategy for solving the problem of inclusion bodies in recombinant protein technology: His-tag deletions enhance soluble expression. Appl Microbiol Biotechnol 97, 837–845, https://doi.org/10.1007/s00253-012-4630-y (2013).
Article CAS PubMed Google Scholar
Basu, A., Li, X. & Leong, S. S. Refolding of proteins from inclusion bodies: rational design and recipes. Appl Microbiol Biotechnol 92, 241–251, https://doi.org/10.1007/s00253-011-3513-y (2011).
Article CAS PubMed Google Scholar
Czitrom, V. One-Factor-at-a-Time versus Designed Experiments. The American Statistician 53, 126–131, https://doi.org/10.1080/00031305.1999.10474445 (1999).
Article Google Scholar
Ferreira, S. L. et al. Box-Behnken design: an alternative for the optimization of analytical methods. Anal Chim Acta 597, 179–186, https://doi.org/10.1016/j.aca.2007.07.011 (2007).
Article CAS PubMed Google Scholar
Ferrari de Andrade, L. et al. Antibody-mediated inhibition of MICA and MICB shedding promotes NK cell-driven tumor immunity. Science 359, 1537–1542, https://doi.org/10.1126/science.aao0505 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Sotelo, P. et al. An efficient method for variable region assembly in the construction of scFv phage display libraries using independent strand amplification. MAbs 4, 542–550, https://doi.org/10.4161/mabs.20653 (2012).
Article PubMed PubMed Central Google Scholar
Ahmad, Z. A. et al. scFv antibody: principles and clinical application. Clin Dev Immunol 2012, 980250, https://doi.org/10.1155/2012/980250 (2012).
Article CAS PubMed PubMed Central Google Scholar
Croxford, A. L., Mair, F. & Becher, B. IL-23: one cytokine in control of autoimmunity. Eur J Immunol 42, 2263–2273, https://doi.org/10.1002/eji.201242598 (2012).
Article CAS PubMed Google Scholar
Oppmann, B. et al. Novel p19 protein engages IL-12p40 to form a cytokine, IL-23, with biological activities similar as well as distinct from IL-12. Immunity 13, 715–725, https://doi.org/10.1016/S1074-7613(00)00070-4 (2000).
Article CAS PubMed Google Scholar
Harrington, L. E., Mangan, P. R. & Weaver, C. T. Expanding the effector CD4 T-cell repertoire: the Th17 lineage. Curr Opin Immunol 18, 349–356 (2006).
Article CAS PubMed Google Scholar
Teng, M. W. L. et al. IL-12 and IL-23 cytokines: from discovery to targeted therapies for immune-mediated inflammatory diseases. Nat Med 21, 719–729, https://doi.org/10.1038/nm.3895 (2015).
Article CAS PubMed Google Scholar
Pistoia, V. In Encyclopedia of Immunobiology 525–533 (Academic Press, 2016).
Croxford, A. L., Kulig, P. & Becher, B. IL-12-and IL-23 in health and disease. Cytokine & Growth Factor Reviews 25, 415–421, https://doi.org/10.1016/j.cytogfr.2014.07.017 (2014).
Article CAS Google Scholar
Marks, D. J., Rahman, F. Z., Sewell, G. W. & Segal, A. W. Crohn’s disease: an immune deficiency state. Clin Rev Allergy Immunol 38, 20–31, https://doi.org/10.1007/s12016-009-8133-2 (2010).
Article CAS PubMed PubMed Central Google Scholar
Casanova, J. L. & Abel, L. Revisiting Crohn’s disease as a primary immunodeficiency of macrophages. J Exp Med 206, 1839–1843, https://doi.org/10.1084/jem.20091683 (2009).
Article CAS PubMed PubMed Central Google Scholar
Montgomery, D. C. Design and analysis of experiments. (2013).
Cock, P. J. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423, https://doi.org/10.1093/bioinformatics/btp163 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hebditch, M., Carballo-Amador, M. A., Charonis, S., Curtis, R. & Warwicker, J. Protein-Sol: a web tool for predicting protein solubility from sequence. Bioinformatics 33, 3098–3100, https://doi.org/10.1093/bioinformatics/btx345 (2017).
Article CAS PubMed PubMed Central Google Scholar
Swalley, S. E., Fulghum, J. R. & Chambers, S. P. Screening factors effecting a response in soluble protein expression: formalized approach using design of experiments. Anal Biochem 351, 122–127, https://doi.org/10.1016/j.ab.2005.11.046 (2006).
Article CAS PubMed Google Scholar
Yamaguchi, S., Yamamoto, E., Mannen, T., Nagamune, T. & Nagamune, T. Protein refolding using chemical refolding additives. Biotechnol J 8, 17–31, https://doi.org/10.1002/biot.201200025 (2013).
Article CAS PubMed Google Scholar
de Marco, A. Strategies for successful recombinant expression of disulfide bond-dependent proteins in Escherichia coli. Microb Cell Fact 8, 26, https://doi.org/10.1186/1475-2859-8-26 (2009).
Article CAS PubMed PubMed Central Google Scholar
Makowska-Grzyska, M. et al. Protein production for structural genomics using E. coli expression. Methods Mol Biol 1140, 89–105, https://doi.org/10.1007/978-1-4939-0354-2_7 (2014).
Article CAS PubMed PubMed Central Google Scholar
Larentis, A. L. et al. Cloning and optimization of induction conditions for mature PsaA (pneumococcal surface adhesin A) expression in Escherichia coli and recombinant protein stability during long-term storage. Protein Expr Purif 78, 38–47, https://doi.org/10.1016/j.pep.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Papaneophytou, C. P. & Kontopidis, G. A. Optimization of TNF-alpha overexpression in Escherichia coli using response surface methodology: Purification of the protein and oligomerization studies. Protein Expr Purif 86, 35–44, https://doi.org/10.1016/j.pep.2012.09.002 (2012).
Article CAS PubMed Google Scholar
Papaneophytou, C. & Kontopidis, G. A comparison of statistical approaches used for the optimization of soluble protein expression in Escherichia coli. Protein Expr Purif 120, 126–137, https://doi.org/10.1016/j.pep.2015.12.014 (2016).
Article CAS PubMed Google Scholar
Tian, J. et al. Predicting synonymous codon usage and optimizing the heterologous gene for expression in E. coli. Sci Rep 7, 9926, https://doi.org/10.1038/s41598-017-10546-0 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kwon, K. et al. Recombinant expression and functional analysis of proteases from Streptococcus pneumoniae, Bacillus anthracis, and Yersinia pestis. BMC Biochem 12, 17, https://doi.org/10.1186/1471-2091-12-17 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gabrielczyk, J., Kluitmann, J., Dammeyer, T. & Jordening, H. J. Effects of ionic strength on inclusion body refolding at high concentration. Protein Expr Purif 130, 100–106, https://doi.org/10.1016/j.pep.2016.10.004 (2017).
Article CAS PubMed Google Scholar
Yamaguchi, H. & Miyazaki, M. Refolding techniques for recovering biologically active recombinant proteins from inclusion bodies. Biomolecules 4, 235–251, https://doi.org/10.3390/biom4010235 (2014).
Article CAS PubMed PubMed Central Google Scholar
Glockshuber, R., Schmidt, T. & Pluckthun, A. The disulfide bonds in antibody variable domains: effects on stability, folding in vitro, and functional expression in Escherichia coli. Biochemistry 31, 1270–1279 (1992).
Article CAS PubMed Google Scholar
R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, www.R-project.org/ (2013).
RcmdrPlugin.DoE: R Commander Plugin for (industrial) Design of Experiments v. R package version 0.12-3 (2014).
Gavlovsky, P. J., Tonnerre, P., Guitton, C. & Charreau, B. Expression of MHC class I-related molecules MICA, HLA-E and EPCR shape endothelial cells with unique functions in innate and adaptive immunity. Hum Immunol 77, 1084–1091, https://doi.org/10.1016/j.humimm.2016.02.007 (2016).
Article CAS PubMed Google Scholar
Lenth, R. V. Response-Surface Methods in R, Using rsm. 2009 32, 17, https://doi.org/10.18637/jss.v032.i07 (2009).
Article Google Scholar
Lê, S., Josse, J. & Husson, F. FactoMineR: An R Package for Multivariate Analysis. 2008 25, 18, https://doi.org/10.18637/jss.v025.i01 (2008).
Article Google Scholar

Download references

Acknowledgements

The authors thank Bastian Jerez for technical assistance. This work was funded by FONDEF-IDEA Grant ID16I10027 (Fondo de Fomento al Desarrollo Científico y Tecnológico), CSIC Iniciación a la Investigación (Universidad de la República Ortiental del Uruguay) and Enlace FONDECYT VID ENL013/17 (Universidad de Chile). The funders had no role in study design, data collection, analysis or preparation of manuscript.

Author information

Authors and Affiliations

Centro de Inmunobiotecnología, Programa Disciplinario de Inmunología, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
Matías Gutiérrez-González, Camila Farías, Samantha Tello, Alfonso Romero, Roberto Zúñiga, Carolina H. Ribeiro & María Carmen Molina
Programa de Doctorado en Farmacología, Facultad de Ciencias Químicas y Farmacéuticas, Universidad de Chile, Santiago, Chile
Matías Gutiérrez-González
Área de Biotecnología, Instituto Polo Tecnológico de Pando, Facultad de Química, Universidad de la República Oriental del Uruguay, Montevideo, Uruguay
Diana Pérez-Etcheverry & Carmen Lorenzo-Ferreiro

Authors

Matías Gutiérrez-González
View author publications
You can also search for this author in PubMed Google Scholar
Camila Farías
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Tello
View author publications
You can also search for this author in PubMed Google Scholar
Diana Pérez-Etcheverry
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Romero
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Zúñiga
View author publications
You can also search for this author in PubMed Google Scholar
Carolina H. Ribeiro
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Lorenzo-Ferreiro
View author publications
You can also search for this author in PubMed Google Scholar
María Carmen Molina
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G.G., C.L. and M.C.M. designed the study. M.G.G., C.F., S.T., A.R. and D.P.E. carried out the experiments. M.G.G., C.F., S.T., D.P.E., R.Z., C.H.R., C.L. and M.C.M. contributed to the interpretations of the results. M.G.G. wrote the manuscript with the assistance of C.F., S.T., D.P.E., C.H.R., C.L. and M.C.M. All authors provided critical feedback and helped shape the research, analysis and manuscript.

Corresponding author

Correspondence to María Carmen Molina.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gutiérrez-González, M., Farías, C., Tello, S. et al. Optimization of culture conditions for the expression of three different insoluble proteins in Escherichia coli. Sci Rep 9, 16850 (2019). https://doi.org/10.1038/s41598-019-53200-7

Download citation

Received: 17 May 2019
Accepted: 12 October 2019
Published: 14 November 2019
DOI: https://doi.org/10.1038/s41598-019-53200-7

This article is cited by

Optimizing recombinant production of L-asparaginase 1 from Saccharomyces cerevisiae using response surface methodology
- Vida Ebrahimi
- Atieh Hashemi
Folia Microbiologica (2024)
Characterization of a novel amidohydrolase with promiscuous esterase activity from a soil metagenomic library and its application in degradation of amide herbicides
- Shengwei Sun
- Wanqi Chen
- Jinju Chen
Environmental Science and Pollution Research (2024)
Efficient recombinant production of mouse-derived cryptdin family peptides by a novel facilitation strategy for inclusion body formation
- Yuchi Song
- Yi Wang
- Tomoyasu Aizawa
Microbial Cell Factories (2023)
Optimization of Process Parameters for Enhanced Production of Ranibizumab in Escherichia coli
- Rucha S. Patil
- Nidhi Upadhyay
- Anurag S. Rathore
Biotechnology and Bioprocess Engineering (2023)
Machine learning modeling for solubility prediction of recombinant antibody fragment in four different E. coli strains
- Atieh Hashemi
- Majid Basafa
- Aidin Behravan
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

anti-MICA scFv, MICA and IL-23p19 are expressed as inclusion bodies in E. coli

Optimization of anti-MICA scFv, MICA and IL-23p19 production in inclusion bodies in E. coli using response surface methodology

Multivariate analysis of soluble and insoluble proteins

Optimization of refolding conditions

Discussion

Materials and Methods

Design of experiments

Protein production

Protein renaturation

Model generation

Multivariate analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links