Improvement in the production of the human recombinant enzyme N-acetylgalactosamine-6-sulfatase (rhGALNS) in Escherichia coli using synthetic biology approaches

Previously, we demonstrated production of an active recombinant human N-acetylgalactosamine-6-sulfatase (rhGALNS) enzyme in Escherichia coli as a potential therapeutic alternative for mucopolysaccharidosis IVA. However, most of the rhGALNS produced was present as protein aggregates. Here, several methods were investigated to improve production and activity of rhGALNS. These methods involved the use of physiologically-regulated promoters and alternatives to improve protein folding including global stress responses (osmotic shock), overexpression of native chaperones, and enhancement of cytoplasmic disulfide bond formation. Increase of rhGALNS activity was obtained when a promoter regulated under σ s was implemented. Additionally, improvements were observed when osmotic shock was applied. Noteworthy, overexpression of chaperones did not have any effect on rhGALNS activity, suggesting that the effect of osmotic shock was probably due to a general stress response and not to the action of an individual chaperone. Finally, it was observed that high concentrations of sucrose in conjunction with the physiological-regulated promoter proU mod significantly increased the rhGALNS production and activity. Together, these results describe advances in the current knowledge on the production of human recombinant enzymes in a prokaryotic system such as E. coli, and could have a significant impact on the development of enzyme replacement therapies for lysosomal storage diseases.

The human GALNS complementary DNA is composed by 1569 bp, encoding a 522 amino acids peptide (herein known as precursor peptide). This protein undergoes several posttranslational modifications by the trafficking through the endoplasmic reticulum, Golgi apparatus and lysosome. These posttranslational modifications include the removal of the signal peptide (first 26 amino acids −3 kDa), the active-site activation by the formylglycine-generating enzyme, the addition of two N-glycosylations, and the proteolytic processing to obtain a ∼58 kDa mature enzyme formed by two polypeptides of 40 kDa and 18 kDa 8, 9 . In the case of the recombinant GALNS produced in Escherichia coli and Pichia pastoris, it was reported a similar processing to that observed for the enzyme produced in mammalian cells 9,10 .
E. coli has been extensively used as the prokaryotic model organism for production of proteins with therapeutic and industrial interest [11][12][13] . The key problem in the production of human recombinant proteins in this bacterial host is related with the lack of post-translational modifications, as glycosylations, and poor protein folding, leading to loss of enzyme activity and the formation of insoluble protein aggregates 14 .
Previously, we demonstrated the production of a recombinant human GALNS enzyme (rhGALNS) in E. coli BL21(DE3) as a potential therapeutic alternative for MPS IVA 9,15 . However, most of the produced protein was present as protein aggregates. Culture conditions were optimized, including inductor concentrations and temperature shifts, which maximized rhGALNS activity, but most of its production was present in the insoluble protein fraction 16 . Here we explored different approaches to increase the production and activity of rhGALNS in E. coli. These approaches included the use of physiologically-regulated promoters to modulate gene expression, induction of osmoprotectants as helpers in protein folding, the overexpression of chaperone proteins, the improvement in the formation of disulfide bonds, and the combination of the different approaches to explore additive effects.

Results and Discussion
Effect of physiologically-regulated promoters on rhGALNS activity. There are several factors contributing to the formation of inclusion bodies in prokaryotic systems at the transcriptional level, which are generally attributed to a high transcriptional rate 17 and transcription leakiness due to poor promoter repression 18 . The staple prokaryotic promoter used in recombinant protein production in E. coli is the lac promoter, which has been widely utilized due to its ability to be straightforwardly controlled with isopropyl β-D-1-thiogalactopyranoside (IPTG) and tightly repressed via lacIq 19 . Other promoters commonly used in recombinant protein production are thermal or chemically induced 20 . Previous studies demonstrated the production of active rhGALNS in E. coli BL21(DE3) using the tac promoter, a synthetic promoter created from the combination of promoters from the trp and lac operons, which is inducible by IPTG 21 . However, most of the rhGALNS was recovered from protein aggregates 9,16 . We hypothesized that the expression of rhGALNS is either controlled poorly at the transcriptional level or it is highly overexpressed, which leads to the accumulation of rhGALNS as protein aggregates.
To facilitate the production process (i.e. avoiding the use of inductors) and considering a necessary strong regulation over rhGALNS expression, the promoters osmY and proU mod were selected for gene expression. These two promoters have been described to be tightly controlled under σ s , and to induce the gene transcription at the late-exponential and at the onset of stationary growth phases 22,23 . The promoters osmY and proU mod were cloned to obtain the plasmids pGEXosmY and pGEXproUmod (Table 1 and Supplementary Data). Subsequently, the plasmids were transformed into E. coli BL21(DE3) for rhGALNS production as described in the Materials and Methods section. The cells were growth in M9 minimal media, and E. coli BL21(DE3)/pGEX-5X-GALNSopt was used as control.
As shown in Fig. 1, we did not observe a statistically significant increment in rhGALNS activity, in comparison to the levels observed using the strain BL21(DE3)/pGEX-5X-GALNSopt, when gene expression was driven by the osmY promotor, either intra-or extracellularly. This promoter has been reported as a weak promoter strongly regulated by σ s 23 , in correspondence to the low enzyme activities obtained. On the other hand, rhGALNS activity increased significantly when the proU mod promoter was used. The enzymatic activities obtained at the intracellular level ( Fig. 1C and D) were 0.08 U/(mg total protein) and 0.16 U/mL, corresponding to increments of 695% and 245%, respectively, in comparison to the levels observed using the strain E. coli BL21(DE3)/pGEX-5X-GALNSopt. At the extracellular fraction, the enzyme activities, as shown in Fig. 1A and B, were 0.67 U/(mg total protein) and 0.86 U/mL of rhGALNS, equivalent to increments of 751% and 309%, respectively, in comparison to the levels observed using pGEX-5X-GALNSopt.
It is important to highlight that secretion of rhGALNS is a process mediated by the human native GALNS signal peptide. Extracellular secretion of rhGALNS in E. coli BL21(DE3), associated to the presence of a native signal peptide, was previously observed 16 . In the study, by using a similar plasmid construction to the one used in this report (i.e. GALNS with a native signal peptide downstream of the glutathione S-transferase [GST]), it was observed that removal of the signal peptide completely abolished secretion of the recombinant enzyme. Those results suggested that GALNS signal peptide is recognized by E. coli secretion machinery, even if it is located downstream of the GST peptide. In addition, removal of signal peptide severely affected the activation process of rhGALNS 16 .
In order to evaluate the presence of rhGALNS in different protein fractions, we performed a Western-blot using an anti-GALNS antibody, produced against the N-terminal region of the enzyme 16 . As expected, the antibody reacted with the high molecular mass rhGALNS polypeptide (~60 kDa) but it did not detect the small rhGALNS subunit (~18 kDa) (Fig. 1E). Although the sequence encoding for the GST tag was present in the recombinant plasmids and in-frame with GALNS sequence (see plasmids maps in the Supplementary Data), the expected 26 kDa size shift corresponding to this tag was not observed. In this sense, these results might suggest that either the fusion protein GST-rhGALNS was not produced or a post-translational processing of the recombinant enzyme might eliminate the GST tag. The absence of the fusion protein was validated using an affinity chromatography (see Supplementary Data). The results showed that GST tag was produced but it was not fused to recombinant enzyme. Although, further experiments are necessary to elucidate the reasons of the production of a protein lacking the GST tag, the Western-blot analysis, in addition to the enzyme activity results, confirm the production of the rhGALNS in E. coli BL21(DE3). Furthermore, the Western-blot analysis showed a lower rhGALNS production in all the protein fractions studied by using the proU mod promoter, opposite to the production under the tac promoter (Fig. 1E). In addition, the ∼60 kDa rhGALNS precursor was observed in extra and intracellular fractions, while the precursor and the processed polypeptides were observed in the protein aggregates. These results might suggest an improvement in protein folding, since rhGALNS activity increases using the promoter proU mod even when the corresponding enzyme amounts decrease. A densitometry analysis of the Western-blot for Fig. 1E was performed using the software ImageJ 24 and allowed us to roughly estimate increments in protein folding of approximately 20X and 15X in the intracellular and extracellular fractions, respectively.
The dynamics of rhGALNS expression under the promoter proU mod was determined as shown in Fig. 2. Different parameters were monitored in 100 ml culture during the first 24 h of cultivation including the intracellular and extracellular total soluble protein, the intracellular and extracellular rhGALNS activity, amounts of rhGALNS mRNA, and the protein amount of rhGALNS in the different protein fractions. After the first 6 h of cultivation the cells decelerate their growth, entering in stationary phase, as evidenced in the steady amounts of intracellular soluble protein ( Fig. 2). At this point, the expression of rhGALNS increased significantly, supported by the elevation of the rhGALNS transcripts. The expression profile is an expected gene expression dynamics profile of promoters governed by σ s regulation, as demonstrated in the study of rpoS-dependent gene promoters expressing the gene lacZ 22 and the study of stationary-phase promoters using GFP 25 . This increment in rhGALNS transcription leads to an increase in the corresponding protein levels, as seen in the Western-blot (it increased in an accelerated manner in the first few hours after entering stationary phase), and consequently an increment in the rhGALNS activity. An increasing concentration of rhGALNS in the solubilized protein aggregates was observed, indicating that reduced expression is not sufficient to ensure proper protein folding (Fig. 2). Analysis of the extracellular fraction exhibits a small variation of protein along with an increasing activity of rhGALNS, suggesting a higher level of protein folding. However, these findings are currently under further study. Additionally, even after the reduction of rhGALNS gene expression under the control of proU mod , we still observed a different expression pattern in protein aggregates (Fig. 2C). This phenomenon could be due to poor protein processing or traffic saturation as shown in other studies [26][27][28] .
These results demonstrate the importance of controlling gene expression in the production of rhGALNS, not only in terms of promoter strength, but also regarding transcriptional dynamics and regulation. These results also support the versatility of using physiologically-controlled promoters for protein expression in the sense that it will facilitate the controllability of production processes and reduce the process costs since no inducer is required.
Induction of osmoprotectants. Two strategies to improve the amount of soluble recombinant proteins were used in this study, which involve increasing the concentration of osmolytes and the overexpression of chaperones 14,29 . Overproduction of bacterial chaperones, some of which actively drive folding processes whereas others prevent protein aggregation, can be obtained by different extracellular stresses such as heat-shock 30 or osmotic shock 31 .
We implemented osmotic shock as a method for inducing proper protein folding, exposing the bacterial cultures of E. coli BL21(DE3)/pGEX-5X-GALNSopt to high concentrations of sucrose. Two concentrations of sucrose were tested for this purpose, 5% and 10% (w/v). As shown in Fig. 3, we observed a statistically significant increase in rhGALNS production when the two conditions were tested, although the highest response was observed using a concentration of 5% (w/v) sucrose. The lower effect observed with 10% (w/v) sucrose, in comparison with 5% (w/v) sucrose, agrees with Barth, et al. 32 report, who suggested that an excessive osmotic stress in the absence of cytoplasmic compatible solutes to rescue cell grow from inhibitory conditions (e.g. organic osmolytes as betaine, carnitine, trehalose, proline, mannitol, and small peptides 33 ), could hinder recombinant protein folding. The largest increments in rhGALNS activity were obtained at the intracellular level, with enzyme activity levels of rhGALNS of 0.06 U/(mg total protein) and 0.14 U/ml (increments of 507% and 219%, respectively, in comparison to an unexposed culture) when cells were grown in presence of 5% (w/v) sucrose. This result is consistent with the effect observed in the Western-blot (Fig. 3E). We did not observe a large increment in the amount of rhGALNS at the intracellular fraction, even if the intracellular activity increased when sucrose was added, suggesting that osmotic stress increases the proper folding of proteins at the cytoplasmic space. However, we estimated a 3-fold increment in rhGALNS recovered from the insoluble fraction, as shown in the Western-blot. On the other hand, an interesting effect occurred with the protein secreted to the culture medium. As shown in Fig. 3A and B, although a statistically significant increment in specific activity after 24 h of induction was not observed, there was a significant increase in the volumetric activity. The same tendency was evidenced when 10% (w/v) sucrose was used. Elevated amounts of secreted rhGALNS were quantified using an indirect ELISA, as shown in Supplementary Data, where approximately 300% more rhGALNS was found in the medium when the culture was exposed to osmotic stress. In this sense, these results indicates an elevation in the amount of secreted protein but not in its folding, as well as that osmotic stress does not increase the proper folding of proteins at the periplasmic space. This effect can be explained since bacterial cells, under osmotic stress, accumulate small molecules known as osmolytes in the cytoplasm, acting as chemical chaperones 34,35 .
Overexpression studies were carried out to determine the effect of individual chaperones genes on the enzyme activity of rhGALNS, or on the contrary, if the observed benefit via osmotic stress was due to a global stress response. Several chaperones were selected for these overexpression studies including the chaperonin system GroESL (genes groS and groL) 36 ; the chaperones DnaK, DnaJ and GrpE capable of repairing heat-inducing protein damage 37 . heat-shock proteins IbpA and IbpB involved in the binding of protein aggregates in E. coli 38 ; the complex DsbA-DsbB involved in the formation of disulfide bonds in the periplasmic space 39 and the chaperone ClpB 40 . The role of these chaperones on protein folding is schematize in Fig. 4A. The plasmids for the overexpression of the chaperones proteins are summarized in Table 1 and Supplementary Data. The E. coli strain BL21(DE3) was co-transformed with the plasmids pGEX-5X-GALNSopt and a recombinant pACYCDuet ™ -1 plasmid for overexpression of the different chaperone genes ( Table 1). The clones were screened based on the double antibiotic selection and tested for rhGALNS production in 100 mL as described in the Materials and Methods section. We did not observe a benefit neither in specific nor in volumetric rhGALNS activity when the chaperones were co-expressed alongside rhGALNS (Fig. 4). Even though chaperones are cataloged as folding modulators in the production of recombinant proteins 41,42 , their co-production has been shown to reduce the yield and quality of several recombinant proteins produced in E. coli, as observed in the production of horseradish peroxidase 43 , guinea pig liver transglutaminase 44 , fibroblast growth factor 45 , cyclodextrin glycosyltransferase 46 and different antibodies fragments 47,48 . In addition, overexpression of chaperones can contribute to metabolic burden, thereby leading to growth rate reduction as well as decreased final biomass yields 49 . It remains to be determined what genes are involved in the increment of rhGALNS activity once the cells are exposed to high concentrations of sucrose, but we hypothesize based on the findings described above, that this is the effect of a global stress response and not to the action of individual chaperones. Improving formation of disulfide bonds. The disulfide bond is the most common link between amino acids after the peptide bond 50 and around 15% of human produced proteins are predicted to have disulfide bonds 51 . In the instance of the human GALNS, it contains three disulfide bonds per monomer, increasing its stability and activity 52 . The formation of disulfide bonds in all organisms are compartmentalized in the extra-cytoplasmic sections such as the endoplasmic reticulum in eukaryotes, or the periplasmic space of gram-negative bacteria, such as E. coli 53,54 . This phenomenon is due to the presence of enzymes devoted to the reduction of disulfide bonds in the cytoplasm. In E. coli, the oxidative environment in the periplasm allows the formation of disulfide bonds, crucial for the activity and stability of several proteins, promoting resistance against proteases and harsh environments 50 . Therefore, proteins requiring disulfide bonds for their folding and stability, such as the rhGALNS, are prone to be misfolded and not active when expressed in the cytoplasm of E. coli. We hypothesized that the lower activity of the intracellular rhGALNS, in comparison to its extracellular counterpart, could be associated with inefficient protein folding due to the reducing cytoplasmic environment. For this reason, we tested the E. coli strain SHuffle ® T7. This strain is an engineered version of E. coli BL21, modified to allow the formation of stable disulfide bonded proteins within the cytoplasm 55 . This modification was generated by the deletion of the thioredoxin reductase (trxB) and glutathione reductase (gor) genes, the presence of the mutant peroxidase AhpC* to restore reducing power to the bacteria, and the overexpression of the chaperone DsbC lacking the signal sequence to be co-expressed in the cytoplasm 55 . Volumetric activity in the intracellular fraction. Enzyme activity was assessed 24 hours after induction. A Student's t-test was performed to assess significance, where (*) corresponds to a p-value < 0.05 and (****) p-value < 0.0001. Specific activities are expressed as U rhGALNS/ (mg total protein) and volumetric activities as U/ml. (E) Western-blot of rhGALNS levels in the intracellular and insoluble protein fractions using high osmotic stress, with production controlled by the promoter tac using optimal (0.5 mM) IPTG concentration. The results showed a significant increase of rhGALNS protein in both, intracellular and solubilized protein aggregates fraction under tac control supplemented with 5% of sucrose. In addition, increased expression was observed in both processed and mature protein.
Scientific RepoRts | 7: 5844 | DOI:10.1038/s41598-017-06367-w We transformed the E. coli strain SHuffle ® T7 either with the plasmids pGEX-5X-GALNSopt or pGEXproUmod. As observed in Fig. 5A and B, there were not statistically significant differences in the extracellular specific and volumetric activities of rhGALNS in E. coli strain SHuffle ® T7 when the gene expression was controlled by the tac promoter, in comparison to the results observed in E. coli BL21(DE3). When the promoter proU mod was used, the outcome was similar to the previously described in regard of the specific activity, but we observed a reduced volumetric activity of rhGALNS. This effect can be attributed to the lower biomass reached by the strain E. coli SHuffle ® T7 (1.8 g/L) than the one obtained using E. coli BL21(DE3) (3.2 g/L), which could be associated with the numerous genetic modifications in the strain E. coli SHuffle ® T7. Nevertheless, by using the proU mod promoter, a large improvement occurred at the cytoplasm with an increase of the specific activity of 1,283% [0.14 U/(mg total protein)] (Fig. 5C and D). We evaluated the increments of protein folding by using a Western-blot (Fig. 5E). The Western-blot analysis indicates a reduction of the intracellular levels of the precursor rhGALNS using E. coli SHuffle ® T7, in comparison with the levels observed by using E. coli BL21(DE3). Through a densitometric analysis of the Western-blot results, we estimated the concentrations of rhGALNS present in the evaluated samples, in order to assess the specific enzyme activity (U rhGALNS/mg rhGALNS), thus determining an approximate increment in enzyme activity of 7X at the intracellular fraction when either plasmids (pGEX-5X-GALNSopt or pGEXproUmod) were used in conjunction to the strain E. coli SHuffle ® T7. However, we did not observe important variations at the extracellular fraction. This result is in accordance with the expected effect using E. coli SHuffle ® T7, since all the genetic modifications in this strain are designed to improve intracellular protein folding due to an oxidative cytoplasmic conditions, but not at the periplasmic space where the disulfide forming environment is already present. Similar results were reported in the production of human herpesvirus type-6, with production increments up to 5-fold in comparison to the control strain E. coli BL21(DE3) 56   Enzyme activity was assessed 24 hours after induction using optimal (0.5 mM) IPTG concentration. A Student's t-test was performed to assess significance, where (**) corresponds to a p-value < 0.01. Specific activities are expressed as U rhGALNS/(mg total protein) and volumetric activities as U/ml. As noticed, the overexpression of individual chaperone proteins did not aid in the increase of rhGALNS activity. The names shown in this panels B to E correspond to the overexpressed chaperones. Full names of the plasmids used can be found in Table 1.
Combinatorial effect. Lastly, we tested the combination of the studied approaches to determine possible additive effects. The largest increment of rhGALNS activity in the extracellular fraction occurred with the use of proU mod promoter and 5% (w/v) sucrose, obtaining activities of 1.23 U/(mg total protein) and 1.20 U/ml, which represent an improvement of 1,463% and 470%, respectively, in comparison to the results obtained with E. coli BL21(DE3)/pGEX-5X-GALNSopt. These substantial improvements also occurred at the intracellular level as seen in Fig. 6 and Table 2, with activities of 0.16 U/(mg total protein) and 0.20 U/ml, which represent an increments of 1,475% and 335%, respectively in comparison to BL21(DE3)/pGEX-5X-GALNSopt.
We tested if the use of a different osmotic agent, in a similar osmolarity as implemented with sucrose, leads to the same effect on rhGALNS activities. As seen in Fig. 6, using sodium chloride at 0.05 N, in conjunction with the use of the promoter proU mod , allows the production of similar enzymatic activities to the observed using sucrose in combination to the same promoter. This result suggests that the osmotic effect on rhGALNS activity is not due to the nature of the osmoreagent. Aspedon et al., using a transcriptomic analysis in Pseudomonas aeruginosa under osmotic stress with sodium chloride and sucrose indicated that stress response to osmotic shock is not dependent of a specific compound or its ionic nature, but it is rather a general stress response process 58 .
Finally, we observed that the highest intracellular specific activity was obtained by combining all the approaches studied in this work (i.e., 5% (w/v) of sucrose, the strain E. coli SHuffle ® T7 and the promoter proUmod ). This improvement corresponds to 2,095% of the activity observed under the initial culture conditions [0.22 U/(mg total protein)]. This arrangement permitted statistically significant improvements of the other studied variables, but it was not as high as with other combinations.

Conclusions
In summary, several approaches were used to increase the production of the human recombinant enzyme GALNS produced in E. coli. These approaches included the control of rhGALNS expression using a promoter regulated under σ s , the induction of osmoprotectans production through osmotic shock by using sucrose or sodium chloride, the improvement in the formation of disulfide bonds in the cytoplasmic space, and the combination of different angles to explore additive effects. The use of the promoter proU mod permitted an improvement in protein folding, with estimated increments of rhGALNS specific activities of 20X and 15X in the intracellular and extracellular fractions, respectively. This improvement could be attributed to the reduction of gene expression, as evidenced in Fig. 1E, permitting the protein folding machinery of the bacteria to cope with the folding of the recombinant protein. The nature of proU mod eliminates the use of inducers, due to the regulation of the promoter by the sigma factor σ s , thus facilitating the protein production bioprocess. We also utilized 5% (w/v) sucrose to expose bacterial cultures to osmotic shock, favoring a proper protein folding. We found important improvements at the intracellular level (Table 2), possibly due to a global stress response. This hypothesis was supported when several chaperones were co-expressed alongside rhGALNS. In this instance, none of the tested chaperones increased the rhGALNS activity, which agrees with the idea that the increase in the rhGALNS activity after osmotic shock induction was probably due to a general stress response and not to the action of a particular chaperone protein.
On the other hand, as an approximation to increase proper folding of the recombinant protein by boosting the formation of disulfide bonds, we used the strain E. coli SHuffle ® T7 as a host for protein production. We observed important improvements in protein activity in the intracellular fraction when rhGALNS was driven by either the promoter tac or proU mod . However, the low biomass yield hinders the use of this strain in the production of large amounts of rhGALNS. Lastly, we reported that high concentrations of sucrose in conjunction with the physiological regulated promoter proU mod significantly increased the rhGALNS production and activity. Taken together, these results represent valuable information for the production of human lysosomal enzymes in E. coli, and could have a significant impact on the development of enzyme replacement therapies for lysosomal storage diseases.

Materials and Methods
Bacterial strains and plasmids construction. The E. coli strains, BL21(DE3) (B F-ompT gl dcm lon hsdSB
Previously, we showed that recombinant GALNS produced in E. coli BL21(DE3) had a lower enzyme activity in the crude extract (i.e. production) than that reported using CHO cells 9 . In an attempt to increase the enzyme production, GALNS cDNA sequence (GenBank accession number NM_000512.4) was optimized by adapting the codon usage to the bias of E. coli, as well as by removing any negative cis-acting sites (i.e. splice sites, TATA-boxes, etc.). Optimized GALNS cDNA (GALNSopt) was synthesized by InvitrogenTM GeneArt ® (Invitrogen, Carlsbad, CA, USA). The synthetic gene was inserted between the EcoRI and XhoI sites of pGEX-5X-3 (GE Healthcare, Piscataway, NJ, USA) to generate pGEX-5X-GALNSopt plasmid (6.4 kb) where rhGALNS expression is driven by the tac promoter. This plasmid was used as control of rhGALNS expression in most of the experiments carried out in this study.
The pGEX-5X-GALNSopt plasmid was used as backbone for the expression of rhGALNS under the control of the physiologically regulated promoters proU CCTATAAT (5′-GGG GCC GCC TCA GAT TCT CAG TAT GTT ATA ATA GAA AA-3′) 22 and osmY (5′-TAT CCC GAG CGG TTT CAA AAT TGT GAT CTA TAT TTA ACA AA-3′) 23 . The ribosomal binding sites were designed to maximize the translational initiation rates, using the RBS Calculator from Salis Lab 59 . The RBS designed for the promoters osmY and proU mod (proU CCTATAAT ) were 5′-CGA AAT CAA CAA AAG CGG TTA CTA AC-3′ and 5′-GCG AAC GGA AAT CTA CGG TTA ACA T-3′, respectively. All primers used in this study are summarized in Table 3. For the construction of the plasmid pGEXosmY, the backbone was amplified via PCR with high-fidelity Bestaq ™ DNA Polymerase (Applied Biological Materials, Richmond, BC, Canada), using the primers pGEX-5X-f and osmY-r and the rhGALNS gene was amplified using the primers osmY-f and pGEX-5X-r. For the generation of these amplicons, the plasmid pGEX-5X-GALNSopt was used as template for PCR. The "scarless" cloning technique, Sequence and Ligation Independent Cloning (SLIC) 60 was used to create the DNA assembles RBS/promoter/backbone. In short, equimolar amounts of the amplicons (total 200 ng) were mixed with 1 µL of 1/5 diluted (in water) 100X bovine serum albumin (New England Biolabs) and 1 µL 1/5 diluted T4 DNA Polymerase [1 uL NEBuffer2, 7 uL H 2 0, 2 µL T4-Polymerase (3 U/ml) (New England Biolabs)]. The total reaction volume was 20 µL. The mixture was incubated at 22 °C for 5 min, heated up to 70 °C for 20 minutes, followed by 30 minutes at 37 °C using a 10% ramp, and then kept at 4 °C for 18 hours. The reaction product was transformed via electroporation into E. coli DH5α and grown onto solid LB agar supplemented with 100 ng/µL of ampicillin. Transformants were screened with restriction endonucleases and the correct construct was transformed in E. coli BL21(DE3) using heat shock. A similar protocol was implemented for the construction of the plasmid pGEXproUmod, where the primers pGEX-5X-f and proU-r, and proU-f and pGEX-5X-r were used to obtain the amplicons corresponding to the backbone and rhGALNS, assembled together using SLIC as previously described.
All the corresponding genes for the chaperones in study were amplified via PCR from genomic DNA of E. coli BL21(DE3) using the high-fidelity Bestaq ™ DNA Polymerase (Applied Biological Materials) and cloned into the bicistronic plasmid pACYCDuet ™ -1 (Novagen, Merck Millipore, Billerica, MA, USA) by restriction digest. All the constructs are summarized in Table 1  Quantification of GALNS activity. rhGALNS activity was assessed using the fluorescent substrate 4-methylumbelliferyl-β-D-galactopyranoside-6-sulfate (Toronto Chemicals Research, North York, ON, Canada) as described elsewhere 62 . One unit (U) was defined as the amount of rhGALNS catalyzing 1 nmol substrate per hour, and rhGALNS activity was expressed as U/(mg total protein) (total protein determined by Lowry protein assay 63 ) or as U/mL. Enzyme activity was assayed in the soluble fraction (intracellular) and growth media (extracellular).
Western blotting. The proteins from different fractions were homogenized in lysis buffer and equivalent amounts of total protein extracts were loaded and processed by SDS-PAGE gels and electroblotted onto nitrocellulose membranes (Amershan Protan TM 0.45 µm, GE Healthcare, Little Chalfont, UK). The membranes were blocked with a blocking buffer containing TTBS [0.3% Tween 20, and 5% BSA (Sigma-Aldrich, St. Louis, MO, USA)] at room temperature for 1 hour. Membranes containing resuspended protein aggregates, extracellular and intracellular proteins were incubated overnight at 4 °C with a specific primary antibody rabbit anti human GALNS (1:1000 in blocking buffer) 9 . The specificity of this antibody was previously assessed, and no band was recognized in samples from non-induced and plasmid-free strains 9 . A peroxidase conjugated goat anti-rabbit (Sigma-Aldrich) was added (1:2000 in blocking buffer) for 1 hour at room temperature. The specific bands were visualized using enhanced chemiluminescence (SuperSignal ™ West Pico Chemiluminescent Substrate, Thermo Fisher Scientific ® , Waltham, MA, USA).
qRT-PCR experiments. Frozen stocks of the strain BL21(DE3)/pGEXproU mod (0.5 mL) were transferred into 10 mL of fresh LB media supplemented with 100 ng/mL of ampicillin and incubated for 18 h at 37 °C and 180 rpm. Production of rhGALNS was carried out in 100 mL cultures using M9 minimum medium supplemented with 20 g/L of D-glucose under aerobic conditions, inoculated with 10 mL of overnight culture, kept at 37 °C and 180 rpm. Samples (10 mL) were taken 2, 4, 7, 12 and 24 hours after inoculation of all three biological replicas. The samples were pelleted by centrifugation at 3000 x g and 4 °C during 10 minutes. The supernatant was discarded and the pellets were immediately placed at −80 °C until further processing. The samples were processed in a period lower than 12 hours after pelleting. Total RNA was extracted using the ZR Fungal/Bacterial RNA Miniprep TM Kit (Zymo Research, Irvine, CA, USA), following manufacturer's indications and immediately stored at −80 °C. The cDNA was generated using the RevertAid First Strand cDNA Synthesis Kit (Thermo Fisher Scientific) using 100 ng of total RNA and following the random hexamer primed synthesis manufacturer's protocol. Two primers (GALNS-f and GALNS-r) ( Table 3)) were designed for the quantification of rhGALNS transcripts using NCBI Primer Design Tool and they were used for the qPCR experiments. Primer amplification efficiency was assessed using the plasmid pGEXproUmod (purified using the Wizard SV Gel and PCR Clean-up System (Promega, Madison, WI, USA)) as template. The amplification efficiency of these primers was estimated as 94%.
To generate a standard curve for absolute rhGALNS copy number quantification, the plasmid pGEXproUmod was serially diluted to obtain different plasmid copies ranging from 300,000 to 30. Triplicates of each data-point were used to ensure reliability of the data. All qPCR experiments were carried out using Luna ® Universal qPCR Master Mix (New England Biolabs) following manufacturer's indications and read in a QuantStudio 3 Real-Time PCR System (Thermo Fisher Scientific). The results as reported as rhGALNS copy number per ng of total RNA. Statistical analysis. Statistical significance was evaluated using a Student's t-test, employing the software GraphPad Prism version 6 for Windows (GraphPad Software, San Diego, California, USA).