Determining the N-terminal orientations of recombinant transmembrane proteins in the Escherichia coli plasma membrane

Lee, Chien-Hsien; Chou, Chia-Cheng; Hsu, Min-Feng; Wang, Andrew H.-J.

doi:10.1038/srep15086

Download PDF

Article
Open access
Published: 14 October 2015

Determining the N-terminal orientations of recombinant transmembrane proteins in the Escherichia coli plasma membrane

Chien-Hsien Lee^1,2,
Chia-Cheng Chou^1,2,
Min-Feng Hsu^1,2 &
…
Andrew H.-J. Wang^1,2

Scientific Reports volume 5, Article number: 15086 (2015) Cite this article

5630 Accesses
4 Citations
2 Altmetric
Metrics details

Subjects

Abstract

In silico algorithms have been the common approach for transmembrane (TM) protein topology prediction. However, computational tools may produce questionable results and experimental validation has proven difficult. Although biochemical strategies are available to determine the C-terminal orientation of TM proteins, experimental strategies to determine the N-terminal orientation are still limited but needed because the N-terminal end is essential for membrane targeting. Here, we describe a new and easy method to effectively determine the N-terminal orientation of the target TM proteins in Escherichia coli plasma membrane environment. D94N, the mutant of bacteriorhodopsin from Haloarcula marismortui, can be a fusion partner to increase the production of the target TM proteins if their N-termini are in cytoplasm (N_in orientation). To create a suitable linker for orientating the target TM proteins with the periplasmic N-termini (N_out orientation) correctly, we designed a three-TM-helix linker fused at the C-terminus of D94N fusion partner (termed D94N-3TM) and found that D94N-3TM can specifically improve the production of the N_out target TM proteins. In conclusion, D94N and D94N-3TM fusion partners can be applied to determine the N-terminal end of the target TM proteins oriented either N_in or N_out by evaluating the net expression of the fusion proteins.

Structure of the ER membrane complex, a transmembrane-domain insertase

Article 03 June 2020

Covalently modified carboxyl side chains on cell surface leads to a novel method toward topology analysis of transmembrane proteins

Article Open access 31 October 2019

Stabilization and structure determination of integral membrane proteins by termini restraining

Article 17 January 2022

Introduction

Transmembrane (TM) proteins are essential for cell function and viability¹. However, they must be correctly embedded in the appropriate membrane for proper folding, activity and stability. The genes encoding these proteins, e.g., transporters, receptors and channels, comprise ~20–30% of all coding sequences in a typical genome^1,2. Moreover, membrane proteins currently represent ~50% of the drug targets³.

How TM proteins generate their correct topologies and how they fold into functional structures in membrane environments are important questions needing answers if we are to understand TM protein biogenesis^4,5. For polytopic α-helical TM proteins, their topologies can be defined by the number of helices and how these helices are oriented in the lipid bilayer, i.e., with their termini positioned in the cytosol (in) or on the opposite side of the membrane (out). However, orientation information on TM proteins is lost during X-ray crystallographic studies because the proteins are first extracted from the membrane and solubilized in an isotopic medium, i.e., a detergent. To predict the topology and structure of TM proteins in vivo, computational tools are available^4,6,7 and some experimental biochemical methods exist^8,9. However, in silico simulation and prediction studies still require experimental validation, a task that has proven difficult to date^10,11. Because TM protein topology depends on multiple determinants, and, in certain cases, insufficient data are considered in predictive algorithms resulting in misaligned and/or misoriented helices^4,7. In addition, only a few TM protein structures have been deposited into the Protein Database that might be used for in silico comparisons and statistical analyses (http://blanco.biomol.uci.edu/mpstruc/)¹². Moreover, although some soluble protein tags, e.g. β-lactamase (BlaM) and green fluorescent protein (GFP), can be used to determine the orientation of C-terminal TM protein residues^8,9,13, methods for determining the orientation of the N-terminus are limited.

During TM protein biogenesis in Escherichia coli, usually the N-terminal region is the first to be inserted into the plasma membrane via a coordinated effort involving the ribosome-nascent polypeptide complex and the translocon (reviewed in Luirink et al.¹⁴) and, therefore, is critical for both this process and the proper orientation of a protein in the membrane^4,9,15. Mutation or modification of the N-terminus of a TM protein may have a negative effect on membrane targeting and the native state of the protein^9,16. Consequently, only limited experimental method exists to assess the N-terminus location of a mature TM protein. To address this deficiency, we have developed a quick and easy method to obtain N-terminal topological information concerning the recombinant TM proteins inserted into the E. coli plasma membrane.

Previously, we developed an expression vector for TM proteins¹⁷ that encodes a mutant (D94 → N) of the TM protein bacteriorhodopsin from Haloarcula marismortui (HmBRI/D94N, denoted hereafter as D94N) with a cytoplasm-oriented C-terminus (C_in) when embedded in a membrane. This design increases the expression of a downstream targeted TM protein with the same N-terminus orientation (N_in). However, this design is not suitable for N_out targets. We surmised that by adding a linker with odd number TM helices between D94N and the N_out targets will serve the purpose.

In a separate project in our laboratory, we have designed computationally a four-helical bundle membrane protein (called 4TM) derived from the soluble cytochrome b₅₆₂. We tested this newly designed 4TM and its subsystems of 1TM, 2TM and 3TM as possible linkers. We found that 3TM is the most effective one. By coupling D94N with 3TM, we created a D94N-related TM protein construct with a C-terminus exposed to the E. coli periplasmic face (C_out) which improves the expression of N_out TM proteins. Therefore, using our two fusion partners, i.e., D94N and D94N-3TM, the N-terminal orientation of the target TM protein can be deduced efficiently by assessing the net expression of the protein product. Fusing the target TM protein downstream to the compatible fusion partner can produce a correct topology, whereas fusing the target TM protein to the incompatible one might results in a unstable protein due to the interrupted topology. The unstable TM proteins are presumably proteolytically removed by in vivo quality control system^18,19.

We assessed our method using three TM proteins with known N_out orientations, i.e., NADH dehydrogenase subunit A (NuoA) and subunit J (NuoJ)²⁰, nitrate reductase 1 gamma chain (NarI)²¹ and another one with a proposed N_in orientation, undecaprenyl pyrophosphate phosphatase (UppP)²² from E. coli. The results are consistent with our expectations. We next used our method to determine the orientations of the N-termini of seven additional TM proteins to assess the practicality of this strategy. Our method is quick and easy to perform and does not rely on mutagenesis, structural determination, or chemical labeling to determine the topology of the N-terminal residue in a TM protein. By combining the protein tags for C-terminal orientation determination with D94N-related fusion partners, a more comprehensive topological information of a target TM protein can be obtained.

Results

Computational design of a TM protein

Previously, we generated D94N to improve the expression levels of some membrane proteins^17,22. Because the C-terminus of D94N is oriented in the membrane as C_in (http://opm.phar.umich.edu/)¹², fusing a TM protein with a naturally occurring N_in orientation downstream to D94N should potentially increase its expression. Therefore the increased expression could suggest that the fused TM target protein has a N_in orientation. In order to produce the target TM proteins with N_out termini, a different D94N-related fusion partner need to be generated. We used a four helical bundle TM protein derived from E. coli cytochrome b₅₆₂ as the starting point. We tested this newly designed 4TM and its subsystems of 1TM, 2TM and 3TM as possible linkers. The computational design strategy is described in the Method section (Fig. 1A).

The successful expression of D94N-4TM-EGFP in E. coli C41(DE3) was confirmed by Western blotting (WB) using an anti-His antibody (Fig. 1D). A clear band was detected at the anticipated molecular mass of D94N-4TM-EGFP and migrated more rapidly than expected for TM proteins (theoretical molecular mass, 73.4 kDa). At the bottom position a band with the molecular mass corresponding to D94N/His-tag fragment was detected, suggesting the adventitious proteolysis of the fusion protein. The product does not have the red color of heme protein, indicating that 4TM is no longer a heme-binding protein.

To see if the 4TM is able to form a stable conformation, 4TM-EGFP was purified and released from D94N-4TM-EGFP by TEV protease and then characterized by fluorescence-detection size-exclusion chromatography²³ in the buffer (50 mM Tris-HCl, pH 7.5, 500 mM NaCl, 0.01% n-dodecyl-β-d-maltopyranoside). The result showed that 4TM-EGFP has a very symmetrically peak shape, suggesting a single conformation (Supplementary Fig. S2). Taken together, it appears that 4TM is stable in the E. coli plasma membrane, which permits us to test if the odd-helix derivatives of 4TM can be a linker with a C_out orientation.

Verifying that the predicted TM sequences of 4TM are inserted into the membrane

To determine whether 4TM is embedded in the membrane and the C-terminal orientation of each TM, we individually fused BlaM or EGFP at R121 of 4TM and derivatives of 4TM that contained one, two, or three predicted TM helices (from the N- to C-termini of 4TM and denoted 1TM, 2TM and 3TM, respectively; see Fig. 1C and Fig. 2A for the truncation sites). BlaM and EGFP have been routinely used to determine the C-terminal orientations of TM proteins^8,9. BlaM is a periplasmic reporter and protects cells from β-lactam antibiotics only if it translocates to periplasm. Τhe C-terminal codon of each construct was fused to the mature BlaM DNA sequence and cloned into pET-42b(+). To confirm that BlaM was well expressed, folded and functional, patch tests were performed²⁴. All clones were viable at 10 μg/mL ampicillin, indicating that functional BlaM is expressed in each clone (Supplementary Fig. S3A). The relative migration patterns of the fusion proteins detected by WB for whole cells and the membrane fractions presented as a ladder for increasing molecular mass (Fig. 2B), also demonstrating that all constructs are fully and stably expressed. Meanwhile, spot test showed that only clones expressing D94N-1TM-BlaM or D94N-3TM-BlaM formed colonies at 50 and 100 μg/mL ampicillin (Fig. 2B and Supplementary Fig. S3B), whereas the systems expressing D94N-2TM-BlaM and D94N-4TM-BlaM did not. The results suggest that TM1 and TM3 are oriented as N_in-C_out and that TM2 and TM4 are oriented as N_out-C_in. The periodic relationship between the four 4TM constructs and antibiotic resistance indicates that 4TM contains four membrane-spanning segments.

To provide additional evidence, we replaced BlaM with the cytosolic reporter, EGFP. Fluorescence of GFP can be detected when GFP localized to cytoplasm, whereas it is inhibited if GFP translocated to periplasm where GFP is improperly folded and degraded²⁵. The fluorescence of intact cells that expressed D94N-1TM-EGFP was taken as the cut-off value for its C_out orientation deduced by the result from BlaM tag (Fig. 2C). The calculated fluorescence intensities of cells expressing D94N-2TM-EGFP and D94N-4TM-EGFP were above the cut-off value (Fig. 2C). That the whole cell fluorescent intensity of D94N-2TM-EGFP construct is 2.4-fold to that of D94N-4TM-EGFP may come from the net expression levels of them (Fig. 2C). These results are consistent with those of the BlaM-fusion tag experiments in that both types of experiments suggest that TM2 and TM4 have C_in orientations and TM1 and TM3 have C_out orientations.

Taken together, the results suggested that 4TM is a multi-spanning protein and it possesses a topology consistent with our predicted model (Fig. 2A). Moreover, 1TM and 3TM linked after D94N can successfully modify the C-terminal orientation to C_out.

Development of an expression fusion partner specific for TM proteins with an N_out orientation

D94N fusion partner has a C_in orientation¹⁷ (Fig. 3A, left), therefore it should only allow for the proper insertion of the TM proteins possessing N_in orientations. Notably, the D94N-1TM-BlaM and D94N-3TM-BlaM constructs should have C_out orientations allowing BlaM in the periplasm space. Because D94N-3TM-BlaM has a higher net expression level than does D94N-1TM-BlaM (Fig. 2B), we assessed if D94N-3TM could be a possible fusion partner for a N_out TM protein (Fig. 3A, right). Even though the designed protein folds as a four helix in the membrane cannot guarantee the 3TM as a stable linker, but a higher net expression of D94N-3TM than that of D94N-4TM indicated that D94N-3TM is also a stable complex. We thus individually fused three N_out TM proteins, which are NADH dehydrogenase subunit A (NuoA) and subunit J (NuoJ)²⁰ and nitrate reductase 1 gamma subunit (NarI)²¹ from E. coli BL21(DE3) to D94N-3TM and D94N and assessed their net expression levels. WB of each expressed DN94N-3TM constructed, e.g., D94N-3TM-NarI, showed a clear band at its anticipated position (Fig. 3B, +), providing evidence that D94N-3TM can be used for expression of TM proteins with an N_out orientation.

To validate that the observed levels of NarI, NuoA and NuoJ expression require D94N-3TM, the expression level of each construct containing only D94N fused upstream of each protein was assessed (Fig. 3B, −). In each case, the WB signal was much weaker when the 3TM sequence was not included in the constructs, indicating that the fusion of only D94N cannot increase the net expression of NarI, NuoA and NuoJ. Given these results D94N-3TM appears to be a useful fusion partner for N_out TM proteins expression. The protein fused downstream to D94N-3TM is functional, as in the case of BlaM (Fig. 2B and Supplementary Fig. S3), further suggesting that D94N-3TM can be applied to protein expression.

The three N_out TM proteins, NuoA, NuoJ and NarI, are better expressed when fused with D94N-3TM than with D94N, likely because the C-terminal of D94N-3TM and the N-terminal of each N_out TM protein are both located in the periplasm (Fig. 3B, right panel). To validate this argument, we assessed the expression of UppP from E. coli, which has been predicted to have an N_in orientation²², fused to D94N-3TM or D94N. By visualizing the protein product by WB, it is apparent that expression of UppP is increased when fused to D94N than to D94N-3TM (Fig. 3C), a finding which supports our expectation that UppP has an N_in orientation.

By surveying the practicability of D94N-related fusion partners on expression of the four TM proteins, of which the N-terminal orientations have been known and proposed, D94N-3TM fusion partner, as our expectations, can only be applied on the expression of NuoA, NuoJ and NarI, while D94N fusion partner can only be used to express UppP. Therefore, the D94N-3TM fusion vector should identify N_out TM proteins and the D94N fusion vector should identify N_in TM proteins. When expressing a target TM protein using both expression fusion partners, its status as an N_in or N_out could be independently confirmed.

Extracting N-terminal topological information using the two expression vectors

We then addressed the orientation issue by examining the compatibility of our expression fusion partners with the expression of various TM proteins. We fused D94N-3TM or D94N with the individual seven additional TM proteins, which are cobalamin synthase (CobS), multidrug efflux protein (EmrE), multiple antibiotic resistance-related protein (MarC), predicted Mg²⁺ transport ATPase (MgtC), respiratory nitrate reductase 2 gamma chain (NarV), multidrug efflux system protein (SugE), conserved inner membrane protein (YgjV) (Fig. 4B,C). These randomly selected TM proteins¹³ were subjected to the protocol diagrammed in Fig. 4A. We also predicted their orientations using TMHMM (http://www.cbs.dtu.dk/services/TMHMM/)², TOPCONS (http://topcons.cbr.su.se/)²⁶ and Phobius (http://phobius.sbc.su.se/)²⁷ (Table 1). To minimize the number of steps required, the genes were first cloned into the D93N-3TM fusion vector (see Supplementary Fig. S4 for the cloning strategy) and then a portion of each preparation was treated with BamHI to remove the 3TM sequence. The BamHI sites are in-frame and the corresponding codons (Gly-Ser) would not be involved in the topological determination. The restriction enzyme cutting sites can be strategically changed if the target genes contain BamHI site(s) during one’s cloning procedure. The N-terminal orientation of each TM protein was assessed by evaluating its expression after insertion into D94N-3TM and D94N fusion vectors by WB analysis, where the fusion proteins were all at the anticipated molecular weight (Fig. 4B,C). Expression of CobS, EmrE, MarC and SugE is more compatible with D94N fusion vector, which indicates that they have an N_in orientation (Fig. 4B). Conversely, expression of MgtC, NarV and YgjV is better when the D94N-3TM fusion vector is used, suggesting that they possess an N_out orientation (Fig. 4C).

Table 1 Topology results from prediction by TMHMM², TOPCONS²⁶, and Phobius²⁷, and the experimental data.

Full size table

We thus try to assess the C-terminal orientation of CobS and MarC fused to D94N and MgtC, NarV and YgjV fused to D94N-3TM by BlaM and EGFP protein tags (Supplementary Fig. S5). Only the cells expressing D94N-CobS-BlaM showed resistance to 50 μg/mL ampicillin, suggesting the C-terminus orientation of CobS is C_out. The fluorescent intensity of the cells expressing D94N-CobS-EGFP was used as cutoff value to assign the C-terminus orientation of the other target TM proteins fused with EGFP. The C-termini of D94N-3TM-MgtC, D94N-3TM-NarV and D94N-3TM-YgjV are accordingly assigned to C_in for the significant fluorescent intensities (Supplementary Fig. S5). Taken these together, except MarC, both N- and C-terminal orientations of CobS, MgtC, NarV, YgjV can be obtained and they are N_in/C_out, N_out/C_in, N_out/C_in and N_out/C_in, where the orientations of CobS and MgtC cannot be determined before¹³.

Discussions

A fundamental question related to TM protein biogenesis still needs to be fully answered. How does a TM protein assume its proper topology and fold into its functional state in the lipid environment? For most membrane proteins, the first hydrophobic segment is responsible for membrane targeting through the translocon and initiates topogenesis at the site^14,28,29. It has been difficult to determine the N-terminal orientation of a TM protein experimentally⁹ because the membrane anchor sequence is involved in membrane targeting and the early stage of TM protein biogenesis^1,14,29. Bioinformatics software have been introduced to address topogenesis^{2,4,6,7,26,27}. However, these programs sometimes assign different topologies to a given TM protein⁷ (Table 1). Misidentification of the orientation can result in an incorrect prediction model of a TM protein⁷. Furthermore, it is increasingly apparent that TM protein biogenesis and lipid-protein interaction also complicate TM protein topogenesis determinants^5,14 and the orientation of TM proteins is governed by multiple factors^5,9,29. Consequently the prediction algorithms may not contain sufficient information. Therefore, it is necessary to discriminate the prediction topology models of a TM protein by experimental data. The strategy^30,31, involving fluorescein 5-maleimide labeled at N-terminus and the reporter proteins fused at C-terminus, had been performed to discriminate the models of secondary transporters generated by prediction algorithms.

In our designed system, by expressing a target TM protein using D94N-3TM and D94N fusion vectors without the need for site-directed mutation and/or chemical labeling, its N-terminal orientation which has been difficult to experimentally determine, can be easily accomplished. GFP may also work at N-terminus, however, fusing GFP at N-terminus of the N_out TM proteins can impede its expression¹⁶. The fusion partners apparently can be a orientation effector for determining the topology of the downstream TM segments. As in the cases of NarI, NuoA and NuoJ (Fig. 3B), the target TM proteins can only be properly expressed when fused to a topologically compatible fusion partner, i.e., D94N-3TM in the case of NarI. In contrast, a topologically incompatible fusion partner would disturb the topology and, consequently, the folding of the targeted protein. Changing the topological signal of a TM protein may disrupt its topology integrity^15,32, which is a decisive prerequisite for stable TM protein folding in the lipid environment⁵. Quality control of membrane protein expression and folding removes mistranslated, misfolded, unstable, or malfunctioning TM proteins from Gram-negative bacterial inner membranes by a process that involves assessing the stability of the protein via a membrane protease-dependent mechanism^14,18,19. Cross-linking and co-purification data have indicated that a membrane-bound protease FtsH involved in quality control forms a complex with a membrane-bound chaperone. This suggests that proteolytic quality control may function during TM protein biogenesis³³. If a TM protein is fused improperly to D94N-3TM or D94N, the integrity of its topology is probably disturbed, which may subsequently disturb proper folding of the protein in the membrane. A TM protein that does not possess properly oriented TM segments in the membrane cannot be able to form the correct tertiary structure⁵ and may be removed by the quality control system. This scenario may explain the different expression levels of target proteins when fused downstream of D94N-3TM or D94N (Figs 3B,C and 4B,C). In all cases the TM proteins (Figs 3 and 4) were well expressed when fused to the correct expression vector, demonstrating the effectiveness of our method in determining their N-terminal orientation.

EmrE and SugE, which belong to the small multidrug resistance protein family, can present in either orientation in the membrane (dual-topology)^4,5, are included in our study. EmrE is known to form and function in an antiparallel homodimer^34,35,36,37, but the formation of antiparallel homodimer is still a question. However, EmrE and SugE showed only a N_in orientation by our method which might indicate that EmrE and SugE should be initiated from cytoplasm during topogenesis. This result suggests a topological reorientation scenario⁴, i.e., the TM protein first inserts into the membrane in one orientation and then switches to the final state. Seppälä S et al.³⁶ had demonstrated that the topology of TM proteins remains uncertain till the last reside has been translated. Initiation at the other site, as the case of D94N-3TM-EmrE, might be adverse to topogenesis due to the asymmetry character of lipid bilayer^5,38. Although a cellular system directly involved in topological reorientation has not been found, posttranslational reorientation has been reported for eukaryotic, bacterial and viral TM proteins^4,5. The bioinformatics analysis indicates that dual orientation of small multidrug resistance proteins may be associated with the lipid composition³⁹ which is known to regulate reorientation of TM proteins^5,40,41. Perhaps the reorientation of small multidrug resistance proteins is induced by a lipid composition change in membrane microdomains driven by specific cellular conditions, e.g., stress^18,38. Since some data had suggested that the dimerization of small multidrug resistance transporters can stabilize the complex³⁷, we cannot exclude the possibility that the instability of D94N-3TM-EmrE and -SugE is due to a possible impeded dimerization with the endogenous EmrE and SugE.

In addition, we could not detect recombinant CobS in E. coli system (Supplementary Fig. S6), and, therefore, its orientation would have been impossible to investigate biochemically¹³. However, by fusing CobS to D94N, it was expressed at a detectable level and its N-terminal orientation was, therefore, easily determined (Fig. 4C). Our D94N-based expression vector had been shown to increase the amounts of some TM proteins expressed in E. coli¹⁷, which is useful when a TM protein is otherwise poorly expressed²² and increase the possibility to obtain the C-terminal orientation of the poorly expressed TM proteins, as the case of CobS (Supplementary Fig. S5). In addition, the C-terminal orientation of UppP also determined by the same strategy had been published²².

In summary, we present herein an easy method that provides experimental evidence for the orientation of the N-terminal end of a TM protein in a membrane environment and thereby eliminates the need for predictive algorithms that may not incorporate the complexity needed to accurately determine TM protein topology. We validated our method using three TM proteins with known N_out orientations and one protein with a proposed N_in orientation²² (Fig. 3A,B). We also applied our method to seven other TM proteins (Fig. 4B,C). Combining our method with other biochemical and computational tools will allow a more comprehensive assessment of the topology of TM proteins.

Methods

Protein and DNA sequence design

The crystal structure of soluble cytochrome b562 from E. coli (PDB code: 3DE9)⁴² was selected as the initial model, while the cytochrome b of membrane-bound cytochrome bc1 complex was selected as the reference for amino acid substitution (PDB code: 3H1J; chain C)⁴³. In cytochrome b there are two heme group in the four-helical core (TM1-TM4). The environment of the protruding heme group of cytochrome b is similar to that of cytochrome b562 and used as the base for superimposing their four-helical structures (Fig. 1A). The exterior residues of cytochrome b562 were replaced by the corresponding ones on the cytochrome b (Fig. S1). The residues that cannot be well aligned were changed to the most frequent residues found in TM segments⁴⁴ based on the reasonable interaction of which to milieu. Two helical turns, ILTGLLLAM and VQYGWLI, were captured from the cytochrome b and inserted at the C-terminus of TM1 and the N-terminus of TM2 to achieve the thickness of a membrane (Fig. 1C, underlined; Fig. S1). The structural modeling and refinement was done by PyMol⁴⁵ and the geometry was optimized by the structural idealization function in RefMac⁴⁶. The prediction of ΔG for transmembrane helix insertion was done by the ΔG prediction server (http://dgpred.cbr.su.se/index.php? p = TMpred)⁴⁷ while the calculation of hydrophobic layer was completed by PPM server (http://opm.phar.umich.edu/server.php)⁴⁸. All four helices are predicted to have a negative value of ΔG_app, which suggests that the sequence is recognized as a TM segment by the Sec translocon. Moreover, the computational result of PPM server (http://opm.phar.umich.edu/server.php)⁴⁸ indicates the depth of the hydrophobic layer is around 32 Å in the model, which agrees with the thickness of a membrane.

Gene cloning and protein expression

The DNA sequence of 4TM was generated by the “Codon Optimization Tool” at the website of Integrated DNA Technologies Inc. (http://sg.idtdna.com/CodonOpt), which optimizes the DNA sequence for increasing the possibility of the high-yield protein expression in E. coli. This sequence was inserted downstream to the gene of D94N/His-tag¹⁷ and upstream of enhanced green fluorescent protein (EGFP) in the expression vector, pET21-b(+). A stop codon TAA was inserted at the end of EGFP DNA sequence. The DNA sequence was confirmed by using the “Rare Codon Tool” from the website of GeneScript Inc. (http://www.genscript.com), where the codon adaptation index (CAI) was suitable for the E.coli (CAI = 0.72). The primer sequences used to clone the genes of target membrane proteins from BL21(DE3) were obtained from the European Nucleotide Archive (http://www.ebi.ac.uk/ena). All needed oligonucleotides were synthesized by Genomics BioSci & Tech (Taiwan). The DNA sequences were cloned from BL21(DE3) genome by PCR and then inserted into D94N-3TM and D94N¹⁷ fusion vectors. The inserted sequences were confirmed by sequencing. The plasmids were transformed into E. coli C41(DE3) and the cells were incubated at 37 °C in Terrific broth⁴⁹ containing 100 μg/mL ampicillin. When OD₆₀₀ of medium reached 0.6, 0.4 mM isopropyl-β-D-thiogalactoside and 5 mM all-trans retinal (Sigma) were added and the cells were incubated for another 16 h at 20 °C. After spun down at 4000 × g for 10 min, the cells were resuspended in the buffer, 50 mM Tris-HCl pH 7.5, 500 mM NaCl, benzonase 2 U/ml, lysozyme 200 μg/ml and protease cocktail (Roche). The cells were disrupted by Constant Cell Disruption Systems (Constant Systems Ltd) and centrifuged at 4000 × g for 10 min. Each supernatant was centrifuged at 84,000 × g for 35 min to harvest the membrane fraction. The precipitates were washed with 10-fold volumes of storage buffer, 50 mM Tris-HCl pH 7.5, 500 mM NaCl, 10% (w/v) glycerol, before centrifugation at 84,000 × g for 1 h. Finally, the membrane fraction was resuspended in the storage buffer and stored at −80 °C for later use. Protein concentrations were measured by the Bio-Rad Protein Assay.

Western blotting analysis

The total protein of 50 μL induced cell culture (OD₆₀₀ of 1.0) or 20 μg membrane fraction was resolved on 12% SDS-PAGE. The protein was transferred onto nitrocellulose membrane and visualized by following the standard protocol of SuperSignal^TM West HisProbe Kit (Thermo Fisher Scientific).

Spot test and patch test

By the method described in the article²⁴ with minor modifications, the transformed cells were firstly incubated in Luria-Bertani broth containing 50 μg/mL kanamycin till OD₆₀₀ was 1.0 and then induced by 0.4 mM isopropyl-thio-β-D-galactopyranoside for 5 h at 37 °C. For spot assay, the culture mediums were diluted 1000-fold and 1 μl of the dilutions was spread onto the culture plate containing 50 and 100 μg/mL ampicillin and 0.2 mM isopropyl-thio-β-D-galactopyranoside. For patch assay, 1μl of each cultured medium was spread onto the culture plate containing 10 μg/mL ampicillin and 0.2 mM isopropyl-thio-β-D-galactopyranoside. The plates were incubated for 16 h at 37 °C to see the consequences of colony growth.

Whole cell fluorescence

The whole cell fluorescence estimation followed the protocol described by Daley et al.¹³. After washed by the buffer containing 50 mM Tris-HCl pH 8.0, 200 mM NaCl and 15 mM EDTA, the cell pellet was resuspended in the same buffer and incubated at room temperature for 1h and assayed for EGFP fluorescence. Fluorescence was measured in Synergy™ H4 Hybrid Multi-Mode Microplate Reader (BioTEK) with an excitation wavelength of 485 nm and an emission wavelength of 520 nm.

Additional Information

How to cite this article: Lee, C.-H. et al. Determining the N-terminal orientations of recombinant transmembrane proteins in the Escherichia coli plasma membrane. Sci. Rep. 5, 15086; doi: 10.1038/srep15086 (2015).

References

Dalbey, R. E., Wang, P. & Kuhn, A. Assembly of bacterial inner membrane proteins. Annu. Rev. Biochem. 80, 161–187, 10.1146/annurev-biochem-060409-092524 (2011).
Article CAS PubMed Google Scholar
Krogh, A., Larsson, B., von Heijne, G. & Sonnhammer, E. L. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J. Mol. Biol. 305, 567–580, 10.1006/jmbi.2000.4315 (2001).
Article CAS PubMed Google Scholar
Overington, J. P., Al-Lazikani, B. & Hopkins, A. L. How many drug targets are there? Nat Rev. Drug. Discov. 5, 993–996, 10.1038/nrd2199 (2006).
Article CAS PubMed Google Scholar
von Heijne, G. Membrane-protein topology. Nat. Rev. Mol. Cell Biol. 7, 909–918, 10.1038/nrm2063 (2006).
Article CAS PubMed Google Scholar
Bogdanov, M., Dowhan, W. & Vitrac, H. Lipids and topological rules governing membrane protein assembly. Biochim. Biophys. Acta 1843, 1475–1488, 10.1016/j.bbamcr.2013.12.007 (2014).
Liang, J., Naveed, H., Jimenez-Morales, D., Adamian, L. & Lin, M. Computational studies of membrane proteins: models and predictions for biological understanding. Biochim. Biophys. Acta 1818, 927–941, 10.1016/j.bbamem.2011.09.026 (2012).
Ott, C. M. & Lingappa, V. R. Integral membrane protein biosynthesis: why topology is hard to predict. J. Cell Sci. 115, 2003–2009 (2002).
CAS PubMed Google Scholar
van Geest, M. & Lolkema, J. S. Membrane Topology and Insertion of Membrane Proteins: Search for Topogenic Signals. Microbiol. Mol. Biol. Rev. 64, 13–33, 10.1128/mmbr.64.1.13-33.2000 (2000).
Article CAS PubMed PubMed Central Google Scholar
Lee, H. & Kim, H. Membrane topology of transmembrane proteins: determinants and experimental tools. Biochem. Biophys. Res. Commun. 453, 268–276, 10.1016/j.bbrc.2014.05.111 (2014).
Article CAS PubMed Google Scholar
Nugent, T. & Jones, D. T. Membrane protein orientation and refinement using a knowledge-based statistical potential. BMC Bioinformatics 14, 276, 10.1186/1471-2105-14-276 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kimmett, T. et al. ProBLM web server: protein and membrane placement and orientation package. Comput. Math. Methods Med. 2014, 838259, 10.1155/2014/838259 (2014).
Article PubMed PubMed Central Google Scholar
Lomize, M. A., Lomize, A. L., Pogozheva, I. D. & Mosberg, H. I. OPM: orientations of proteins in membranes database. Bioinformatics 22, 623–625, 10.1093/bioinformatics/btk023 (2006).
Article CAS Google Scholar
Daley, D. O. et al. Global topology analysis of the Escherichia coli inner membrane proteome. Science 308, 1321–1323, 10.1126/science.1109730 (2005).
Article CAS Google Scholar
Luirink, J., Yu, Z., Wagner, S. & de Gier, J. W. Biogenesis of inner membrane proteins in Escherichia coli. Biochim. Biophys. Acta 1817, 965–976, 10.1016/j.bbabio.2011.12.006 (2012).
Article CAS PubMed Google Scholar
Sato, M., Hresko, R. & Mueckler, M. Testing the charge difference hypothesis for the assembly of a eucaryotic multispanning membrane protein. J. Biol. Chem. 273, 25203–25208 (1998).
Article CAS Google Scholar
Wagner, S., Bader, M. L., Drew, D. & de Gier, J. W. Rationalizing membrane protein overexpression. Trends Biotechnol. 24, 364–371, 10.1016/j.tibtech.2006.06.008 (2006).
Article CAS PubMed Google Scholar
Hsu, M. F. et al. Using Haloarcula marismortui bacteriorhodopsin as a fusion tag for enhancing and visible expression of integral membrane proteins in Escherichia coli. PLoS One 8, e56363, 10.1371/journal.pone.0056363 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Akiyama, Y. Quality control of cytoplasmic membrane proteins in Escherichia coli. J. Biochem. 146, 449–454, 10.1093/jb/mvp071 (2009).
Article CAS PubMed Google Scholar
Dalbey, R. E., Wang, P. & van Dijl, J. M. Membrane proteases in the bacterial protein secretion and quality control pathway. Microbiol. Mol. Biol. Rev. 76, 311–330, 10.1128/MMBR.05019-11 (2012).
Article CAS PubMed PubMed Central Google Scholar
Efremov, R. G. & Sazanov, L. A. Structure of the membrane domain of respiratory complex I. Nature 476, 414–420, 10.1038/nature10330 (2011).
Article CAS ADS PubMed Google Scholar
Bertero, M. G. et al. Insights into the respiratory electron transfer pathway from the structure of nitrate reductase A. Nat. Struct. Biol. 10, 681–687, 10.1038/nsb969 (2003).
Article CAS PubMed Google Scholar
Chang, H. Y., Chou, C. C., Hsu, M. F. & Wang, A. H. Proposed carrier lipid-binding site of undecaprenyl pyrophosphate phosphatase from Escherichia coli. J. Biol. Chem. 289, 18719–18735, 10.1074/jbc.M114.575076 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kawate, T. & Gouaux, E. Fluorescence-detection size-exclusion chromatography for precrystallization screening of integral membrane proteins. Structure 14, 673–681, 10.1016/j.str.2006.01.013 (2006).
Article CAS PubMed Google Scholar
Broome-Smith, J. K. & Spratt, B. G. A vector for the construction of translational fusions to TEM beta-lactamase and the analysis of protein export signals and membrane protein topology. Gene 49, 341–349 (1986).
Article CAS Google Scholar
Feilmeier, B. J., Iseminger, G., Schroeder, D., Webber, H. & Phillips, G. J. Green fluorescent protein functions as a reporter for protein localization in Escherichia coli. J. Bacteriol. 182, 4068–4076 (2000).
Article CAS Google Scholar
Tsirigos, K. D., Peters, C., Shu, N., Kall, L. & Elofsson, A. The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Res. 43, W401–W407 10.1093/nar/gkv485 (2015).
Article CAS PubMed PubMed Central Google Scholar
Käll, L., Krogh, A. & Sonnhammer, E. L. L. Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server. Nucleic Acids Res. 35, W429–W432, 10.1093/nar/gkm256 (2007).
Article PubMed PubMed Central Google Scholar
Higy, M., Junne, T. & Spiess, M. Topogenesis of membrane proteins at the endoplasmic reticulum. Biochemistry 43, 12716–12722, 10.1021/bi048368m (2004).
Article CAS PubMed Google Scholar
Devaraneni, P. K. et al. Stepwise insertion and inversion of a type II signal anchor sequence in the ribosome-Sec61 translocon complex. Cell 146, 134–147, 10.1016/j.cell.2011.06.004 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ter Horst, R. & Lolkema, J. S. Rapid screening of membrane topology of secondary transport proteins. Biochim. Biophys. Acta 1798, 672–680, 10.1016/j.bbamem.2009.11.010 (2010).
Article CAS PubMed Google Scholar
ter Horst, R. & Lolkema, J. S. Membrane topology screen of secondary transport proteins in structural class ST[3] of the MemGen classification. Confirmation and structural diversity. Biochim. Biophys. Acta 1810, 72–81, 10.1016/j.bbamem.2011.09.021 (2012).
Article CAS Google Scholar
Sato, M. & Mueckler, M. A conserved amino acid motif (R-X-G-R-R) in the Glut1 glucose transporter is an important determinant of membrane topology. J. Biol. Chem. 274, 24721–24725 (1999).
Article CAS Google Scholar
van Bloois, E. et al. Detection of cross-links between FtsH, YidC, HflK/C suggests a linked role for these proteins in quality control upon insertion of bacterial inner membrane proteins. FEBS Lett. 582, 1419–1424, 10.1016/j.febslet.2008.02.082 (2008).
Article CAS PubMed Google Scholar
Chen, Y. J. et al. X-ray structure of EmrE supports dual topology model. Proc. Natl. Acad. Sci. USA 104, 18999–19004, 10.1073/pnas.0709387104 (2007).
Article ADS PubMed Google Scholar
Rapp, M., Seppala, S., Granseth, E. & von Heijne, G. Emulating membrane protein evolution by rational design. Science 315, 1282–1284, 10.1126/science.1135406 (2007).
Article CAS ADS PubMed Google Scholar
Seppälä, S., Slusky, J. S., Lloris-Garcera, P., Rapp, M. & von Heijne, G. Control of membrane protein topology by a single C-terminal residue. Science 328, 1698–1700, 10.1126/science.1188950 (2010).
Article CAS ADS PubMed Google Scholar
Kolbusz, M. A., ter Horst, R., Slotboom, D. J. & Lolkema, J. S. Orientation of small multidrug resistance transporter subunits in the membrane: correlation with the positive-inside rule. J. Mol. Biol. 402, 127–138, 10.1016/j.jmb.2010.07.019 (2010).
Article CAS PubMed Google Scholar
Vigh, L. et al. The significance of lipid composition for membrane activity: new concepts and ways of assessing function. Prog. Lipid Res. 44, 303–344, 10.1016/j.plipres.2005.08.001 (2005).
Article CAS PubMed Google Scholar
Bay, D. C. & Turner, R. J. Membrane composition influences the topology bias of bacterial integral membrane proteins. Biochim. Biophys. Acta 1828, 260–270, 10.1016/j.bbamem.2012.09.003 (2013).
Article CAS PubMed Google Scholar
Bogdanov, M. & Dowhan, W. Lipid-dependent generation of dual topology for a membrane protein. J. Biol. Chem. 287, 37939–37948, 10.1074/jbc.M112.404103 (2012).
Article CAS PubMed PubMed Central Google Scholar
Vitrac, H., Bogdanov, M. & Dowhan, W. In vitro reconstitution of lipid-dependent dual topology and postassembly topological switching of a membrane protein. Proc. Natl. Acad. Sci. USA 110, 9338–9343, 10.1073/pnas.1304375110 (2013).
Article ADS PubMed Google Scholar
Salgado, E. N., Lewis, R. A., Mossin, S., Rheingold, A. L. & Tezcan, F. A. Control of protein oligomerization symmetry by metal coordination: C2 and C3 symmetrical assemblies through Cu(II) and Ni(II) coordination. Inorg. Chem. 48, 2726–2728, 10.1021/ic9001237 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. Electron transfer by domain movement in cytochrome bc1. Nature 392, 677–684, 10.1038/33612 (1998).
Article CAS ADS PubMed Google Scholar
Senes, A. Computational design of membrane proteins. Curr. Opin. Struct. Biol. 21, 460–466, 10.1016/j.sbi.2011.06.004 (2011).
Article CAS PubMed Google Scholar
Schrodinger, L. L. C. The PyMOL Molecular Graphics System. Version 1.2r2 (2009).
Vagin, A. A. et al. REFMAC5 dictionary: organization of prior chemical knowledge and guidelines for its use. Acta Crystallogr., Sect. D: Biol. Crystallogr. 60, 2184–2195, 10.1107/S0907444904023510 (2004).
Article CAS Google Scholar
Hessa, T. et al. Molecular code for transmembrane-helix recognition by the Sec61 translocon. Nature 450, 1026–1030, 10.1038/nature06387 (2007).
Article CAS ADS PubMed Google Scholar
Lomize, M. A., Pogozheva, I. D., Joo, H., Mosberg, H. I. & Lomize, A. L. OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Res. 40, D370–376, 10.1093/nar/gkr703 (2012).
Article CAS PubMed Google Scholar
Tartoff, K. D. & Hobbs, C. A. Improved Media for Growing Plasmid and Cosmid Clones. Bethesda research laboratory focus 9, 12 (1987).
Google Scholar

Download references

Acknowledgements

We thank the support from National Research Program for Biopharmaceuticals (Grant NSC 10102325-B-492-001).

Author information

Authors and Affiliations

The Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan
Chien-Hsien Lee, Chia-Cheng Chou, Min-Feng Hsu & Andrew H.-J. Wang
Core Facilities for Protein Structural Analysis, Academia Sinica, Taipei, Taiwan
Chien-Hsien Lee, Chia-Cheng Chou, Min-Feng Hsu & Andrew H.-J. Wang

Authors

Chien-Hsien Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chia-Cheng Chou
View author publications
You can also search for this author in PubMed Google Scholar
Min-Feng Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Andrew H.-J. Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H.L. and C.C.C. designed the research and wrote the manuscript; C.H.L. performed the experiment; M.F.H. prepared the constructs and the expression system; A.H.J.W. directed the work and revised the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Lee, CH., Chou, CC., Hsu, MF. et al. Determining the N-terminal orientations of recombinant transmembrane proteins in the Escherichia coli plasma membrane. Sci Rep 5, 15086 (2015). https://doi.org/10.1038/srep15086

Download citation

Received: 09 June 2015
Accepted: 11 September 2015
Published: 14 October 2015
DOI: https://doi.org/10.1038/srep15086

This article is cited by

The de novo design of a biocompatible and functional integral membrane protein using minimal sequence complexity
- Christophe J. Lalaurie
- Virginie Dufour
- Paul Curnow
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.