Synthesis runs counter to directional folding of a nascent protein domain

Chen, Xiuqi; Rajasekaran, Nandakumar; Liu, Kaixian; Kaiser, Christian M.

doi:10.1038/s41467-020-18921-8

Download PDF

Article
Open access
Published: 09 October 2020

Synthesis runs counter to directional folding of a nascent protein domain

Nature Communications volume 11, Article number: 5096 (2020) Cite this article

4954 Accesses
13 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Folding of individual domains in large proteins during translation helps to avoid otherwise prevalent inter-domain misfolding. How folding intermediates observed in vitro for the majority of proteins relate to co-translational folding remains unclear. Combining in vivo and single-molecule experiments, we followed the co-translational folding of the G-domain, encompassing the first 293 amino acids of elongation factor G. Surprisingly, the domain remains unfolded until it is fully synthesized, without collapsing into molten globule-like states or forming stable intermediates. Upon fully emerging from the ribosome, the G-domain transitions to its stable native structure via folding intermediates. Our results suggest a strictly sequential folding pathway initiating from the C-terminus. Folding and synthesis thus proceed in opposite directions. The folding mechanism is likely imposed by the final structure and might have evolved to ensure efficient, timely folding of a highly abundant and essential protein.

A short translational ramp determines the efficiency of protein synthesis

Article Open access 18 December 2019

Structural basis of early translocation events on the ribosome

Article Open access 07 July 2021

Co-translational assembly orchestrates competing biogenesis pathways

Article Open access 09 March 2022

Introduction

How proteins fold into the native structures that enable their cellular functions remains a central question in biology. Folding of small proteins or single domains often follows single-exponential kinetics, suggesting a highly cooperative process¹. However, experimental and computational studies have detected partially folded intermediates for a multitude of proteins, and proteins larger than ~100 amino acids are generally thought to populate folding intermediates before reaching their native structure^2,3.

While the presence of intermediates in most protein folding pathways is universally recognized, their functional roles are less clear. Partially structured states can represent on-pathway intermediates along a multistep folding pathway, or off-pathway misfolded states that must dissolve before productive folding can be achieved^2,4. Distinguishing these two scenarios experimentally is often challenging, although single-molecule approaches have demonstrated their potential to provide this information^5,6.

How intermediates are linked together into a folding pathway remains a subject of debate^7,8. The foldon hypothesis posits that small, cooperative units acquire structure in a strictly sequential order, resulting in a single folding pathway⁹. Considerations based on energy landscape theory propose that a folding protein can reach its native structure through several accessible routes¹⁰, although specific pathways may be energetically favored. Either theory is supported by evidence from both simulations and experiments. It thus seems possible that proteins might fall into distinct categories based on their folding mechanism. Determining the relationship between amino acid sequence, folding pathway and final structure is important for designing sequences that can adopt novel structures and for understanding how natural proteins robustly reach their functional conformations.

Folding pathways are not only shaped by protein-specific properties, but also depend strongly on environmental conditions, many of which differ greatly between in vitro refolding experiments and folding in living cells¹¹. The vast majority of mechanistic folding studies has been carried out with isolated proteins or domains¹². In the cell, proteins begin to fold while they gradually emerge from the ribosome as it translates the information in the messenger RNA into a polypeptide sequence. Biophysical experiments have shown that interactions of the ribosome with the proximal part of the nascent polypeptide can reduce the stability of native domains^13,14, stabilize secondary structure inside the ribosome exit tunnel^15,16,17, modulate nascent chain folding kinetics¹⁸, and prevent misfolding^18,19,20. Co-translational folding is particularly important for large multi-domain proteins, because it prevents the accumulation of extended unfolded regions during synthesis that have a high propensity for misfolding²¹. Several examples show that the ribosome does not necessarily change the folding pathway^19,22,23, but the connection between vectorial emergence and folding of the nascent chain remains poorly defined.

Here, we have investigated folding of the N-terminal G-domain of Escherichia coli elongation factor G (EF-G). Following nascent chain folding in bacterial cells with an arrest peptide-based reporter assay²⁴, we find that productive folding is not initiated until the full domain has emerged from the ribosome. Force-spectroscopy experiments with optical tweezers confirmed the absence of stable structure at shorter chain lengths. These single-molecule measurements also showed that folding of the full domain proceeds through productive folding intermediates both on the ribosome and in isolation. Our studies thus reveal a strictly ordered folding pathway in which the first step requires the extreme C-terminus of the domain. Folding and synthesis of the domain therefore proceed in opposite directions.

Results

Detection of co-translational folding in vivo with a luciferase reporter

To investigate folding of the G-domain from EF-G, we took advantage of the 17 amino acid arrest peptide from the E. coli SecM protein²⁵ (termed SecM17 here). While the residue at position 17 is important for arrest function, it is not incorporated into the nascent polypeptide, and SecM17 causes elongation arrest when its N-terminal 16 amino acids have been synthesized²⁶. Folding-mediated release of elongation arrest results in translation of the coding sequence downstream of SecM17 and production of the full-length encoded protein²⁴. The arrest peptide can thus be utilized to detect nascent chain folding that occurs in close proximity to the ribosome (Fig. 1). The assay has mainly been used to study folding in in vitro translation systems, quantifying arrest release by autoradiography^{23,27,28,29,30,31,32,33}. To monitor folding in living cells, we utilized a genetically encoded luciferase reporter, NanoLuc³⁴ (Fig. 1).

**Fig. 1: NanoLuc reporter assay for monitoring nascent chain folding in vivo.**

To validate the in vivo folding assay, we generated reporter plasmids in which the G-domain is connected to SecM17 through 4, 17, or 30 residues of the subsequent domain II, resulting in constructs termed G+20, G+33, and G+46 (Fig. 2a). The numbers represent the sum of the domain II and SecM17 residues that separate the G-domain from the peptidyl-transferase center (PTC) at the arrest point. The G-domain (residues 1–293 of EF-G) has been shown to stably fold upon emerging from the ribosome exit tunnel^18,35, which sequesters ~30–40 residues of the nascent chain inside the large subunit of the ribosome.

**Fig. 2: The G-domain folds upon emerging from the ribosome during EF-G synthesis.**

The G+33 construct, which places the G-domain close to the tunnel exit, yielded a high luminescence signal upon expression in E. coli cells (Fig. 2b, bar diagram). At this length, the stably folded domain abuts the ribosome, generating a pulling force that destabilizes SecM arrest²⁴. Consequently, translation resumes and the NanoLuc reporter is synthesized. As expected, the other two constructs yielded lower levels of reporter expression (Fig. 2b, bar diagram). In G+20, sequestration of the C-terminal ~15 amino acids in the tunnel destabilizes the G-domain; in G+46, unfolded domain II polypeptide separates the folded G-domain from the ribosome, preventing the generation of tension on the nascent chain (Fig. 2b, cartoons).

To verify that the luminescence readouts in our experiments reflect increased NanoLuc accumulation, rather than reduced specific activity of NanoLuc caused by fusion to poorly folded polypeptides³⁶, we visualized arrested and full-length translation products by Western blotting. We observed similar levels of arrested protein for all three constructs (magenta arrowhead in Fig. 2b). After arrest release, ribosomes continue to elongate and synthesize the luciferase reporter, resulting in accumulation of full-length protein over time. The significantly higher amount of full-length product for G+33 compared to the control constructs (blue arrowhead in Fig. 2b) therefore reflects an increased arrest release rate. This result indicates that elevated luminescence indeed reports on folding-mediated release of SecM arrest.

Colony luminescence identifies arrest-releasing candidates

The folding of some small proteins or domains is well described by a two-state model, while proteins of the size of the G-domain usually populate intermediates during folding^2,3. To determine whether intermediates are formed co-translationally, we generated 75 individual reporter constructs with EF-G inserts ranging in length from 72 to 368 amino acids (Fig. 3a). When the ribosome stalls at the SecM sequence, the separation of the N-terminal G-domain residue from the PTC of the ribosome, referred to here as length L (Fig. 3b), ranges from 88 to 384 aa (16 residues from SecM plus 72 and 368 EF-G residues, respectively).

**Fig. 3: On-plate screening identifies nascent chain folding in a pool of constructs.**

We transformed E. coli cells with a mixture of all plasmids at similar concentrations and grew colonies on inducing agar. We applied the luciferase substrate to the plate and imaged the resulting colony luminescence (Fig. 3c). Analyzing the light emission of individual colonies allowed us to distinguish highly luminescent colonies from nonluminescent (dark) colonies (Supplementary Fig. 1). Sequencing revealed that constructs yielding highly luminescent colonies all encoded candidate proteins with lengths ranging from 308 to 328 amino acids (Fig. 3d, cyan dots). It thus appears that colony luminescence reliably identifies arrest-releasing constructs in the region where full folding of the G-domain is expected. Nonluminescent colonies, selected from a small area of the plate that also contained highly luminescent colonies (Fig. 3c, blowup), all had candidate lengths outside this region (Fig. 3d, gray dots). This result suggests that, surprisingly, no stable intermediates are formed co-translationally, and that only full G-domain folding constitutes a major folding waypoint during EF-G synthesis.

Full in vivo folding profile of the G-domain

The on-plate assay provides a convenient format for analyzing a pool of candidates in a simple experiment. However, colony luminescence only provides a binary readout, and subtle differences in signal are not resolved. To compare arrest release rates more quantitatively, we assayed the 75 EF-G truncation constructs individually by growing them separately in liquid cultures. After normalizing by cell density, culture luminescence intensity shows a clear peak with the maximum at L = 332 (Fig. 4). Visualization of the arrested and released product in the region of 280 ≤ L ≤ 340 by Western blotting (Supplementary Fig. 2) confirmed that luciferase activity reports on arrest release. Strong arrest release is therefore detected when the C-terminal G-domain residue (aa 293) is separated from the PTC by 39 aa, a value similar to that observed for other relatively large domains³¹.

**Fig. 4: Folding profile of EF-G nascent chains.**

Notably, a signal increase to ~25% of the maximum intensity coincides with completion of G-domain synthesis (L = 309). At this length, offset from the maximum of the peak by 23 amino acids (Fig. 4, vertical bar), all of the G-domain except for the C-terminal helix has been extruded from the ribosome. The slightly elevated arrest release rates in this length range could suggest the formation of meta-stable structures outside the ribosome^29,32, or formation of secondary structure inside the exit tunnel^16,31,37. All chain lengths shorter than 308 amino acids exhibit only basal luciferase activity (below 20% of the peak value; gray box in Fig. 4), suggesting that the G-domain does not form stably folded intermediates. Given that the G-domain is large, this finding is unexpected.

Stably structured states are not formed until synthesis is complete

Several folding scenarios can explain why most of the G-domain truncations do not exhibit luminescence above the baseline level. If partially folded structures are destabilized by the ribosome, they may become stably folded only after an intervening spacer has been synthesized, preventing detection of these intermediates in the in vivo reporter assay. Alternatively, the G-domain may only begin to form stable structures once the complete domain has been synthesized and is mostly exposed to the outside of the ribosome.

To distinguish between these scenarios, we probed the structure of nascent EF-G polypeptides directly using single-molecule force spectroscopy with optical tweezers (Fig. 5). We generated terminally stalled ribosome-nascent chain complexes (RNCs) by in vitro translation of nonstop messenger RNAs¹⁸. Tag sequences at the N-terminus of the nascent chain and on protein L17 in the large ribosomal subunit allowed us to tether these complexes for mechanical manipulation with optical tweezers (Fig. 5a). Mechanical force acts as a denaturant that destabilizes folded structures. In force ramp experiments, a continuously increasing tension is applied to the tethered molecule. In the resulting force-extension curves, unfolding of nascent chain structure results in rips, sudden increases in molecular extension as the polypeptide transitions from a compact folded to an extended unfolded state. The method is thus suitable to detect tertiary structure in individual protein molecules.

**Fig. 5: Nascent chain structure probed with single-molecule force spectroscopy.**

In RNCs with a length of L = 328 (termed 328_RNC here), unfolding of the native G-domain results in a characteristic transition (Fig. 5b), as observed previously^18,35. As expected, a very short nascent chain (44_RNC) that exposes only a few EF-G residues outside the exit tunnel does not exhibit any transitions in these experiments. Surprisingly, however, substantially longer nascent chains (252_RNC and 316_RNC) do not exhibit defined unfolding transitions, either (Fig. 5b). Occasionally, we detect heterogeneous transitions at these chain lengths (Supplementary Fig. 3). Their distribution is distinct from that expected for unfolding of well-defined states and might be reminiscent of misfolded states that have been observed with other proteins^19,38. Regardless of what these heterogeneous transitions represent, the results from our single-molecule experiments indicate that the nascent G-domain does not form stable folding intermediates or collapsed states, even when almost all of its sequence has emerged from the ribosome.

Multistate folding of the full domain follows a strict order

To follow folding of the ribosome-bound G-domain, we carried out optical tweezers experiments with 328_RNC in force clamp mode. In these experiments, the force is held at a constant value while changes in molecular extension are recorded. After fully unfolding the G-domain, we jumped the force to 3.5 pN to initiate refolding. The molecule transitions repeatedly between the unfolded and partially folded states (Fig. 6a), reflecting the complexity in folding that is expected for a large domain. Transitions cease upon forming a fully structured stable state (Fig. 6a, open arrowhead). The partially structured states exhibit several distinct extensions that may represent productive on-pathway folding intermediates or misfolded off-pathway species. Folding to the native state occurs from a partially folded structure (Fig. 6a, gray arrowhead), demonstrating the presence of at least one on-pathway intermediate.

**Fig. 6: Refolding of the G-domain on and off the ribosome.**

A complex pattern of folding intermediates is also observed in force clamp experiments with the isolated G-domain in the absence of the ribosome (Fig. 6b). Previous work¹⁸ showed that the overall folding rate of 328_RNC is reduced compared to that of the isolated G-domain, implying interactions of the nascent chain with the ribosome prior to folding. The ribosome may therefore affect the stability of folding intermediates or transitions between them. Nevertheless, the intermediates exhibit similar extensions in the isolated G-domain and the ribosome-bound nascent chain, suggesting similar folding pathways in both scenarios. As observed for 328_RNC, the final folding step to the stably structured domain initiates from a partially structured state (Fig. 6b, gray arrowhead).

The on-pathway folding intermediate is ~10 nm more extended than the natively folded G-domain (Fig. 6a, b, gray vs. open arrowheads). At the refolding force of 3.5 pN, this length corresponds to ~90 amino acids of unfolded polypeptide, as calculated using a worm-like chain model³⁹ with a persistence length of 0.65 nm and a contour length of 0.36 nm per amino acid. The intermediate would therefore be composed of ~200 structured G-domain residues. Transitions consistent with unfolding of such a structure are observed in some of the force ramp recordings obtained with 328_RNC (Supplementary Fig. 3, dashed gray line). The absence of similar transitions in shorter nascent chains (252_RNC and 316_RNC, Fig. 5b and Supplementary Fig. 3) suggests that amino acids near the C-terminus of the G-domain are required for the formation of the obligatory on-pathway intermediate. Consistent with this observation, constant force measurements with the 316_RNC nascent chain do not exhibit compaction into a partially structured state (Fig. 6c and Supplementary Fig. 4). It thus appears that the G-domain remains largely unfolded during synthesis until its C-terminus, encompassing the last alpha-helix of the domain, has been extruded from the exit tunnel. Taken together, our measurements are consistent with a strictly ordered folding pathway that begins at the extreme C-terminus of the G-domain.

Discussion

We have defined the folding pathway of the nascent G-domain from EF-G using a combination of in vivo (Figs. 2–4) and single-molecule measurements (Figs. 5 and 6). Notably, no stable folding intermediates are detected in vivo or in vitro until the complete domain of 293 amino acids is synthesized. Previously, the small src SH3 domain (64 amino acids) was found to fold only upon reaching the exit of the ribosome tunnel²². However, polypeptides composed of more than 100 amino acids are commonly assumed to populate folding intermediates^2,3. It is thus surprising that the N-terminal ~250 amino acids of the G-domain do not appear to acquire stable structures.

Consistent with previous results^18,35, the G-domain forms a stable structure after fully emerging from the ribosome (L = 328), which manifests as a large peak in the SecM reporter assay (Fig. 4). Nonrandom signal fluctuations at shorter chain lengths (48 < L < 320) might represent secondary structure formations, especially α-helices, which have previously been shown to accelerate SecM arrest release^29,31,40. The identity of nascent chain sequences inside the ribosome exit tunnel has been described to also affect arrest release^29,31,32. Interactions of nascent chain residues with the ribosome could prevent formation of the SecM secondary structure that is required to cause stalling and could thus in principle account for the observed signal. Regardless of their origin, the amplitudes of these fluctuations are small compared to the main peak, suggesting the absence of stably folded structures until the G-domain is fully synthesized.

Single-molecule optical tweezers experiments confirm that folding begins only after synthesis is complete. We do not detect well-defined stable structures in nascent chains at lengths up to L = 316 (Fig. 5). Small, heterogeneous transitions that are occasionally observed for the incomplete G-domain (Supplementary Fig. 3) suggest misfolded states that are likely suppressed in vivo by molecular chaperones. Optical tweezers measurements readily detect partially folded (e.g., in alpha-synuclein³⁸ and T4 lysozyme¹⁹) or collapsed, molten globule-like structures (e.g., in ribonuclease H⁴¹ and apomyoglobin⁴²). The contact order⁴³ of the G-domain is moderately low (relative contact order: 0.07), and its overall hydrophobicity⁴⁴ is similar to that of other proteins (Supplementary Fig. 5). These factors therefore do not appear to account for the observed lack of early structure formation or collapse during synthesis. Its sequence properties might help to keep the G-domain extended and prevent (premature) collapse into kinetically trapped states⁴⁵. Ribosome interactions, previously shown to destabilize native^13,46 and non-native^19,35 nascent chain structures, might further contribute to keeping the nascent domain unfolded until its synthesis is complete.

Interestingly, our measurements suggest a relatively sharp transition in the propensity to form compact structures as the nascent chain is elongated. At L = 316, the nascent chain has properties of an intrinsically disordered protein, whereas the addition of just 12 amino acids to L = 328 results in the formation of collapsed or partially structured states and subsequent folding to the native structure (Figs. 5 and 6). Theoretical studies concluded that the formation of compact states is an evolved property of natural proteins⁴⁷. The G-domain may be an attractive model to investigate how this property is related to protein sequence and structure.

Our studies provide an example of folding occurring in the direction opposite to that of synthesis and contrast with previous findings of gradual compaction and folding concomitant with protein elongation⁴⁸. Decoupling of folding and synthesis has previously been observed. Folding of the N-terminal regions of the low-density lipoprotein receptor (LDL-R) in the endoplasmic reticulum is delayed by the formation of intermediates that are stabilized by non-native disulfide bonds, which slowly rearrange into the native configuration⁴⁹. Thus, the protein completes its folding post-translationally. LDL-R folds in the oxidizing environment of the endoplasmic reticulum, whereas the G-domain emerges from the ribosome into the cytosol, remaining unfolded. Once the full G-domain has emerged from the ribosome, folding occurs in several steps that appear similar on the ribosome and in isolation (Fig. 6). The ribosome therefore does not seem to change the folding pathway. The late onset of folding and the detection of defined intermediates suggest that folding proceeds along a sequential pathway. This scenario is consistent with the foldon hypothesis, in which well-defined states are formed in a prescribed order⁵⁰.

The strict sequentiality of G-domain folding might be dictated by the final structure. The region containing the C-terminal helix is largely buried in the folded structure (Fig. 6d, e). Perhaps folding must occur with the helix serving as a central nucleus around which the remainder of the structure is formed subsequently, rather than by inserting the helix into preformed intermediates. Interestingly, part of this enclosure around the C-terminal helix is formed by the G′ domain, an insertion present in some, but not all G-domain containing elongation factors⁵¹ (Supplementary Fig. 6). In future studies with homologous G-domains (such as those shown in Supplementary Fig. 6), it will be interesting to examine whether lack of the G′ insertion allows intermediate formation during synthesis and relaxes the strict order of folding that we observe here for EF-G. Nascent G-domains appear as attractive models for investigating how folding pathways co-evolve with structures that enable crucial cellular functions.

EF-G is a highly abundant protein (top 1% in the E. coli proteome⁵²) that fulfills an essential cellular function. The efficiency of EF-G synthesis and folding may have been under evolutionary pressure. The coding sequence contains very few rare codons (Supplementary Fig. 7), suggesting that it is translated without major pauses⁵³. Collapsed states and intermediates can kinetically trap folding proteins in non-native states⁴. Structure acquisition through a strictly ordered sequential pathway upon completion of synthesis might have evolved as a mechanism to ensure timely folding of EF-G.

Methods

Bacterial strains, plasmids, and reagents

In vivo folding experiments were carried out in E. coli strain Lemo21(DE3) (New England Biolabs, NEB, C2528J). All plasmids used in this study are based on a backbone with a pUC origin, Lac-operator-controlled T7 promoter, and Ampicillin resistance gene³⁵. The NanoLuc coding sequence was obtained as a synthetic DNA fragment (Integrated DNA Technologies). The SecM coding sequence was introduced through synthetic DNA fragments that served as primers for PCR amplification. Vector backbone and PCR products were assembled with Gibson Assembly Master Mix (NEB E2611), yielding plasmid pWP3. All primers used for cloning are listed with their sequences in Supplementary Information (Tables 1–3). The amino acid sequence of the arrest peptide used in this study is FSTPVWISQAQGIRAGP. All commercially available enzymes were purchased from NEB unless stated otherwise. PCR reactions were carried out with Phusion high-fidelity DNA polymerase (Thermo Scientific, F530S). Chemicals were purchased from Sigma-Aldrich unless stated otherwise. Streptavidin-HRP for Western blot detection was from SouthernBiotech™ (#7100–05).

Construction of EF-G truncation library

EF-G fragments of defined lengths were individually PCR-amplified with a NheI tailed universal forward primer (WP3-EF-G-uni-fwd) and reverse primers at designated positions along the EF-G open reading frame (WP3-EF-G-44 to WP3-EF-G-424; see Supplementary Table 2 for all primer sequences). The backbone containing the SecM-NanoLuc sequence was amplified from pWP3 with a SpeI tailed forward primer (WP3-bb-fw) at the AviTag and reverse primer at the SecM (WP3-bb-rv). After PCR clean-up, backbone and insert were mixed at a 1 to 3 ratio (40 fmol in total) in CutSmart^® Buffer (NEB) supplemented with 3-mM ATP and 10-mM DTT. The reaction was provided with 4U of each NheI, SpeI, T4PNK and T4 DNA ligase at 10 μl final volume. After 2 h at 37 and 16 °C overnight, the product was transformed and verified by colony PCR with Taq DNA polymerase. Plasmid DNA from colonies showing the correct insert sizes was isolated, and its correct sequences were verified by Sanger sequencing.

On-plate NanoLuc assay

Cells were transformed with indicated plasmids per manufacturer instructions and spread on a LB agar supplemented with ampicillin, chloramphenicol, 500 μM L-rhamnose and 500 μM IPTG on a 9 cm diameter round plate. Colonies were grown at 37 °C for 12–16 h. We reconstituted 500 μl Nano-Glo^® Live Cell Reagent (Promega N2011) for each plate and sprayed evenly onto the plate with an airbrush (Neo Iwata HP-CN N4500). Images were taken in a shaded home-made imaging box equipped with a Canon Rebel T3 camera, operated in raw image acquisition mode to avoid complications from camera-internal image processing. Camera settings were at Neutral between 5 and 15 s exposure and ISO800 for optimal contrast. Images were recorded with 8-bit color depth. To identify nascent chains that fold into stable structures, colony luminescence was quantified using custom Matlab scripts for image analysis. Circular areas of identical size (shown in Fig. 3c) were defined around colonies of interest, and integrated intensities were obtained by summing the intensities values of all pixels within these circles (see Supplementary Fig. 1). Colonies with an integrated intensity above 20,000 were designated as luminescent, colonies with integrated intensities below 10,000 were designated as nonluminescent (dark). For sequence analysis, we picked colonies that were well separated on the plate to avoid cross contamination between colonies. Dark colonies chosen for analysis were from an area of the plate that also contained highly luminescent colonies to rule out that uneven coating of the plate with luciferase substrate accounted to the lack of luminescence. DNA was amplified from selected colonies (circles in Fig. 3c) by PCR, and the resulting PCR products were analyzed by Sanger sequencing.

Liquid culture NanoLuc assay

Cells transformed with individual plasmids were spread on LB agar with antibiotics and allowed to form colonies overnight. LB supplemented with ampicillin and chloramphenicol was inoculated with a single colony to grow an overnight culture. Overnight cultures were diluted into fresh LB containing antibiotics to OD₆₀₀ = 0.01 and incubated in a 37 °C shaker at 220 rpm. Cell densities were monitored with a plate reader (ThermoMax Microplate Reader, Molecular Devices). At OD₆₀₀ = 0.2–0.4, cultures were induced with 500 μM L-rhamnose and 500 μM IPTG for 1 h. Cell densities were measured and 100 μl cultures were put onto a white round-bottom 96-well plate for luminescence measurement. All NanoLuc assays in this study were carried out with Nano-Glo^® Live Cell Assay from Promega (N2011) according to manufacturer instructions. Luminescence was measured on a GloMax^® Navigator Microplate Luminometer (Promega). Signals were linearly normalized to OD₆₀₀ = 0.4. Each data set was acquired using identical instrument settings to allow comparison between samples. The integration time chosen such that the highest signal did not exceed 1E9 RLU to avoid saturating the detector. For visualization of translation products by Western blotting, Streptavidin-HRP was used to detect biotinylated proteins in whole-cell lysates after ribonuclease treatment. Uncropped images of the Western blots shown in Fig. 2 and Supplementary Fig. 2 are provided in the Source Data file.

Single-molecule force spectroscopy with RNCs

Stalled RNCs were generated as described previously^18,35. Nonstop mRNA templates were generated by in vitro transcription of PCR products (see Supplementary Table 3 for primer sequences). Stalled RNCs were produced by in vitro translation, isolated by ultracentrifugation, and dissolved in HKMβ buffer (20 mM HEPES, 100 mM KCl, 5 mM MgCl_2, 5 mM β-mercaptoethanol, pH 7.4), and stored in small aliquots after flash-freezing. Single-molecule experiments were carried out using a custom home-built instrument⁵⁴. The experiments were carried out in HKMβ buffer. For force ramp experiments, the trap was moved at a constant speed of 100 nm/s to apply continuously increasing forces on the nascent chain in the range from 2 to 50 pN. The force was ramped down at the same loading rate, and the molecule was held at 2 pN for 10 s before being pulled again. Data were collected at a sampling frequency of 1000 Hz and averaged to 33 Hz for plotting. The extension changes were determined using custom MATLAB scripts as described in detail previously¹⁸. For constant force experiments, the molecule was first subjected to force ramp cycles to ensure it exhibited the characteristic unfolding transitions. After fully unfolding the molecule at 30 pN, the force was reduced to 3.5 pN to initiate refolding. The change in molecular extension was recorded at a sampling frequency of 1000 Hz. The measurement does not yield the absolute extension of the molecule. For measurements with 328_RNC and with the isolated G-domain, the extension of the folded state was defined as 0, all other extensions are relative to this reference state. For measurements with the unfolded 316_RNC, the relative extension is reported.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

PDB Accession Codes for data sets used in Fig. 6 and Supplementary Fig. 6 are 4V9P (EF-G), 6EZE (EF-Tu), 2H5E (RF3), and 3CB4 (LepA). All data are available from the corresponding author upon reasonable request. Source data are provided with this paper.

Code availability

The custom Matlab code that was used for data analysis in this study is available from the corresponding author upon reasonable request.

References

Sosnick, T. R. & Barrick, D. The folding of single domain proteins—have we reached a consensus? Curr. Opin. Struct. Biol. 21, 12–24 (2011).
Article CAS PubMed Google Scholar
Brockwell, D. J. & Radford, S. E. Intermediates: ubiquitous species on folding energy landscapes? Curr. Opin. Struct. Biol. 17, 30–37 (2007).
Article CAS PubMed PubMed Central Google Scholar
Malhotra, P. & Udgaonkar, J. B. How cooperative are protein folding and unfolding transitions? Protein Sci. 25, 1924–1941 (2016).
Article CAS PubMed PubMed Central Google Scholar
Sosnick, T. R., Mayne, L., Hiller, R. & Englander, S. W. The barriers in protein folding. Nat. Struct. Biol. 1, 149–156 (1994).
Article CAS PubMed Google Scholar
Stigler, J., Ziegler, F., Gieseke, A., Gebhardt, J. C. & Rief, M. The complex folding network of single calmodulin molecules. Science 334, 512–516 (2011).
Article ADS CAS PubMed Google Scholar
Bustamante, C., Alexander, L., Maciuba, K. & Kaiser, C. M. Single-molecule studies of protein folding with optical tweezers. Annu Rev. Biochem. 89, 443–470 (2020).
Article CAS PubMed PubMed Central Google Scholar
Englander, S. W. & Mayne, L. The case for defined protein folding pathways. Proc. Natl Acad. Sci. USA 114, 8253–8258 (2017).
Article CAS PubMed PubMed Central Google Scholar
Eaton, W. A. & Wolynes, P. G. Theory, simulations, and experiments show that proteins fold by multiple pathways. Proc. Natl Acad. Sci. USA 114, E9759–E9760 (2017).
Article CAS PubMed PubMed Central Google Scholar
Englander, S. W., Mayne, L., Kan, Z. Y. & Hu, W. Protein folding-how and why: by hydrogen exchange, fragment separation, and mass spectrometry. Annu. Rev. Biophys. 45, 135–152 (2016).
Article CAS PubMed PubMed Central Google Scholar
Onuchic, J. N. & Wolynes, P. G. Theory of protein folding. Curr. Opin. Struct. Biol. 14, 70–75 (2004).
Article CAS PubMed Google Scholar
Gruebele, M., Dave, K. & Sukenik, S. Globular protein folding in vitro and in vivo. Annu. Rev. Biophys. 45, 233–251 (2016).
Article CAS PubMed Google Scholar
Braselmann, E., Chaney, J. L. & Clark, P. L. Folding the proteome. Trends Biochem. Sci. 38, 337–344 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cabrita, L. D. et al. A structural ensemble of a ribosome-nascent chain complex during cotranslational protein folding. Nat. Struct. Mol. Biol. 23, 278–285 (2016).
Article CAS PubMed PubMed Central Google Scholar
Samelson, A. J., Jensen, M. K., Soto, R. A., Cate, J. H. & Marqusee, S. Quantitative determination of ribosome nascent chain stability. Proc. Natl Acad. Sci. USA 113, 13402–13407 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lu, J. & Deutsch, C. Folding zones inside the ribosomal exit tunnel. Nat. Struct. Mol. Biol. 12, 1123–1129 (2005).
Article CAS PubMed Google Scholar
Bhushan, S. et al. Alpha-Helical nascent polypeptide chains visualized within distinct regions of the ribosomal exit tunnel. Nat. Struct. Mol. Biol. 17, 313–317 (2010).
Article CAS PubMed Google Scholar
Holtkamp, W. et al. Cotranslational protein folding on the ribosome monitored in real time. Science 350, 1104–1107 (2015).
Article ADS CAS PubMed Google Scholar
Liu, K., Maciuba, K. & Kaiser, C. M. The ribosome cooperates with a chaperone to guide multi-domain protein folding. Mol. Cell 74, 310–319.e7 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kaiser, C. M., Goldman, D. H., Chodera, J. D., Tinoco, I. Jr & Bustamante, C. The ribosome modulates nascent protein folding. Science 334, 1723–1727 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Alexander, L. M., Goldman, D. H., Wee, L. M. & Bustamante, C. Non-equilibrium dynamics of a nascent polypeptide during translation suppress its misfolding. Nat. Commun. 10, 2709 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Kaiser, C. M. & Liu, K. Folding up and moving on—nascent protein folding on the ribosome. J. Mol. Biol. 430, 4580–4591 (2018).
Article CAS PubMed PubMed Central Google Scholar
Guinn, E. J., Tian, P., Shin, M., Best, R. B. & Marqusee, S. A small single-domain protein folds through the same pathway on and off the ribosome. Proc. Natl Acad. Sci. USA 115, 12206–12211 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tian, P. et al. Folding pathway of an Ig domain is conserved on and off the ribosome. Proc. Natl Acad. Sci. USA 115, E11284–E11293 (2018).
Article CAS PubMed PubMed Central Google Scholar
Goldman, D. H. et al. Ribosome. Mechanical force releases nascent chain-mediated ribosome arrest in vitro and in vivo. Science 348, 457–460 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Nakatogawa, H. & Ito, K. The ribosomal exit tunnel functions as a discriminating gate. Cell 108, 629–636 (2002).
Article CAS PubMed Google Scholar
Muto, H., Nakatogawa, H. & Ito, K. Genetically encoded but nonpolypeptide prolyl-tRNA functions in the A site for SecM-mediated ribosomal stall. Mol. Cell 22, 545–552 (2006).
Article CAS PubMed Google Scholar
Nilsson, O. B. et al. Cotranslational protein folding inside the ribosome exit tunnel. Cell Rep. 12, 1533–1540 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nilsson, O. B. et al. Cotranslational folding of spectrin domains via partially structured states. Nat. Struct. Mol. Biol. 24, 221–225 (2017).
Article CAS PubMed Google Scholar
Notari, L., Martinez-Carranza, M., Farias-Rico, J. A., Stenmark, P. & von Heijne, G. Cotranslational folding of a pentarepeat beta-helix protein. J. Mol. Biol. 430, 5196–5206 (2018).
Article CAS PubMed Google Scholar
Kudva, R. et al. The shape of the bacterial ribosome exit tunnel affects cotranslational protein folding. Elife 7, e36326 (2018).
Article PubMed PubMed Central Google Scholar
Farias-Rico, J. A., Ruud Selin, F., Myronidi, I., Fruhauf, M. & von Heijne, G. Effects of protein size, thermodynamic stability, and net charge on cotranslational folding on the ribosome. Proc. Natl Acad. Sci. USA 115, E9280–E9287 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kemp, G., Kudva, R., de la Rosa, A. & von Heijne, G. Force-profile analysis of the cotranslational folding of HemK and Filamin domains: comparison of biochemical and biophysical folding assays. J. Mol. Biol. 431, 1308–1314 (2019).
Article CAS PubMed Google Scholar
Marino, J., von Heijne, G. & Beckmann, R. Small protein domains fold inside the ribosome exit tunnel. FEBS Lett. 590, 655–660 (2016).
Article CAS PubMed Google Scholar
Hall, M. P. et al. Engineered luciferase reporter from a deep sea shrimp utilizing a novel imidazopyrazinone substrate. ACS Chem. Biol. 7, 1848–1857 (2012).
Article CAS PubMed PubMed Central Google Scholar
Liu, K., Rehfus, J. E., Mattson, E. & Kaiser, C. M. The ribosome destabilizes native and non-native structures in a nascent multidomain protein. Protein Sci. 26, 1439–1451 (2017).
Article CAS PubMed PubMed Central Google Scholar
Waldo, G. S., Standish, B. M., Berendzen, J. & Terwilliger, T. C. Rapid protein-folding assay using green fluorescent protein. Nat. Biotechnol. 17, 691–695 (1999).
Article CAS PubMed Google Scholar
Ziv, G., Haran, G. & Thirumalai, D. Ribosome exit tunnel can entropically stabilize alpha-helices. Proc. Natl Acad. Sci. USA 102, 18956–18961 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Neupane, K., Solanki, A., Sosova, I., Belov, M. & Woodside, M. T. Diverse metastable structures formed by small oligomers of alpha-synuclein probed by force spectroscopy. PLoS One 9, e86495 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Bustamante, C., Marko, J. F., Siggia, E. D. & Smith, S. Entropic elasticity of lambda-phage DNA. Science 265, 1599–1600 (1994).
Article ADS CAS PubMed Google Scholar
Marsden, A. P. et al. Investigating the effect of chain connectivity on the folding of a beta-sheet protein on and off the ribosome. J. Mol. Biol. 430, 5207–5216 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cecconi, C., Shank, E. A., Bustamante, C. & Marqusee, S. Direct observation of the three-state folding of a single protein molecule. Science 309, 2057–2060 (2005).
Article ADS CAS PubMed Google Scholar
Elms, P. J., Chodera, J. D., Bustamante, C. & Marqusee, S. The molten globule state is unusually deformable under mechanical force. Proc. Natl Acad. Sci. USA 109, 3796–3801 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Plaxco, K. W., Simons, K. T. & Baker, D. Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 277, 985–994 (1998).
Article CAS PubMed Google Scholar
Roseman, M. A. Hydrophilicity of polar amino acid side-chains is markedly reduced by flanking peptide bonds. J. Mol. Biol. 200, 513–522 (1988).
Article CAS PubMed Google Scholar
Clark, P. L., Plaxco, K. W. & Sosnick, T. R. Water as a good solvent for unfolded proteins: folding and collapse are fundamentally different. J. Mol. Biol. 432, 2882–2889 (2020).
Article CAS PubMed PubMed Central Google Scholar
Waudby, C. A. et al. Systematic mapping of free energy landscapes of a growing filamin domain during biosynthesis. Proc. Natl Acad. Sci. USA 115, 9744–9749 (2018).
Article CAS PubMed PubMed Central Google Scholar
Thirumalai, D., Samanta, H. S., Maity, H. & Reddy, G. Universal nature of collapsibility in the context of protein folding and evolution. Trends Biochem. Sci. 44, 675–687 (2019).
Article CAS PubMed Google Scholar
Wruck, F., Katranidis, A., Nierhaus, K. H., Buldt, G. & Hegner, M. Translation and folding of single proteins in real time. Proc. Natl Acad. Sci. USA. 114, E4399–E4407 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jansens, A., van Duijn, E. & Braakman, I. Coordinated nonvectorial folding in a newly synthesized multidomain protein. Science 298, 2401–2403 (2002).
Article ADS CAS PubMed Google Scholar
Englander, S. W. & Mayne, L. The nature of protein folding pathways. Proc. Natl Acad. Sci. USA. 111, 15873–15880 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Wittinghofer, A. & Vetter, I. R. Structure-function relationships of the G domain, a canonical switch motif. Annu. Rev. Biochem. 80, 943–971 (2011).
Article CAS PubMed Google Scholar
Schmidt, A. et al. The quantitative and condition-dependent Escherichia coli proteome. Nat. Biotechnol. 34, 104–110 (2016).
Article CAS PubMed Google Scholar
Clarke, T. F. T. & Clark, P. L. Rare codons cluster. PLoS One 3, e3412 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Smith, S. B., Cui, Y. & Bustamante, C. Optical-trap force transducer that operates by direct measurement of light momentum. Methods Enzymol. 361, 134–162 (2003).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

C.M.K. acknowledges support from the National Institutes of Health (5R01GM121567) and from the Pew Biomedical Scholars Program.

Author information

Kaixian Liu
Present address: Molecular Biology Program, Sloan Kettering Institute, New York, NY, USA

Authors and Affiliations

CMDB Graduate Program, Johns Hopkins University, Baltimore, MD, USA
Xiuqi Chen, Nandakumar Rajasekaran & Kaixian Liu
Department of Biology, Johns Hopkins University, Baltimore, MD, USA
Christian M. Kaiser
Department of Biophysics, Johns Hopkins University, Baltimore, MD, USA
Christian M. Kaiser

Authors

Xiuqi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Nandakumar Rajasekaran
View author publications
You can also search for this author in PubMed Google Scholar
Kaixian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Christian M. Kaiser
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.M.K. and X.C. designed research, X.C. and N.R. generated reagents, X.C., N.R., and K.L. performed experiments, all authors analyzed data, C.M.K., X.C., and N.R. wrote the manuscript, all authors edited the manuscript.

Corresponding author

Correspondence to Christian M. Kaiser.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Tae-Young Yoon, Yongli Zhang, and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, X., Rajasekaran, N., Liu, K. et al. Synthesis runs counter to directional folding of a nascent protein domain. Nat Commun 11, 5096 (2020). https://doi.org/10.1038/s41467-020-18921-8

Download citation

Received: 06 May 2020
Accepted: 18 September 2020
Published: 09 October 2020
DOI: https://doi.org/10.1038/s41467-020-18921-8

This article is cited by

Chp1 is a dedicated chaperone at the ribosome that safeguards eEF1A biogenesis
- Melania Minoia
- Jany Quintana-Cordero
- Claes Andréasson
Nature Communications (2024)
Resolving chaperone-assisted protein folding on the ribosome at the peptide level
- Thomas E. Wales
- Aleksandra Pajak
- David Balchin
Nature Structural & Molecular Biology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.