Non-cooperative 4E-BP2 folding with exchange between eIF4E-binding and binding-incompatible states tunes cap-dependent translation inhibition

Dawson, Jennifer E.; Bah, Alaji; Zhang, Zhenfu; Vernon, Robert M.; Lin, Hong; Chong, P. Andrew; Vanama, Manasvi; Sonenberg, Nahum; Gradinaru, Claudiu C.; Forman-Kay, Julie D.

doi:10.1038/s41467-020-16783-8

Download PDF

Article
Open access
Published: 19 June 2020

Non-cooperative 4E-BP2 folding with exchange between eIF4E-binding and binding-incompatible states tunes cap-dependent translation inhibition

Nature Communications volume 11, Article number: 3146 (2020) Cite this article

3514 Accesses
14 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Phosphorylation of intrinsically disordered eIF4E binding proteins (4E-BPs) regulates cap-dependent translation by weakening their ability to compete with eIF4G for eIF4E binding within the translation initiation complex. We previously showed that phosphorylation of T37 and T46 in 4E-BP2 induces folding of a four-stranded beta-fold domain, partially sequestering the canonical eIF4E-binding helix. The C-terminal intrinsically disordered region (C-IDR), remaining disordered after phosphorylation, contains the secondary eIF4E-binding site and three other phospho-sites, whose mechanisms in inhibiting binding are not understood. Here we report that the domain is non-cooperatively folded, with exchange between beta strands and helical conformations. C-IDR phosphorylation shifts the conformational equilibrium, controlling access to eIF4E binding sites. The hairpin turns formed by pT37/pT46 are remarkably stable and function as transplantable units for phospho-regulation of stability. These results demonstrate how non-cooperative folding and conformational exchange leads to graded inhibition of 4E-BP2:eIF4E binding, shifting 4E-BP2 into an eIF4E binding-incompatible conformation and regulating translation initiation.

The structure of a human translation initiation complex reveals two independent roles for the helicase eIF4A

Article Open access 29 January 2024

Overlapping regions of Caf20 mediate its interactions with the mRNA-5′cap-binding protein eIF4E and with ribosomes

Article Open access 29 June 2021

Structural basis for the transition from translation initiation to elongation by an 80S-eIF5B complex

Article Open access 06 October 2020

Introduction

Tuning the binding affinities of the eukaryotic translation initiation factor 4E (eIF4E) for its intrinsically disordered binding proteins (4E-BPs) is central to regulation of cap-dependent translation initiation. Initiation is the rate-limiting step in translation, in which the ribosome is recruited to the mRNA by the eIF4F complex^1,2,3. eIF4E, along with the RNA helicase eIF4A and the scaffolding protein eIF4G, forms the eIF4F complex. eIF4E directly interacts with the 7-methyl guanosine cap structure at the mRNA 5′ end. Interaction of eIF4E with 4E-BPs^4,5 inhibits cap-dependent translation initiation by competing for an overlapping eIF4E surface with eIF4G^6,7,8,9. 4E-BPs have multiple phospho-sites¹⁰ (four or more depending on isoform or species), which are hierarchically phosphorylated^3,11. Non-phospho 4E-BPs (np-4E-BPs) bind eIF4E tightly and inhibit eIF4G binding^10,11. The mammalian target of rapamycin (mTOR) phosphorylates the first two sites, T37 and T46¹², resulting in the hypo-phosphorylated state, which binds eIF4E more weakly, but still inhibits eIF4G binding. Hyper-phosphorylated 4E-BP is modified at all sites, including S65 and T70, and has further weakened eIF4E affinity, allowing eIF4G to compete for eIF4E and translation initiation to proceed¹⁰. The hyper-phosphorylated state has a potential fifth site at S83 that is conserved in the 4E-BP1 and 4E-BP2 isoforms, but is less conserved in 4E-BP3 and in invertebrates¹³. Since dysregulation of eIF4E function is involved in many diseases including cancer and autism spectrum disorders^1,2 and ubiquitin-mediated degradation of 4E-BPs is dependent on its phosphorylation status¹⁴, understanding the stepped binding affinities and the link with protein stability is critical.

4E-BPs, including the predominantly neuronal isoform 4E-BP2^4,5, are intrinsically disordered proteins (IDPs)^15,16,17, yet contain significant transient secondary structure distributed throughout the protein^8,11. The canonical binding helix is conserved between eIF4G and the 4E-BPs (⁵⁴YDRKFLLDRR⁶³ for 4E-BP2) and binds to the same interface on the convex surface of eIF4E structures of the 4E-BP:eIF4E complexes^{7,9,13,18,19,20}. The residues after the helix are less conserved and more dynamic, and contain a linker region and secondary binding site (centered at ⁷⁸IPGTV⁸² in 4E-BP2), which winds along the lateral surface of eIF4E^7,8,9,13. Determining structural effects of phosphorylation is complicated by available structures of 4E-BP:eIF4E complexes being for np-4E-BP fragments with few phospho-sites present in the observed structured regions, and only one site, S65, being in direct contact with the eIF4E surface. Proximity of S65 to E70 on eIF4E suggested the potential for electrostatic repulsion between the binding partners^18,21, but the S65 sidechain is oriented differently relative to the eIF4E surface in different crystal structures, pointing to binding being controlled by other mechanisms^11,22.

Although a low-resolution model based on SAXS data exists²³, no atomic structure of eIF4E in complex with any full-length 4E-BP has been observed, despite intense structural studies since 1998¹⁷. Interestingly, only upon truncating both the N-terminal and C-terminal disordered regions (residues 44–87²¹ and residues 50–83¹³) or abrogating the secondary binding site in the context of full-length protein⁸ (⁷⁸IPGTV⁸² deleted or mutated to ⁷⁸AAAAA⁸²), did NMR spectroscopy studies and X-ray crystallographic structures begin to reveal detailed structural and dynamic interactions between eIF4E and the extended bipartite 4E-BP-binding site¹⁷. Together, these data demonstrated that full-length 4E-BP:eIF4E interactions are dynamic, fuzzy complexes¹⁷.

We have previously demonstrated that the phosphorylation of T37 and T46 in apo 4E-BP2 induces folding of residues P18–R62 into a four β-stranded folded domain that partially sequesters the canonical eIF4E-binding helix, with pT37 and pT46 being part of conserved pTPGGT motifs that form hairpin turns central to the structure¹¹. The C-terminal intrinsically disordered region (C-IDR, residues 63–120) contains the secondary eIF4E-binding site, the linker region between the canonical and secondary binding sites, and three additional phospho-sites (S65, T70, and S83). The C-IDR is not required for the folded domain formation¹¹. However, phosphorylation at T37 and T46 alone only reduces 4E-BP2’s eIF4E affinity by ~100-fold compared to np-4E-BP2 (from K_D = 3.2 ± 0.6 to 267 ± 32 nM). 4E-BP2 phosphorylated at all five sites has ~4000-fold weaker affinity (K_D = 12,320 ± 200 nM), allowing eIF4G to out-compete 4E-BP2, indicating that the three C-IDR phospho-sites also play important roles in binding¹¹. We previously found that five-phospho 4E-BP2 (5p-4E-BP2) mutants that lacked folded domains have affinities similar to that of np-4E-BP2, establishing that the C-IDR sites must act in concert with the folded domain¹¹. This, together with evidence that the folded domain is marginally stable and is in exchange with partially unfolded states¹¹, led to the hypothesis that C-IDR phosphorylation stabilizes the folded domain and that increased stability lowers 4E-BP2:eIF4E binding by decreasing availability of the eIF4E-binding helix.

Here, we define in mechanistic detail how the C-IDR phospho-sites modulate the folded domain stability, with hierarchical phosphorylation controlling eIF4E binding by tuning access to eIF4E-binding sites via changes in conformational equilibria. Using NMR spectroscopy, single-molecule fluorescence and calorimetry, we show that stability of the two pTPGGT hairpin turns engenders non-cooperative folding of the phosphorylated domain that controls access to the canonical binding site, while C-IDR phospho-sites tune the domain’s stability. The biophysical stability of the pTPGGT turns, combined with biochemical and bioinformatic results, suggests a more general relationship of pTPGGT-like motifs to cellular degradation and stability. Overall, our data explain the underlying mechanism of graded inhibition of eIF4E binding by hierarchical phosphorylation of 4E-BP2, which regulates the protein’s dynamic conformational equilibria, gradually converting it into an eIF4E binding-incompatible conformation.

Results

Binding largely unaffected by phosphate electrostatics

Previously, we found that the presence of the folded domain was indispensable for weakening eIF4E:4E-BP2 binding. Mutants that disrupt the folded domain of 5p-4E-BP2 bind eIF4E with similar affinities to that of np-4E-BP2, arguing against a dominant effect of overall electrostatics¹¹. However, structures of eIF4E bound to closely related 4E-BP1 fragments show the close proximity of phospho-site S65 to a negative charge on eIF4E, suggesting that phosphorylation of S65 may weaken binding by electrostatic repulsion. Analysis of the energy minimized bound-state structure (PDB ID 4UED), with and without phosphate, revealed no detectable energetic difference (Supplementary Fig. 1b, c). Thus, modeling coulombic and steric interactions due to phosphorylation at S65 in bound state structures does not capture the effect of phosphorylation.

Phospho 4E-BP2 folded domain unfolds non-cooperatively

Understanding binding energetics requires consideration of the equilibrium between all possible states, not only analysis of the bound state. We tested our primary hypothesis that C-IDR phosphorylation affects binding by modulating the 4E-BP2 folded domain stability. We used NMR spectroscopy and differential scanning calorimetry (DSC) to measure the stability of the fold under different denaturing conditions and phosphorylation states. Despite the folded domain’s small size, ~40 residues, we found that it is not a simple two-state folder. The presence of non-cooperative folding was readily observed in NMR ¹⁵N-¹H HSQC experiments on 5p-4E-BP2 (Fig. 1). Matching previous observations¹¹, chemical shifts for residues in the folded domain fall within ranges associated with secondary structure, clear evidence that the domain is not in a random coil state. The peaks from the hairpin residues G39 and G48 are shifted significantly downfield, reflecting stable backbone NH hydrogen bonds to the phosphate groups. After adding chemical denaturants, guanidinium chloride (GdmCl) or urea (Fig. 1a, b), we observed heterogeneous behavior with most folded domain peaks moving significantly toward random coil values (ω_1H ~ 8.2 ppm). In contrast, the hairpin G39 and G48 peaks (Fig. 1a, b insets) remain much farther from random coil values, indicating that the phosphorylation-induced turns are not unfolding with the rest of the domain.

**Fig. 1: Chemical, acid, and thermal denaturation of five-phospho 4E-BP2.**

Since deviations from random coil values are still observed for many domain residues even at high denaturant concentration (1.6 M GdmCl or ~8 M urea), we used acid and heat denaturation, yielding peaks much closer to random coil values under acidic (pH 1.0) and high temperature (70 °C) conditions (Fig. 1c). We also observed non-cooperative unfolding 4E-BP2 by acid and thermal denaturation, with unfolding of the hairpin turns lagging behind the changes in the rest of the domain (Fig. 1c). The chemical shift behavior of the peaks was used to distinguish between residues affected by acid denaturation of the folded domain and the direct effect of temperature and buffer conditions on solvent-exposed residues. Residues that shift toward random coil are likely to be affected by denaturation (Fig. 1c, Supplementary Fig. 2a). In contrast, the peaks for the np-4E-BP2, which does not contain any folded domain, show a uniform shift upfield with increasing temperature, consistent with expectation for non-hydrogen-bonded amides and no conformational changes^24,25,26 (Supplementary Fig. 2b). To quantify this non-cooperative unfolding for 5p-4E-BP2, we fitted the chemical shift changes at 20 °C as a function of pH to obtain residue-specific apparent pK_a values for the unfolding transition (Supplementary Table 1). Chemical shift changes for residues affected by both the denaturation and the protonation of neighboring solvent-exposed residues were fitted to the three-state equation (see “Methods” for further details). If acid denaturation occurred in a two-state cooperative event, a single apparent pK_a for the transition would be found across the domain²⁷. Instead, there is a variation in apparent pK_a values between 3.62 and 5.40 (Fig. 2a, Supplementary Table 1), with higher apparent pK_a values indicative of less structural stability. This non-cooperative behavior extends beyond the hairpins, with differences in stability distributed across the domain.

**Fig. 2: Non-cooperative folding of five-phospho 4E-BP2 folded domain.**

NMR data could only be collected up to 70 °C or 1.6 M GdmCl due to NMR probe engineering limits, so we used DSC to measure overall melting transition (T_m) values of 84 °C at pH 7.4 and 49 °C at pH 1.0 (Supplementary Fig. 3a). We also developed a method to indirectly estimate the thermal denaturation free energies from NMR thermal melting data (between 5 and 70 °C), collected at six different pH values between pH 2.6 and 7.4. Our method, based on the temperature-dependence of the pK_a values (see “Methods” section, Supplementary Fig. 3b–e), reveals that residues near the hairpin hydrogen bonds between the pT37 phosphate and G39 NH and the pT46 phosphate and G48 NH display pH-dependent thermal stability (Fig. 2b, Supplementary Fig. 3e). Other residues in the folded domain do not show this behavior, demonstrating the hairpins’ independence as miniature folding units. The pH dependence of hairpin stability (Fig. 2c) suggests that protonation of the phosphate weakens the critical hydrogen bond to the first glycine of the hairpin motif (see “Methods” section). We reported earlier that the folding of the hairpins upon T37 and T46 phosphorylation was a prerequisite for folded domain formation¹¹. But this was not sufficient for its stability, and the hairpins themselves can be populated even when the domain core is disrupted by mutations¹¹. The new data show that the two independently folded hairpins are extremely stable, requiring very low pH to denature, acting as stabilizing molecular staples.

C-IDR phosphorylation perturbs eIF4E-binding states

To probe how C-IDR phosphorylation modulates the stability of the folded domain, we systematically replaced the C-IDR sites with alanine substitutions (combinations of S65A, T70A, and S83A). By comparing the NMR spectra of these mutants and 5p-4E-BP2, we observe backbone NH chemical shift differences, Δω, near the C-IDR phospho-sites, but we also observed differences in the long loop connecting strands 1 and 2 of the folded domain, and in the canonical and secondary eIF4E-binding motifs (Fig. 3, Supplementary Fig. 4a, b). Lambda protein phosphatase dephosphorylated S65, T70, and S83 of 5p-4E-BP2 (Fig. 3b, Supplementary Fig. 4c, d), confirming that chemical shift differences are due to phosphorylation status and not due to unintended consequences of the Ala mutations. Removing the S65 site caused the largest chemical shift perturbation of all the modifications, particularly near the canonical binding site residues (Fig. 3). To examine the conformational changes caused by phosphorylation in more detail, we calculated secondary structural propensity (SSP) values²⁸ derived from backbone and sidechain chemical shifts (¹HN, ¹⁵N, ¹Hα, ¹³CO, ¹³Cα and ¹³Cβ). This metric estimates the fractional α-helical (SSP > 0) or β-strand (SSP < 0) population in a dynamic state and is used to gauge the effect of phosphorylation on conformational equilibria. Apo np-4E-BP2 is an intrinsically disordered protein that transiently samples several α-helices along its length, most notably between residues 49 and 67 (Fig. 4a, black bars). These residues encompass the canonical binding motif, partially pre-ordering a region that becomes a stable α-helix when bound to eIF4E^8,9. Np-4E-BP1 in complex with eIF4E also has a helical half-turn elbow loop between residues 64 and 67¹³. Based on our NMR SSP data, much of this binding-ready conformation is abolished in 5p-4E-BP2 (Fig. 4a, red bars). The folded domain β-strands can be observed as four regions with the most strongly negative SSP values between residues P18 and R62, consistent with their being part of the stable folded domain structure. The formation of the folded domain incorporates residues 52–57 of the helical canonical binding motif into its β-sheet structure¹¹, while the elbow loop region switches from favoring helical to extended or β-strand-like character (i.e., from moderately positive SSP in non-phospho to near zero or negative SSP in 5p-4E-BP2).

**Fig. 3: Effects of the C-IDR and its phosphorylation status on the 4E-BP2 folded domain.**

**Fig. 4: Effects of the C-IDR phosphorylation on 4E-BP2 folded domain secondary structure.**

The folded domain of 5p-4E-BP2 is most stable at neutral pH with its four β-strands featuring prominently in the SSP data and the rest of the protein tending toward β-strand character. Due to the non-cooperativity of 4E-BP2 folding, parts of the folded domain structure can be altered without undermining the entire domain. Under acid denaturing conditions at pH 3, the β-strand propensities of residues within the folded domain decrease (SSP → 0), especially near the first β-strand, β1, indicating that the fold is destabilized but still partially populated at low pH (Fig. 4b). Some α-helical character has emerged for residues immediately C-terminal to β4, re-forming part of the canonical binding helix or elbow loop, consistent with denaturation yielding a more eIF4E-binding ready conformation for the canonical binding motif. Thus, acid denaturation of phospho 4E-BP2 leads to conformational states closer to those populated by np-4E-BP2 than 5p-4E-BP2.

Partial C-IDR phosphorylation is associated with a range of conformational effects in the folded domain, with helical character adjacent to the canonical binding site decreasing as C-IDR phosphorylation increases (Fig. 4c–e). Matching the chemical shift perturbations, we find that the C-IDR phospho-site at S65 has the largest effect on conformation. Transient helices within the C-IDR are also partially present for the two-phospho, three-phospho, and four-phospho (2p-, 3p-, and 4p-) constructs, indicating that the secondary eIF4E-binding site is also affected. These findings demonstrate a dynamic continuum of conformational states with transient population of eIF4E-binding helices in np-4E-BP2, a conformational equilibrium between these helices and the folded β-sheet structure in partially phosphorylated states and more stable β-sheet structure in the full five-phospho state. Thus, the non-cooperatively folded nature of the folded domain of 4E-BP2 enables its conformational ensemble to be dramatically affected by C-IDR phosphorylation state, controlling the population of eIF4E-binding competent states.

Phosphorylation affects C-IDR:folded domain interactions

To further understand how C-IDR phosphorylation could affect the folded domain stability, we monitored transient interactions between the folded domain and the C-IDR using single-molecule fluorescence resonance energy transfer (smFRET) experiments (Fig. 5a) on np-, 2p-, and 5p-4E-BP2. The proteins were labeled with a donor–acceptor dye pair in the long loop of the folded domain (residue 32) and in the C-IDR (residue 91). The average FRET efficiency, 〈E〉, is inversely proportional to the sixth power of the donor–acceptor distances²⁹. In parallel, we recorded NMR paramagnetic relaxation enhancement (PRE) data on 2p-4E-BP2 and 5p-4E-BP2, with a paramagnetic nitroxide label at residue 32 that leads to decreased peak intensity for residues close in space due to resonance broadening³⁰ (Fig. 5b). Both smFRET and NMR PRE data clearly demonstrate transient contacts that could mediate stabilizing effects of the C-IDR on the folded domain.

**Fig. 5: C-IDR phosphorylation and pH affect interactions between folded domain and C-IDR.**

smFRET data for np-4E-BP2, 2p-4E-BP2, and 5p-4E-BP2 all show a broad distribution of 〈E〉 values that shift towards lower values during GdmCl denaturation, indicating an ensemble of conformations expanding and undergoing non-cooperative unfolding³¹ (Supplementary Fig. 5). This is in contrast with the decrease in a high-FRET (folded) state accompanied by the increase in a low-FRET (unfolded) state typically observed for cooperative protein unfolding³². Together, these data are consistent with our NMR results demonstrating the non-cooperative unfolding of 4E-BP2 for both phospho states. A similar dependency, i.e. gradual shift of 〈E〉 to lower values (larger distances), is also observed for np-4E-BP2, indicative of unfolding of transient helical structure within the disordered np-4E-BP2 ensemble⁸.

In the absence of denaturant at pH 7.4, the FRET 〈E〉 shifted from 0.57 ± 0.01 for np-4E-BP2, to 0.44 ± 0.01 for 2p-4E-BP2, and to 0.30 ± 0.01 for 5p-4E-BP2 (Fig. 5a, center row). These data indicate that the average distance between residue 32 in the folded domain region and residue 91 in the C-IDR increases with phosphorylation. The mechanism behind this expansion is different than that observed during chemical denaturation. The change in average distance is consistent with the conformations observed previously with NMR. The helical conformations sampled in np-4E-BP2 bring dyes at residues 32 and 91 into close average proximities. While phosphorylation induces folded domain formation, it also causes expansion of the C-IDR. Phosphorylation to 2p-4E-BP2, then to 5p-4E-BP2, shifts the conformational equilibrium towards more extended, β-strand-like conformations (Fig. 4), and leads to an increased separation between the folded domain and the C-IDR. In agreement, NMR PRE data show that residue 32 within the folded domain region and the stretch of residues between 65 and 90 of the C-IDR are farther apart from each other in 5p-4E-BP2 than in 2p-4E-BP2 (Fig. 5b). 5p-4E-BP2 has greater 〈E〉 at pH 3 than at pH 7.4 (Fig. 5a, right column), with a value, 0.74 ± 0.01, that is similar for all the phospho-states at acidic pH (Fig. 5a, bottom row). This observation is consistent with (i) destabilization (by phosphate group protonation) of all the phosphorylation-induced extended β strands in the folded β-structure and in the C-IDR, and (ii) the subsequent enhanced sampling of helical conformations as detected by NMR spectroscopy⁸ (Fig. 4). Together, these two effects enable closer approach of the dyes at low pH for all the different phospho states. Interestingly, for each phospho state, 〈E〉 values for the dominant peak at pH 10 and pH 7.4 are similar (Fig. 5a, first vs. second row), reflecting little or no change in the protonation status of the phosphates (or other residues). Consequently, there are no significant structural changes, which is consistent with previous observations¹¹. Furthermore, we used fluorescence correlation spectroscopy (FCS) to measure the hydrodynamic radius (R_H) of 4E-BP2 under different phosphorylation states and pH conditions (Supplementary Fig. 6). At pH 3, R_H = 27.6 ± 0.6 Å, independent of phosphorylation state, which is slightly smaller than R_H measured at pH 7.4 (R_H = 29.0 ± 0.2 Å) for np-4E-BP2 and R_H = 31.1 ± 0.2 Å for 5p-4E-BP2. The FCS data rule out protein aggregation as a cause for high FRET at acidic pH³³. These data also confirm the observed higher smFRET 〈E〉 values (i.e. the close distance between residues 32 and 91) for all phospho states of 4E-BP2 at low pH, reflecting the increase in helical population observed in Fig. 4. The longer R_H value for 5p-4E-BP2 at pH 7.4 also agrees with the lower smFRET 〈E〉 and NMR data showing that folding and stabilization by increased phosphorylation at neutral and high pH reflect extended conformations.

Together, the smFRET, FCS, and NMR PRE results are consistent with the NMR chemical shift-derived SSP data (Fig. 4). A more compact helical structure is observed in np-4E-BP2⁸ and in phosphorylated 4E-BP2 states under acidic conditions. A more extended β-character, with stabilization of the fold, is found at neutral pH or after C-IDR phosphorylation. Therefore, phosphorylation of C-IDR modulates the structural ensemble of the entire protein, affecting the folded domain stability, transient helical structure within both the folded domain region and the C-IDR, and transient contacts between the folded domain region and C-IDR.

C-IDR phosphorylation weakens binding via conformational change

To understand how conformational effects of phosphorylation affect eIF4E-binding affinities, we used isothermal titration calorimetry (ITC) (Fig. 6a, Supplementary Figs. 7 and 8). Our previous work¹¹ showed that 4E-BP2 mutants with disrupted folded domains bound eIF4E with K_D values of ~2–36 nM, close to the affinity of np-4E-BP2 (K_D = 3.2 ± 0.6 nM)¹¹ (Supplementary Fig. 7), relatively independent of C-IDR phosphorylation. T37/T46 phosphorylation in 2p-4E-BP2 induces the folded domain and attenuates eIF4E binding to 267 ± 32 nM. Phosphorylating the three C-IDR sites in the presence of the folded domain weakens eIF4E binding to K_D = 15,000 ± 4000 nM (ref. ¹¹ and new measurements), demonstrating that the folded domain must be present for phosphorylation of the C-IDR to regulate eIF4E binding. With our new data, we obtained K_D values for eIF4E binding to 4E-BP2 containing the folded domain plus one of the eight possible combinations of C-IDR phosphorylation. 2p-4E-BP2 with no C-IDR phosphorylation and 5p-4E-BP2 with all three C-IDR sites phosphorylated were investigated previously¹¹. The other variants (pT37/pT46/pS83, pT37/pT46/pT70, pT37/pT46/pS65, pT37/pT46/pT70/pS83, pT37/pT46/pS65/pS83, pT37/pT46/pS65/pT70) eliminate either one or two C-IDR sites with Ala mutations. All but one of these six 4E-BP2 variants bind eIF4E with K_D values intermediate between that for 2p-4E-BP2 and 5p-4E-BP2 (K_D ~ 1800–5700 nM, Supplementary Fig. 7). The pT37/pT46/pS65/pS83 variant binds eIF4E with an affinity that is statistically similar to that of 5p-4E-BP2. Note that this state is not present in the cell due to hierarchical phosphorylation requiring phosphorylation of T70 before that on S65; it is possible that S65 phosphorylation, with the most dramatic impact on conformational equilibria, requires destabilization from T70 phosphorylation for accessibility of the kinase.

**Fig. 6: 4E-BP2 phosphorylation tunes eIF4E binding.**

Comparison of the binding affinities with the conformational properties from NMR chemical shifts and SSP calculations provides strong evidence that the effects of phosphorylation on structure and binding are related, consistent with a model in which stabilizing the folded domain disrupts binding and the non-cooperative folding provides a mechanism for titratable stabilization of the folded domain. This structural link between the degree of phosphorylation and binding affinity leads to a tuned structural basis of phospho-regulation of 4E-BP2:eIF4E binding (Fig. 6b). Apo np-4E-BP2 is pre-ordered, albeit transiently, in eIF4E-binding helical form¹¹. Hierarchical phosphorylation of pT37 and pT46 shifts the part of the binding motif away from helical conformation toward β-strand and triggers the formation of the folded domain. Some of the binding residues are tucked into the β-sheet as β4, with β1 between it and the edge of the β-sheet. This partially sequesters the canonical binding motif, accounting for the weaker affinity of eIF4E for 2p-4E-BP2. However, some helical propensity remains in the C-terminal end of the binding motif. Previously, we found that the effects of C-IDR phosphorylation (pS65, pT70, pS83) required the presence of the folded domain¹¹. Here, we have defined in detail the mechanism behind the three-stage attenuation of eIF4E binding that is the core of the phospho-regulation of 4E-BP2 inhibition. C-IDR phosphorylation further decreases the helical population of the canonical and secondary eIF4E-binding sites. β1 and β4 are lengthened and have increased β-strand propensity. The canonical binding site residues are increasingly in the wrong conformation for binding, sequestered away from its binding partner, eIF4E.

Phospho TPGGT hairpins turns are independently stable

4E-BP2 has a conserved stretch of residues near the S83 phospho-site, ⁸³SPGT, that is similar to the TPGGT hairpin-forming motif, except that Ser replaces Thr and one of the glycine residues is missing (Supplementary Fig. 1a). There is no stable hairpin formed after S83 phosphorylation, based on the lack of additional downfield ¹HN chemical shifts indicative of a phosphate hydrogen bond to the G85 amide proton. To test the independent stability of the phosphate-induced turn, we made a mutant with a glycine insertion between G85 and T86, and observed a third downfield ¹HN chemical shift indicative of a phosphate hydrogen bond to the G85 amide proton. This new resonance confirms formation of a third hairpin within the disordered C-IDR upon phosphorylation (⁸³pSPGGT) (Fig. 7a) and the ability of pTPGGT/pSPGGT (or p[TS]PGGT) motifs to form stable hairpins independently of the 4E-BP2-folded domain.

**Fig. 7: Independent stability of the pTPGGT motif.**

The pTPGGT motif is conserved across species in 4E-BP1, 4E-BP2, and 4E-BP3 (Supplementary Fig. 1a)^11,23. Given that 4E-BP2 folding requires hairpin formation, we tested the sequence specificity of hairpin formation by their effects on folding. Folding can be monitored in NMR spectra by the presence of the extremely downfield resonances diagnostic of hairpin formation and the other normally dispersed resonances reflecting stable structure. We find that despite the absolute conservation of phospho-threonine in the two hairpins, phospho-serine is also capable of forming a hairpin, both in the independent ⁸³SPGGT motif introduced in the C-IDR as well as in a ³⁷pSPGGT/⁴⁶pSPGGT 4E-BP2 double mutant (Supplementary Fig. 9a). However, the characteristic ^39/48Gly peaks are not as down-field shifted in the double mutant as in the WT, suggesting that the latter is more stable. The pTPGGT is also conserved in the single isoform of 4E-BP found in invertebrates^34,35 with a notable exception of ticks, which have a TPGGS motif at the second hairpin (Supplementary Fig. 1a). Similarly, we find that the domain can fold in the context of select mutations to the Thr in the fifth position, with both ³⁷pTPGGR and ³⁷pTPGGA showing little disruption to the domain (Supplementary Fig. 9b, c). Mutations to the two glycine residues of the motif (the third and fourth positions) become increasingly more disruptive to the domain as the bulk of the substituted residue sidechain increases. ³⁷pTPSGT and ³⁷TPGLT 4E-BP2 show some peaks associated with the folded domain, but are missing many others (Supplementary Fig. 9d, e). Larger or more rigid β-branched residues, such as ³⁷pTPIGT, ³⁷pTPGPT, ³⁷pTPGWT, ³⁷pTPGVT (Supplementary Fig. 9f–i) or ³⁷pTPVGT/⁴⁶pTPVGT (ref. ¹¹), are more disruptive to the domain. Even though these hairpin mutants destabilize the folded domain, most show evidence of some degree of hairpin formation based on downfield glycine chemical shifts indicative of the turn.

Sequence requirements and frequency of TPGGT-like motifs

Since a range of sequences can display the phospho-dependent hairpin structural behavior of 4E-BP’s TPGGT motifs, we searched for the sequences compatible with the hairpin structure through bioinformatic assessment of similar hairpins observed in the PDB (described in “Methods” section). The primary and defining sequence feature associated with this structure is the glycine at the i + 3 position (TPGGT), while other residues are selective but can include a variety of residues at each position (Fig. 8a, b). Confirming the validity of these statistics, hairpin mutations, which we tested in 4E-BP2, exhibited effects corresponding to their prevalence in similar turns within the PDB. Fold-compatible mutants, ³⁷pTPGGR and ³⁷TPGGA, are found in similar turns and fold-destabilizing mutants, ³⁷pTPIGT, ³⁷pTPGPT, ³⁷pTPGWT, and ³⁷pTPGVT, are excluded.

**Fig. 8: Bioinformatic analysis of hairpin sequence motifs.**

Beyond the enrichment of glycine at i + 3, we observe that T or S at i and P at i + 1 are also associated with an increased hairpin propensity, with 2640 unique [TS]P-containing hairpins found in the PDB. This suggests that the structurally similar turns found in constitutively folded proteins can include a proline-directed phospho-site, and that the stability of these folds could be phospho-regulated. We also note that there is a bias towards polar residues at the i + 4 position of the turn, whose sidechain would come in direct contact with any phosphate present at position i, and this includes enrichment for negatively charged residues which we would expect to be repelled by phosphate.

To test the importance of the i + 4 position and whether phospho-regulation of hairpin stability can occur in folded proteins, we first devised a position-specific scoring matrix (PSSM)-based scoring function for assessing the overall hairpin propensity of a given 11-mer sequence, which we made based on the amino acid frequencies observed in hairpin structures identified in the DSSP secondary structure database³⁶ (Fig. 8c and see “Methods” section). Next, we used the scoring function to score the 9764 unique 11-mer sequences with [TS]PxG (where x is any residue) found in the portion of the human proteome covered by the PhosphoSitePlus database³⁷. For each individual protein, we then took the lowest scoring [TS]PxG containing 11-mer (lowest predicted hairpin propensity) and for the 6619 proteins containing at least one of these 11-mer sequences, we identified 418 proteins where the lowest [TS]PxG hairpin propensity observed scored as highly as 4E-BP2, and 324 proteins with higher predicted propensity than any human 4E-BP sequence. We selected an example from the highest scoring 324 proteins that satisfies two criteria: (i) containing a negatively charged residue at position i + 4, and (ii) forming a constitutive hairpin as part of a folded domain. We identified the ubiquitin-like domain of Fas-associated factor 1 (FAF1, residues 570–650) (Fig. 7b) based on the hairpin turn score of its ⁵⁸⁰TPSGE sequence (it was in the top 269 out of 6619 human proteins in PhosphoSitePlus³⁷, when ranked by the lowest motif score observed per protein (Fig. 8a, b)). We then transplanted the TPGGT sequence into FAF1 and tested the effect on stability of phosphorylating both wildtype and TPGGT-modified domains.

The non-phospho ⁵⁸⁰TPSGE FAF1 domain has a single two-state transition with a T_m of ~58 °C observed via DSC (Fig. 7c). Phosphorylating T580 results in a loss of protein stability with most of the protein precipitated by ~50 °C, most likely due to the electrostatic repulsion between the negative charges of the phospho-group and the E584 carboxylate. Replacing the FAF1 ⁵⁸⁰TPSGE sequence with ⁵⁸⁰TPGGT, however, converts the effect of phosphorylation from destabilization to stabilization. The non-phospho TPGGT FAF1 domain has a T_m of ~52 °C and phospho TPGGT FAF1 domain has a T_m of ~59 °C, a 7 °C increase. Phosphorylation is a destabilization event for the wildtype FAF1 domain, with glutamic acid at the i + 4 position, but transplanting the TPGGT motif to FAF1 leads to stabilization upon phosphorylation. The pTPGGT FAF1 domain also has similar thermal denaturation behavior observed under NMR spectroscopy. At pH 7.4, the hairpin glycine residue is resistant to thermal denaturation, even up to 65 °C, in a strikingly similar manner to phosphorylated 4E-BP2 (Fig. 7d).

Potential phospho-regulatory roles of TPGGT hairpin turns

The observation that proline-directed phosphorylation of TPGGT-like hairpins can both stabilize and destabilize folded domains suggests a more general role for modifying the stability of proteins in which these motifs are found. Phosphodegrons are short amino acid sequences that, when phosphorylated, route a protein for degradation³⁸. Ubiquitin ligase Fbw7 recognizes two motifs that are similar to our TPGGT sequence: [LIVMP]x(0,2)TPxxE, which is singly phosphorylated at Thr, and [LIVMP]x(0,2)TPxx[ST], which is doubly phosphorylated^37,39,40 (‘x(0.2)’ indicates 0, 1, or 2 open spaces in the motif). To test the broader relevance of hairpin stability to degradation, we explored phosphodegron motif overlap and ubiquitination site annotation frequency as a function of our hairpin propensity score for the 7380 mouse and 9764 human [TS]PxG motifs found in the PhosphoSitePlus database³⁷. We found that hairpin turn propensity correlates with three markers related to ubiquitination (Fig. 8d–i), including (i) the presence of local lysine residues (Fig. 8d, g), (ii) the likelihood of matching one of the aforementioned phosphodegron motifs (Fig. 8e, h) and (iii) the likelihood of ubiquitination being directly observed in the local sequence (Fig. 8f, i). These observations and results suggest that TPGGT-like hairpins could have a more general role in regulating degradation by modifying protein stability.

Discussion

Why most native single domain proteins fold cooperatively is not clear; it could be a by-product of marginal stability, where there is no fitness benefit for any segment of a protein to be more stable than the whole, but it could also be adaptive, for example by making it easier to recognize misfolded proteins and target them for degradation. Here we demonstrate that stepwise, non-cooperative folding can have regulatory function, enabling the occupancy of a fold to be fine-tuned to regulate competition of processes involving either the folded or unfolded states. We previously demonstrated that C-IDR phosphorylation of 4E-BP2 reduces eIF4E-binding affinity by orders of magnitude, and our data strongly suggested that it does so by modulating the stability of the folded domain¹¹. In this paper, we provide a detailed structural mechanism for the impact of the C-IDR phospho-sites on the folded domain to explain the full extent of phospho-regulation of translation initiation. The canonical eIF4E-binding region, which significantly samples helical structure in the non-phosphorylated state, is only partially converted to eIF4E-binding incompatible β-structure in the pT37/pT46 two-phospho state. C-IDR phosphorylation shifts the equilibrium away from helical to β-structure with full stabilization of the β-folded domain only with 5p-4E-BP2.

The previous model, in which phosphorylation leads to electrostatic repulsion between the 4E-BP and negative eIF4E-binding surfaces, is based on the proximity between S65 of 4E-BP1 and E70 eIF4E in crystal structures of eIF4E in complex with np-4E-BP1 fragments^18,21. The pS65 site is positioned at the C-terminal end of the canonical binding helix, destabilizing the helix at physiological pH^10,22. Consistent with this electrostatic effect on the helix dipole, isolated 4E-BP1 peptides containing the canonical eIF4E-binding site (residues 51–67) have a modest decrease in binding affinity upon S65 phosphorylation⁴¹. Our data argue that phosphorylation regulates 4E-BP2 binding by controlling its conformation, contrary to the electrostatic repulsion model. S65 is in a loop/turn region and the orientation of its sidechain varies between different complex structures^13,21. Solution NMR on full-length 4E-BP2 reports on μs–ms motions of the C-IDR over a more extensive eIF4E-binding site than is apparent to X-ray crystallography on truncated 4E-BP proteins^8,17, a dynamic interface supported by SAXS²³. The dynamic nature of the complex complicates interpretation of binding energies from structural data, but our simplistic in silico modeling also supports the view that electrostatic repulsion is not the dominant mechanism by which S65 phosphorylation impacts binding. Disrupting the 5p-4E-BP2-folded domain (with a G39V/G48V double mutant) yields a protein that behaves more like tight-binding WT np-4E-BP2 than weaker-binding WT 5p-4E-BP2 (K_D = 36.1 ± 3.5, 3.20 ± 0.6, and 12320 ± 200 nM, for five-phospho G39V/G48V, non-phospho WT, and five-phospho WT 4E-BP2, respectively)¹¹. Together with the relatively tight (K_D = 11.3 ± 2.9 nM) binding of pS65/pT70/pS83 3p-4E-BP2, also lacking a folded domain¹¹, these data provide very strong evidence that phosphorylation acts primarily by modulation of the fold.

The canonical and secondary eIF4E-binding motifs of 4E-BP2 are accessible to eIF4E binding, but the degree of accessibility depends on the particular 4E-BP2 phosphorylation state. Forming the folded domain after pT37/pT46 phosphorylation restricts access to the N-terminal part of the canonical-binding site, but the restriction is incomplete due to the non-cooperative folding of the domain, which offers some access to the motif. Phosphorylation at the C-IDR sites reinforces the β-sheet conformation of the domain, shifting it, along with the rest of the canonical site, further away from eIF4E-binding-competent conformations.

The two conserved pTPGGT motifs of 4E-BP2 are phosphorylated before the C-IDR sites, likely due to protection from various regulatory kinases when 4E-BP2 is in complex with eIF4E. However, in the absence of eIF4E, we show that the C-IDR sites are more susceptible to de-phosphorylation by lambda protein phosphatase. 2p-4E-BP1 is protected from ubiquitination and degradation only in complex with eIF4E, while in the absence of eIF4E, 2p-4E-BP2 is vulnerable to ubiquitination at K57 in the canonical-binding site¹⁴, consistent with K57 existing in a dynamic equilibrium between helical and extended conformers in the two-phospho state. Interestingly, the free 5p-4E-BP1 is not ubiquitinated¹⁴, consistent with the full stabilization of the folded domain incorporating the putative ubiquitination site K57 in a protected structure. Our data, including de-phosphorylation kinetics, in conjunction with the literature¹⁴, suggest that the relative amounts of free 2p-4E-BP2 and 5p-4E-BP2 in cells are regulated by de-phosphorylation and ubiquitination based on the differential stabilities and protection afforded by the various phospho-states.

The TPGG[ST] sequences are conserved features of all three 4E-BP isoforms, with high positional sequence conservation throughout the proteins (Supplementary Fig. 1a), fairly unusual for IDPs⁴². This sequence conservation is constrained by (i) the structural requirements of the folded domain, (ii) the need to maintain specific eIF4E:4E-BP versus eIF4E:eIF4G binding and (iii) binding to kinases, phosphatases, and ubiquitin ligases. The sequence gives rise to a protein for which stepped phosphorylation enables tuned stabilization of a non-cooperatively folded domain, controlling exposure of the canonical eIF4E-binding helix and ultimately competition for eIF4E binding to eIF4G, required for translation initiation. This knowledge of the underlying mechanism of stability of the various phospho-states of 4E-BPs and other similar hairpin-containing sequences, including their vulnerabilities to ubiquitin-dependent degradation, may provide valuable insights for controlling a variety of biological processes.

Methods

Construct and sample generation

ACGT Corp (Toronto, ON) synthesized mutagenesis primers (see Supplementary Methods) and performed DNA sequence analysis for the wild type and mutants used in this study. Site-directed mutagenesis was performed in-house using standard Quikchange protocols (Agilent) or out-sourced to either Genscript Inc or ACGT Corp. Invitrogen GeneArt (Lifetechnologies) synthesized a codon-optimized cDNA of the ubiquitin-like domain of FAF1 (residues 570–650) and subcloned it into a pET SUMO-vector, generating an expression vector similar to those of the eIF4E and 4E-BP2 constructs. The proteins were expressed in BL21-CodonPlus (DE3)-RIPL E. coli cells (Agilent Technologies) in Luria Broth for unlabeled samples and uniformly ¹⁵N or ¹⁵N/¹³C-labeled M9 for NMR samples. The cells were cultured at 37 °C until OD₆₀₀~0.6–0.8, induced with IPTG, and overexpressed at 16 °C for 16–20 h. Cell lysates were purified using a nickel–nitrilotriacetic acid (Ni–NTA) column. The SUMO solubility tag was cleaved by 30-min incubation with Ulp1 and then removed by the Ni–NTA column. The proteins were purified to homogeneity using S75 HiLoad gel filtration columns (GE Healthcare). Activated His-tagged Erk2 for phosphorylation was expressed and purified using a protocol and plasmid co-expressing Erk2 and MEK1 obtained from Attila Remenyi at Eötvös Loránd University. 4E-BP2 and FAF1 were phosphorylated using Erk2 using a dialysis technique. Each ~50 ml phosphorylation reaction was made up of phosphorylation buffer (50 mM Tris pH 7.5 at 25 °C, 1 mM EGTA, 2 mM DTT, 20 mM MgCl₂, and 10 mM ATP) containing ∼20 μM 4E-BP2 and ∼5 μM Erk2 in a dialysis bag. The dialysis bag was placed in 1 l of phosphorylation buffer and phosphorylation proceeded for 1–3 days with stirring, then the reaction was stopped by removing the kinase with a Ni–NTA column. Flow-through and wash fractions of phosphorylated protein were purified via gel filtration.

Alignment of 4E-BP sequences

The 4E-BP sequences were extracted from the Uniprot Database⁴³ using the database’s BLASTP server⁴⁴. The sequences were aligned with CLUSTAL OMEGA⁴⁵.

In silico-bound state energy minimization

Structural studies of eIF4E bound to 4E-BP fragments provide a binding-energy hypothesis for the mechanism of hierarchical phosphorylation, where phosphosite S65’s close bound-state proximity to a glutamine sidechain suggests that phosphorylation of S65 may weaken binding by electrostatic repulsion^18,21. We tested whether the structural context of S65 confirms electrostatic repulsion in silico by doing energy minimization analysis of the bound structure with and without phosphate and found no detectable energetic difference between states (Supplementary Fig. 1c), demonstrating that, while electrostatic change could potentially have some effect on binding, the effect is not captured by coulombic (electrostatic) interactions modeled in the bound state. The electrostatic effects of phosphorylation on bound state energy were modeled using Rosetta version 3.10⁴⁶ and its energy function⁴⁷ using the default command line for the relax application (relax -in:file:s 4ued.pdb). A phosphate group was added to S65 by modifying the input file, changing the amino acid code for S65 from SER to SEP (phospho-serine) which triggers Rosetta to build in the missing phosphate atoms. Two different net charges were tested for the phosphate group: the default for Rosetta 3.10 (−1.17) and one closer to the expected charge under physiological conditions (−1.88). This second charge was defined by replacing the partial charge values in Rosetta database file “/database/chemical/residue_type_sets/fa_standard/patches/ser_phosphorylated.txt” with values taken from Steinbrecher et al.⁴⁸, with oxygen O1P, O2P, and O3P partial charges changed from −0.78 to −0.98 and phosphate partial charge changed from 1.5 to 1.39.

Rosetta score minimization against the energy function⁴⁷ was used to gauge the potential for electrostatic repulsion in bound state structures of S65 phosphorylation. Five thousand minimized structures were generated for each state starting from atom coordinates in the human eIF4E:4E-BP1 structure (PDB ID 4UED). Non-phospho S65 uses the default minimization protocol and the phosphorylation states use the same protocol but with the addition of phosphate groups, with net charge tested at around both −1 and −2. Supplementary Fig. 1b shows an overlay of the best scoring structure from each set superimposed to the PDB structure, with eIF4E in gray for all sets, and 4E-BP colored in gray for PDB:4UED, cyan for the best scoring non-phospho model, magenta for phosphate charge −1, and yellow for phosphate charge −2, demonstrating that phosphorylation does not require perturbation of the overall structure, and Supplementary Fig. 1c shows that the minimization energy distribution is indistinguishable between states, even with different net charges used on the phosphate. The boxes in Supplementary Fig. 1c show the interquartile range of energy scores, with a white dot labeling the median.

Differential scanning calorimetry

All experiments were performed on a Nano DSC equipped with an Autosampler (TA Instruments) with a heating rate of 1 K/min using protein concentrations ranging from 1 to 10 mg/mL. A Cys-less (C35S/C73S) 4E-BP2 construct was used to optimize the DSC signal. Data were collected and analyzed using the DSCrun and NanoAnalyze programs provided with the instrument. N = 3 data sets were collected at each condition using different samples, some from the same preparation.

NMR chemical shift analysis

NMR experiments were collected on 500, 600, and 800 MHz Varian NMR spectrometers (the field for each experiment is specified in the below sections). The data were processed with nmrPipe⁴⁹, and Sparky⁵⁰ was used for spectra viewing, overlays, and assignment transfer. Sensitivity-enhanced ¹⁵N–¹H HSQC spectra⁵¹ were collected for chemical shift analysis of denaturation, phosphorylation, and mutational data. When analyzing NMR data, peaks that were overlapped with other peaks in the NMR spectra were eliminated from analysis, as were peaks near the background noise level of the spectra. Peaks that shifted too far from their original positions to be assigned with confidence (due to mutation or de-phosphorylation) are also excluded from explicit analysis, though their location was noted on the 4E-BP2 protein structure.

Changes in chemical shifts, Δω, due to denaturation, mutation, or other modifications were calculated using Eq. (1).

$${\mathrm{\Delta }}\omega = \sqrt {{\mathrm{\Delta }}\omega _{1{\mathrm{H}}}^2 + 0.154^2 *{\mathrm{\Delta }}\omega _{15{\mathrm{N}}}^2}$$

(1)

Chemical denaturation of 5p-4E-BP2 was done using GdmCl and urea. Both sets of denaturation experiments were performed at 20 °C and pH 6.0 with samples in phosphate buffer (30 mM Na₂HPO₄, 100 mM NaCl, 2 mM DTT, 1 mM EDTA, pH 6.0) with various concentration of denaturant added. The pH was adjusted back to pH 6.0 with HCl and NaOH after the addition of GdmCl. The urea concentrations shown are only approximate, due to water and degradation products. ¹⁵N–¹H NMR sensitivity enhanced HSQC spectra⁵¹ were collected at 500 MHz and the chemical shifts are referenced to water.

To investigate the pH-dependence of the stability of the folded domain of 5p-4E-BP2, samples with pH values between 2.2 and 8.0 were made with McIlvaine (phosphate–citrate) buffer⁵² with 2 mM DTT. The pH 1.0 samples were in 30 mM trichloroacetic acid (TCA) with 2 mM DTT (pH adjusted), since this pH is out of McIlvaine buffer’s buffering range. Only the data for samples in McIlvaine buffer were used to fit the pH titration data, and their chemical shifts were referenced to DSS. At the pH values investigated, titration of His, phosphoThr (pThr), Asp, and Glu residues in the phospho domain is expected to be observed. Supplementary Fig. 2a shows a group of folded domain residues shifted toward random coil values (~8.4 ppm). Many show curvatures indicating multistate transitions. To find the pK_a values and pH-dependent free energies of unfolding, we fit the two-state and three-state protonation models using bootstrap minimization with a 1000 iterations implemented with python and numpy. The fitted parameters are given as mean ± standard deviation.

Two-state protonation model:

$$\begin{array}{l}{\mathrm{A}}^{\mathrm{ - }} + {\mathrm{ H}}^{\mathrm{ + }} \rightleftharpoons {\mathrm{BH}}\\ \omega _{{\mathrm{obs}}} = \frac{{\omega _1 + K \ast \omega _2}}{{1 + K}}\\ K = 10^{ - \left( {{\mathrm{{pH}}} - {{\mathrm{p}}K_{\mathrm{a}}}} \right)}\end{array}$$

(2)

$$\Delta G_{{\mathrm{pH}}}^{\mathrm{{unfold}}} = - RT\,{\mathrm{ln}}\left( {10^{ - \left( {{\mathrm{pH}} - {{\mathrm{p}}K_{\mathrm{a}}}} \right)}} \right) = + RT\,{\mathrm{ln}}\left( {10} \right)\left( {{\mathrm{pH}} - {{\mathrm{p}}K_{\mathrm{a}}}} \right)$$

(3)

Three state protonation model: A²⁻ + H⁺ ⇌ BH⁻ + H⁺ ⇌ CH₂

$$\begin{array}{l}{\hskip 15pt}\omega _{{\mathrm{obs}}} = \frac{{\omega _1 + K1 \ast \omega _2 + K1 \ast K2 \ast \omega _3}}{{1 + K1 + K1 \ast K2}}\\ K1 = 10^{ - ({\mathrm{pH}} - {\mathrm{p}}K_{\mathrm{a1}})}\\ K2 = 10^{ - ({\mathrm{pH}} - {\mathrm{p}}K_{\mathrm{a2}})}\end{array}$$

(4)

$$\begin{array}{l}\Delta G1_{{\mathrm{pH}}}^{{\mathrm{unfold}}} = + RT\,{\mathrm{ln}}(10)\left( {{\mathrm{pH}} - {\mathrm{p}}K_{\mathrm{a1}}} \right)\\ \Delta G2_{{\mathrm{{pH}}}}^{{\mathrm{unfold}}} = + RT\,{\mathrm{ln}}(10)\left( {{\mathrm{pH}} - {\mathrm{p}}K_{\mathrm{a2}}} \right)\end{array}$$

(5)

For the pH titration data, the weighted sum of the ¹H and ¹⁵N chemical shifts (ω_obs = ω¹H + 0.154*ω¹⁵N) was used to reduce noise.

The chemical shift behavior was used to distinguish between residues affected by acid denaturation of the folded domain and solvent-exposed residues affected by buffer conditions. Residues that shift toward random coil are likely to be affected by denaturation (Supplementary Fig. 2a). The pK_a values for the residues affected by acid denaturation, referred to in the text as apparent pK_a, varied between 3.62 and 5.40, consistent with heterogeneity in pH-dependent unfolding free energy (Fig. 2a, Supplementary Table 1). Sidechains that titrate independently of denaturation are also observed for some residues. For example, pH titration data for residue Q29 fit to a three-state model. The chemical shift changes at higher pH values move toward random coil and probably reflect denaturation (Supplementary Fig. 2a). The lower pH titration, however, moves in the opposite direction and has pK_a2 = 2.3 ± 0.4. The direction is consistent with proximity to a residue that is becoming less negative at lower pH, probably D26 or D33. H32 has two titrations with pK_a values at pH 6.0 ± 0.2 and 2.6 ± 0.1, most likely due to the H32 and D33 sidechains. The lower pH data points for D33 were excluded from analysis due to peak overlap. However, H32 titrates with a pK_a = 5.9 ± 0.1. Y34 was also affected by H32 with a pK_a1 = 6.0 ± 0.3. T45 fits the three-state model with the lower pH titration moving away from random coil, probably also corresponding to the titration of a nearby acidic residue.

Analysis of NMR thermal denaturation data

¹⁵N–¹H HSQC thermal denaturation spectra were collected for 5p-4E-BP2 samples in pH 1.0, 2.6, 4.0, 5.0, 6.0, 7.0, and 7.4 buffers at temperatures between 5 and 70 °C. 70 °C was the operational temperature limit for the 500 MHz spectrometer and its probe. As with the pH titration data, samples at pH 1.0 were in TCA buffer. The rest were in McIlvaine buffer and the data from only these samples were used in the analysis of thermal denaturation. As a comparison, HSQC spectra of disordered np-4E-BP2 in pH 5.0 McIlvaine buffer were collected at temperatures between 5 and 70 °C.

For the 5p-4E-BP2 data, many of the folded domain peaks do not reach random coil values, even at pH 1.0 and 70 °C (Figs. 1c and 2c), so we could not obtain a complete thermal denaturation curve. We did, however, develop a way to indirectly estimate the thermal denaturation energies. The folded domain is less stable at low pH and is more sensitive to thermal denaturation. Based on this observation, we propose a three-state model, in which protonation is required for thermal denaturation.

Three state thermal model:

$$\begin{array}{l}{\mathrm{A}}^{\mathrm{ - }} + {\mathrm{ H}}^{\mathrm{ + }} \rightleftharpoons {\mathrm{BH}} \rightleftharpoons {\mathrm{CH}}\\ \omega _{{\mathrm{{obs}}}} = \frac{{\omega _1 + K_{{\mathrm{{pH}}}} \ast \omega _2 + K_{{\mathrm{{pH}}}} \ast K_{{\mathrm{{thermal}}}} \ast \omega _3}}{{1 + K_{{\mathrm{{pH}}}} + K_{{\mathrm{{pH}}}} \ast K_{{\mathrm{thermal}}}}}\end{array}$$

(6)

$$\begin{array}{ccccc}{\hskip 12pt}K_{{\mathrm{{pH}}}} = 10^{ - \left( {{\mathrm{{pH}}} - {\mathrm{p}}K_{\mathrm{a}}} \right)}\hfill \\ K_{{\mathrm{thermal}}} = {\mathrm{{e}}}^{ - \Delta G(T)/RT}\hfill\\ \Delta G\left( T \right) = \Delta G^\circ - \Delta S^\circ \ast \left( {T - T^\circ } \right) + \Delta C_{\mathrm{{p}}} \ast \left( {T - T^\circ - T \ast \ln \left( {\frac{T}{{T^\circ }}} \right)} \right)\\ \end{array}$$

(7)

Equation (7), specifically the expression for ΔG(T), has been used previously to model the thermodynamics of protein folding⁵³. As for acid denaturation, we take the pH-independent free energy to be the unfolding energy. For this analysis, the reference temperature for the standard free energy, entropy, and change in heat capacity (denoted with °) is 20 °C, matching the pH titration data.

Since we could not obtain a complete thermal denaturation curve, we instead calculated apparent pK_a′ values at different temperatures. In order to find the apparent pK_a′ at each available temperature, we recast Eq. (6) in two-state form.

Generic, target two-state model:

$$\omega ^\prime = \frac{{\omega _1^\prime + K_{{\mathrm{{pH}}}}^\prime \ast \omega _2^\prime }}{{1 + K_{{\mathrm{{pH}}}}^\prime }}$$

(8)

Converting Eq. (6) into two-state form:

$$\omega _{{\mathrm{{obs}}}} = \frac{{\omega _1 + K_{{\mathrm{{pH}}}} \ast \left( {\omega _2 + K_{{\mathrm{{thermal}}}} \ast \omega _3} \right)}}{{1 + K_{{\mathrm{{pH}}}} \ast \left( {1 + K_{{\mathrm{{thermal}}}}} \right)}}$$

Assuming ω₂ ~ ω₃ = ω₂′ and casting this into the two-state form:

$$\frac{{\omega _1 + K_{{\mathrm{{pH}}}} \ast \left( {1 + K_{{\mathrm{{thermal}}}}} \right) \ast \omega _2^\prime }}{{1 + K_{{\mathrm{{pH}}}} \ast (1 + K_{{\mathrm{{thermal}}}})}} = \frac{{\omega _1^\prime + K_{{\mathrm{{pH}}}}^\prime \ast \omega _2^\prime }}{{1 + K_{{\mathrm{{pH}}}}^\prime }}$$

(9)

We now have a relation between pK_a′, the apparent pK_a that is affected by thermal denaturation, and the actual pK_a:

$$\begin{array}{l}{K}_{{\mathrm{pH}}}^\prime = {K}_{{\mathrm{pH}}} \ast \left( {{\mathrm{1 + }}K_{{\mathrm{thermal}}}} \right)\\ 10^{ - \left( {{\mathrm{{pH}}} - {\mathrm{p}}K_{\mathrm{a}}^\prime } \right)} = 10^{ - \left( {{\mathrm{{pH}}} - {\mathrm{p}}K_{\mathrm{a}}} \right)} \ast \left( {1 + K_{{\mathrm{{thermal}}}}} \right)\\ 10^{ + \left( {{\mathrm{p}}K_{\mathrm{a}}^\prime - {\mathrm{p}}K_{\mathrm{a}}} \right)} - 1 = K_{{\mathrm{{thermal}}}} = {\mathrm{{e}}}^{ - \Delta G(T)/RT}\end{array}$$

(10)

For this model, since K_thermal ≥ 0, then pK_a′ ≥ pK_a. If pK_a is known (see below), then ΔG(T) can be calculated using pK_a′(T) for that temperature. The ΔG(T) values can then be fitted to Eq. (7) for the standard free energy, energy, and change of heat capacity. The errors for parameters extracted from curve fitting were estimated using bootstrap analysis with a 1000 iterations or by propagation of error. The fitted parameters are given as mean ± standard deviation.

To illustrate the method, here we focus on the analysis of G39 thermal denaturation data. The pH titrations for hairpin glycines G39 and G48 are shown in Supplementary Fig. 3b. Each of the different colors represent a pH titration at a specific temperature. In order to find the thermal denaturation energies, ΔG(T), we first estimate the true pK_a. To increase the number of data points to fit, we fit the ¹H and ¹⁵N peaks simultaneously to Eq. (9).

$$\omega _{{\mathrm{{obs}}}}^{1{\mathrm{{H}}}} = \frac{{\omega _1^{1{\mathrm{{H}}}\prime } + K_{{\mathrm{{pH}}}}^\prime \omega _2^{1{\mathrm{{H}}}\prime }}}{{1 + K_{{\mathrm{{pH}}}}^\prime }}$$

$$\omega _{{\mathrm{{obs}}}}^{15{\mathrm{{N}}}} = \frac{{\omega _1^{15{\mathrm{{N}}}\prime } + K_{{\mathrm{{pH}}}}^\prime \omega _2^{15{\mathrm{{N}}}\prime }}}{{1 + K_{{\mathrm{{pH}}}}^\prime }}$$

Suppmentary Fig. 3c and Supplementary Data 1 shows the fitted apparent pK_a′ value at each temperature for G39. For G39, the value of pK_a′ plateaued at its minimum at the three lowest temperatures. The average and standard deviation of these three values were used to estimate the true pK_a and its error: 5.19 ± 0.02. Then we use Eq. (10) to calculate K_thermal, then ΔG(T). After ΔG(T) values are calculated, Eq. (7) is fitted to calculate ΔG⁰, ΔS⁰, and ΔCp (Supplementary Fig. 3d).

In general, pK_a is estimated for other residues by averaging the pK_a′ points in the low temperature plateau. If this method does not work, we fit pK_a′ vs. temperature to linear, cubic, and quadratic curves to roughly estimate the T = 0 °C intercept for each and then average the intercepts. This second method results in much larger uncertainties than the first method. Most folded domain residues have minimal pH-independent free energy at 20 °C, i.e. ΔG° (Supplementary Fig. 3e, Supplementary Table 2). The exceptions are residues near or at the hairpins (pT37, G39, pT46, and G48), which have significantly more unfolding free energy. Theses residues, while not as obviously affected by three-state pH titration events like Q29 and H32, do have chemical shift change trajectories that deviate from the straight line expected for two-state pH titrations (Supplementary Fig. 2a). Each hairpin has a hydrogen bond between a phosphoThr phosphate group and the (i + 2) glycine’s amide group (pT37/G39, pT46/G48)¹¹. This strongly implies that the pH-independent energy estimated here characterizes the strength of the hydrogen bonds in the hairpins. The presence of the additional thermal transition also explains why these residues, while not as obviously affected by three-state pH titrations such as Q29 and H32, have chemical shift trajectories that deviate from the straight line expected for two-state pH titrations.

The observed apparent pK_a′ values increase with temperature (e.g. Supplementary Fig. 3c, Supplementary Data 1). Another possible cause of this phenomenon is the temperature dependence of the buffer pK_a. While this possibility cannot be completely eliminated, it should be noted that the temperature dependence of the pK_a values for phosphate buffers, such as the ones used here, is negative: dpK_a/dT = −0.0028⁵⁴. The hairpin residues associated with pT37 and pT46 have significant thermal denaturation free energies and their apparent pK_a′ increased with increasing temperature, the opposite direction expected for the pK_a shift of a phosphate buffer.

NMR assignment transfer and secondary structure propensities

The H_N, N, H_alpha, C′, C_alpha, and C_beta peak assignments were found using standard triple resonance experiments and experiments specialized for proline-enriched/IDP proteins^51,55,56,57 for pH 3.0 and 7.4 5p-4E-BP2, as well as 3p-4E-BP2 and 4p-4E-BP2 at pH 7.4, aided by the previously published pH 6.0 5p-4E-BP2 assignments¹¹. The secondary structure propensity program SSP²⁸ and each set of H_alpha, C′, C_alpha, and C_beta chemical shifts were used to quantify secondary structure propensities.

NMR paramagnetic resonance enhancement (PRE) data

PRE samples for 2p-4E-BP2 (S65A/T70A/S83A) and 5p-4E-BP2 were constructed using Cys-less versions of both proteins (C35S/C73S). The H32C mutation was made to both constructs with standard Quikchange protocols (Agilent). The proteins were purified as before, labeled with TEMPO-maleimide spin label (Toronto Research Chemicals), phosphorylated (as before), and then given a final purification with gel filtration chromatography. Two PRE samples were made for each protein, one with the spin label oxidized and the other with the tag reduced. To oxidize the samples, five-fold excess of TEMPOL (Toronto Research Chemicals) was mixed with the sample. The reduced samples were made by mixing 1 mM ascorbic acid and the sample. Both the oxidized and reduced samples were incubated overnight with shaking, then dialyzed three times into NMR buffer. PRE data were collected using sensitivity-enhanced HSQC spectra⁵¹ with a recycle delay (d1) of 5 s. For denatured or disordered residues, this method is considered more sensitive to PRE attenuation than the T2 PRE measurement techniques⁵⁸. Separate HSQC spectra were collected for the oxidized and reduced samples at 500 MHz and 20 °C. The signal attenuation for each residue was estimated using the ratio of peak volume for the oxidized sample to that of the reduced sample, V(ox)/V(red). The ratio is proportional to the distance separating the TEMPO tag from the backbone amide group. Therefore, signal attenuation (V(ox)/V(red) < 1) indicates that the TEMPO tag is proximal to a residue. The uncertainty in the signal attenuation is based on the propagation of the fit heights of the peak volumes.

Hairpin motif mutant 4E-BP2 HSQC spectra

¹⁵N–¹H HSQC spectra of hairpin motif mutants were collected at 20 or 25 °C on either a 500 or 600 MHz NMR spectrometer. The wildtype 5p-4E-BP2 spectra shown in Supplementary Fig. 9 were recorded under matching temperature, pH, and magnetic field conditions.

Single-molecule FRET

Single-molecule FRET (smFRET) experiments used two constructs, H32C/S91C with the native cysteines mutated to serine (i.e. C35S/C73S) on the 5p-4E-BP2 and the 2p-4E-BP2 (pT37/pT46 S65A/T70A/S83A) background. The smFRET samples were labeled with thiol-reactive dyes, i.e., Alexa 488 (A488) as donor and Alexa 647 (A647) as acceptor (ThermoFisher Scientific, Canada). For the labeling reaction, the dyes were added to a 50 μL solution of 100 μM protein at a A488:A647:protein molar ratio of 1.3:3:1 in the presence of tris(2-carboxyethyl)phosphine (TCEP) at a 10× molar excess to the protein. All the maleimide–cysteine coupling reactions were performed in PBS buffer at pH 7.4. Oxygen was removed by flushing the sample with argon gas in a desiccator for 5 min. The vial was capped tightly and shaken gently for 3 h at room temperature. The excess dye was removed by size-exclusion chromatography using Sephadex G-50 gels (G5080, Sigma Aldrich) in a BioLogic LP system (731-8300, Bio-Rad).

In order to estimate the intensity correction factor (γ) for smFRET analysis⁵⁹, donor-only and acceptor-only labeled protein samples were prepared using a similar protocol. Using the free dyes as reference, the fluorescence quantum yield (QY) of A488 attached to 4E-BP2 was estimated to be 0.76, 0.81, and 0.83 for non-phospho, two-phospho, and five-phospho conditions, respectively. Under similar conditions, the 4E-BP2-bound A647 had QY values of 0.32, 0.34 and 0.35, respectively. The error margins of the estimated QY values are around 5–10%.

All samples were diluted to concentrations of 20–50 pM for smFRET burst experiments. For a typical experiment, a sample solution of ~30 μL was dropped on the surface of plasma-cleaned glass coverslip. To prevent protein adsorption, the coverslip was coated with bovine serum albumin (BSA) (15260-037, ThermoFisher Scientific) as described previously^60,61, and 0.005% (v/v) Tween-20 (P2287, Sigma-Aldrich) was added to the solution. All experiments were performed at 20 °C.

smFRET measurements were performed on a custom-built multiparameter fluorescence microscope that was described in detail elsewhere^60,61. The sample is excited at 480 nm by femtosecond pulses and the fluorescence signal is filtered and then recorded by sensitive avalanche photodiodes, which are fed into a time-correlated counting module (PicoHarp300, PicoQuant). A custom-written Matlab code identified and analyzed dual-color fluorescence bursts, and displayed the results as FRET efficiency histograms.

The FRET efficiency (E) was calculated based on the number of detected photons in both donor (I_D) and acceptor (I_A) channels in each single-molecule intensity burst⁵⁹

$$E = \frac{{{I}_{\mathrm{A}}}}{{{I}_{\mathrm{A}} + \gamma {I}_{\mathrm{D}}}},$$

(11)

where γ is the correction factor for differences in detection efficiencies of donor and acceptor channels and quantum yields of the dyes. In addition, corrections were applied on both I_D and I_A to subtract background and spectral crosstalk. The dyes retained considerable rotational freedom under all conditions investigated and therefore the reported FRET E values are good measures of proximity distances and can be directly compared among various conditions tested^33,62. E was calculated for each detected burst, and all values obtained from a sample were used to build a histogram. Gaussian fits to the data were used to indicate the locations of the peak FRET efficiency for the non-zero Gaussian. The average FRET values have an error margin of ±0.01 (the fitting error).

Fluorescence correlation spectroscopy

FCS experiments were performed on a custom-built instrument following a protocol described in detail elsewhere⁶³. FCS experiments used constructs in the Cys-less (C35S/C73S) 4E-BP2 background with one of six Cys mutations: C0 (inserted before first residue), S14C, C35, C73, S91C, and C121 (inserted after sequence). Each single cysteine protein was labeled with A488 by adding the A488 fluorophores at a dye:protein molar ratio of 3:1 to a 50 μl solution of 100 μM protein. The rest of the preparation proceeded as described above for the smFRET samples. Non-phospho and five-phospho samples were diluted to concentrations of 1–10 nM in pH 3 and pH 7.4 buffers and FCS data was acquired for all four conditions (Supplementary Fig. 6). The coverslips were prepared as before and all experiments were performed at 20 °C. The laser excites the sample at intensities of ~5 kW/cm² in FCS measurements. The experimental correlation decay curves were fit to a to the typical model of molecular diffusion and dye photophysics, as given by Eq. (12):⁶¹

$$G\left( \tau \right) = \frac{1}{{N_{{\mathrm{{eff}}}}}}\left( {1 + \frac{\tau }{{\tau _{\mathrm{{d}}}}}} \right)^{ - 1}\left( {1 + \frac{\tau }{{s^2\tau _{\mathrm{{d}}}}}} \right)^{ - 0.5}\mathop {\sum }\limits_i \left( {1 + a_i{\mathrm{{e}}}^{ - \tau /\tau _{\mathrm{{d}}}}} \right)$$

(12)

In Eq. (12), N_eff is the average number of molecules in the confocal detection volume; s is the ratio between the axial and the lateral radii of the detection ellipsoid (z₀/w₀); τ_d is the diffusion time through the confocal volume, which is related to the diffusion coefficient (w₀² = 4Dτ_d) and the hydrodynamic radius, R_H, of the molecule via the Stokes–Einstein equation⁶⁴:

$$R_{\mathrm{{H}}} = \frac{{k_{\mathrm{{B}}}T}}{{6\pi \eta D}}$$

(13)

In addition, a_t and τ_t are the amplitude and the lifetime of the triplet (dark) state of the fluorophore, which also causes fluorescence intensity fluctuations⁶⁵. Prior to each set of FCS measurements, a sample of Rhodamine 110 dye was used to characterize the geometrical parameters of the confocal detection volume⁶⁴.

ITC affinity measurements for phospho 4E-BP2:eIF4E binding

The ITC instrument (MicroCal ITC200) and data collection methods (using Auto-ITC200, an Origin software package from MicroCal) are detailed previously¹¹. The ITC data for each construct is shown in Supplementary Fig. 8. The K_D values and their uncertainties are from this work or Bah et al.¹¹ (see Supplementary Fig. 7). Repeat data collections (n = 3 for 5p-4E-BP2; n = 2 for pT37/pT46/pT70, pT37/pT46/pS65, pT37/pT46/pT70/pS83, pT37/pT46/pS65/pS83, pT37/pT46/pS65/pT70; n = 1 for pT37/pT46/pS83 due to sample and data quality) were taken from different samples, some from the same preparation and others from different preparations. The K_d values and their uncertainties were estimated as the mean and standard deviations. The WT 5p-4E-BP2 K_d value found for this article is comparable to that from Bah et al.¹¹.

Bioinformatic assessment of pTPGGT structure and sequences

In order to identify sequences compatible with the hairpin structure found in the folded domain of 4E-BP2 (PDB ID 2m × 4), we first defined a secondary structure and hydrogen bonding-based definition of the hairpin that could be categorized using the DSSP secondary structure assignments³⁶. Potential turns were identified using an 11-mer DSSP secondary structure assignment of ‘?’-‘E’-‘E’-‘ ‘-‘T’-‘T’-‘ ‘-‘ ‘-‘E’-‘E’-‘?’ to identify potential turns, where ‘E’ is extended conformation, ‘T’ is a turn, ‘?’ means any class, and a blank (‘ ‘) means any class but E, T, H (alpha-helix), or G (3₁₀ helix). The following set of hydrogen bonds were then used to confirm that they match the hairpins observed in the 4E-BP2 structure: 2-O to 10-NH, 2-NH to 10-0, 4-O to 7-NH, and 4-NH to 8-O.

We then downloaded a comprehensive set of 153,773 DSSP files from the DSSP database (rsync://rsync.cmbi.umcn.nl/dssp). Since many structures in the PDB are redundant, we chose to use this redundancy to prioritize hairpins that are consistently observed even when solved multiple times, first identifying 49,938 non-redundant protein chains by using the PISCES PDB culling server⁶⁶ (resolution threshold of 3 Å, r-factor threshold of 1.0, and a sequence identity threshold of 90%) and then by clustering all of the culled redundant chains with their corresponding non-redundant exemplar. For each one of these sets we then collated the complete list of unique 11-mer sequences observed, and only accepted hairpins when they satisfy the DSSP criteria in at least 50% of the chains they are observed in over the cluster. 8917 cluster-unique hairpin observations were considered this way and 1136 were rejected for being inconsistent across multiple protein structures, leaving N = 7781 confident hairpin-containing 11-mers identified over a set of 5611 nonredundant protein chains.

To make a score function for hairpin prediction we generated a PSSM^67,68 for these 11-residue windows by taking the log2 probability of each amino acid at each position divided by the overall amino acid probability, with probabilities factoring in a small pseudocount of 0.1. For scoring 11-mers we then take the sum of log scores over each position. Positional frequencies used in generating PSSMs are diagrammed in Fig. 8a, b.

In order to assess the ability of this PSSM score to predict hairpins in the [TS]PxG motifs being studied we then went back through our protein clusters and identified a set of 5015 11-mers which match the sequence filter “xxx[TS]PxGxxxx”, 858 of which were found to also contain consistent hairpins structures, for a negative/positive ratio of ~4.8 to 1. Unlike the negative data, the positive data overlap with the sequences used to define the PSSM. This is because the positive data come from 570 of the 5611 nonredundant protein clusters identified previously. To assess the method’s ability to score hairpins that it has not been trained on, we used five-fold cross-validation repeated 100 times, with training and testing sets defined by splitting the list of nonredundant [TS]PxG containing PDB chains used, and then by removing test set hairpin sequences from the pool used to generate the PSSM. ROC-AUC for these tests averaged to 0.83 ± 0.03 (Fig. 8c), which is reasonable for the purpose of predicting enrichment.

Correlations between hairpin propensity scores for [TS]PxG motifs and proteomic data were tested by scoring unique xxx[TS]PxGxxxx motifs containing 11-mer sequences found in the PhosphoSitePlus³⁷ database, independently for both the human and mouse proteomes. To minimize the influence of highly repetitive sequences, 11-mers which show up multiple times in the same protein were reduced to the most C-terminal example, leaving 8015 human 11-mers from 6619 proteins and 6826 mouse 11-mers from 5084 proteins. We assessed three properties related to ubiquitination using the following definitions: (1) direct ubiquitination, measured by the observation of ubiquitination events in the PhosphoSitePlus database that in the primary sequence are within 30 residues of the 11-mer in question, (2) presence of local lysine residues, measured by the occurrence of lysine in the 30 residues preceding the 11-mer sequence or the 30 residues after it (representing local regions which are not scored directly by the PSSM score), and (3) overlap with phosphodegron motifs [LIVMP]x(0.2)TPxxE and [LIVMP]x(0.2)TPxx[ST]^37,39,40.

Enrichment was measured by comparing log2(O/E) for the frequency of observed features in high scoring sets of sequences divided by the frequency over the full set. Significance of enrichment was tested using bootstrap analysis, sampling the 6619 human and 5084 mouse proteins (with replacement) 10,000 times to calculate the 99% confidence intervals. Doing this across multiple PSSM score thresholds shows that there is significant enrichment (p < 0.01) for the top 20% by PSSM score over all three properties for both human and mouse (Fig. 8d–i), and for 5 out of 6 of the top 10%.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The source data (NMR chemical shift and peak intensity data) underlying Figs. 2a, b, 3, 4, 5, Supplementary Figs. 1a, 3, and 4, as well as Supplementary Tables 1 and 2 are provided in the source data file. The Hα, Cα, Cβ, CO, Hn, and N chemical shift values used to calculate secondary structural propensities are also found in the BMRB with the ID 50253. All other data are available from the corresponding author on reasonable request. Source data are provided with this paper.

Code availability

Data analysis scripts are available from GitHub repositories. Python scripts for image generation and NMR chemical shift analysis are found in the directory jdawson_scripts at https://github.com/jenniferdawson/4EBP2_nmr/. Python scripts for bioinformatics analysis are available from ‘https://github.com/jenniferdawson/Robert_Vernon_Bioinformatics_HairpinScoring’. The scripts for fluorescence data analysis are located in https://github.com/ccgradi72/singlemolecule2. All scripts are also available in the Source Data file. Source data are provided with this paper.

References

Sonenberg, N. & Hinnebusch, A. G. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell 136, 731–745 (2009).
Article CAS PubMed PubMed Central Google Scholar
Siddiqui, N. & Sonenberg, N. Signalling to eIF4E in cancer. Biochem. Soc. Trans. 43, 763–772 (2015).
Article CAS PubMed PubMed Central Google Scholar
Gingras, A.-C. et al. Regulation of 4E-BP1 phosphorylation: a novel two-step mechanism. Genes Dev. 13, 1422–1437 (1999).
Article CAS PubMed PubMed Central Google Scholar
Pause, A. et al. Insulin-dependent stimulation of protein synthesis by phosphorylation of a regulator of 5’-cap function. Nature 371, 762–767 (1994).
Article ADS CAS PubMed Google Scholar
Poulin, F., A.C.G., Olsen, H., Chevalier, S. & Sonenberg., N. 4E-BP3, a new member of the eukaryotic initiation factor 4E-binding protein family. J. Biol. Chem. 273, 14002–14007 (1998).
Article CAS PubMed Google Scholar
Salvi, N., Papadopoulos, E., Blackledge, M. & Wagner, G. The role of dynamics and allostery in the inhibition of the eIF4E/eIF4G translation initiation factor complex. Angew. Chem. Int. Ed. Engl. 55, 7176–7179 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gruner, S. et al. The structures of eIF4E–eIF4G complexes reveal an extended interface to regulate translation initiation. Mol. Cell 64, 467–479 (2016).
Article PubMed CAS Google Scholar
Lukhele, S., Bah, A., Lin, H., Sonenberg, N. & Forman-Kay, J. D. Interaction of the eukaryotic initiation factor 4E with 4E-BP2 at a dynamic bipartite interface. Structure 21, 2186–2196 (2013).
Article CAS PubMed Google Scholar
Igreja, C., Peter, D., Weiler, C. & Izaurralde, E. 4E-BPs require non-canonical 4E-binding motifs and a lateral surface of eIF4E to repress translation. Nat. Commun. 5, 4790 (2014).
Article ADS CAS PubMed Google Scholar
Gingras, A. C. et al. Hierarchical phosphorylation of the translation inhibitor 4E-BP1. Genes Dev. 15, 2852–2864 (2001).
Article CAS PubMed PubMed Central Google Scholar
Bah, A. et al. Folding of an intrinsically disordered protein by phosphorylation as a regulatory switch. Nature 519, 106–109 (2015).
Article ADS CAS PubMed Google Scholar
Brunn, G. J. et al. Phosphorylation of the translational repressor PHAS-I by the mammalian target of rapamycin. Science 277, 99–101 (1997).
Article CAS PubMed Google Scholar
Peter, D. et al. Molecular architecture of 4E-BP translational inhibitors bound to eIF4E. Mol. Cell 57, 1074–1087 (2015).
Article CAS PubMed Google Scholar
Yanagiya, A. et al. Translational homeostasis via the mRNA cap-binding protein, eIF4E. Mol. Cell 46, 847–858 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wright, P. E. & Dyson, H. J. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 16, 18–29 (2015).
Article CAS PubMed PubMed Central Google Scholar
Uversky, V. N. Unusual biophysics of intrinsically disordered proteins. Biochim. Biophys. Acta 1834, 932–951 (2013).
Article CAS PubMed Google Scholar
Fletcher, C. M. et al. 4E binding proteins inhibit the translation factor eIF4E without folded structure. Biochemistry 37, 9–15 (1998).
Article CAS PubMed Google Scholar
Marcotrigiano, J., A.C.G., Sonenberg, N. & Burley, S. K. Cap-dependent translation initiation in eukaryotes is regulated by a molecular mimic of eIF4G. Mol. Cell 3, 707–716 (1999).
Article CAS PubMed Google Scholar
Fukuyo, A., In, Y., Ishida, T. & Tomoo, K. Structural scaffold for eIF4E binding selectivity of 4E-BP isoforms: crystal structure of eIF4E binding region of 4E-BP2 and its comparison with that of 4E-BP1. J. Pept. Sci. 17, 650–657 (2011).
Article CAS PubMed Google Scholar
Matsuo, H. et al. Structure of translation factor eIF4E bound to m7GDP and interaction with 4E-binding protein. Nat. Struct. Biol. 4, 717–724 (1997).
Article CAS PubMed Google Scholar
Sekiyama, N. et al. Molecular mechanism of the dual activity of 4EGI-1: dissociating eIF4G from eIF4E but stabilizing the binding of unphosphorylated 4E-BP1. Proc. Natl Acad. Sci. USA 112, E4036–E4045 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tait, S. et al. Local control of a disorder-order transition in 4E-BP1 underpins regulation of translation via eIF4E. Proc. Natl Acad. Sci. USA 107, 17627–17632 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Gosselin, P. et al. The translational repressor 4E-BP called to order by eIF4E: new structural insights by SAXS. Nucleic Acids Res. 39, 3496–3503 (2011).
Article CAS PubMed Google Scholar
Tomlinson, J. H. & M. P., W. Amide temperature coefficients in the protein G B1 domain. J. Biomol. NMR 52, 57–64 (2012).
Article CAS PubMed Google Scholar
Baxter, N. J. & Williamson, M.P. Temperature dependence of 1H chemical shifts in proteins. J. Biomol. NMR 9, 359–369 (1997).
Article CAS PubMed Google Scholar
Cierpicki, T. & Otlewski, J. Amide proton temperature coefficients as hydrogen bond indicators in proteins. J. Biomol. NMR 21, 249–261 (2001).
Article CAS PubMed Google Scholar
Tollinger, M., Kay, L. E. & Forman-Kay, J. D. Measuring pKa values in protein folding transition state ensembles by NMR spectroscopy. J. Am. Chem. Soc. 127, 8904–8905 (2005).
Article CAS PubMed Google Scholar
Marsh, J. A., Singh, V. K., Jia, Z. & Forman-Kay, J. D. Sensitivity of secondary structure propensities to sequence differences between alpha- and gamma-synuclein: implications for fibrillation. Protein Sci. 15, 2795–2804 (2006).
Article CAS PubMed PubMed Central Google Scholar
Gomes, G. N. & Gradinaru, C. C. Insights into the conformations and dynamics of intrinsically disordered proteins using single-molecule fluorescence. Biochim. Biophys. Acta Proteins Proteom. 1865, 1696–1706 (2017).
Article CAS PubMed Google Scholar
Clore, G. M. & Iwahara, J. Theory, practice, and applications of paramagnetic relaxation enhancement for the characterization of transient low-population states of biological macromolecules and their complexes. Chem. Rev. 109, 4108–4139 (2009).
Article CAS PubMed PubMed Central Google Scholar
Mukhopadhyay, S., Krishnan, R., Lemke, E. A., Lindquist, S. & Deniz, A. A. A natively unfolded yeast prion monomer adopts an ensemble of collapsed and rapidly fluctuating structures. Proc. Natl Acad. Sci. USA 104, 2649–2654 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Rhoades, E., Cohen, M., Schuler, B. & Haran, G. Two-state folding observed in individual protein molecules. J. Am. Chem. Soc. 126, 14686–14687 (2004).
Article CAS PubMed Google Scholar
Zhang, Z. Single-Molecule Spectroscopy of Disordered States and Dynamics in Proteins. (University of Toronto, Toronto, 2017).
Google Scholar
Kume, A. et al. RNAi of the translation inhibition gene 4E-BP identified from the hard tick, Haemaphysalis longicornis, affects lipid storage during the off-host starvation period of ticks. Parasitol. Res. 111, 889–896 (2012).
Article PubMed Google Scholar
Lasko, P. The drosophila melanogaster genome: translation factors and RNA binding proteins. J. Cell Biol. 150, F51–F56 (2000).
Article CAS PubMed Google Scholar
Touw, W. G. et al. A series of PDB-related databanks for everyday needs. Nucleic Acids Res. 43, D364–D368 (2015).
Article CAS PubMed Google Scholar
Hornbeck, P. V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43, D512–D520 (2015).
Article CAS PubMed Google Scholar
Ravid, T. & Hochstrasser, M. Diversity of degradation signals in the ubiquitin–proteasome system. Nat. Rev. Mol. Cell Biol. 9, 679–690 (2008).
Article CAS PubMed PubMed Central Google Scholar
Dinkel, H. et al. ELM 2016–data update and new functionality of the eukaryotic linear motif resource. Nucleic Acids Res. 44, D294–D300 (2016).
Article ADS CAS PubMed Google Scholar
Welcker, M. & Clurman, B. E. FBW7 ubiquitin ligase: a tumour suppressor at the crossroads of cell division, growth and differentiation. Nat. Rev. Cancer 8, 83–93 (2008).
Article CAS PubMed Google Scholar
Andrew, C. D., Warwicker, J., Jones, G. R. & Doig, A. J. Effect of phosphorylation on alpha-helix stability as a function of position. Biochemistry 41, 1897–1905 (2002).
Article CAS PubMed Google Scholar
Bellay, J. et al. Bringing order to protein disorder through comparative genomics and genetic interactions. Genome Biol. 12, R14 (2011).
Article CAS PubMed PubMed Central Google Scholar
The UniProt Consortium. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
Madden, T. The BLAST sequence analysis tool. In The NCBI Handbook (eds McEntyre, J. & Ostell, J.) (National Center for Biotechnology Information (US), 2002).
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Article PubMed PubMed Central Google Scholar
Leaver-Fay, A. et al. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol. 487, 545–574 (2011).
Article CAS PubMed PubMed Central Google Scholar
Alford, R. F. et al. The Rosetta all-atom energy function for macromolecular modeling and design. J. Chem. Theory Comput. 13, 3031–3048 (2017).
Article CAS PubMed PubMed Central Google Scholar
Steinbrecher, T., Latzer, J. & Case, D. A. Revised AMBER parameters for bioorganic phosphates. J. Chem. Theory Comput. 8, 4405–4412 (2012).
Article CAS PubMed PubMed Central Google Scholar
Delaglio, F. et al. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293 (1995).
Article CAS PubMed Google Scholar
Lee, W., Tonelli, M. & Markley, J. L. NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinformatics 31, 1325–1327 (2015).
Article PubMed Google Scholar
Kay, L., Keifer, P. & Saarinen, T. Pure absorption gradient enhanced heteronuclear single quantum correlation spectroscopy with improved sensitivity. J. Am. Chem. Soc. 114, 10663–10665 (1992).
Article CAS Google Scholar
McIlvaine, T. C. A buffer solution for colorimetric comparison. J. Biol. Chem. 49, 183–186 (1921).
CAS Google Scholar
Hawley, S. A. Reversible pressure–temperature denaturation of chymotrypsinogen. Biochemistry 10, 2436–2442 (1971).
Article CAS PubMed Google Scholar
AppliChem. Biological Buffers. A Brochure Providing Information on Biological Buffers. www.applichem.com (2008).
Redfield, C. Assignment of Protein NMR Spectra Using Heteronuclear NMR — A Tutorial. Protein NMR: Modern Techniques and Biomedical Applications 32, 1–42 (2015).
Article CAS Google Scholar
Panchal, S. C., Bhavesh, N. S. & Hosur, R. V. Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and ¹⁵N sequential correlations in (¹³C, ¹⁵N) labeled proteins: Application to unfolded proteins. J. Biomol. NMR 20, 135–147 (2001).
Article CAS PubMed Google Scholar
Kanelis, V., Donaldson, L., Muhandiram, R., Rotin, D., Forman-Kay, J. & Kay, L. Sequential assignment of proline-rich regions in proteins: Application to modular binding domain complexes. J. Biomol. NMR 16, 253–259 (2000).
Article CAS PubMed Google Scholar
Xue, Y. et al. Paramagnetic relaxation enhancements in unfolded proteins: theory and application to drkN SH3 domain. Protein Sci. 18, 1401–1424 (2009).
Article CAS PubMed PubMed Central Google Scholar
Gopich, I. V. & Szabo, A. Theory of the energy transfer efficiency and fluorescence lifetime distribution in single-molecule FRET. Proc. Natl Acad. Sci. USA 109, 7747–7752 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, B. et al. The effect of intrachain electrostatic repulsion on conformational disorder and dynamics of the Sic1 protein. J. Phys. Chem. B 118, 4088–4097 (2014).
Article CAS PubMed Google Scholar
Mazouchi, A. et al. Conformations of a metastable SH3 domain characterized by smFRET and an excluded-volume polymer model. Biophys. J. 110, 1510–1522 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Badali, D. & Gradinaru, C. C. The effect of Brownian motion of fluorescent probes on measuring nanoscale distances by Förster resonance energy transfer. J. Chem. Phys. 134, 225102 (2011).
Article ADS PubMed CAS Google Scholar
Zhang, Z., Yomo, D. & Gradinaru, C. Choosing the right fluorophore for single-molecule fluorescence studies in a lipid environment. Biochim. Biophys. Acta Biomembr. 1859, 1242–1253 (2017).
Article CAS PubMed Google Scholar
Mazouchi, A., Liu, B., Bahram, A. & Gradinaru, C. On the performance of bioanalytical fluorescence correlation spectroscopy measurements in a multiparameter photon-counting microscope. Anal. Chim. Acta 688, 61–69 (2011).
Article CAS PubMed Google Scholar
Haustein, E. & Schwille, P. Fluorescence correlation spectroscopy: novel variations of an established technique. Annu. Rev. Biophys. Biomol. Struct. 36, 151–169 (2007).
Article CAS PubMed Google Scholar
Wang, G. & Dunbrack, R. L. Jr. PISCES: a protein sequence culling server. Bioinformatics 19, 1589–1591 (2003).
Article CAS PubMed Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Article CAS PubMed PubMed Central Google Scholar
Yaffe, M. B. et al. A motif-based profile scanning approach forgenome-wide prediction of signaling pathways. Nat. Biotechnol. 19, 348–353 (2001).
Article CAS PubMed Google Scholar
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank and acknowledge Mickael Krzeminski and Rhea Hudson for useful discussions during the execution of this project and Lewis Kay and Ranjith Muhandiram for their expert help with NMR spectroscopy. All ITC and DSC experiments were performed at The Hospital for Sick Children’s Structural & Biophysical Core Facility with the assistance of Greg Wasney. The phosphorylation status of each 4E-BP2 mutant was confirmed at the AIMS Mass Spectrometry Laboratory, Department of Chemistry, University of Toronto. This work was funded by the Canadian Institutes of Health Research (CIHR, MOP-119579, to J.D.F.-K.) and the Natural Sciences and Engineering Research Council of Canada (NSERC, RGPIN 2017—06030 to C.G). A.B. was partly supported by a CIHR post-doctoral fellowship, and Z.Z. was supported by a doctoral CIHR Training Grant.

Author information

These authors contributed equally: Jennifer E. Dawson, Alaji Bah.

Authors and Affiliations

Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
Jennifer E. Dawson, Alaji Bah, Robert M. Vernon, Hong Lin, P. Andrew Chong, Manasvi Vanama & Julie D. Forman-Kay
Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University, Syracuse, NY, 13210, USA
Alaji Bah
Department of Chemical and Physical Sciences, University of Toronto, Mississauga, ON, L5L 1C6, Canada
Zhenfu Zhang & Claudiu C. Gradinaru
Department of Biochemistry, McGill University, Montreal, QC, H3G 1Y6, Canada
Nahum Sonenberg
Goodman Cancer Research Centre, McGill University, Montreal, QC, H3A 1A3, Canada
Nahum Sonenberg
Department of Physics, University of Toronto, Toronto, ON, M5S 1A7, Canada
Claudiu C. Gradinaru
Department of Biochemistry, University of Toronto, Toronto, ON, M5S 1A8, Canada
Julie D. Forman-Kay

Authors

Jennifer E. Dawson
View author publications
You can also search for this author in PubMed Google Scholar
Alaji Bah
View author publications
You can also search for this author in PubMed Google Scholar
Zhenfu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Robert M. Vernon
View author publications
You can also search for this author in PubMed Google Scholar
Hong Lin
View author publications
You can also search for this author in PubMed Google Scholar
P. Andrew Chong
View author publications
You can also search for this author in PubMed Google Scholar
Manasvi Vanama
View author publications
You can also search for this author in PubMed Google Scholar
Nahum Sonenberg
View author publications
You can also search for this author in PubMed Google Scholar
Claudiu C. Gradinaru
View author publications
You can also search for this author in PubMed Google Scholar
Julie D. Forman-Kay
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.E.D., A.B., and J.D.F.-K. conceived the project and designed and analyzed the NMR, ITC, and DSC experiments, with contributions from N.S. C.G., and Z.Z. designed and analyzed the smFRET and FCS experiments. J.E.D., A.B., H.L., and M.V. prepared reagents. J.E.D., A.B., Z.Z. and P.A.C. performed experiments. R.M.V. conducted the bioinformatic analysis. J.E.D. wrote the manuscript with contributions from A.B., R.M.V. and Z.Z. J.E.D., A.B., R.M.V., N.S., C.G. and J.D.F.-K. edited the manuscript.

Corresponding author

Correspondence to Julie D. Forman-Kay.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Brandon M. Invergo, and other, anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dawson, J.E., Bah, A., Zhang, Z. et al. Non-cooperative 4E-BP2 folding with exchange between eIF4E-binding and binding-incompatible states tunes cap-dependent translation inhibition. Nat Commun 11, 3146 (2020). https://doi.org/10.1038/s41467-020-16783-8

Download citation

Received: 31 October 2019
Accepted: 15 May 2020
Published: 19 June 2020
DOI: https://doi.org/10.1038/s41467-020-16783-8

This article is cited by

Control of the eIF4E activity: structural insights and pharmacological implications
- Alice Romagnoli
- Mattia D’Agostino
- Daniele Di Marino
Cellular and Molecular Life Sciences (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.