# Methionine in a protein hydrophobic core drives tight interactions required for assembly of spider silk

## Abstract

Web spiders connect silk proteins, so-called spidroins, into fibers of extraordinary toughness. The spidroin N-terminal domain (NTD) plays a pivotal role in this process: it polymerizes spidroins through a complex mechanism of dimerization. Here we analyze sequences of spidroin NTDs and find an unusually high content of the amino acid methionine. We simultaneously mutate all methionines present in the hydrophobic core of a spidroin NTD from a nursery web spider’s dragline silk to leucine. The mutated NTD is strongly stabilized and folds at the theoretical speed limit. The structure of the mutant is preserved, yet its ability to dimerize is substantially impaired. We find that side chains of core methionines serve to mobilize the fold, which can thereby access various conformations and adapt the association interface for tight binding. Methionine in a hydrophobic core equips a protein with the capacity to dynamically change shape and thus to optimize its function.

## Introduction

Over hundreds of millions of years spiders evolved the construction of silk webs of different geometries tailored for various purposes including prey capture, reproduction and shelter1. They use up to seven glands specialized for these purposes. In each of them soluble silk proteins, so-called spidroins, are being tightly connected during their passage through the spinning duct and brought out of solution in a controlled fashion to form a fiber2. Spiders evolved mechanisms that carefully control the ordered phase and structural transitions in order to assure fabrication of high-quality material and to avoid early and lethal fibrillation within the gland. Dragline silk, which is used as a lifeline and to build the web frame, is formed by spidroins from the major ampullate gland (MaSp). The dragline represents the toughest known biological thread and is thus a current focus of biomimetic material sciences3,4.

To date, the molecular mechanisms that underlie phase and structural transitions of spidroins are only understood in parts. At the beginning of the process, spidroins are stored in soluble form in the ampulla of a spinning gland, which is located in the spider’s abdomen. On demand, they pass through the tapering duct where they experience mechanical and chemical stimuli that transform them into silk2,3,4. The N-terminal and C-terminal domains of spidroins (NTD and CTD) fulfill critical functions during storage and assembly5. Both domains are highly conserved five-helix bundles that provide water-solubility and connectivity5. The CTD covalently connects two spidroins through formation of a homo-dimer6,7. The NTD, on the other hand, contains a relay that triggers dimerization upon a change of solution conditions within the spinning duct4,8. A decrease of pH along the duct leads to tight self-association of the NTDs, which connects and polymerizes spidroins9,10,11,12. The mechanism of NTD dimerization is conserved across glands and species and involves site-specific protonation of surface charges and conformational change4,8,9,10,11,12,13,14,15,16,17. While the contribution of surface charges and their protonation to dimerization has been investigated intensely9,10,12,13,16, the mechanism of conformational change and its role in dimerization remains largely unexplored. Conformational change involves motion of helices that are part of the association interface, which tilt upon dimerization and adopt perfectly self-complementary surfaces14,15. Helix rearrangement is associated with motion of a single, conserved tryptophan (Trp) from buried to solvent-exposed position4,14,17. The driving force underlying these conformational changes and their energetic contribution to spidroin association, and hence spider silk formation, are unknown.

Here we find that side chains of the amino acid methionine (Met), which are present in unusually high numbers in the core of the NTD, are responsible for conformational changes of the domain. We simultaneously mutate all core Met in the NTD of MaSp1 from the nursery web spider Euprosthenops australis to leucine (Leu) and make surprising observations. The mutant gained a substantial amount of stability compared to the wild-type protein and its structure is fully preserved. Its ability to dimerize is considerably impaired. Conformational dynamics of the mutant are stalled, in contrast to the wild-type protein. Our results show that Met side chains in the NTD core facilitate structural plasticity, which tightens dimerization through shape-optimization of a mobile binding interface.

## Results

### Unusual amino acid composition of MaSp NTDs

In search for the mechanism of conformational change we analyzed the amino acid composition of aligned sequences of MaSp NTDs, published previously8. The amino acid composition shows striking peculiarities. Alanine and serine are most abundant with a combined content of ~30%. Only very few charged side chains are present and the domain is unusually rich in Met. We focussed specifically on the NTD of MaSp1 from the nursery web spider E. australis because this representative has been intensely investigated in the past8,11,12,13,14,17. The sequence contains 12.4% alanine, which is more than the average 8.1% alanine found in proteins from all three domains of life (Bacteria, Archaea and Eukaryota)18. Alanine is known to stabilize helical secondary structure19 and is presumably important for the structural integrity of the five-helix bundle. The NTD contains only 9% charged side chains, which is little compared to the average content of 25% charged side chains typically found in proteins18. Water-solubility of proteins is commonly provided by charged and hydrophilic side chains located on a surface. Water-solubility of the NTD appears to be facilitated by its unusually high content of 16.1% serine, which is a hydrophilic amino acid that can compensate the lack of charges. The average content of serine in bacterial, archaeal and eukaryotic proteins, by contrast, is only 6.2%18.

Met is commonly an infrequent amino acid with an average content of only 2.5% in proteins from across all phyla of life18. The content of Met in dragline silk is even lower (0.2–0.4%)20. It therefore took us by surprise to find that NTDs from MaSps contain a substantial number of Met residues (content of 7.4% Met in the sequence of MaSp1 NTD from E. australis, Fig. 1a). Met is unevenly distributed along a spidroin sequence and accumulates in the NTD. We analyzed the location of Met side chains in the NTD using sequence alignment and homology of available structural data8,14,15,16. We found that the majority of Met residues are buried in the hydrophobic core and/or involved in tertiary interactions of helices. Only few Met residues were solvent-exposed (Fig. 1a). The high abundance of Met in the domain core indicated an unknown structural and/or functional role of this side chain.

### Solution structure of a Met-depleted spidroin NTD

We investigated the structural role of Met in the NTD of MaSp1 from E. australis. Visual inspection of the structure (PDB IDs: 2LPJ and 3LR2)8,14 showed that six of a total of ten Met side chains of the 137-residue domain are located in the core and/or are involved in tertiary interactions. The branched aliphatic side chain of Leu has a similar hydrophobicity and size as Met and can therefore serve as a suitable replacement21,22. To investigate the potential structural, dynamic and energetic effects of the six core Met we replaced them simultaneously by Leu, yielding a construct that we termed L6-NTD. Even though mutations in protein cores are generally not well tolerated and often act destabilizing23, we found that L6-NTD expressed well and at high yield. Far-UV circular dichroism (CD) spectroscopy showed that the helical secondary structure of the wild-type NTD (WT-NTD) was retained in the mutant (Fig. 1b). Using NMR spectroscopy, we determined the atomic-resolution solution structure of L6-NTD at pH 7.0, i.e., under conditions where the domain is a monomer (Fig. 1c, Supplementary Fig. 1, Supplementary Table 1, PDB ID: 6QJY). The solution structure showed that L6-NTD is a well-folded, five-helix bundle with no significant deviations from the structure of the WT-NTD. The helices of L6-NTD and WT-NTD have the same respective lengths and form the same tertiary interactions. The all-atom alignment of residues 9–129 (omitting the flexible, unstructured C-terminal and N-terminal tails) of the lowest-energy conformer of L6-NTD with that of the WT-NTD14 yielded a heavy-atom RMSD of 1.2 Å. The backbone heavy-atom RMSD was 1.2 Å. Helix orientations and positions of mutated side chains superimposed well. The conformation of the conserved Trp, which is wedged in the center of the helix bundle14, was also preserved. We concluded that replacement of the six core Met by Leu had no impact on the structure of the NTD.

### L6-NTD folds at the speed limit

In order to investigate the influence of the Met-to-Leu mutations on the mechanism and energetics of folding, we performed chemical and thermal denaturation experiments of L6-NTD under pH 7.0 solution conditions where the NTD is a monomer. Equilibrium denaturation data revealed a dramatic increase of stability of L6-NTD compared to WT-NTD (Fig. 2a–c, Table 1). The free energy of folding increased from 5.6 ± 0.1 kcal/mol of WT-NTD to 8.0 ± 0.3 kcal/mol of L6-NTD (mean value ± s.d. of denaturation data measured by CD and fluorescence). The equilibrium m-value (mD-N) of L6-NTD was slightly reduced compared to the WT value (Table 1). Changes in m-value are known to correlate with changes in accessible surface area between native and denatured states24. Since the structures of native WT-NTD and L6-NTD overlay (Fig. 1c), the slightly lower m-value measured for L6-NTD indicates that its denatured state is slightly more compact than that of WT-NTD. The melting temperature Tm increased from originally 61.0 ± 0.1 °C of WT-NTD to 81.0 ± 0.2 °C of L6-NTD (±s.e. from regression analysis). To test whether stabilization resulted from a single mutation or was an additive effect25 we generated six cumulative point mutants that build up L6-NTD (L1: M20L; L2:M20L/M24L; L3: M20L/M24L/M41L; L4: M20L/M24L/M41L/M48L; L5: M20L/M24L/M41L/M48L/M77L; L6: M20L/M24L/M41L/M48L/M77L/M101L) and analyzed their thermal stabilities. We found a cumulative increase of melting temperature, which showed that the observed effect of stabilization was additive (Fig. 2c). Mutation M48L (L4) had little effect on stability of L3 and thus behaved differently (Fig. 2c, inset). The structure shows that the side chain of residue M48 is squeezed between helix 2 and helix 3, and is located close to the protein surface. M48 appears to form a less well consolidated van-der-Waals interaction network than the other Met side chains probed, which may explain the lack of effect upon mutating this residue. On the other hand, Met is reported26 to interact specifically with Trp side chains, which may cause a stabilizing effect. Removing this stabilizing interaction through mutation may be balanced by the added stability increment from substitution by Leu.

To gain insight into the origin of the enhanced stability of L6-NTD, we measured its kinetics of folding using stopped-flow Trp fluorescence spectroscopy (Fig. 2d, e). Our previous kinetic experiments performed on a set of homologous spidroin NTDs shows that WT-NTDs fold rapidly on a time scale of ~100 µs through a barrier-limited two-state transition11. Here we found that L6-NTD folded 40 times faster than the WT-NTD (Table 1). The rate constant of folding of L6-NTD, extrapolated to zero denaturant, was kf = 519,000 ± 213,000 s−1 (±s.e. from regression analysis). Synergy of theory and experiment predicts that the speed of folding of a generic N-residue single-domain protein is limited to τf = N/100 µs, which is referred to as the speed limit of folding27. Following this theory, the speed limit of folding of the 137-residue NTD is ~1.4 µs. For L6-NTD we determined τf = 1/kf = 1.9 ± 0.8 µs, which corresponds to the theoretical speed limit. Besides the observed increase of kf we found that L6-NTD was stabilized by a 15-fold decrease of unfolding rate constant compared to the WT-NTD (Table 1). From kf and ku of WT-NTD and L6-NTD we estimated a decrease of the free energy barrier between the denatured and the transition state by ~2.2 kcal/mol, and an increase of the free energy barrier between the native and the transition state by ~1.6 kcal/mol, of L6-NTD compared to WT-NTD (Fig. 2f).

### Met-depletion of the NTD core impairs dimerization

Conservation of structure and increase of stability suggested that the L6 mutant should retain functionality. Functionality of NTDs can be tested by measuring their ability to undergo pH-induced dimerization, in analogy to what happens in a spider’s spinning duct4,9,12,13,28. NTDs at pH 6.0 are tight dimers that exhibit dissociation constants (Kd) in the low nanomolar range11. We characterized the equilibrium dimerization of L6-NTD and its response to changing solution conditions using size-exclusion chromatography in combination with multi-angle light scattering spectroscopy (SEC-MALS). At pH 7.0 and at 200 mM ionic strength, L6-NTD eluted homogenously as a monomer, as expected, similar to WT-NTD (expected molecular weight, Mw = 14 kDa; measured Mw = 14 kDa). At pH 6.0 and 60 mM ionic strength, however, the average molecular mass of L6-NTD measured by SEC-MALS was between that of a monomer and a dimer (expected Mw = 28 kDa; measured Mw = 20 kDa; Fig. 3a). The detected intermediate mass can be interpreted as that of a system in rapid, dynamic equilibrium between a monomer and a dimer. The intermediate MW indicated an elevated Kd of L6-NTD compared to WT-NTD, which fell in the µM sample concentration range applied in this experiment. WT-NTD measured under the same conditions showed a molecular mass close to the value expected for a dimer (expected Mw: 28 kDa; measured Mw: 26 kDa, Fig. 3a). The measured value was only 7% below the expected value for a dimer. The discrepancy was little above the precision of the measurement (±3–5%) and may be explained by small amounts of salt (60 mM ionic strength) present in SEC experiments, which were required to reduce sticking of protein material to the column. Ions in solution shield electrostatics and induce dissociation of the WT-NTD9,10,12. High concentrations of salt lead to Debye–Hückel29 screening of attractive electrostatic forces in the dimerization interface. This explains residual population of monomer in the dimer elution band and consequently a lower detected molecular mass. In order to strengthen dimerization we reduced the ionic strength of the pH 6.0 buffer to 8 mM. Under these conditions, the measured molecular mass of L6-NTD was closer to that of a dimer (Fig. 3b). Dimerization of L6-NTD was thus apparently stabilized by electrostatic forces that are screened at high solution ionic strength, similar as previously observed for WT-NTD9,10,12. Upon progressive reduction of concentration of L6-NTD samples in SEC-MALS experiments we measured a decrease of molecular weight and an increase of elution volume (VE), confirming that, despite low ionic strength, dimerization was still weak. The Kd of L6-NTD was in the concentration range probed during measurement (i.e., Kd in the µM range) (Fig. 3b). However, we were not able to obtain a full binding isotherm required to quantify Kd because MALS detection was not sensitive enough to probe sub-µM protein concentrations. We therefore measured VE of dilute L6-NTD samples in pH 6.0 buffer using high-resolution SEC in combination with fluorescence detection, which had sub-µM sensitivity. An increase of L6-NTD concentration shifted the VE to lower values indicating formation of dimers (Fig. 3c). From concentration-dependent data we obtained a binding isotherm that fitted well to a thermodynamic model of dimerization, yielding a Kd of 1.1 ± 0.2 µM (±s.e. from regression analysis) (Fig. 3d). This Kd was three orders of magnitude higher than the value reported for WT-NTD (Kd = 1.1 ± 0.1 nM)11. VE of WT-NTD at pH 6.0 did not change significantly over the probed protein concentration range because it was far above Kd11 (Fig. 3d). However, there was a slight increase of VE of WT-NTD probed in the 10-nM concentration range, which may indicate the onset of dimer dissociation. The VE of the WT-NTD dimer was significantly higher than the value of the L6-NTD dimer, which indicated that the L6-NTD dimer had larger dimensions. Expansion of the L6-NTD dimer can be explained by loose association of subunits. To test if high-resolution SEC data reported reliably on Kd of the dimer, we conducted thermophoresis experiments as an alternative probe for dimerization (Fig. 3e). Using thermophoresis we obtained a Kd of 3.6 ± 1.8 µM, which was in reasonable agreement with the value obtained by high-resolution SEC.

Next, we investigated the influence of individual Met side chains on size and stability of the NTD dimer by measuring dimerization of six cumulative Met-to-Leu point mutants using high-resolution SEC (mutations are detailed above in denaturation experiments). The obtained isotherms are shown in Fig. 4a–e. The Kd increased progressively with increasing number of mutations. At the same time, the VE of the dimer decreased with increasing number of mutations (Fig. 4f). This showed that the dimensions of the dimer increased progressively with increasing number of mutations, which indicated progressive loosening of the complex.

### Met facilitates conformational changes upon dimerization

Dimerization of the NTD is known to be associated with conformational changes. Helices that form the dimerization interface tilt and the conserved Trp (W10) moves out of its binding pocket to solvent-exposed position14. This is evident from the quenching and red-shift of Trp fluorescence emission, effects that are conserved across NTDs from various glands and species8,9,11,13,14. We found that the L6-NTD lacked these fluorescence characteristics. There was no red-shift and only minor quenching of Trp fluorescence emission upon dimerization (Fig. 5a). The observation indicated that in L6-NTD Trp W10 remained in buried position both in the monomeric and in the dimeric state. Fluorescence characteristics changed gradually with increasing number of Met-to-Leu mutations (Fig. 5b), similar as observed for the change of stability and of Kd (Figs. 2c and 4f). Full conformational change of Trp thus required the presence of several core Met.

We measured and analyzed 1H, 15N HSQC NMR spectra of L6-NTD and WT-NTD recorded in both the monomeric and the dimeric states. The full 1H, 15N HSQC NMR data set is shown in Supplementary Fig. 2. In agreement with our results from fluorescence spectroscopy, 1H, 15N-Trp chemical shifts of L6-NTD measured by NMR spectroscopy showed only minute perturbations upon change of solution pH from 7.0 to 6.0, which was in stark contrast to what we observed for the WT-NTD (Fig. 5c). The NMR data showed that the environment of W10 in L6-NTD remained essentially unchanged upon dimerization, contrasting the substantial changes of the environment of W10 in WT-NTD. Figure 5d shows the respective differences in chemical shifts of all assigned residues in WT-NTD and L6-NTD between pH 7.0 and pH 6.0 (full HSQC spectra are provided as Supplementary Fig. 2). Residues of helices H1, H4, and H5 of L6-NTD could be reliably assigned at pH 6.0. However, assignment of helices H2 and H3 of L6-NTD was complicated by line broadening caused by dynamic exchange on the intermediate time scale, in agreement with the reduced dimer affinity. The observation supports our finding that mutation of Met to Leu in the domain core has profound consequences both for dynamics and function of the protein and prevents the dimer interface to adopt a conformation suitable for high-affinity dimerization. A loose L6-NTD dimer was found in the high-resolution SEC experiments described above (Fig. 4f). The loose dimer presumably shows rapid inter-molecular interaction dynamics between subunits that may enhance NMR line broadening. However, analysis of the many assigned residues showed that chemical shift changes upon dimerization covered the entire sequence and were in general substantially larger for WT-NTD compared to L6-NTD (Fig. 5d). The result was in agreement with strong dimer interactions and more extensive conformational changes in WT-NTD, which are apparently blocked in L6-NTD. In L6-NTD, the majority of chemical shift changes of residues that are part of the dimerization interface resulted from weak dimer interactions. Interestingly, we found pronounced chemical shift changes of resonances of glycine residues located in loops connecting helices of WT-NTD upon dimerization. These chemical shift changes were absent in L6-NTD (Fig. 5e). Since the glycine-rich NTD loops are not involved in dimerization, their significant chemical shift changes upon dimerization in WT but not in L6-NTD indicated that Met promotes conformational changes that are remote from the core and the dimerization interface.

### Met drives native-state dynamics of the NTD monomer

Having established that Met enables structural changes required for dimerization, we set out to investigate the influence of core Met on native-state conformational dynamics of the NTD monomer. For insights into per-residue dynamics and changes in solvent-accessibility of residues in the L6-NTD structure in comparison to WT-NTD, we performed NMR-based hydrogen/deuterium exchange experiments (Fig. 6a). We found that proton-to-deuterium exchange in L6-NTD was on average ~10-fold slower compared to WT-NTD, i.e., L6-NTD was overall less dynamic and the exchange-competent conformations were less frequently accessible than in WT-NTD. For L6-NTD, H/D exchange was significantly slower not just at the positions of mutation but throughout all five helices, i.e., through residues 18–28, 41–51, 72–81, 98–107, and 118–128 (Fig. 6a, b). In contrast, the local backbone dynamics on the fast nanosecond-to-picosecond time scale were not influenced by the mutations, as indicated by heteronuclear NOE measurements (Fig. 6c, d). Protein dynamics on the nanosecond-to-picosecond time scale generally report on local events, while dynamics on time scales of microseconds (µs) and slower reflect collective conformational motions30. Met residues in the core of the NTD thus appeared to facilitate collective motions rather than local, uncoupled flexibility.

To probe conformational motions on the µs time scale we applied photoinduced electron transfer (PET) in combination with fluorescence correlation spectroscopy (PET-FCS), a technique that measures conformational dynamics from single-molecule fluorescence fluctuations17,31,32. We probed dynamics of the conserved Trp residue (W10), which moves in and out of the five-helix bundle17. To this end, we modified the side-chain thiol of an engineered cysteine introduced at the N-terminus, vicinal to W10 (mutant G3C), using the thiol-reactive fluorophore AttoOxa11 (Fig. 6e). Motion of W10 from buried to solvent-exposed position lead to contact-induced quenching of the fluorescence label via PET. PET-FCS autocorrelation functions recorded from AttoOxa11-modified WT-NTD and L6-NTD showed three decays. One decay was on the millisecond (ms) time scale and caused by molecular diffusion of the NTD through the detection focus. Two additional decays on the sub-ms time scale were caused by PET fluorescence fluctuations that reported on Trp conformational change (Fig. 6f). The sub-ms decays were dominated by a single-exponential kinetic phase with a relaxation time constant τ1 = 240 ± 17 µs and an amplitude a1 = 0.64 ± 0.03 measured for WT-NTD, and τ1 = 529 ± 19 µs and a1 = 0.35 ± 0.01 measured for L6-NTD (±s.e. from regression analysis). The second kinetic phase had τ2 = 2 ± 1 µs and a2 = 0.19 ± 0.04 measured for WT-NTD; and τ2 = 4 ± 1 µs and a2 = 0.05 ± 0.01 measured for L6-NTD. The observed reduction of amplitude and increase of relaxation time constant of the main kinetic phase indicated that the mobility of W10 was reduced in L6-NTD compared to WT-NTD. From τ1 and a1 we estimated33 the microscopic time constants of Trp conformational change (Fig. 6g). The time constant of W10 moving out of the binding pocket, as happens upon dimerization, was substantially larger for L6-NTD compared to WT-NTD (τoutWT = 0.61 ± 0.06 ms, τoutL6 = 2.1 ± 0.1 ms; Fig. 6g). Thus, core Met in WT-NTD accelerated the release of Trp W10 from buried to solvent-exposed position.

## Discussion

Synthesis of silk fibers by web spiders involves dimerization of the spidroin NTD, which polymerizes protein building blocks4,9,10. The seemingly simple process underlies a complex multi-step mechanism4,12,13 triggered by a gradual change of pH along the spinning duct28. A gradual mechanism makes physiological sense because soluble spidroins need to orient and align during their passage through the spinning duct before inter-molecular interactions consolidate. Sequential protonation gives rise to early, attractive long-range electrostatic forces between pairs of NTDs. In the early complex, tight dimerization is prevented by a mismatch of shape of the dimerization interface4,14, i.e., the subunits have the wrong conformation. Late rearrangement of helices adapts the interface and tightens binding. What is the mechanism behind these conformational changes and what is their energetic contribution to dimerization? Our study provides answers to these questions.

The shape of a protein surface is determined by the shape of the protein hydrophobic core23. Side chains in core position are commonly tightly packed and essentially immobile. Their interaction network resembles a jigsaw puzzle34 where side chains are in extensive van-der-Waals contact and in low-energy conformation23. Structures of NTDs show a common core that is densely packed with hydrophobic side chains8,14,15,16. But we found that the rare18,20 amino acid Met was unusually overrepresented (Fig. 1a). This accumulation of Met may not be seen as a surprise because the amino acid has similar hydrophobicity and size compared to other aliphatic side chains frequently present in the core region of proteins21,22. However, we made surprising observations concerning the role of Met in structure and stability of the domain and revealed an intriguing functional role of its side chain, as we discuss below.

In protein engineering experiments the replacement of core side chains is usually avoided because it often leads to considerable destabilization of the fold23,35. Yet, we found that the simultaneous replacement of six core Met in the NTD yielded a well-folded protein that was substantially more stable than the wild type (~50% increase of ΔG; Table 1). Extensive modification of the domain core and the resulting change of stability would typically suggest a change of structure. Surprisingly, the fold of L6-NTD was fully preserved with no noticeable deviations from the wild type (Fig. 3c). Stabilization and conservation of structure can be expected to preserve function. But dimerization of L6-NTD was dramatically impaired: its Kd was ~1000-fold higher than the value of the wild type. The NTD thus emerges as yet another example of a protein where residues at positions of functional importance impair stability36. But what exactly does Met do to the NTD to enhance dimerization at the expense of stability?

We found that side chains of core Met facilitate motion of secondary and tertiary structure. This was evident from NMR hydrogen-exchange and PET-FCS experiments, which revealed that dynamics of the Met-depleted domain were substantially slower than dynamics of the Met-rich domain (Fig. 6). Models of protein-protein association suggest that native-state dynamics transiently populate activated states, which are competent to bind37. The NTD likely gains access to such activated states through Met-driven conformational changes, which prime the domain for tight dimerization. But how does Met induce such dynamics?

Met is unique among all 20 natural amino acids in that its side chain is highly flexible38. This unusual flexibility originates from a low energy barrier to rotation around the thioether bond38. Met positioned on a protein surface can make the surface ductile and thereby facilitate promiscuous binding38,39,40. Recently, reversible oxidation of solvent-exposed Met side chains has been reported to promote protein phase transition41. Oxidation of the thioether sulfur changes its polarity, but presumably also stiffens the side chain. Our results show that Met side chains in the core of the NTD transfer their flexibility on large parts of the structure and thereby malleablize it (Fig. 6). The lack of Met in L6-NTD blocked conformational changes required for tight dimerization. However, the electrostatics of the domain remained unchanged. L6-NTD can thus serve as a model for the elusive intermediate state formed along the pathway of multi-step dimerization13, in which early changes of surface electrostatics through protonation precede late conformational change. We can dissect the energetic contributions from electrostatics and conformational change. The total free energy of dimerization, which contains both contributions, can be calculated from the equilibrium dissociation constant of WT-NTD (Kd = 1.1 ± 0.1 nM)11: ΔGWT = -RTln(Kd) = −12.3 ± 0.1 kcal/mol (±propagated s.e. from regression analysis). The electrostatic contribution can be calculated from the Kd of L6-NTD (Kd = 1.1 ± 0.2 µM) where conformational changes are blocked and the domains are loosely held together by electrostatic forces: ΔGelec = -RTln(Kd) = −8.2 ± 0.1 kcal/mol. The portion of the free energy resulting from conformational change is thus: ΔGconf = ΔGWT − ΔGelec = −4.1 ± 0.1 kcal/mol, which represents a substantial fraction of the total free energy of dimerization.

We found that Met-to-Leu mutations accelerated folding and strongly stabilized the NTD monomer. Folding of L6-NTD was ultrafast and at the theoretical speed limit27 (Table 1). Its folding time constant was close to one µs, which places L6-NTD amongst the fastest folding protein domains of this size27. Acceleration of folding is explained by changes of structure and/or interactions in the denatured or transition state of folding. Those effects are generally hard to analyze because both states are elusive to experimental observation42. It has been suggested that the large polarizability of the sulfur atom of Met renders the side chain more sticky at van der Waals contact compared to other hydrophobic side chains38. Met may thus induce stronger interactions in the denatured state and thereby reduce its free energy. Mutation of Met abolishes such interactions and, in turn, increases the free energy of the denatured state, which brings it closer to the transition state. A reduced free energy gap between denatured and transition state will increase the rate constant of folding (Fig. 2f). On the other hand, replacement of Met by Leu slowed unfolding (Table 1), which shows that these mutations stabilized the native state. Leu has a branched aliphatic side chain that perhaps forms more interlocked and extensive van der Waals interactions in the native state compared to the linear Met side chain, which can explain the observed stabilization.

Stable proteins are more capable to evolve new or improved functions compared to unstable ones because they can better tolerate mutations43. Reconstructed ancestors are consequently more stable than their modern descendants44. We hypothesize that the stable, Leu-rich core of L6-NTD resembles an early ancestor of the highly evolved Met-rich core. Phylogenetic studies show that Met is among the most frequently gained amino acids in protein evolution18. DNA codons of Met, Leu and isoleucine (Ile) differ by only one nucleobase. Evolutionary transformation of Leu or Ile to Met requires thus only single nucleotide exchange and has higher probability than transformation to other amino acids. Alignment of homologous sequences16 shows that Met is overrepresented in NTDs of major and minor ampullate, as well as of aciniform spidroins. Interestingly, Met, Leu and Ile often share the same sequence positions (Fig. 7). In fact, the sum of Met, Leu and Ile is conserved in each NTD and the lack of Met in NTDs from tubuliform or cylindrical silk is compensated by Leu or Ile (Fig. 8). This supports our hypothesis that a Leu/Ile-rich NTD is an early ancestor of the Met-rich NTD. It is tempting to speculate about the influence of Met in the spidroin NTD on the mechanical properties of the different silk types they constitute. Silks containing Met-rich, high-affinity NTDs should exhibit higher strength compared to silks containing Met-depleted, low-affinity NTDs. Major and minor ampullate, as well as aciniform silks, which are used by web spiders to build a web and for wrapping prey, are reported to exhibit high toughness45. Indeed, their spidroin NTDs show high content of Met (Fig. 8). Tubuliform silk, on the other hand, is a flocculent silk used to build the egg case, same as cylindrical silk. It is less tough45 and its spidroin NTDs contain no Met (Fig. 8).

We learned from web spiders that the accumulation of Met in a protein hydrophobic core induces structural plasticity and enables dynamical excursions to conformational states, which, in the case of the spidroin NTD, facilitate tight binding. Mobilization of a hydrophobic core can modulate a protein surface where function is defined and optimize it. We may thus be able to improve or change the functionality of proteins by engineering hydrophobic cores to become more dynamic.

## Methods

### Protein expression, mutagenesis and modification

The synthetic gene (GeneArt, Thermo Fisher Scientific) of the MaSp1 NTD from E. australis, was cloned into a pRSETA vector (Invitrogen, Thermo Fisher Scientific) containing an N-terminal His6-tag followed by a thrombin cleavage site for proteolytic removal of the tag11. For site-directed mutagenesis experiments the QuikChange mutagenesis protocol (Stratagene) was used. We generated L6-NTD through cumulative single-point Met-to-Leu mutants. Met sequence positions mutated to Leu were 20, 24, 41, 48, 77, and 101. NTDs and mutants thereof were overexpressed in Escherichia coli C41 (DE3) bacterial cells and isolated from clarified cell lysate using affinity chromatography on a Ni-Sepharose 6 Fast-Flow column (GE Healthcare), followed by proteolytic cleavage of the His6-tag using thrombin from bovine plasma (Sigma-Aldrich)11. Proteins were purified using size exclusion chromatography (SEC) on a Superdex 75 column (GE Healthcare) in 200 mM ammonium bicarbonate, pH 8.0. Purity of isolated protein was confirmed using SDS-PAGE. Pooled protein fractions from SEC were lyophilized. For thermophoresis and PET-FCS experiments, the single-point cysteine mutants Q50C and G3C were modified with the thiol-reactive maleimide derivative of the oxazine fluorophore AttoOxa11 (Atto-Tec). A 5-fold molar excess of fluorophore was added to the Cys mutant dissolved in 50 mM 3-(N-morpholino)propanesulfonic acid (MOPS), pH 7.5, containing 6 M guanidinium chloride and a 10-fold molar excess of tris(2-carboxyethyl)phosphine (TCEP) to prevent thiol oxidation. The labeling reaction was carried out for 2.5 h at 298 K. Labeled protein was isolated using SEC on a Sephadex G-25 column (GE Healthcare).

For NMR studies, cells were grown in minimal media supplemented with 15N ammonium chloride and 13C glucose as the sole nitrogen and carbon sources46. Isotope labeled protein was purified as described above.

### Far-UV CD spectroscopy

Far-UV CD spectroscopy was carried out using a Jasco J-815 CD spectrometer equipped with a Peltier thermocouple. Sample temperature was set to 298 K throughout all experiments, except for thermal denaturation experiments where a temperature ramp of 1 K/min was applied. 10 μM protein samples were measured in a 1 mm path-length cuvette (Hellma). Chemical and thermal denaturation was monitored at 222 nm, i.e., the wavelength of maximal amplitude of α-helix secondary structure. Chemical denaturation was performed by manual titration between 0 and 8 M Urea in 50 mM MOPS, pH 7.0, with the ionic strength adjusted to 200 mM using potassium chloride. Thermal denaturation was carried out in 50 mM phosphate buffer, pH 7.0, with the ionic strength adjusted to 200 mM using potassium chloride.

Trp fluorescence spectroscopy was carried out using a Jasco FP-6500 spectrometer. In chemical denaturation experiments Trp fluorescence emission intensities were recorded from 10 μM protein samples measured in a 10 mm path-length cuvette (Hellma). Spectra were recorded from 100 µM protein samples in either 50 mM phosphate, pH 7.0, with the ionic strength adjusted to 200 mM using potassium chloride (monomer conditions) or in 20 mM MES, pH 6.0 and 8 mM ionic strength (dimer conditions). The high protein concentration was applied to ensure dimerization of Met-depleted NTD mutants. Sample temperature was controlled using a Peltier thermocouple set to 298 K throughout all experiments. Chemical denaturation experiments were conducted under the same solution conditions as described above for CD spectroscopy.

### Stopped-flow fluorescence spectroscopy

Kinetics of folding and unfolding were measured at 298 K using a SFM-2000 stopped-flow fluorescence spectrometer (BioLogic) equipped with a 280-nm diode as excitation source, monitoring Trp fluorescence emission of the NTD. The temperature was adjusted using a circulating water bath. One hundred micromolar protein samples were prepared in 50 mM MOPS, pH 7.0, with the ionic strength adjusted to 200 mM using potassium chloride, containing either zero or six molar urea. All samples were filtered through 0.2 µm syringe filters before measurement and rapidly mixed into urea solutions of varying concentration applying a volumetric mixing ratio of 1:10 using the stopped-flow machine.

### SEC-MALS and high-resolution SEC experiments

SEC-MALS was carried out using a Superdex-75 HR10/300 or on a Superdex-200 HR10/300 analytical gel filtration column (GE Healthcare) run at 0.5 ml min−1 in either 50 mM phosphate, pH 7.0, with the ionic strength adjusted to 200 mM using KCl (monomer conditions) or in 20 mM MES, pH 6.0, with the ionic strength adjusted to 60 mM using KCl or in 20 mM MES, pH 6.0 at 8 mM ionic strength (dimer conditions). The sample loading concentration reduced by 10-fold in the detection flow-path because of sample dilution on the column. Stated concentrations represent the maximal level during the experiment. At the edge of the peaks concentrations will be lower than this. The elution was followed in a standard SEC-MALS format using SEC UV absorbance, light scattering intensity measured with a Wyatt Heleos II 18 angle instrument, and finally excess refractive index using a Wyatt Optilab rEX instrument. The Heleos detector 12 was replaced with a Wyatt’s QELS detector for dynamic light scattering measurements. Protein concentration was determined from the excess differential refractive index based on 0.186 RI increment for 1 g/ml protein solution. Concentrations and observed scattered intensities at each point in the chromatograms were used to calculate the absolute molecular mass from the intercept of the Debye plot, using Zimm’s formalism as implemented in Wyatt’s ASTRA software.

High-resolution SEC experiments were conducted using a Jasco HPLC work station with fluorescence detection equipped with a Superdex-75 Increase 10/300 column (GE Healthcare). SEC was run at a flow rate of 0.8 ml/min and in 20 mM MES, pH 6.0, with the ionic strength adjusted to 20 mM using potassium chloride. VE of NTD constructs, which were applied at various concentrations, was determined from the peaks of elution bands detected measuring the native Trp fluorescence signal. Native Trp fluorescence was excited at 280 nm and emission intensities were recorded at 330 nm.

### Thermophoresis experiments

Microscale thermophoresis was measured using a Nanotemper Monolith instrument equipped for fluorescence excitation in the far-red spectral range. Forty nanomolar samples of labeled AttoOxa11-modified L6-NTD mutant Q50C were incubated with increasing concentrations of unlabeled L6-NTD and loaded into standard, untreated capillaries (Nanotemper). After localization of capillaries, thermophoresis was performed at 50% heating power and at 50% illumination intensity. The data were expressed as the ratio of fluorescence intensities before and at the end of 30 s of heating when the new, stable equilibrium intensities had been achieved.

### PET-FCS

PET-FCS was carried out on a confocal fluorescence microscope setup17 (Zeiss Axiovert 100 TV) equipped with a diode laser emitting at 637 nm (Coherent Cube) and a high numerical aperture oil-immersion objective lens (Zeiss Plan Apochromat, 63×, NA 1.4). The average laser power was 400 µW before entering the back aperture of the microscope lens. The fluorescence signal was shared by two fiber-coupled avalanche photodiode detectors (APDs; Perkin Elmer, SPCM-AQRH-15-FC). Signals of APDs were recorded in the fast cross-correlation mode using a digital hard-ware correlator device (ALV 5000/60 × 0 multiple tau digital real correlator). One nanomolar sample of fluorescently modified NTDs were prepared in 50 mM phosphate buffer, pH 7.0, with the ionic strength adjusted to 200 mM using potassium chloride, containing 0.3 mg/ml bovine serum albumin (BSA) and 0.05% Tween-20 as additives to suppress glass surface interactions. Samples were filtered through a 0.2 µm syringe filter before measurement. The sample temperature was set to 298 K using an objective heater. For each sample, three individual ACFs were recorded of 10 min measurement time each.

### NMR spectroscopy

NMR measurements were carried out on Bruker AVANCE 600, 700, 800, 900, and 950 MHz spectrometers equipped with cryogenic triple resonance probes. Experiments were carried out at 298 K and a protein sample concentration of 300 µM. The proton chemical shifts of 13C, 15N-labeled L6-NTD were referenced to 2.2-dimethyl-2-silapentane-5-sulfonic acid (DSS). The heteronuclear 13C and 15N chemical shifts were indirectly referenced with the appropriate conversion factors47. All spectra were processed using Bruker TopSpinTM 2.1 or 3.2 and analyzed using the programs CARA48 and CcpNmr Analysis 2.249.

Backbone resonance assignments of 13C, 15N-L6-NTD at pH 7 at a protein concentration of 300 µM were obtained from HNCO, HN(CA)CO, HNCA, and HNCACB spectra, whereas side chain assignments were obtained from (H)C(CO)NH, H(CCO)NH, HCCH-TOCSY, CBCACONH, HBHACONH spectra. All experiments were recorded with standard Bruker pulse sequences including water suppression with WATERGATE50. Side chains were additionally assigned with the help of 13C-NOESY-HSQC (mixing time 150 ms, aliphatic carbons) and 15N-NOESY-HSQC (mixing time 150 ms) experiments51.

Ninety-six percent of the protein backbone could be assigned, including proline residues, whereas aliphatic side chain protons and carbons were assigned with 85% and 84%, respectively. The aromatic protons were assigned with 33%.

Partial backbone assignments of 13C, 15N-L6-NTD at pH 6 could be obtained at a protein concentration of 600 µM where the protein is mostly dimeric using an HNCA spectrum.

For the WT NTD at pH 7, the published backbone assignments of WT at pH 7.2 (BMRB Entry: 18262) could be directly transferred to our spectra, while published backbone assignments of WT at pH 5.5 (BMRB Entry 18480) were used to assign WT at pH 6.

For structure calculation of L6-NTD, peak picking and NOE assignment was performed with the ATNOS/CANDID module in UNIO52 in combination with CYANA53 using the 3D NOESY spectra listed above. Peak lists were reviewed manually to correct for artefacts. Distance restrains were obtained using the CYANA based automated NOE assignment and structure calculation protocol53. Torsion angle restraints were derived from backbone H, N, Cα, Cβ chemical shifts using TALOS+54.

The final set of torsion angle and distance restraints was used to calculate 100 conformers with CYANA. Twenty structures with the lowest target function were submitted to a restrained energy refinement with OPALp55 and the AMBER94 force field56. The structure was validated with the Protein Structure Validation Software suite 1.557 restricted to residues with hetNOE values > 0.6 (Supplementary Table 1).

{1H},15N-heteronuclear nuclear Overhauser effect (hetNOE) experiments for 15N-labeled WT-NTD and L6-NTD were recorded using Bruker standard pulse sequences. Experiments were run in an interleaved fashion with and without proton saturation during the recovery delay. Peak integrals were obtained using Bruker TopSpin 3.2. The {1H},15N-hetNOE data were recorded on a Bruker 600 MHz spectrometer in an interleaved manner with a 1H saturation period of 5 s duration on-resonant or 10,000 Hz off-resonant for the cross-experiment and reference experiment, respectively. The relaxation delay was set to 3 s. Two consecutive {1H},15N-hetNOE experiments of both WT and L6-NTD were carried out and their average values are shown in Fig. 6c.

For H/D exchange experiments, WT-NTD and L6-NTD was lyophilized in an NMR tube, respectively, to remove H2O. The dried protein was dissolved in an equivalent amount of D2O and immediately placed into the NMR spectrometer to record a series of 1H,15N-HSQC spectra with 12 min intervals.

### Data analysis

Equilibrium denaturation data were fitted using the thermodynamic model for two-state folding11. The spectroscopic signal S was expressed as a function of the perturbation P58:

$$S\left( P \right)\;=\;\frac{{\alpha _{\mathrm{N}} + \beta _{\mathrm{N}} \cdot P + \left( {\alpha _{\mathrm{D}} + \beta _{\mathrm{D}} \cdot P} \right) \cdot {\mathrm{exp}}\;(- \Delta G_{{\mathrm{D}} - {\mathrm{N}}}(P)/RT)}}{{1\;+\;{\mathrm{exp}}\;(- \Delta G_{{\mathrm{D}} - {\mathrm{N}}}(P)/RT)}}$$
(1)

where the P was either heat (T, thermal denaturation) or concentration of urea ([urea], chemical denaturation), αN, βN, αD, and βD characterized the linearly sloping baselines of native (N) and denatured (D) states, R was the gas constant, and ΔGD−N the difference in free energy between D and N.

ΔGD−N as a function of denaturant is described by the linear-free energy relationship59:

$$\Delta G_{{\mathrm{D}} - {\mathrm{N}}}\left( {\left[ {{\mathrm{urea}}} \right]} \right)\;=\;\Delta G_{{\mathrm{D}} - {\mathrm{N}}} - m_{{\mathrm{D}} - {\mathrm{N}}}[{\mathrm{urea}}]$$
(2)

where mD−N is the equilibrium m-value that describes the sensitivity of the folding equilibrium to denaturant. Experimental errors of ΔGD–N were determined from propagated errors (s.e.) of fitted values of mD−N and mid-point concentrations of urea ([urea]50%).

Analysis of thermal denaturation data to determine Tm was performed using Eq. 1 in combination with the Gibbs–Helmholtz formalism11.

Kinetic transients of folding/unfolding measured using stopped-flow spectroscopy were fitted to a single exponential function containing a linear baseline drift:

$$S\left(t \right) = a\;\exp \left({- k_{{\mathrm{obs}}}t} \right) + bt + c$$
(3)

S(t) was the fluorescence signal as function of time, a was the amplitude and kobs the observed rate constant. The parameters b and c described the linear drift of the baseline11. kobs is the sum of the microscopic rate constants for folding and unfolding (kf and ku). The change of kobs as a function of denaturant concentration was analyzed by fitting the data to the chevron model for a barrier-limited two-state transition that follows the linear-free-energy relationship60:

$$\begin{array}{l}\log k_{{\mathrm{obs}}}\left( {\left[ {{\mathrm{urea}}} \right]} \right) = \\ \log \left[ {k_f\;\exp \left( { - \frac{{m_{{\mathrm{TS}} - {\mathrm{D}}}\left[ {{\mathrm{urea}}} \right]}}{{RT}}} \right) + k_u\;\exp \left( {\frac{{m_{{\mathrm{TS}} - {\mathrm{N}}}\left[ {{\mathrm{urea}}} \right]}}{{RT}}} \right)} \right]\end{array}$$
(4)

mTS-D and mTS-N were the kinetic m-values of folding and unfolding, respectively, where TS denoted the transition state of folding. kf and ku were the microscopic rate constants of folding and unfolding, respectively, under standard solvent conditions and in the absence of denaturant.

Dimerization (binding isotherms) was analyzed using two different thermodynamic models. In thermophoresis experiments we used fluorescently modified NTD that bound non-modified NTD present at excess concentration. This pseudo-dimerization equilibrium could be described using the model for a bi-molecular binding isotherm (A + B = AB). The concentration of the complex AB was described as61:

$$\left[ {{\mathrm{AB}}} \right] = \frac{{[{\mathrm{A}}]_t[{\mathrm{B}}]}}{{\left[{\mathrm{B}} \right] + K_{\mathrm{d}}}},$$
(5)

where [AB] was the concentration of the complex (pseudo-dimer), [A]t was the total concentration of fluorescently modified NTD, [B] was the concentration of non-modified NTD and Kd was the equilibrium dissociation constant. The fluorescence signal and thermophoresis ratio was modeled as:

$$\frac{{S - S_{\mathrm{u}}}}{{S_{\mathrm{b}} - S_{\mathrm{u}}}} = \frac{{[{\mathrm{B}}]}}{{\left[ {\mathrm{B}} \right] + K_{\mathrm{d}}}},$$
(6)

where S was the observed signal, Su was the signal of the unbound state and Sb was the signal of the bound state.

Analysis of high-resolution SEC data was carried out by plotting VE versus concentration of NTD, which yielded a binding isotherm of dimerization. The isotherm was fitted using a thermodynamic model for a monomer/dimer equilibrium (2N = N2), where N and N2 denoted for NTD in the monomeric and dimeric form, respectively. Kd was described by the law of mass action:

$$K_{\mathrm{d}} = \frac{{[{\mathrm{N}}]^2}}{{[{\mathrm{N}}_2]}}$$
(7)

Assuming reversible dimerization, measured values of VE are composed of:

$$V_{\mathrm{E}} = V_{\mathrm{E}}^{\mathrm{N}} + F_{{\mathrm{N2}}}\left( {V_{\mathrm{E}}^{{\mathrm{N2}}} - V_{\mathrm{E}}^{\mathrm{N}}} \right)$$
(8)

Where VEN and VEN2 were the elution volumes of monomer and dimer, and FN2 is the fraction of dimer, which was described as:

$$F_{{\mathrm{N2}}} = \frac{{[{\mathrm{N}}_2]}}{{c_{\mathrm{t}} - [{\mathrm{N}}_2]}}$$
(9)

Where ct is the total concentration of NTD in terms of monomer. [N2] was described as:

$$\left[ {{\mathrm{N}}_2} \right] = \frac{{K_{\mathrm{d}}}}{8}\left[ {1 - \left[ {\left( {1 + \frac{{4c_{\mathrm{t}}}}{{K_{\mathrm{d}}}}} \right)^2 - \frac{{16c_{\mathrm{t}}^2}}{{K_{\mathrm{d}}^2}}} \right]^{0.5}} \right] + \frac{{c_{\mathrm{t}}}}{2}$$
(10)

In PET-FCS experiments ACFs, G(τ), were fitted using an analytical model for translational diffusion of a globule that exhibited two independent, single-exponential relaxations17:

$$G\left( \tau \right) = \frac{1}{{N\left( {1 + \frac{\tau }{{\tau _{\mathrm{D}}}}} \right)}}\left( {1 + a_1{\mathrm{exp}}\left( { - \frac{\tau }{{\tau _1}}} \right) + a_2{\mathrm{exp}}\left( { - \frac{\tau }{{\tau _2}}} \right)} \right)$$
(11)

τ was the lag time, N was the average number of molecules in the detection focus, τD was the diffusion time constant, a1 and a2 were the amplitudes of the first and second relaxation, and τ1 and τ2 were the corresponding time constants. The application of a model for diffusion in two dimensions was of sufficient accuracy because the two horizontal dimensions (x, y) of the detection focus were much smaller than the lateral dimension (z) in the applied setup17. With τi = 1/ki, microscopic time constants (τout and τin) were calculated from the observed amplitudes and time constants, a1 and τ133:

$$k_1 = k_{out} + k_{in} ; a_1 = k_{out}/k_{in}$$
(12)

Errors are s.e. from regression analysis and propagated s.e.

## Data availability

The protein data bank (PDB) accession code for the solution NMR structure of L6-NTD is 6QJY (https://www.rcsb.org/). The data that support the findings of this study are available from the corresponding author upon reasonable request. The source data underlying Figs. 1b, 2a–e, 3, 4, 5a, b, 6, and 8 are provided as a Source Data file.

## References

1. 1.

Vollrath, F. & Selden, P. The role of behavior in the evolution of spiders, silks, and webs. Annu Rev. Ecol. Evol. S 38, 819–846 (2007).

2. 2.

Vollrath, F. & Knight, D. P. Liquid crystalline spinning of spider silk. Nature 410, 541–548 (2001).

3. 3.

Heim, M., Keerl, D. & Scheibel, T. Spider silk: from soluble protein to extraordinary fiber. Angew. Chem. Int. Ed. Engl. 48, 3584–3596 (2009).

4. 4.

Rising, A. & Johansson, J. Toward spinning artificial spider silk. Nat. Chem. Biol. 11, 309–315 (2015).

5. 5.

Eisoldt, L., Thamm, C. & Scheibel, T. The role of terminal domains during storage and assembly of spider silk proteins. Biopolymers 97, 355–361 (2011).

6. 6.

Hagn, F. et al. A conserved spider silk domain acts as a molecular switch that controls fibre assembly. Nature 465, 239–242 (2010).

7. 7.

Rat, C., Heiby, J. C., Bunz, J. P. & Neuweiler, H. Two-step self-assembly of a spider silk molecular clamp. Nat. Commun. 9, 4779 (2018).

8. 8.

Askarieh, G. et al. Self-assembly of spider silk proteins is controlled by a pH-sensitive relay. Nature 465, 236–238 (2010).

9. 9.

Gaines, W. A., Sehorn, M. G. & Marcotte, W. R. Jr. Spidroin N-terminal domain promotes a pH-dependent association of silk proteins during self-assembly. J. Biol. Chem. 285, 40745–40753 (2010).

10. 10.

Hagn, F., Thamm, C., Scheibel, T. & Kessler, H. pH-dependent dimerization and salt-dependent stabilization of the N-terminal domain of spider dragline silk-implications for fiber formation. Angew. Chem. Int. Ed. Engl. 50, 310–313 (2011).

11. 11.

Heiby, J. C., Rajab, S., Rat, C., Johnson, C. M. & Neuweiler, H. Conservation of folding and association within a family of spidroin N-terminal domains. Sci. Rep. 7, 16789 (2017).

12. 12.

Schwarze, S., Zwettler, F. U., Johnson, C. M. & Neuweiler, H. The N-terminal domains of spider silk proteins assemble ultrafast and protected from charge screening. Nat. Commun. 4, 2815 (2013).

13. 13.

Kronqvist, N. et al. Sequential pH-driven dimerization and stabilization of the N-terminal domain enables rapid spider silk formation. Nat. Commun. 5, 3254 (2014).

14. 14.

Jaudzems, K. et al. pH-dependent dimerization of spider silk N-terminal domain requires relocation of a wedged tryptophan side chain. J. Mol. Biol. 422, 477–487 (2012).

15. 15.

Atkison, J. H., Parnham, S., Marcotte, W. R. Jr. & Olsen, S. K. Crystal structure of the nephila clavipes major ampullate spidroin 1A N-terminal domain reveals plasticity at the dimer interface. J. Biol. Chem. 291, 19006–19017 (2016).

16. 16.

Otikovs, M. et al. Diversified structural basis of a conserved molecular mechanism for pH-dependent dimerization in spider silk N-terminal domains. Chembiochem 16, 1720–1724 (2015).

17. 17.

Ries, J., Schwarze, S., Johnson, C. M. & Neuweiler, H. Microsecond folding and domain motions of a spider silk protein structural switch. J. Am. Chem. Soc. 136, 17136–17144 (2014).

18. 18.

Jordan, I. K. et al. A universal trend of amino acid gain and loss in protein evolution. Nature 433, 633–638 (2005).

19. 19.

Pace, C. N. & Scholtz, J. M. A helix propensity scale based on experimental studies of peptides and proteins. Biophys. J. 75, 422–427 (1998).

20. 20.

Lombardi, S. J. & Kaplan, D. L. The amino-acid-composition of major ampullate gland silk (Dragline) of Nephila-Clavipes (Araneae, Tetragnathidae). J. Arachnol. 18, 297–306 (1990).

21. 21.

Chothia, C. Principles that determine the structure of proteins. Annu. Rev. Biochem 53, 537–572 (1984).

22. 22.

Rose, G. D., Geselowitz, A. R., Lesser, G. J., Lee, R. H. & Zehfus, M. H. Hydrophobicity of amino acid residues in globular proteins. Science 229, 834–838 (1985).

23. 23.

Bowie, J. U., Reidhaar-Olson, J. F., Lim, W. A. & Sauer, R. T. Deciphering the message in protein sequences: tolerance to amino acid substitutions. Science 247, 1306–1310 (1990).

24. 24.

Myers, J. K., Pace, C. N. & Scholtz, J. M. Denaturant m values and heat capacity changes: relation to changes in accessible surface areas of protein unfolding. Protein Sci. 4, 2138–2148 (1995).

25. 25.

Wang, Q., Buckle, A. M., Foster, N. W., Johnson, C. M. & Fersht, A. R. Design of highly stable functional GroEL minichaperones. Protein Sci. 8, 2186–2193 (1999).

26. 26.

Pal, D. & Chakrabarti, P. Non-hydrogen bond interactions involving the methionine sulfur atom. J. Biomol. Struct. Dyn. 19, 115–128 (2001).

27. 27.

Kubelka, J., Hofrichter, J. & Eaton, W. A. The protein folding ‘speed limit’. Curr. Opin. Struct. Biol. 14, 76–88 (2004).

28. 28.

Andersson, M. et al. Carbonic anhydrase generates CO2 and H+ that drive spider silk formation via opposite effects on the terminal domains. PLoS Biol. 12, e1001921 (2014).

29. 29.

Debye, P. & Hückel, E. The interionic attraction theory and deviations from ideal behavior in solution. Phys. Z. 24, 185–206 (1923).

30. 30.

Henzler-Wildman, K. & Kern, D. Dynamic personalities of proteins. Nature 450, 964–972 (2007).

31. 31.

Sauer, M. & Neuweiler, H. PET-FCS: probing rapid structural fluctuations of proteins and nucleic acids by single-molecule fluorescence quenching. Methods Mol. Biol. 1076, 597–615 (2014).

32. 32.

Neuweiler, H., Johnson, C. M. & Fersht, A. R. Direct observation of ultrafast folding and denatured state dynamics in single protein molecules. Proc. Natl Acad. Sci. USA 106, 18569–18574 (2009).

33. 33.

Schulze, A. et al. Cooperation of local motions in the Hsp90 molecular chaperone ATPase mechanism. Nat. Chem. Biol. 12, 628–635 (2016).

34. 34.

Baldwin, E. P. & Matthews, B. W. Core-packing constraints, hydrophobicity and protein design. Curr. Opin. Biotechnol. 5, 396–402 (1994).

35. 35.

Lim, W. A., Hodel, A., Sauer, R. T. & Richards, F. M. The crystal structure of a mutant protein with altered but improved hydrophobic core packing. Proc. Natl Acad. Sci. USA 91, 423–427 (1994).

36. 36.

Shoichet, B. K., Baase, W. A., Kuroki, R. & Matthews, B. W. A relationship between protein stability and protein function. Proc. Natl Acad. Sci. USA 92, 452–456 (1995).

37. 37.

Goh, C. S., Milburn, D. & Gerstein, M. Conformational changes associated with protein-protein interactions. Curr. Opin. Struct. Biol. 14, 104–109 (2004).

38. 38.

Gellman, S. H. On the role of methionine residues in the sequence-independent recognition of nonpolar protein surfaces. Biochemistry 30, 6633–6636 (1991).

39. 39.

Bernstein, H. D. et al. Model for signal sequence recognition from amino-acid sequence of 54K subunit of signal recognition particle. Nature 340, 482–486 (1989).

40. 40.

Oneil, K. T. & Degrado, W. F. How calmodulin binds its targets-sequence independent recognition of amphiphilic alpha-helices. Trends Biochem Sci. 15, 59–64 (1990).

41. 41.

Kato, M. et al. Redox state controls phase separation of the Yeast Ataxin-2 protein via reversible oxidation of its methionine-rich low-complexity domain. Cell 177, 711–721 (2019). e718.

42. 42.

Kazlauskas, R. Engineering more stable proteins. Chem. Soc. Rev. 47, 9026–9045 (2018).

43. 43.

Bloom, J. D., Labthavikul, S. T., Otey, C. R. & Arnold, F. H. Protein stability promotes evolvability. Proc. Natl Acad. Sci. USA 103, 5869–5874 (2006).

44. 44.

Trudeau, D. L., Kaltenbach, M. & Tawfik, D. S. On the potential origins of the high stability of reconstructed ancestral proteins. Mol. Biol. Evol. 33, 2633–2641 (2016).

45. 45.

Blackledge, T. A. & Hayashi, C. Y. Silken toolkits: biomechanics of silk fibers spun by the orb web spider Argiope argentata (Fabricius 1775). J. Exp. Biol. 209, 2452–2461 (2006).

46. 46.

Muchmore, D. C., McIntosh, L. P., Russell, C. B., Anderson, D. E. & Dahlquist, F. W. Expression and nitrogen-15 labeling of proteins for proton and nitrogen-15 nuclear magnetic resonance. Methods Enzymol. 177, 44–73 (1989).

47. 47.

Markley, J. L. et al. Recommendations for the presentation of NMR structures of proteins and nucleic acids. IUPAC-IUBMB-IUPAB Inter-Union Task Group on the Standardization of Data Bases of Protein and Nucleic Acid Structures Determined by NMR Spectroscopy. J. Biomol. NMR 12, 1–23 (1998).

48. 48.

Keller, R. in The Computer Aided Resonance Tutorial. (CANTINA, Goldau, 2004).

49. 49.

Vranken, W. F. et al. The CCPN data model for NMR spectroscopy: development of a software pipeline. Proteins 59, 687–696 (2005).

50. 50.

Grzesiek, S. & Bax, A. The importance of not saturating H2O in protein NMR-application to sensitivity enhancement and Noe measurements. J. Am. Chem. Soc. 115, 12593–12594 (1993).

51. 51.

Sattler, M., Schleucher, J. & Griesinger, C. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog. Nucl. Mag. Res. Sp. 34, 93–158 (1999).

52. 52.

Herrmann, T., Guntert, P. & Wuthrich, K. Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J. Mol. Biol. 319, 209–227 (2002).

53. 53.

Wurz, J. M., Kazemi, S., Schmidt, E., Bagaria, A. & Guntert, P. NMR-based automated protein structure determination. Arch. Biochem. Biophys. 628, 24–32 (2017).

54. 54.

Shen, Y., Delaglio, F., Cornilescu, G. & Bax, A. TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J. Biomol. NMR 44, 213–223 (2009).

55. 55.

Koradi, R., Billeter, M. & Guntert, P. Point-centered domain decomposition for parallel molecular dynamics simulation. Comput. Phys. Commun. 124, 139–147 (2000).

56. 56.

Ponder, J. W. & Case, D. A. Force fields for protein simulations. Protein Simul. 66, 27 (2003).

57. 57.

Bhattacharya, A., Tejero, R. & Montelione, G. T. Evaluating protein structures determined by structural genomics consortia. Proteins 66, 778–795 (2007).

58. 58.

Santoro, M. M. & Bolen, D. W. Unfolding free energy changes determined by the linear extrapolation method. 1. Unfolding of phenylmethanesulfonyl. alpha.-chymotrypsin using different denaturants. Biochemistry 27, 8063–8068 (1988).

59. 59.

Tanford, C. Protein denaturation. Adv. Protein Chem. 23, 121–282 (1968).

60. 60.

Jackson, S. E. & Fersht, A. R. Folding of chymotrypsin inhibitor 2. 1 Evidence for a two-state transition. Biochemistry 30, 10428–10435 (1991).

61. 61.

Hulme, E. C. & Trevethick, M. A. Ligand binding assays at equilibrium: validation and interpretation. Br. J. Pharm. 161, 1219–1237 (2010).

## Acknowledgements

We thank Carolin Hacker for stimulating discussions. The authors are grateful to the U.S. Army Research Office for financial support (grant number W911NF-17-1-0336) to H.N. B.G. acknowledges a PhD fellowship from the Max Planck Graduate Center (MPGC). U.A.H. acknowledges support by the Carl Zeiss Foundation and the Center of Biomolecular Magnetic Resonance (BMRZ), Goethe University Frankfurt, funded by the state of Hesse.

## Author information

Authors

### Contributions

J.C.H. designed experiments, synthesized protein material, performed denaturation experiments, far-UV CD, Trp fluorescence, stopped-flow fluorescence spectroscopy, PET-FCS, thermophoresis, analytical SEC, analyzed data, and created figures. B.G. performed NMR experiments, NMR structure calculation, analyzed NMR data, and created figures. C.M.J. performed SEC-MALS and thermophoresis experiments, and analyzed data. U.A.H. designed and performed NMR experiments, analyzed NMR data, and wrote the paper. H.N. conceptually designed the research, designed experiments, analyzed data, created figures, and wrote the paper.

### Corresponding authors

Correspondence to Ute A. Hellmich or Hannes Neuweiler.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Peer review information Nature Communications thanks Jan Johansson and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Heiby, J.C., Goretzki, B., Johnson, C.M. et al. Methionine in a protein hydrophobic core drives tight interactions required for assembly of spider silk. Nat Commun 10, 4378 (2019). https://doi.org/10.1038/s41467-019-12365-5

• Accepted:

• Published:

• ### Methionine-Rich Loop of Multicopper Oxidase McoA Follows Open-to-Close Transitions with a Role in Enzyme Catalysis

• Patrícia T. Borges
• , Vânia Brissos
• , Guillem Hernandez
• , Laura Masgrau
• , Maria Fátima Lucas
• , Emanuele Monza
• , Carlos Frazão
• , Tiago N. Cordeiro
•  & Lígia O. Martins

ACS Catalysis (2020)

• ### NMR assignments of a dynamically perturbed and dimerization inhibited N-terminal domain variant of a spider silk protein from E. australis

• Benedikt Goretzki
• , Julia C. Heiby
• , Carolin Hacker
• , Hannes Neuweiler
•  & Ute A. Hellmich

Biomolecular NMR Assignments (2020)

• ### High intracellular stability of the spidroin N‐terminal domain in spite of abundant amyloidogenic segments revealed by in‐cell hydrogen/deuterium exchange mass spectrometry

• Margit Kaldmäe
• , Axel Leppert
• , Gefei Chen
• , Medoune Sarr
• , Cagla Sahin
• , Kerstin Nordling
• , Nina Kronqvist
• , Marta Gonzalvo‐Ulla
• , Nicolas Fritz
• , Axel Abelein
• , Sonia Laίn
• , Henrik Biverstål
• , Hans Jörnvall
• , David P. Lane
• , Anna Rising
• , Jan Johansson
•  & Michael Landreh

The FEBS Journal (2019)