Introduction

Hydrophobins are a group of small, globular proteins produced and secreted by filamentous fungi that self-assemble into larger, amphipathic structures at hydrophilic-hydrophobic interfaces1,2,3,4,5,6,7. These assemblies create durable, protective, and water repellent coatings that fulfill various biological roles for fungi such as decreasing the surface tension of water, allowing hyphae to grow through the water’s surface8,9, and acting as water repellent coatings on fungal fruiting bodies and spores10. Hydrophobins also function as adhesion points for fungi by assembling between the cell wall and hydrophobic surfaces, which plays an important role in the pathogenicity of rice blast fungus (Magnaporthe grisea)11,12.

Due to their unusual properties, there is considerable interest in using hydrophobins as emulsifiers, surface modifiers, and protective coatings2,13,14,15,16,17. This includes applications such as increasing the bioavailability of water-insoluble drugs, modifying surfaces of medical implants, and as foam stabilizers to improve shelf life and food texture7. In order to develop hydrophobins for particular applications, studies are required to determine what influences the properties of hydrophobin assemblies and which regions of hydrophobins are necessary for function.

The amino acid sequence and molecular structure of hydrophobins are diverse but there are some characteristic features, including eight cysteine residues that form four conserved disulphide bonds that are separated by three loops (L1–L3, also referred to as the C3–C4, C4–C5, and C7–C8 intercysteine loops)18. Based on the spacing between cysteine residues, the pattern of hydrophobic patches, and the properties of their assemblies6,19 hydrophobins are broadly divided into two classes (class I and class II). Class I hydrophobins assemble into extremely stable amyloid-like structures called rodlets that resist denaturation by detergent, heat, and acid2. They are produced by Ascomycota and Basidiomycota fungi, with constituent hydrophobins having little sequence or inter-cysteine sequence length conservation6. Conversely, class II hydrophobins assemble into less stable films, do not form rodlets, are only secreted by Ascomycota fungi, and share a more conserved sequence. Based on phylogenetic analysis of hydrophobin sequences, class I sequences can be further separated into those originating from the Ascomycota (class IA) and Basidiomycota (class IB) fungi6,20.

Class IB hydrophobins have a greater degree of sequence conservation than class IA hydrophobins, suggesting that they may follow a common assembly mechanism. Current models of rodlet assembly have been developed by studying the class IA hydrophobin EAS (from Neurospora crassa) and involve L3 undergoing a structural transition from a disordered loop to a β-strand, which then oligomerizes with L3 of other EAS molecules into amyloid-like rodlets21. However, this model is not compatible with the nuclear magnetic resonance (NMR)-derived structure of the class IB hydrophobin SC16 (from Schizophyllum commune), since in SC16 L3 only consists of four residues which are structured as a β-turn. This makes it unlikely that L3 of SC16 can undergo large conformational changes during rodlet assembly, meaning that rodlet assembly likely involves other regions of SC16.

To investigate possible assembly mechanisms of SC16 we used X-ray crystallography to determine its atomic-resolution crystal structure, with the rationale that crystal contacts may provide insight into how SC16 monomers may associate with each other. We noted a large crystal contact was formed that involved the N-terminal tail of SC16. We then used NMR spectroscopy to investigate the dynamics of this tail, and functional assays to investigate how this region of SC16 influences rodlet assembly.

Results and discussion

Crystallization of SC16 identifies a possible biological contact

Protein crystallography inherently relies on protein association, and proteins will often crystallize in biologically meaningful ways because of favourable energetics22. Therefore, we set out to crystallize SC16 in order to gain insight into which regions of the molecule may form extensive intermolecular contacts and hence may be involved in the self-association that occurs during rodlet self-assembly. After optimizing initial conditions, SC16 crystallized as a cluster of plates over 1–2 weeks (Supplemental Fig. 1). Virtually indistinguishable atomic-resolution structures were determined for two crystals of SC16 that existed in two different space groups: C2221 (Fig. 1 light blue; 2.0 Å resolution) and P21212 (Fig. 1b dark blue; 2.2 Å resolution). These structures superpose to a root mean squared deviation (RMSD) of 0.1 Å and had refinement statistics consistent with good quality models (Table 1). The SC16 crystal structure consists of four disulphide bonds which are characteristic of hydrophobins18, a β-barrel core consisting of four β-strands (β1: 39Tyr-46Asp, β2: 69Leu-76Pro, β3: 92Val-98Tyr, β4: 102Leu-107Cys), and three loops (L1-L3) that connect the β-strands. An α-helix was present in L1 from 50Lys-59Leu. Residues near the N-terminus and L2 (C2221: 79Val-85Asn, P21212: 78Ser-85Asn) did not have interpretable electron density, consistent with these regions being flexible. Overall, the structure of SC16 obtained from crystallization is consistent with the NMR-derived ensemble of SC16 structures20 (Protein Data Bank, PDB ID:2NBH, RMSD of 1.39 Å and 1.37 Å for the C2221 and P21212 crystal forms, respectively, Fig. 1b).

Figure 1
figure 1

X-ray crystal structure of SC16. (a) A ribbon representation of SC16 crystallized in the C2221 space group (PDB ID: 7S86). Two different orientations are shown and the beta strands (β1 − β4), and loops (L1 − L3) are indicated. Conserved disulphide bonds are highlighted as orange sticks. (b) A superposition of SC16 crystalized in the C2221 (light blue) and P21212 (PDB ID: 7S7S; dark blue) space groups with the NMR-derived structure of SC1620 (PDB ID: 2NBH; grey). (c) Ribbon representation of 4 symmetrically related SC16 molecules (coloured differently) in the C2221 SC16 crystal. The inset highlights a putative biological contact spanning 606.80 Å2 that involves portions of L1, L2, and β-barrel structure from one molecule (coloured cyan and orange) and the N- and C-termini of a second molecule (coloured light blue and red). Residues flagged by EPPIC as participating in close contacts are highlighted orange or red, labelled, and drawn as sticks. The second inset illustrates the association between the N-terminal tail of one SC16 molecule with the β1 strand of another. Putative hydrogen bonds are indicated with yellow lines.

Table 1 X-ray diffraction data collection and refinement statistics.

We then used the EPPIC (Evolutionary Protein–Protein Interface Classification) webserver23 to probe for large protein–protein interfaces in the crystal lattice that may represent biologically relevant associations of SC16. EPPIC detected a large intermolecular interface that was present in both SC16 crystals and had a surface area of 607 Å2 and 589 Å2 for the C2221 and P21212 space groups, respectively (Fig. 1c). This surface involves both the N- and C-termini of SC16 interacting with L1, L2, and the β-barrel of a second symmetry-related SC16 molecule, with 44Thr, 72Leu, 89Ala, and 110Ile being flagged by EPPIC as participating in close contacts.

In particular, the N-terminal tail adopts an extended structure from 27Gly-32Ser with partial β-strand character that associates with the β1 strand over residues 42Asn-46Asp of a symmetry related SC16 molecule (Fig. 1c). Although the buried surface area (~ 600 Å2) is smaller than the 800 Å2 expected for a definitive biological contact24, it is larger than a typical crystal contact, involves a very small protein, and is found in two different space groups. Repeating units of this interface also result in a linear association of SC16 monomers that suggests a possible assembly pattern within rodlets.

NMR spectroscopy identifies the termini and L2 of SC16 as dynamic

In both crystal structures of SC16, sections of the N-terminus and L2 were poorly defined. To assess the flexibility of SC16 in solution we used NMR spectroscopy to determine which regions of the protein are dynamic. Through analysis of triple resonance NMR spectra and with the aid of deposited NMR assignments for SC16 (BioMagRes Bank accession 25976)20 we were able to assign 89% of the amide resonances of SC16. To assess the dynamics of SC16 on the ns-ps timescale under native conditions, T1 (longitudinal) relaxation, T2 (transverse) relaxation, and heteronuclear nuclear Overhauser effect (NOE) experiments were collected and analyzed using the relaxation module of CcpNmr Analysis (Fig. 2). Values of T1 were constant over the entire protein with an average value of 808 ± 66 ms, while T2 and NOE values both plateaued over the ranges of 31Lys–79Val and 89Ala–112Ile, with values of 82 ± 18 ms and 0.748 ± 0.014, respectively. T2 values increased and NOE values decreased for the N- and C-termini and L2 (80Ile–88Ser), identifying these as dynamic regions that may not be resolvable by X-ray diffraction. Importantly, this suggests that 27Gly-32Ser are dynamic enough to undergo conformational changes and be important for rodlet assembly of SC16. Based on the dynamic nature of the N-terminus and its involvement in a large, possibly biological, crystal contact, we generated a variant of SC16 that lacked the N-terminus up to 31Lys (SC16ΔN). SC16ΔN was expressed and purified in the same manner as SC16 (Supplemental Figs. 2 and 3).

Figure 2
figure 2

T1, T2, and NOE relaxation measurements identify the N- and C-termini and L2 of SC16 as dynamic. T1, T2, and NOE measurements were collected for 15N13C-labelled SC16 and are plotted on a per-residue basis. Residues that have elevated T2 and decreased NOE values relative to the rest of the molecule are indicated with blue shading.

The N-terminal tail of SC16 is dispensable for rodlet assembly

Thioflavin T (ThT) is a fluorescent dye commonly used to quantify amyloid formation and hydrophobin assembly, since fluorescence emission intensity is linearly related to fibril concentration25. Thus, ThT fluorescence assays were used to quantitatively measure the assembly of both SC16 and SC16ΔN when agitated to create an air:water interface. SC16ΔN assembly was monitored in buffers of pH 5.5–8.5 containing 10 mM–1 M NaCl (Supplemental Fig. 4). Among these conditions, pH 8.5 and 1 M NaCl generated maximal ThT fluorescence upon SC16ΔN agitation. In this buffer condition, the assembly kinetics of SC16 and SC16ΔN were indistinguishable (Fig. 3), suggesting that residues 18Thr-30Pro do not play a significant role in the rate of SC16 self-assembly.

Figure 3
figure 3

SC16ΔN self-assembles into β-sheet containing rodlets. Thioflavin T fluorescence assays were used to monitor the assembly of SC16 (dashed line) and SC16ΔN (solid line) during agitation. ThT assays were carried out at pH 8.5 with 1 M NaCl. Error bars represent standard deviation from 6 replicates.

We further characterized the morphology of SC16ΔN rodlets with atomic force microscopy. SC16ΔN was deposited onto freshly cleaved highly oriented pyrolytic graphite (HOPG) and allowed to assemble for either 20 min or overnight (Fig. 4 with additional fields shown in Supplemental Fig. 5). Rodlets of a uniform width (~ 10 Å) were observed and were often paired together. The rodlets could either remain dispersed (Fig. 4, top) or form laterally associated bundles (Fig. 4, bottom), which is consistent with what has been observed previously for SC1620 and rodlets from other hydrophobins21,26. Interestingly, the longest rodlets often form from multiple shorter rodlets closely organized end-to-end.

Figure 4
figure 4

SC16ΔN self-assembles into rodlets. A 50 μL drop of SC16 ΔN in dH2O (left: 10 μg/mL; right: 5 μg/mL) was incubated on freshly cleaved highly oriented pyrolytic graphite at room temperature for (left) 20 min before wicking excess solution and drying overnight or (right) allowing to dry overnight. Rodlets were then imaged using atomic force microscopy and a representative image from two samples for each condition is shown.

Understanding the mechanism of and the sequences responsible for hydrophobin assembly is important for developing commercial applications of these proteins. For example, mutation of residues important for assembly could tailor hydrophobin assembly to different buffer conditions. Conversely, regions that are not important for self-assembly can be modified to add new functionality to hydrophobins. The mechanistic origins of hydrophobin rodlet assembly are unclear and they seem to vary depending on the hydrophobin studied. For example the class IA hydrophobin EAS requires sequences in L3 for assembly21, while sequences in L1 are dispensable27. Conversely in the hydrophobin DewA (from Aspergillus nidulans), L2 is predicted to be required for self-assembly and rodlet formation26.

In this study we used X-ray crystallography to determine the structure of SC16 and investigated a potential mode of its self-assembly. We found that although the N-terminal tail of SC16 associates with neighbouring molecules in the crystalline state, it is dynamic in solution and does not significantly influence the self-assembly rate or morphology of SC16 rodlets. This suggests that other regions of SC16 (such as L1 and L2) are likely essential for rodlet formation. These results outline an opportunity for hydrophobin functionalization, since modification of the N-terminal tail of SC16 should be a viable strategy to create functionalized SC16 variants that are still able to self-assemble onto surfaces.

Methods

Plasmid construction

pET21-derived plasmids coding for SC16 with the N-terminal signal sequence removed (18Thr–114Leu of the hyd1 gene of S. commune; NCBI ID: EFI94929) with upstream sequences coding for a hexahistidine tag, the B1 domain of protein G, and a thrombin protease recognition sequence (H6-GB1-SC16) are the same as those used previously28. A second expression plasmid was constructed using restriction enzyme-based procedures that coded for SC16 with 13 residues from the N-terminal tail removed (residues 31Lys-114Leu of SC16; H6-GB1-SC16ΔN). The validity of all plasmids was verified by sequencing (Eurofins Genomics).

Protein expression and purification

SC16 and SC16ΔN were isolated as previously described for SC1628. Briefly, Shuffle T7 express E. coli (New England Biolabs) were transformed with expression plasmids. After overnight expression at 20 °C, cells were lysed via resuspending in lysis buffer (20 mM Tris pH 8, 250 mM NaCl) and heating to 80 °C for 15 min. Hydrophobins were isolated via Ni2+ affinity chromatography and then cleaved overnight with thrombin while being dialyzed against 20 mM Tris pH 8, 50 mM NaCl, 1 mM CaCl2. After cleavage SC16 and SC16ΔN were isolated by Ni2+ affinity chromatography, ion exchange chromatography, and finally by size exclusion chromatography in 20 mM Tris pH 8, 50 mM NaCl. For NMR studies of SC16, 15N- or 13C/15N-labelled SC16 was produced by growing cells in 15N or 15N/13C-enriched M9 minimal media (adapted from Studier)29, and purified as above. After purification samples were either used immediately or frozen at − 20 °C for long-term storage.

Crystallography

SC16 was used for crystallization trials immediately after purification. Initial crystallization screens were performed using a sitting drop setup and the MCSG1-4 crystallization screens (Anatrace). After optimization, crystals were grown by the hanging drop vapour diffusion method at 22 °C. Drops were 4 μL and contained 1:1 protein (14.15 mg/mL) to reservoir (0.1 M citric acid pH 3.5 with either 30.5% or 29% w/v PEG3350) solution. Crystals grew as plates over 1–2 weeks after which individual crystals were cryoprotected with reservoir solution containing 25% ethylene glycol. Data for the crystal grown in 30.5% PEG3350 was collected at 100 K at the Canadian Light Source (08B1-1 beamline)30,31. Data for the crystal grown in 29% PEG 3350 was collected at 100 K using a D8 Quest diffractometer with Cu microfocus source (Bruker AXS). Molecular replacement was used to phase both datasets using the Phenix software package and the NMR-derived structure of SC16 as the search model (PBD ID: 2NBH)20,32. Further model refinement was completed using Phenix32 and Coot33 and final structures are deposited in the Protein Data Bank34 (PDB IDs 7S86 and 7S7S for SC16 in the C2221 and P21212 space groups, respectively). The Pymol Molecular Graphics System (v2.1, Schrödinger) was used for model visualization and preparing figures. The EPPIC (Evolutionary Protein–Protein Interface Classification) webserver was used to probe for large protein–protein interfaces in the crystal lattice23.

Nuclear magnetic resonance spectroscopy

A 300 μM sample of 13C/15N-labelled SC16 in 20 mM MES pH 6.5, 50 mM NaCl, 5% D2O was prepared. 1H–15N-heteronuclear single quantum coherence (HSQC), HNCA, HNCACB, HN(CO)CA, and CBCA(CO)NH experiments were collected using a 700 MHz Avance III spectrometer (National Research Council Institute for Marine Biosciences, Halifax, NS) and used to assign backbone resonances of SC1635. T1 and T2 relaxation, as well as heteronuclear nuclear Overhauser effect (NOE) experiments were collected using an 800 MHz Bruker Avance III HD console and Ascend magnet with TCI cryoprobe (the Québec/Eastern Canada High Field NMR Facility (QANUC)) and analyzed using the relaxation module of CcpNmr Analysis36.

Thioflavin T assay

SC16ΔN and SC16 were dialysed against dH2O over night at 4 °C before the assay. Reactions for SC16ΔN were set up in a matrix of buffer conditions consisting of a multicomponent buffer (20 mM each of MES, Tris, phosphate) set to varying pH (5.5, 6.5, 7.5, 8.5). Assembly at each pH was measured in the presence of 10 mM, 250 mM, and 1 M NaCl. SC16 reactions were set up only for pH 8.5, 1 M NaCl. The total reaction volumes were 100 μL and all conditions contained 50 μg/mL protein, 20 μM filtered ThT. Each condition was replicated 6 times and measured in a sealed 96-well black-bottomed plate. A SpecraMax M3 plate reader was used to monitor the ThT fluorescence (λex = 430 nm, λcutoff = 455 nm, and λem = 480 nm) while shaking the reactions for 30 s every 2 min.

Atomic force microscopy

SC16ΔN was dialysed against dH2O overnight at 4 °C. Hydrophobin drops (50 µL at 5 µg/mL) were placed onto freshly cleaved 3 mm HOPG discs and allowed to dry overnight. Alternatively, 50 µL drops containing 10 μg/mL of hydrophobins were allowed to sit on the HOPG for 20 min. Excess solution was wicked away and HOPG was allowed to dry overnight at room temperature. Two samples were produced for each condition and at least 3 fields of view were collected for each sample. Images were collected on a NanoWizard II Ultra from JPK using Tap300AI-G tips (Budget Sensors).