Biological context

The origin of the eukaryotic cell remains an unsettled scientific question and several hypotheses have been put forward to explain the complex evolutionary history of the eukaryotic cell1,2,3,4. The Woese hypothesis proposes three domains of life—Archaea, Bacteria and Eukaryota, with independent evolutionary trajectories5. The eocyte hypothesis suggests the existence of only two domains—bacteria and archaea, and that eukaryotes emerged from the symbiotic relationship of an unknown archaeal host with an alphaproteobacterium6. Recently, environmental metagenomic sampling led to the discovery of the Asgard superphylum. Comparative genomic analysis of Asgard archaea and eukaryotes appears to support the eocyte hypothesis7. The genomes of Asgardarchaea are enriched with proteins previously considered eukaryote-specific, so called eukaryotic signature proteins (ESPs), and phylogenetic analysis placed the Asgardarchaea in a monophyletic group with eukaryotes8.

Actin plays a crucial part of the eukaryotic cytoskeleton and is essential to many processes, including cellular motility, cell division, endocytosis, intracellular cargo transport, amongst many other9. Because of the central role actin plays in the eukaryotic cell, the sequence of actin remains highly conserved among eukaryotes. While actin homologues are widespread throughout all domains of life, the dynamic actin cytoskeleton and the regulatory actin-binding proteins are a hallmark of eukaryotic life.

The Asgard genomes contain close actin homologues and several actin-binding proteins; including profilin, gelsolin, Arp2/3 complex subunit 4 and a large family of small GTPases that regulate the actin cytoskeleton in eukarotes8. This posits the question; do these archaea possess an actin cytoskeleton with complex regulation analogous with the eukaryotic cytoskeleton? While metagenomic analysis has identified these proteins, their cellular function is still poorly understood. Laboratory culturing of these organisms is still in early development which makes in vivo comparison with eukaryotic homologs difficult10. Currently, protein production in heterologous expression systems and reconstitution of the purified complexes in vitro represents one of the best approaches in characterizing their function. Profilin is expressed in most, if not all, eukaryotic cells and is one of the most important proteins in regulating actin cytoskeletal dynamics11. Eukaryotic profilin (eprofilin) is a small protein (approximately 14–19 kDa) which sequesters monomeric G-actin from the cytoplasmic pool, thus controlling polymerization12. Despite significant divergence at the sequence level, the eprofilin tertiary structure is well-conserved and folds into 3D structures constituting 7 β-strands and 4 α-helices13. eprofilin promotes the elongation of actin filament assembly at the barbed end by acting as a nucleotide exchange factor, and by interacting with elongation factors such as Ena/Vasp, Formins, and Wasp14,15,16. These nucleation factors bind eprofilin through a polyproline motif at a domain physically separate from the actin binding-site. Moreover, eprofilin can also bind to phosphatidylinositol 4,5-bisphosphate (PIP2)17 at the plasma membrane which results in a reduced affinity towards polyproline and actin17. eprofilin also competes with phospholipase C for PIP2 binding which leads to interference with the PI3K/AKT signaling pathway18.

Recently, it has been shown that profilins encoded in several lineages of the Asgardarchaea not only share structural similarity with eukaryotic orthologues but are able to regulate the function of eukaryotic actin. This implies that profilin from Asgardarchaea have the potential of complex regulation of the hypothetical actin cytoskeleton as well19. In contrast to human profilin I, a previous study showed that the Asgard profilins (Loki 1 and 2, Thor, Odin and Heimdall) did not show polyproline binding. This led the authors to suggest that Asgard profilins do not bind polyproline, and that polyproline directed actin assembly is a later addition in eukaryotic evolution19. However, PIP2 was shown to modulate the affinity of Asgard profilin towards rabbit actin in a functional assay19. Nevertheless, some of the Asgard genomes are incomplete and the structural and functional relationships of representative profilins from different Asgard lineages are still poorly understood. It might therefore be too early to assume that Asgard profilins do not bind polyproline. In addition, the crystal structures of various profilins combined with functional data do not only reveal structural similarity between Asgard profilins, but also highlights some subtle differences at the species level19. Within the Asgard superphylum, the Heimdallarchaeota appears to currently be the closest relative of eukaryotes7. Here we present the NMR backbone assignment and dynamics of the Heimdallarchaeota profilin (heimProfilin) as a first step towards characterizing it structurally. These NMR amino acid specific assignments and dynamics provide for the first time an atomic snapshot of heimProfilin as well as providing further evidence for the idea that the Asgard encoded proteins possess similar structural elements and are likely to perform similar roles as those in eukaryotes.

Methods and experiments

Protein expression and purification

Heimdallarchaeota profilin (GenBank: OLS22855.1) was cloned into the pSUMO-YHRC vector, kindly provided by Claes Andréasson (Addgene Plasmid #54,336; RRID: Addgene_54336) with an N-terminal 6xHistidine-tag and a SUMO-tag (cleavable with Ulp1 protease). The vector was transformed and expressed in E. coli Rosetta DE3 cells. Initially, the cells were grown in 2 × TY media at 37 °C until the optical density of the culture was 0.8 at 600 nm. Cells were then harvested by centrifugation at 4,000 × g for 15 min at 4 °C and washed twice with M9 medium. The cells were then transferred into M9 media supplemented with 1 g/L 15N-ammonium chloride and 1 g/L 13C-glucose and grown for 1 h at 30 °C. Protein expression was induced by 0.5 mM IPTG. For Deuterium (2H) labelling, the M9 medium was prepared with 100% or 50% D2O and cells were grown overnight at 30 °C. Post-induction, the cells were harvested by centrifugation and resuspended in the binding buffer (50 mM Tris–HCl pH 7.5, 0.3 M NaCl, 1 mM TCEP, 10 mM imidazole, 10% glycerol). The cells were then lysed by sonication and the cell lysate was clarified by centrifugation at 25,000 × g for 45 min at 4 °C and finally filtered through a 0.2 µm syringe filter (Sarstedt). The supernatant was loaded onto a His GraviTrap column (1 mL, GE healthcare) and the bound protein was eluted with binding buffer containing 250 mM imidazole. The protein was incubated with Ulp1 protease overnight at 4 °C to cleave the SUMO-tag including the Histidine-tag. The protein was desalted using a PD10 column (GE Healthcare) and loaded onto a His GraviTrap column again to remove the tag and the Ulp1 protease. The protein was concentrated using a 10,000 NMWL cutoff centrifugal filter (Merck Millipore) and further purified on a Superdex 75 10/300 GL (GE Healthcare) size exclusion column, equilibrated with 25 mM Tris–HCl, 50 mM NaCl, 5% Glycerol, 1 mM TCEP at pH 7.5. Protein concentration was determined using the molar absorption coefficient at 280 nm (29,450/M/cm).

NMR spectroscopy

Double labeled 15N, 13C, or triple labeled 15N, 13C, 2H samples were prepared to a concentration of 20 mg/mL in 25 mM Tris–HCl, 50 mM NaCl, 5% Glycerol, 1 mM TCEP at pH 7.5 and thereafter supplemented with 3% D2O and 0.03% sodium azide. The NMR assignment experiments were performed at 308 K on a triple-resonance Bruker 900, 700 or 600 MHz spectrometers equipped with a cryogenic probe. NMR relaxation experiments were performed on a 600 MHz spectrometer at 298 K. Backbone sequence-specific assignments were carried out using the following experiments: 2D 1H-15N-TROSY, 3D TROSY-HNCACCB, 3D TROSY-HNCA, 3D TROSY-HN(CO)CACB and 3D TROSY-HN(CO)CA. For side-chain assignments, 2D 1H-13C CT-HSQC, 3D HBHA(CO)NH and 3D HCCH-TOCSY spectra were utilized. For assignment and fold verification 3D NOESY as well as 3JHNHα for secondary structure verification were measured. For Backbone R1, R2 rates and hetero-nuclear NOEs were determined in an interleaved manner with the experiments from the Bruker pulse program library. For R1 and R2 rates, the relaxation delay was sampled for 9 and 8 delay-durations which were pseudo-randomized, respectively (R1: 20, 60, 100, 200, 400, 600, 800 and 1,000 ms and R2: 16, 33, 67, 136, 170, 203, 237 and 271 ms). The pulse program20 used for determining R2 contained CPMG elements that quenched slow motions partially, that is motion slower than 5.4 ms but slower than the total correlation time (τc). The relaxation delay time was up to 1.5 s for R1 and 1 s for R2. The [1H]15 N-hetNOE experiment and a reference spectra were recorded with a total 2 s 1H saturation time for the NOE experiment and the same recovery time for the reference experiment. The order parameter S2 and the internal correlation time were calculated with the program dynamic center. The rotational diffusion tensor was estimated from the ratio of the relaxation rates (R1 and R2). TALOS and CYANA were employed to predict secondary structure, using 1HN, 15N, and 13Cα chemical shifts. All other data were processed with topspin and analyzed using CCPNMR21 and CYANA22.

Results and discussions

Assignment and data deposition

The expressed and purified heimProfilin corresponds to the full length as was generated from metagenomics data8. It consists of 148 amino acids which was purified with a cleavable tag that leaves no additional N-terminal amino acids (see methods). This profilin possesses a 20-amino acid extension compared with the previously characterized eprofilins or those from Loki I and II and Odin. We obtained up to 88% of all backbone and up to 80% of all side-chain assignments. 135 of the 148 non-proline amide residues were assigned in the 1H-15N TROSY (Fig. 1). The following amides were not possible to assign: M1, K2, D3, I6, K11, K14, I19, S25, E27, N62, S85 and N89. The missing amides could be due to motional broadening or fast solvent exchange. We obtained 92% of the Cα and Cβ resonance assignments. Hβ and Hα proton shifts were completed to 97% and 96%, respectively. These assignments were further verified by 15N/13C 3D NOESY spectra. Backbone and side-chain chemical shifts assignments have been deposited to the Biological Magnetic Resonance Data Bank (BMRB) with the Accession Number 50190.

Figure 1
figure 1

1H-15N TROSY correlation spectrum of Heimdallarchaeota profilin. All 1H-15N pairs that were assigned in this study. Side-chains of Glutamine and Asparagine are not assigned or shown.

Secondary structure analysis

The structures of eukaryotic and Asgard profilin from Loki (1 and 2) and Odin have been determined by X-ray crystallography19. However, no structural information is available from the heimProfilin which appears to be the closest relative to the eukaryotes. With the completed assignments, it was now possible to analyze the secondary structure characteristics of this profilin to see if it adopts similar secondary structural elements. Analysis of sequential and medium range NOEs revealed stretches of dNN, dNN(i, i + 2), dαβ(i, i + 3), dαN(i, i + 3). Residues 29–33, 64–68, 124–127 and 125–144 continual revealed dαN (i, i + 4) NOEs, indicating the presence of helices in this region. This is supported by the 3JHNHα coupling constants for these residues which display small values typical of alpha helices (Fig. 2). 13Cα and Cβ shifts are frequently used to predict secondary structure propensities. Cα shifts generally tend to shift upfield in a beta-sheet and extended strands relative to the random coil values. In alpha helices, these Cα shifts tend to shift downfield23. For Cβ values the opposite is true, they shift downfield for beta-sheets and extended strands and upfield for alpha helices. The Cα and Cβ values relative to random coil values are shown in Fig. 2. Examination of these plots indicates clear helical regions covering residues 29–34, 64–68, 124–127 and 135–144. The helical region between residues 64–68 has not been observed in previous profilin structures. The region of beta strands also agrees with NOEs values and slightly increased 3JHNHαvalues. This analysis indicates that the overall secondary structural elements are preserved from archaea to eukaryotes albeit with some slight differences in their lengths. In addition, we observed an additional helix between residues 124–127 which was not present in the previously determined profilin structures. This might be important for modulating profilin-actin interaction and other physiological roles.

Figure 2
figure 2

Secondary structure characterization of Heimdallarchaeota profilin. (a) Sequence-specific 13Cα secondary chemical shifts (δΔ13Cα) along the amino acid sequence of heimProfilin. (b) 3JHNHα couplings plotted as a function of amino acid sequence. (c) TALOS secondary structure prediction based on 1H, 15N and 13Cα shifts plotted as function of amino acid sequence. All three suggest the presence of helical and extended strands in similar regions. The presence of helices between residues 29–34, 64–68, 124–127 and 135–144 are clearly visible. 3JHNHα couplings are generally lower for helices (2–4 Hz) and higher for beta strands and extended regions (2–8 Hz). Very few 3JHNHα couplings were obtained for residues 1–20. However, the 13Cα shifts and the TALOS prediction clearly shows that this region is extended. Indicated above the figure are the positions of the secondary structural elements.

Backbone dynamics

R1 and R2 rates in addition to [1H]-15N hetNOE are frequently used to estimate the flexibility of proteins24. Deviation of R1 and R2 rates for [1H]-15N moieties from the average value often indicate a change in motional property. R1 values that are larger than the average typically indicates the presence of additional flexibility on the timescale faster than nanoseconds (ps-ns time range). On the other hand, R2 rates with higher values than the average indicates regions of slow conformational exchange in the µs-ms time scale.[1H]-15N hetNOE with negative or near zero values indicate regions of high flexibility with motions faster than approximately 1 ns. We measured and plotted the longitudinal R1 and transverse R2 rates as well the [1H]-15N hetNOE versus amino acids sequence (Fig. 3). Overall, the results from these values indicate a highly rigid protein between residues 25–148 (Fig. 3). However, N-terminal residues 1–24 show a high degree of flexibility, which is reflected in the very low [1H]-15N hetNOE values (Fig. 3). We also back calculate order parameter S2 and internal correlation time (τe). A plot of the calculated order parameter S2 and internal correlation time (τe) is shown in Fig. 3d. As shown in the plot, only the N-terminal 1–24 amino acids show some degree of flexibility with very low order parameter and high degree of internal motion. A few residues along the protein sequence indicate some degree of flexibility. We determined the total correlation time (τc) of 11.3 ns. This value is slightly higher for a protein of this size probably due to the extended N-terminal loop not completely structured. 1H-15N TROSY-HSQC spectra of profilin with and without the N-terminal extension show small or no structural changes (Fig. 4). This indicates that the N-terminal extension does not influence the overall fold of the full length profilin.

Figure 3
figure 3

Dynamic characterization of the backbone based on [1H-]15N. (a) Longitudinal 15N R1 relaxation rates plotted as a function of amino acid sequence. (b) Transverse 15N R2 relaxation rates versus the amino acid sequence. (c) [1H]15N-hetero-nuclear NOE data (hetNOE) along the amino acid sequence. (d) Calculated S2 order parameter (left axis) and internal motion (τe) (right axis) plotted as a function of the amino acid sequence. All parameters indicate a very rigid molecule structure apart from the N-terminal 20 amino acids which show some degree of ps-ns motion based on the elevated R1, lower hetNOE and lower S2 order parameter. Indicated above the figure are the positions of the secondary structural elements.

Figure 4
figure 4

Comparison between full length heimProfilin and mutant. (a) Overlay of 1H-15N TROSY-HSQC spectra of full length heimProfilin and N-terminal deletion mutant (1–20 amino acids). Shown in blue is the full length heimProfilin and in red mutant heimProfilin. Extra residues (in blue) corresponds to the N-terminal residues. Note, very small changes in shifts are observed between the full length and the mutant. This indicates that deletion of the N-terminal 20 amino acids do not influence the overall fold of heimProfilin. (b) Structure of human profilin (PDB code: 1fil), showing the different structural elements. The corresponding position of the additional helix observed in heimProfilin is also shown.

Conclusions

In this study, we have determined the NMR backbone assignment and dynamic data of a profilin from Heimdallarchaeota in the Asgard superphylum. Our secondary structure analysis indicates that this profilin possesses similar structural elements to eukaryotic homologues, albeit at varied lengths. Our data also indicates that the heimProfilin appears rigid apart from N-terminal residues 1–24 which are not present in previously characterized eukaryotic profilins. We observed the helix between residue 64–68 which lies in the interface of the actin binding site when compared to eukaryotic profilin, and likely plays a role in modulating acting polymerization. In fact, corresponding residues 50–53 (GVLVG) which are involved in actin binding25 in eukaryotic profilin, form part of helix 2 and appeared to align with the helix between residues 64–68 (GSEVL). These amino acids show some sequence similarity and thus support the notion of a role in actin binding. Finally, we observed that heimProfilin contains an 8 strand beta sandwich between the helices and a sequence alignment indicate that these structural elements occupy similar positions as their eukaryotic counterpart.