Abstract
To understand the origin and early evolution of life it is crucial to establish characteristics of the primordial environment that facilitated the emergence and evolution of life. One important environmental factor is the pH of the primordial environment. Here, we assessed the pH-dependent thermal stabilities of previously reconstructed ancestral nucleoside diphosphate kinases and ribosomal protein uS8s. The selected proteins were likely to be present in ancient organisms such as the last common ancestor of bacteria and that of archaea. We also assessed the thermal stability of homologous proteins from extant acidophilic, neutralophilic, and alkaliphilic microorganisms as a function of pH. Our results indicate that the reconstructed ancestral proteins are more akin to those of extant alkaliphilic bacteria, which display greater stability under alkaline conditions. These findings suggest that the common ancestors of bacterial and archaeal species thrived in an alkaline environment. Moreover, we demonstrate the reconstruction method employed in this study is a valuable technique for generating alkali-tolerant proteins that can be used in a variety of biotechnological and environmental applications.
Similar content being viewed by others
Introduction
Ancestral sequence reconstruction (ASR) is a method used to deduce plausible protein sequences that may have been present in extinct organisms1,2,3,4,5,6. Fossil records offer an invaluable source of information about extinct organisms as well as the likely conditions on early Earth. However, no fossil records of ancient organisms that lived more than 3,500 million years ago have been found. The microfossil discovered in the Pilbara region of Western Australia in 1993 has, so far, been recognized as the oldest known fossil on Earth7. Initially it was unclear whether these microstructures could have formed via abiotic processes, but more recent evidence supports their biogenicity8. However, approaches to study even earlier lifeforms cannot solely rely on the discovery of still older microfossils. It is now possible to approximate the sequences of ancient genes and proteins by comparing those found in modern genomes. This is because the sequence information of the genes and proteins that were present in ancient species has been inherited throughout the evolutionary process. The concept of resurrecting ancient protein sequences was originally proposed by Pauling and Zuckerkandl more than 50 years ago9. At that time, the technology required for accurate reconstruction was unavailable, and their assertion was highly speculative. However, this concept has been refined over time, and with the expansion of genome projects it has become possible to infer ancient sequences with a greater degree of confidence10,11. Many studies have used ASR to characterize ancient proteins, obtaining information on their physical properties, including thermal stability12,13,14,15 and substrate specificity16,17,18,19,20,21, as well as to deduce the likely environment for early life22,23,24,25.
ASR is not only a powerful tool for inferring ancient protein sequences but also for protein engineering, particularly in designing thermally stable proteins. The inferred ancestral amino acid sequence at the root of a phylogenetic tree of 3-isopropyl malate dehydrogenases was introduced into an extant hyperthermophilic enzyme as an amino acid substitution, which resulted in a further thermostabilized enzyme26. ASR often yields thermostable proteins, possibly because the ancestral organisms lived in high-temperature environments and their proteins were highly thermostable27,28,29. However, the inferred ancestral sequences tend to converge to the consensus amino acid in many positions, which also contributes to improving the thermal stability of proteins30,31,32. ASR has been criticized because it is biased toward high stability amino acid sequences that are often more thermostable than the consensus sequences12. In principle, ASR can predict an ancestral/consensus sequence irrespective of the possible bias present in the input data of extant homologous sequences compared to the consensus method33.
Here, we investigated the pH stability of the ancestral nucleoside diphosphate kinase (NDK) and ribosomal protein uS8. NDK plays a crucial role in the regulation of nucleotide metabolism in cells. The enzyme is responsible for catalyzing the transfer of the γ-phosphate group from a nucleoside triphosphate to a nucleoside diphosphate, thereby producing a new nucleoside triphosphate and a new nucleoside diphosphate. This reaction is important for maintaining the overall nucleotide balance, which is necessary for a wide range of cellular processes such as DNA replication, RNA transcription, and energy metabolism. NDK is found in almost all existing organisms and is highly conserved across different species, suggesting its fundamental role in cellular function. The ribosomal protein, uS8, is a small RNA-binding protein and plays a crucial role in the assembly and stability of the ribosome34. Molecular phylogenetic analysis together with inference of the ancestral amino acid sequence for NDK and uS8 were previously carried out, which revealed their highly thermostable ancestral forms at near neutral pH12,35,36.
In this study, we aimed to investigate the pH profiles of the unfolding temperatures of the ancestral proteins. Intriguingly, ancestral NDKs and uS8s retain their thermostability even under alkaline conditions. Finally, we discuss the potential pH conditions of the ancestral biosphere and the utility of designing alkali-tolerant proteins using ASR for biotechnological or bioremediation applications.
Results
Thermal stabilities of extant NDKs as a function of pH
Initially, the pH-dependent thermal stability of NDKs from extant organisms was analyzed. NDKs from seven organisms were prepared: specifically, Sulfolobus tokodaii, Methanocaldococcus jannaschii, Archaeoglobus fulgidus, Thermus thermophilus, Bacillus halodurans, B. pseudofirmus and Desulfonatronovibrio hydrogenovorans. The acidophilic and hyperthermophilic archaeon, S. tokodaii, grows optimally at pH 2–337. M. jannaschii and A. fulgidus are hyperthermophilic archaea and their optimum pH for growth are 6.0 and 6.9, respectively38,39. T. thermophilus is an extreme thermophilic bacterium and its optimum pH for growth is also nearly 7.040. B. halodurans, B. pseudofirmus and D. hydrogenovorans are alkaliphilic bacteria and grow optimally at pH 10.0–10.5, pH 10.0–11.0 and pH 9.6, respectively41,42,43. The protein from A. fulgidus exhibited an unconventional unfolding curve at pH 7.0, indicating a slight increase in the content of helical structure rather than denaturing into a random coil at temperatures between 100 and 106 °C (Fig. S1). Therefore, further thermal unfolding experiments were not performed for A. fulgidus NDK. The other extant NDKs were subjected to temperature-induced unfolding experiments at pH values of 5.0, 7.0, and 9.0. To generate the unfolding curves, ellipticity at 222 nm was monitored as a function of temperature. The normalized unfolding curves are shown in Fig. 1. Table 1 lists the unfolding midpoint temperatures (Tms) at which the relative signal change reaches 0.5 during the unfolding process. These Tm values were used to compare the thermal stabilities of proteins. The common characteristic shared by acidophilic and neutralophilic NDKs is that their highest Tms were observed at pH 7.0. In addition, the acidophilic and neutralophilic NDKs have a higher Tm at pH 5.0 than at pH 9.0.
The extant alkaliphilic NDKs were subjected to temperature-induced unfolding experiments under conditions of both low (50 mM KCl) and high (500 mM NaCl) salt because their host alkaliphilic bacteria also exhibit halophilic characteristics. It was noted that the protein from D. hydrogenovorans exhibited a conventional two-state unfolding curve only under high salt conditions (500 mM NaCl). Under the low salt condition, the two alkaliphilic NDKs from Bacillus sp. displayed the highest Tm at pH 7.0 and their Tms at pH 9.0 were higher than those at pH 5.0. Moreover, in the presence of 500 mM NaCl, the Tms of the three alkaliphilic NDKs at pH 9.0 were similar to those at pH 7.0 and their Tms at pH 5.0 were 14–23 °C lower than those at pH 9.0.
Thermal stabilities of ancestral NDKs as a function of pH
We previously constructed phylogenetic trees comparing the amino acid sequences of NDKs in currently existing archaea and bacteria12. This enabled the reconstruction of various ancestral NDKs representing proteins in the common ancestor of archaea and that of bacteria. All the reconstructed NDKs appeared to be highly thermostable at near neutral pH. In this study, some of the resurrected NDKs (Arc3, Arc4, Arc5, Bac3, Bac4, Bac5) were subjected to temperature-induced unfolding experiments at pH values of 5.0, 7.0, and 9.0 (Fig. 1, Table 1). The six ancestral NDKs showed the highest Tm values at pH 7.0. Moreover, the Tm values of the ancestral NDKs at pH 9.0 were similar to those at pH 7.0. However, while Arc4 and Bac4 still exhibited high Tm values at pH 5.0, the Tm values of Arc3, Arc5, Bac3, and Bac5 at pH 5.0 were significantly lower than those observed at pH 7.0 and pH 9.0. Thus, the ancestral NDKs maintain greater thermostability in alkaline conditions, while their thermostabilities tend to decrease in acidic conditions. We conclude that the pH-dependencies of thermal stability for the reconstructed ancestral NDKs are more akin to those of NDKs from extant alkaliphilic bacteria rather than those from acidophilic or neutralophilic microorganisms.
The amino acid composition, ratio of acidic to basic amino acids and isoelectric points of the extant and ancestral NDKs are summarized in Tables S1 and S2. It has been reported that alkali-tolerant proteins often have increased proportions of arginine and histidine residues and a paucity of glutamate residues44. However, such a trend was not found in the extant alkaliphilic and ancestral NDKs. It is important to note that this observation specifically applies to the ancestral and alkaliphilic NDKs analyzed in this study. To ascertain the universality of this trend across all NDKs, additional analysis encompassing a more diverse set of NDKs is required. Alkaliphilic proteins are also known to have higher isoelectric points than their neutralophilic counterparts45. However, this was not the case for the alkaliphilic and ancestral NDKs because they have a greater ratio of acidic amino acids to basic amino acids. Consequently, the isoelectric point of alkaliphilic and ancestral NDKs is more acidic than the neutralophilic and acidophilic NDKs. Accordingly, the alkali-tolerance displayed by alkaliphilic NDKs and ancestral NDKs may involve a unique mechanism. Further mutational and structural analyses, including X-ray crystallography and molecular dynamics simulations, will be required to elucidate the precise mechanism of alkali tolerance of NDKs.
Specific activities of ancestral and extant NDKs as a function of pH
The enzymatic activity of ancestral NDKs was measured by monitoring the transfer of phosphate from GTP to ADP to produce GDP and ATP, respectively. Figure 2 shows the pH dependence of the specific activities of the ancestral NDKs, as well as those of NDKs from extant acidophilic archaeon S. tokodaii and neutralophilic archaeon A. fulgidus for comparison. Specific activity increased with pH for NDKs from both S. tokodaii and A. fulgidus. However, the optimal pH for growth of S. tokodaii and A. fulgidus is pH 2–3 and approximately 7, respectively. Similarly, the ancestral NDKs functioned optimally in an alkaline solution. Thus, the pH profile of the specific activity of NDKs that originate from microorganisms inhabiting different pH environments did not correlate with the pH environments of the host microorganisms. In other words, the reaction catalyzed by NDK, transferring the γ-phosphate of GTP to ADP to generate GDP and ATP, showed its maximum efficiency at pH 8.0–8.5 independent of the optimal growth pH of the host microorganism. The observation of the highest reaction rate under alkaline conditions may be attributed more to the characteristics of the phosphotransfer reaction between a nucleoside triphosphate and a nucleoside diphosphate than to the nature of the catalyzing enzyme.
Thermal stabilities of extant uS8s as a function of pH
The pH-dependent thermal stabilities of extant and ancestral uS8s were measured (Fig. 3; Table 2). For extant uS8s, we chose to analyze four proteins in all: namely, two from extant neutralophilic bacteria and two from alkaliphilic bacteria. For neutralophilic uS8s, we chose to analyze proteins from T. thermophilus and Thermotoga maritima. T. maritima is a hyperthermophilic bacterium that optimally grows at pH 746. For both uS8s, the Tm values at pH 5.0 and pH 7.0 were similar or the same but the Tm values decreased at pH 9.0. Therefore, the extant uS8s from neutralophilic bacteria are less stable in alkali conditions compared to acidic and neutral conditions. For alkaliphilic uS8s, we chose to analyze proteins from Alkaliphilus transvaalensis and B. halodurans. A. transvaalensis grows in highly alkaline environments, with a pH range of 8.5 to 12.5 and optimally at pH 10.047. In contrast to the neutralophilic uS8s, proteins from alkaliphilic bacteria are adapted to alkali conditions. Indeed, their Tm values at pH 9.0 were the same or even higher than those at pH 7.0. Moreover, their Tm values at pH 5.0 were significantly lower than those at pH 7.0 and pH 9.0.
Thermal stabilities of ancestral uS8s as a function of pH
Previously, we built a phylogenetic tree containing the sequences of extant bacterial and archaeal uS8s, which was then used to infer the two last bacterial common ancestral sequences of uS8 (P_Bac and I_Bac)36. In this study, we inferred the last archaeal common ancestral sequences of uS8 using the same phylogenetic tree with either IQ-TREE48 or CodeML in PAML49. The archaeal sequence inferred using IQ-TREE was named I_Arc and that predicted by CodeML was named P_Arc (Supplementary Data 1). Genes encoding the two inferred sequences were synthesized and then used to prepare the ancestral uS8 of the last archaeal ancestor. However, I_Arc was not used in subsequent analyses as it precipitated during the purification process.
Temperature-induced unfolding experiments were performed at pHs 5.0, 7.0, and 9.0 for P_Arc, as well as for P_Bac and I_Bac. The unfolding curves are given in Fig. 3 and their Tms are listed in Table 2. P_Arc showed the highest Tm value at pH 7.0 and its Tm at pH 9.0 was higher than that at pH 5.0. Moreover, Tm values at pH 9.0 for P_Bac and I_Bac were similar to those at pH 7.0. However, the Tm values at pH 5.0 were significantly lower than those at pH 7.0 and pH 9.0. Therefore, akin to the results for ancestral NDKs, the structural stabilities of the ancestral uS8s are adapted to alkaline rather than acidic conditions. Consequently, the pH profiles of thermal stability for ancestral uS8s from the last bacterial common ancestor are similar to those of uS8s from extant alkaliphilic bacteria.
The amino acid composition, ratio of acidic to basic amino acids and isoelectric points of the extant and ancestral uS8s are summarized in Tables S3 and S4. Extant alkaliphilic uS8s have a smaller ratio of basic residues to acidic residues by comparison to neutralophilic uS8s. By contrast, ancestral uS8s have a larger ratio of basic residues. However, neutralophilic, alkaliphilic and ancestral uS8s all possess similar isoelectric points. As with NDK, further structural analysis will be required to reveal the molecular mechanism of alkali-tolerance in uS8s.
Discussion
The potential of hydrogen or pH, which represents the hydrogen ion concentration in an aqueous solution, is a fundamental property of Earth's oceans and plays a crucial role in understanding water biogeochemistry. The primordial pH environment is especially significant as it played a vital role in the development of early life on Earth by providing the necessary conditions for the formation of organic compounds, including amino acids and nucleotides. Figure 4 illustrates candidate primordial biosphere environments and their estimated pH values. Several studies have attempted to estimate the pH of the early ocean. Rouchon and Orberger suggested that the early ocean was an acidic environment, with pH values ranging from 5.5 to 6.5 due to the dissolution of carbon dioxide from the atmosphere into the ocean50. Meanwhile, another study proposed that the early Earth's ocean may have had a more neutral pH environment, with pH values ranging from ~ 6.5 to 7.051. This conclusion was reached using a statistical model of seawater pH as a function of atmospheric carbon dioxide pressure. Local environments that may have been the site of the origin and early evolution of life include submarine hydrothermal vents. Some of the submarine hydrothermal vents spew alkaline hydrothermal water, generating an environment with an extremely high pH value that could have potentially supported early life on primitive Earth52,53.
Aside from the ocean, terrestrial hot springs are another possible biosphere for primordial organisms. Some researchers have proposed that the origin of life relied on a land-based volcanic hot spring pool54. Koonin and colleagues predicted that primitive hot springs were neutral55. Deep groundwater could also provide a possible habitat for primitive life. Water that springs from the ultramafic rocks and serpentinite soils at The Cedars was once thought to be too harsh for life due to its highly alkaline nature (pH = ~ 12) and highly reducing conditions. However, Suzuki et al. discovered that this seemingly harsh and challenging environment supports a surprisingly diverse range of microbes with specialized metabolic strategies56. Such springs, which rely on deep-seated water as a source, might have provided a habitat for primitive biotic communities.
The technique of ASR allows us to reconstruct proteins from ancient organisms2,6,57,58. In the present study, this method was employed to obtain insights into the likely pH of primitive habitats. Each protein has an optimal pH range in which it is most stable and functional. This is because the amino acid side chains within the protein have different pKa values, which determine their ionization state at different pHs. When the pH of the environment does not correspond to the optimal range for the protein, it can alter the charged state of these amino acid residues and destabilize the protein fold. The pH range that promotes protein stability is thought to reflect the intracellular pH of the organism hosting the protein. It is known that certain acidophilic microorganisms possess acidic cytosols, necessitating the functionality of the proteome at low pH levels59. Alternatively, other acidophilic microorganisms maintain a cytoplasmic pH close to neutrality by balancing protons and electrons through the use of transporters and channels, thereby obviating the necessity for the proteome to function at low pH60. Conversely, organisms living in alkaline environments have higher intracellular pH, although the intracellular pH values are usually more neutral than the external environment61. Therefore, microorganisms inhabiting alkaline environments need to possess proteins that are sufficiently stable and functional at high pH.
Most of the reconstructed ancestral proteins examined in this study were found to be more stable in alkaline conditions, displaying properties similar to those from contemporary alkaliphilic bacteria. Specifically, the ancestral NDKs, Arc4 and Bac4, and the bacterial common ancestral uS8s, P_Bac, and I_Bac, exhibited comparable thermal stability at both alkaline and neutral pHs (Figs. 1 and 3; Tables 1 and 2). By contrast, microbial proteins derived from acidophilic and neutralophilic organisms displayed higher unfolding temperatures at pH 5.0 than at pH 9.0 (Figs. 1 and 3; Tables 1 and 2). The findings suggest that the bacterial and archaeal common ancestors may have thrived in an alkaline environment such as submarine alkaline hydrothermal vents or alkaline springs originating from deep groundwater.
In the presence of NaCl, protein stability is known to vary—either improving, decreasing, or remaining unchanged—depending on the specific protein62,63,64. However, our understanding of salinity in the primordial environment is limited. Blaber and colleagues demonstrated the necessity of a halophilic environment for the folding of proteins designed with a bias for prebiotic amino acids, presumed to have been available in the early Earth's environment65. Their study emphasizes the significance of a halophilic environment in the context of early protein synthesis and, consequently, in abiotic events.
Our temperature-induced unfolding experiment using alkaliphilic NDKs demonstrated a stabilizing effect of 500 mM NaCl at pH 9.0 (Fig. 1; Table 1). Similarly, it is plausible that salinity levels played a crucial role in maintaining the stability of the cellular machinery of ancestral microorganisms. However, an unfolding experiment conducted on the ancestral NDK, Arc3, in the presence of 500 mM NaCl at pH 9.0 yielded an atypical thermal unfolding curve (Fig. S2), and thus the unfolding temperature could not be estimated. Therefore, thermal unfolding experiments under the presence of 500 mM NaCl for other ancestral NDKs and acidophilic/neutralophilic NDKs were not performed. Additionally, for uS8, the unfolding temperature was consistently measured under the presence of 250 mM NaCl, as analysis at lower salt concentrations was not feasible due to precipitation. The salinity levels in the primordial biosphere environment need further investigation using other proteins in future studies.
ASR relies on a phylogenetic tree of homologous amino acid sequences. However, obtaining an accurate tree and faithfully reconstructing ancestral sequences is technically challenging. Although advancements in ASR techniques over the past decade have greatly improved the accuracy, implications drawn from the tree and ancestral sequences are difficult to verify. Several studies have used ASR to estimate the temperature of the ancient biosphere, which suggested the thermophilic ancestry of life based on the high thermostability of most reconstructed ancestral proteins12,23. However, Williams et al. proposed that inaccurately reconstructed sequences could result in an overestimation of thermostability66. Tawfik and colleagues also suggested that high thermodynamic stability of ancestral proteins may have been driven by other factors, such as high oxidative pressure and radiation levels, absence of cellular osmolytes or chaperones, or the low fidelity of transcription–translation machinery67. ASR has also been used to connect ancient proteins to early environments. One such study was conducted by Kaçar and colleagues, who used ASR to resurrect an ancestral RuBisCO gene from extant cyanobacteria68. When cultured in conditions corresponding to the potential Precambrian environment the carbon isotope signatures of the engineered cyanobacteria were within modern ranges. Although this finding has important implications for understanding the uniformitarian assumptions of carbon isotope signatures over geologic time, they also noted that the modern organism and its proteins may have influenced the ancestral RuBisCO phenotype. The same group also warned of the potential pitfalls of interpreting paleophenotype models and data25. Therefore, despite the proven usefulness of ASR for linking ancient proteins to early environments, it is crucial to consider the limitations of ASR, such as the potential for errors in building phylogenetic trees and inferring ancestral sequences.
Prior to this study, it has been widely accepted that ASR provides a powerful approach for generating thermally stable proteins. Our present results expand upon this knowledge by demonstrating that ASR is also an effective method for producing alkali-tolerant proteins. Such proteins have significant potential for use in diverse fields, such as industrial applications and bioremediation, due to their ability to remain stable at high pHs and function even in the presence of detergents44. Furthermore, they are valuable in instances where the pH-enhanced solubility of substrates and ligands is anticipated44. In the realm of environmental science, alkali-tolerant enzymes may, in principle, be useful tools for the remediation of alkaline soils and wastewater. Given that ASR can produce proteins that are stable against both heat and alkali, this method is poised to become a valuable technique for generating proteins that can be used in a variety of contexts.
Methods
Preparation of ancestral and extant NDKs
The expression plasmids used for ancestral NDKs (Arc3, Arc4, Arc5, Bac3, Bac4, Bac5) and extant NDKs from T. thermophilus and A. fulgidus were constructed previously12. The genes encoding extant NDKs from M. jannaschii and S. tokodaii were synthesized by Eurofins Genomics (Tokyo, Japan) and then cloned into the NdeI-BamHI restriction recognition sites of plasmid pET23a(+) (Merck, Tokyo, Japan). The genes encoding extant NDKs from alkaliphilic microorganisms, B. halodurans, B. pseudofirmus and D. hydrogenovorans were also synthesized by Eurofins Genomics and then cloned into the NdeI-BamHI restriction recognition sites of plasmid pET15b (Merck) so that the resulting recombinant proteins include an N-terminal polyhistidine tag. To prepare and purify each of the ancestral and extant NDKs, one of the expression plasmids were used to transform Escherichia coli Rosetta 2(DE3) (Merck) and the resulting transformant was cultured in Luria–Bertani (LB) medium supplemented with 150 μg/ml ampicillin. Heterologous gene expression was induced using the Overnight Express Autoinduction system I (Merck). After overnight culture at 37 °C, cells were harvested and disrupted by sonication. The soluble fraction was recovered after centrifugation at 60,000 × g for 20 min. To purify the ancestral and extant thermophilic (T. thermophilus, A. fulgidus, M. jannaschii, S. tokodaii) NDKs, the supernatants were heated at 80 °C for 20 min and centrifuged at 60,000 × g for 20 min to precipitate the denatured E. coli proteins. The supernatants were then subjected to chromatography using a HiTrap Q and Resource Q column (Cytiva, Tokyo, Japan). Alkaliphilic NDKs were purified by chromatography using a HisTrap HP and Resource Q column (Cytiva). All the resulting protein preparations appeared to be homogeneous as judged by sodium dodecyl sulfate–polyacrylamide gel electrophoresis followed by Coomassie blue staining.
Thermal stability measurements of NDKs at various pHs
The A280 values of the protein solutions were used to determine their concentrations. The molar absorption coefficient at 280 nm for each protein was determined using a modified method of Pace and colleagues69 based on the procedure described by Gill and von Hippel70. The thermal denaturation profile for each protein was monitored by observing the change in ellipticity at 222 nm using a J-1100 spectropolarimeter (Jasco, Tokyo, Japan) equipped with a programmable temperature controller and a pressure-proof cell compartment to prevent bubbling and evaporation at high temperatures. Protein solutions were diluted to a final concentration of 20 μM with 20 mM potassium citrate (pH 5.0), potassium phosphate (pH 7.0) or potassium borate (pH 9.0), supplemented with 50 mM KCl and 1 mM EDTA. Thermal denaturation of the extant alkaliphilic NDKs was also measured in the solution supplemented with 500 mM NaCl. A cell with a pathlength of 0.1 cm was used, and the temperature was increased at a rate of 1.0 °C/min.
Specific activity measurements of NDKs at various pHs
Enzymatic activity was determined at 70 °C by measuring the increase in the amount of product ATP using the Kinase-Glo Luminescent Kinase Assay kit (Promega, Tokyo, Japan). The assay solution contained 50 mM MOPS (pH 5.5–6.5) or HEPES (pH 6.5–8.5), 25 mM KCl, 10 mM (NH4)2SO4, 2.0 mM Mg(CH3COO)2, 1.0 mM dithiothreitol, 5.0 mM ADP, and 5.0 mM GTP. One unit of enzyme activity was defined as the formation of 1 μmol of ATP per minute.
Ancestral sequence inference of the last archaeal common ancestor of uS8
We previously inferred two amino acid sequences of the last bacterial common ancestral uS8 based on a phylogenetic tree consisting of extant archaeal and bacterial uS8 sequences36. In this study, two amino acid sequences of the last archaeal common ancestor were inferred using the same phylogenetic tree and either IQ-TREE48 or CodeML in PAML49. GASP71 was used to estimate the location of gaps in the ancestral sequences. The resulting amino acid sequences were named I_Arc and P_Arc, which are available in FASTA format (Supplementary Data 1).
Construction of expression plasmids for ancestral and extant uS8
The genes encoding archaeal ancestral uS8s were synthesized by Eurofins Genomics. The genes encoding extant uS8 from T. thermophilus, T. maritima, A. transvaalensis and B. halodurans were also synthesized by Eurofins Genomics. To construct I_Arc and P_Arc carrying a GST-tag at its N-terminus, the respective genes were ligated into pGEX-6p-1 (Cytiva). The genes encoding extant alkaliphilic uS8s from A. transvaalensis and B. halodurans were also cloned into pGEX-6p-1. The gene encoding uS8s from extant neutralophilic thermophiles, T. thermophilus and T. maritima, were cloned into pET23a(+) (Merck).
Purification of uS8s
The bacterial ancestral uS8s were overexpressed in E. coli and subsequently purified as described previously36. The archaeal ancestral and the extant alkaliphilic uS8s were expressed and purified as an N-terminally GST-tagged form. For their production, E. coli Rosetta 2(DE3) was transformed with the respective expression plasmids and then cultured in LB medium supplemented with 150 μg/ml ampicillin. Overexpression was induced with Overnight Express Autoinduction System 1. After overnight cultivation, cells were harvested by centrifugation and disrupted by sonication. The precipitate was removed by centrifugation at 60,000 × g for 20 min and the soluble proteins were subjected to GSTrap HP affinity column chromatography (Cytiva). The N-terminal GST-tag (excepting residues G-4P-3L-2G-1) was then removed by digestion with PreScission Protease (Cytiva). The PreScission Protease and newly liberated GST-tag were subsequently removed by gel filtration chromatography on a Superdex 75 column (Cytiva). For preparation of T. thermophilus and T. maritima uS8s, E. coli Rosetta 2(DE3) carrying the expression plasmid was cultivated in LB medium supplemented with 150 μg/ml ampicillin. Expression was induced with Overnight Express Autoinduction System 1 (Merck). After overnight culture, cells were harvested by centrifugation and disrupted by sonication. The supernatants were then subjected to cation exchange chromatography on a HiTrap SP-FF column (Cytiva). All the purified proteins appeared to be homogeneous as judged by sodium dodecyl sulfate–polyacrylamide gel electrophoresis followed by Coomassie blue staining.
Thermal stability measurements of uS8s at various pH values
The A280 value of each purified protein was measured, and the concentration was then calculated using the respective extinction coefficient. To record the thermal denaturation profile of each protein, the change in ellipticity at 222 nm was monitored using a J-1100 spectropolarimeter (Jasco), which was equipped with a programmable temperature controller and a pressure-proof cell compartment. The protein solutions were diluted with 20 mM potassium citrate (pH 5.0), potassium phosphate (pH 7.0), or potassium borate (pH 9.0), supplemented with 250 mM NaCl and 1 mM EDTA, to achieve a final concentration of 20 μM. A 0.1 cm pathlength cell was used, and the temperature was increased at a rate of 1.0 °C/min.
Data availability
All data is included in the manuscript and/or supplementary materials.
References
Wheeler, L. C., Lim, S. A., Marqusee, S. & Harms, M. J. The thermostability and specificity of ancient proteins. Curr. Opin. Struct. Biol. 38, 37–43. https://doi.org/10.1016/j.sbi.2016.05.015 (2016).
Merkl, R. & Sterner, R. Ancestral protein reconstruction: Techniques and applications. Biol. Chem. 397, 1–21. https://doi.org/10.1515/hsz-2015-0158 (2016).
Rouet, R. et al. Structural reconstruction of protein ancestry. Proc. Natl. Acad. Sci. U S A 114, 3897–3902. https://doi.org/10.1073/pnas.1613477114 (2017).
Akanuma, S. Characterization of reconstructed ancestral proteins suggests a change in temperature of the ancient biosphere. Life https://doi.org/10.3390/life7030033 (2017).
Gaucher, E. A., Kratzer, J. T. & Randall, R. N. Deep phylogeny–how a tree can help characterize early life on Earth. Cold Spring Harb. Perspect. Biol. 2, a002238. https://doi.org/10.1101/cshperspect.a002238 (2010).
Thornton, J. W. Resurrecting ancient genes: Experimental analysis of extinct molecules. Nat. Rev. Genet. 5, 366–375. https://doi.org/10.1038/nrg1324 (2004).
Schopf, J. W. Microfossils of the early archean apex chert: New evidence of the antiquity of life. Science 260, 640–646 (1993).
Tice, M. M. & Lowe, D. R. Photosynthetic microbial mats in the 3,416-Myr-old ocean. Nature 431, 549–552. https://doi.org/10.1038/nature02888 (2004).
Pauling, L. & Zuckerkandl, E. Chemical paleogenetics: Molecular restoration studies of extinct forms of life. Acta Chem. Scand. 17, S9–S16. https://doi.org/10.3891/acta.chem.scand.17s-0009 (1963).
Bielawski, J. P. & Yang, Z. Maximum likelihood methods for detecting adaptive evolution after gene duplication. J. Struct. Funct. Genomics 3, 201–212 (2003).
Messier, W. & Stewart, C. B. Episodic adaptive evolution of primate lysozymes. Nature 385, 151–154. https://doi.org/10.1038/385151a0 (1997).
Akanuma, S. et al. Experimental evidence for the thermophilicity of ancestral life. Proc. Natl. Acad. Sci. U S A 110, 11067–11072. https://doi.org/10.1073/pnas.1308215110 (2013).
Butzin, N. C. et al. Reconstructed ancestral Myo-inositol-3-phosphate synthases indicate that ancestors of the Thermococcales and Thermotoga species were more thermophilic than their descendants. PLoS One 8, e84300. https://doi.org/10.1371/journal.pone.0084300 (2013).
Hart, K. M. et al. Thermodynamic system drift in protein evolution. PLoS Biol. 12, e1001994. https://doi.org/10.1371/journal.pbio.1001994 (2014).
Gaucher, E. A., Govindarajan, S. & Ganesh, O. K. Palaeotemperature trend for Precambrian life inferred from resurrected proteins. Nature 451, 704–707. https://doi.org/10.1038/nature06510 (2008).
Schulz, L. et al. Evolution of increased complexity and specificity at the dawn of form I Rubiscos. Science 378, 155–160. https://doi.org/10.1126/science.abq1416 (2022).
Schriever, K. et al. Engineering of ancestors as a tool to elucidate structure, mechanism, and specificity of extant terpene cyclase. J. Am. Chem. Soc. 143, 3794–3807. https://doi.org/10.1021/jacs.0c10214 (2021).
Harms, M. J. & Thornton, J. W. Evolutionary biochemistry: revealing the historical and physical causes of protein properties. Nat. Rev. Genet. 14, 559–571. https://doi.org/10.1038/nrg3540 (2013).
Bridgham, J. T., Ortlund, E. A. & Thornton, J. W. An epistatic ratchet constrains the direction of glucocorticoid receptor evolution. Nature 461, 515–519. https://doi.org/10.1038/nature08249 (2009).
Siddiq, M. A., Hochberg, G. K. & Thornton, J. W. Evolution of protein specificity: insights from ancestral protein reconstruction. Curr. Opin. Struct. Biol. 47, 113–122. https://doi.org/10.1016/j.sbi.2017.07.003 (2017).
Hochberg, G. K. A. & Thornton, J. W. Reconstructing ancient proteins to Uunderstand the causes of structure and function. Annu. Rev. Biophys. 46, 247–269. https://doi.org/10.1146/annurev-biophys-070816-033631 (2017).
Boussau, B. & Gouy, M. What genomes have to say about the evolution of the Earth. Gondwana Research 21, 483–494. https://doi.org/10.1016/j.gr.2011.08.002 (2012).
Gaucher, E. A., Thomson, J. M., Burgan, M. F. & Benner, S. A. Inferring the palaeoenvironment of ancient bacteria on the basis of resurrected proteins. Nature 425, 285–288. https://doi.org/10.1038/nature01977 (2003).
Garcia, A. K., Schopf, J. W., Yokobori, S. I., Akanuma, S. & Yamagishi, A. Reconstructed ancestral enzymes suggest long-term cooling of Earth’s photic zone since the Archean. Proc. Natl. Acad. Sci. U S A. 114, 4619–4624. https://doi.org/10.1073/pnas.1702729114 (2017).
Garcia, A. K. & Kaçar, B. How to resurrect ancestral proteins as proxies for ancient biogeochemistry. Free Radic. Biol. Med. 140, 260–269. https://doi.org/10.1016/j.freeradbiomed.2019.03.033 (2019).
Miyazaki, J. et al. Ancestral residues stabilizing 3-isopropylmalate dehydrogenase of an extreme thermophile: Experimental evidence supporting the thermophilic common ancestor hypothesis. J. Biochem. 129, 777–782 (2001).
Thomas, A., Cutlan, R., Finnigan, W., van der Giezen, M. & Harmer, N. Highly thermostable carboxylic acid reductases generated by ancestral sequence reconstruction. Commun. Biol. 2, 429. https://doi.org/10.1038/s42003-019-0677-y (2019).
Thomson, R. E. S., Carrera-Pacheco, S. E. & Gillam, E. M. J. Engineering functional thermostable proteins using ancestral sequence reconstruction. J. Biol. Chem. 298, 102435. https://doi.org/10.1016/j.jbc.2022.102435 (2022).
Akanuma, S. & Yamagishi, A. A strategy for designing thermostable enzymes by reconstructing ancestral sequences possessed by ancient life. In Biotechnology of extremophiles: Advances and challenges (ed. Rampelotto, P. H.) 581–596 (Springer, 2016). https://doi.org/10.1007/978-3-319-13521-2_20.
Sternke, M., Tripp, K. W. & Barrick, D. The use of consensus sequence information to engineer stability and activity in proteins. Methods Enzymol. 643, 149–179. https://doi.org/10.1016/bs.mie.2020.06.001 (2020).
Steipe, B. Consensus-based engineering of protein stability: from intrabodies to thermostable enzymes. Methods Enzymol. 388, 176–186. https://doi.org/10.1016/s0076-6879(04)88016-9 (2004).
Steipe, B., Schiller, B., Pluckthun, A. & Steinbacher, S. Sequence statistics reliably predict stabilizing mutations in a protein domain. J. Mol. Biol. 240, 188–192. https://doi.org/10.1006/jmbi.1994.1434 (1994).
Akanuma, S. et al. Phylogeny-based design of a B-subunit of DNA gyrase and its ATPase domain using a small set of homologous amino acid sequences. J. Mol. Biol. 412, 212–225. https://doi.org/10.1016/j.jmb.2011.07.042 (2011).
Shimojo, M. et al. In vitro reconstitution of functional small ribosomal subunit assembly for comprehensive analysis of ribosomal elements in E. coli. Commun. Biol. 3, 142. https://doi.org/10.1038/s42003-020-0874-8 (2020).
Akanuma, S., Yokobori, S., Nakajima, Y., Bessho, M. & Yamagishi, A. Robustness of predictions of extremely thermally stable proteins in ancient organisms. Evolution 69, 2954–2962. https://doi.org/10.1111/evo.12779 (2015).
Zhao, F. & Akanuma, S. Ancestral sequence reconstruction of the ribosomal protein uS8 and reduction of amino acid usage to a smaller alphabet. J. Mol. Evol. 91, 10–23. https://doi.org/10.1007/s00239-022-10078-w (2023).
Suzuki, T. et al. Sulfolobus tokodaii sp. nov. (f. Sulfolobus sp. strain 7), a new member of the genus Sulfolobus isolated from Beppu Hot Springs, Japan. Extremophiles 6, 39–44. https://doi.org/10.1007/s007920100221 (2002).
Jones, W. J., Leigh, J. A., Mayer, F., Woese, C. R. & Wolfe, R. S. Methanococcus jannaschii sp. nov., an extremely thermophilic methanogen from a submarine hydrothermal vent. Arch. Microbiol. 136, 254–261. https://doi.org/10.1007/BF00425213 (1983).
Beeder, J., Nilsen, R. K., Rosnes, J. T., Torsvik, T. & Lien, T. Archaeoglobus fulgidus isolated from Hot North Sea oil field waters. Appl. Environ. Microbiol. 60, 1227–1231. https://doi.org/10.1128/aem.60.4.1227-1231.1994 (1994).
Oshima, T. & Imahori, K. Description of Thermus thermophilus (Yoshida and Oshima) comb, novl., a nonsporulating thermophilic bacterium from a Japanese thermal spa. Intern. J. System. Bacteriol. 24, 102–112 (1974).
Zhilina, T. N. et al. Desulfonatronovibrio hydrogenovorans gen. nov., sp. nov., an alkaliphilic, sulfate-reducing bacterium. Int. J. Syst. Bacteriol. 47, 144–149. https://doi.org/10.1099/00207713-47-1-144 (1997).
Wei, Y. et al. Mutational loss of a K+ and NH4+ transporter affects the growth and endospore formation of alkaliphilic Bacillus pseudofirmus OF4. J. Bacteriol. 185, 5133–5147. https://doi.org/10.1128/jb.185.17.5133-5147.2003 (2003).
Kim, S. K., Min, Y. H. & Jin, H. J. Characteristics of the ErmK protein of Bacillus halodurans C-125. Microbiol. Spectr. 11, e0259822. https://doi.org/10.1128/spectrum.02598-22 (2023).
Fujinami, S. & Fujisawa, M. Industrial applications of alkaliphiles and their enzymes–past, present and future. Environ. Technol. 31, 845–856. https://doi.org/10.1080/09593331003762807 (2010).
Mesbah, N. M. Industrial biotechnology based on enzymes from extreme environments. Front. Bioeng. Biotechnol. 10, 870083. https://doi.org/10.3389/fbioe.2022.870083 (2022).
Zhang, C. L. et al. Carbon isotopic fractionations associated with thermophilic bacteria Thermotoga maritima and Persephonella marina. Environ. Microbiol. 4, 58–64. https://doi.org/10.1046/j.1462-2920.2002.00266.x (2002).
Takai, K. et al. Alkaliphilus transvaalensis gen. nov., sp. nov., an extremely alkaliphilic bacterium isolated from a deep South African gold mine. Int. J. Syst. Evol. Microbiol. 51, 1245–1256. https://doi.org/10.1099/00207713-51-4-1245 (2001).
Nguyen, L. T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274. https://doi.org/10.1093/molbev/msu300 (2015).
Yang, Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591. https://doi.org/10.1093/molbev/msm088 (2007).
Rouchon, V. & Orberger, B. Origin and mechanisms of K-Si-metasomatism of ca. 3.4–3.3Ga volcaniclastic deposits and implications for Archean seawater evolution: Examples from cherts of Kittys Gap (Pilbara craton, Australia) and Msauli (Barberton Greenstone Belt, South Africa). Precambr. Res. 165, 169–189. https://doi.org/10.1016/j.precamres.2008.06.003 (2008).
Halevy, I. & Bachan, A. The geologic history of seawater pH. Science 355, 1069–1071. https://doi.org/10.1126/science.aal4151 (2017).
Martin, W., Baross, J., Kelley, D. & Russell, M. J. Hydrothermal vents and the origin of life. Nat. Rev. Microbiol. 6, 805–814. https://doi.org/10.1038/nrmicro1991 (2008).
Shibuya, T., Komiya, T., Nakamura, K., Takai, K. & Maruyama, S. Highly alkaline, high-temperature hydrothermal fluids in the early Archean ocean. Precambr. Res. 182, 230–238. https://doi.org/10.1016/j.precamres.2010.08.011 (2010).
Damer, B. & Deamer, D. The hot spring hypothesis for an origin of life. Astrobiology 20, 429–452. https://doi.org/10.1089/ast.2019.2045 (2020).
Mulkidjanian, A. Y., Bychkov, A. Y., Dibrova, D. V., Galperin, M. Y. & Koonin, E. V. Origin of first cells at terrestrial, anoxic geothermal fields. Proc. Natl. Acad. Sci. U S A 109, E821-830. https://doi.org/10.1073/pnas.1117774109 (2012).
Suzuki, S. et al. Unusual metabolic diversity of hyperalkaliphilic microbial communities associated with subterranean serpentinization at The Cedars. ISME J. 11, 2584–2598. https://doi.org/10.1038/ismej.2017.111 (2017).
Gaucher, E. A. Experimental resurrection of ancient biomolecules: gene synthesis, heterologous protein expression, and functional assays. In Ancestral sequence reconstruction (ed. Liberles, D. A.) 153–163 (Oxford University Press, 2007).
Arenas, M. Methodologies for microbial ancestral sequence reconstruction. Methods Mol. Biol. 2569, 283–303. https://doi.org/10.1007/978-1-0716-2691-7_14 (2022).
Golyshina, O. V., Golyshin, P. N., Timmis, K. N. & Ferrer, M. The “pH optimum anomaly” of intracellular enzymes of Ferroplasma acidiphilum. Environ. Microbiol. 8, 416–425. https://doi.org/10.1111/j.1462-2920.2005.00907.x (2006).
Baker-Austin, C. & Dopson, M. Life in acid: pH homeostasis in acidophiles. Trends Microbiol 15, 165–171. https://doi.org/10.1016/j.tim.2007.02.005 (2007).
Slonczewski, J. L., Fujisawa, M., Dopson, M. & Krulwich, T. A. Cytoplasmic pH measurement and homeostasis in bacteria and archaea. Adv. Microb. Physiol. 55(1–79), 317. https://doi.org/10.1016/s0065-2911(09)05501-5 (2009).
Vonhippel, P. H. & Wong, K. Y. Neutral salts: the generality of their effects on the stability of macromolecular conformations. Science 145, 577–580. https://doi.org/10.1126/science.145.3632.577 (1964).
Inouye, K., Kuzuya, K. & Tonomura, B. Sodium chloride enhances markedly the thermal stability of thermolysin as well as its catalytic activity. Biochim. Biophys. Acta 1388, 209–214. https://doi.org/10.1016/s0167-4838(98)00189-7 (1998).
Bágel’ová, J., Fedunová, D., Gazová, Z., Fabian, M. & Antalík, M. Influence of NaCl and sorbitol on the stability of conformations of cytochrome c. Biophys. Chem. 135, 110–115. https://doi.org/10.1016/j.bpc.2008.03.010 (2008).
Longo, L. M., Lee, J. & Blaber, M. Simplified protein design biased for prebiotic amino acids yields a foldable, halophilic protein. Proc. Natl. Acad. Sci. USA 110, 2135–2139. https://doi.org/10.1073/pnas.1219530110 (2013).
Williams, P. D., Pollock, D. D., Blackburne, B. P. & Goldstein, R. A. Assessing the accuracy of ancestral protein reconstruction methods. PLoS Comput. Biol. 2, e69. https://doi.org/10.1371/journal.pcbi.0020069 (2006).
Trudeau, D. L., Kaltenbach, M. & Tawfik, D. S. On the potential origins of the high stability of reconstructed ancestral proteins. Mol. Biol. Evol. 33, 2633–2641. https://doi.org/10.1093/molbev/msw138 (2016).
Kędzior, M. et al. Resurrected Rubisco suggests uniform carbon isotope signatures over geologic time. Cell Rep. 39, 110726. https://doi.org/10.1016/j.celrep.2022.110726 (2022).
Pace, C. N., Vajdos, F., Fee, L., Grimsley, G. & Gray, T. How to measure and predict the molar absorption coefficient of a protein. Protein Sci. 4, 2411–2423. https://doi.org/10.1002/pro.5560041120 (1995).
Gill, S. C. & von Hippel, P. H. Calculation of protein extinction coefficients from amino acid sequence data. Anal. Biochem. 182, 319–326 (1989).
Edwards, R. J. & Shields, D. C. GASP: Gapped ancestral sequence prediction for proteins. BMC Bioinf. 5, 123. https://doi.org/10.1186/1471-2105-5-123 (2004).
Acknowledgements
This work was supported by JSPS KAKENHI (Grant numbers 19K21903, and 21H01200) to SA; and by the Astrobiology Center Program of the National Institutes of Natural Sciences (Grant Number AB022003) to SA.
Author information
Authors and Affiliations
Contributions
A.Y. and S.A. designed research; T.F., T.S., F.Z., and S.A. performed research; T.F., T.S., and S.A. analyzed data; A.Y. and S.A. wrote the manuscript; and all authors discussed the results and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fujikawa, T., Sasamoto, T., Zhao, F. et al. Comparative analysis of reconstructed ancestral proteins with their extant counterparts suggests primitive life had an alkaline habitat. Sci Rep 14, 398 (2024). https://doi.org/10.1038/s41598-023-50828-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-50828-4
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.