Evolution of a key enzyme of aerobic metabolism reveals Proterozoic functional subunit duplication events and an ancient origin of animals

Bezerra, Bruno Santos; Belato, Flavia Ariany; Mello, Beatriz; Brown, Federico; Coates, Christopher J.; de Moraes Leme, Juliana; Trindade, Ricardo I. F.; Costa-Paiva, Elisa Maria

doi:10.1038/s41598-021-95094-4

Download PDF

Article
Open access
Published: 03 August 2021

Evolution of a key enzyme of aerobic metabolism reveals Proterozoic functional subunit duplication events and an ancient origin of animals

Bruno Santos Bezerra¹,
Flavia Ariany Belato¹,
Beatriz Mello²,
Federico Brown¹,
Christopher J. Coates³,
Juliana de Moraes Leme⁴,
Ricardo I. F. Trindade⁵ &
…
Elisa Maria Costa-Paiva^1,5

Scientific Reports volume 11, Article number: 15744 (2021) Cite this article

3442 Accesses
5 Citations
Metrics details

Subjects

Abstract

The biological toolkits for aerobic respiration were critical for the rise and diversification of early animals. Aerobic life forms generate ATP through the oxidation of organic molecules in a process known as Krebs’ Cycle, where the enzyme isocitrate dehydrogenase (IDH) regulates the cycle's turnover rate. Evolutionary reconstructions and molecular dating of proteins related to oxidative metabolism, such as IDH, can therefore provide an estimate of when the diversification of major taxa occurred, and their coevolution with the oxidative state of oceans and atmosphere. To establish the evolutionary history and divergence time of NAD-dependent IDH, we examined transcriptomic data from 195 eukaryotes (mostly animals). We demonstrate that two duplication events occurred in the evolutionary history of NAD-IDH, one in the ancestor of eukaryotes approximately at 1967 Ma, and another at 1629 Ma, both in the Paleoproterozoic Era. Moreover, NAD-IDH regulatory subunits β and γ are exclusive to metazoans, arising in the Mesoproterozoic. Our results therefore support the concept of an ‘‘earlier-than-Tonian’’ diversification of eukaryotes and the pre-Cryogenian emergence of a metazoan IDH enzyme.

Heterotrophic euglenid Rhabdomonas costata resembles its phototrophic relatives in many aspects of molecular and cell biology

Article Open access 22 June 2021

Common origin of ornithine–urea cycle in opisthokonts and stramenopiles

Article Open access 07 October 2020

Recovery of Lutacidiplasmatales archaeal order genomes suggests convergent evolution in Thermoplasmatota

Article Open access 15 July 2022

Introduction

All living aerobic organisms need energy for their maintenance and growth—yielding ATP through the oxidation of organic molecules and using oxygen as the terminal electron acceptor¹. The Tricarboxylic Acid (TCA) Cycle, also known as the Citric Acid Cycle or Krebs’ Cycle² undertakes sequential oxidation. Considered a molecular furnace, the TCA does not simply play a role in ATP generation through catabolism of macromolecules, but it is inextricably linked to cellular functionality and homeostasis. The TCA (1) plays a unique role as an NAD and NADP coenzyme reducer for ATP production in the respiratory chain, (2) acts as a core metabolic integration point for catabolic processes of different macromolecules including carbohydrates, lipids, and proteins, and (3) produces intermediate components of anabolic processes, e.g., citrate that is used for lipogenesis^1,2. The turnover of the TCA cycle and the accumulation of citrate is influenced by the enzyme isocitrate dehydrogenase (IDH), which catalyzes the often irreversible oxidative decarboxylation of isocitrate to alpha-ketoglutarate (and CO₂), concurrent to coenzyme (NAD⁺, NADP) reduction¹.

Across the evolution of life, IDH regulation became more complex and the proteins diversified in terms of subunit number, size, or coenzyme binding³. The TCA cycle in eukaryotes uses the hetero-oligomeric NAD-IDH that exclusively locates into the mitochondrial matrix, whereas prokaryotes use mostly the homodimeric enzyme NADP-IDH⁴. Eukaryotes also have homodimeric NADP-IDHs that do not function in the TCA cycle, but aid lipid metabolism and counter oxidative damage in the cytosol and mitochondria⁵. Overall, eukaryotic NAD-IDH and NADP-IDH show more complex allosteric regulation processes than the prokaryotic phosphorylation-mediated regulation of NADP-IDH^3,5,6,7. Such differences in the eukaryotic and prokaryotic NADP-IDH allostery seem to have originated from two different mutation events in the NAD-dependent ancestor⁸. Eukaryotic NADP-IDHs regulate their activity by substrate induced conformational changes in the allosteric site in the presence of the substrate^5,7. Conversely, prokaryotic NADP-IDHs loose activity when phosphate groups are added to specific serine residues in the presence of acetate, alpha-ketoglutarate, and NADPH⁴. Generally, the eukaryotic IDHs have more complex regulation processes than single-step phosphorylation/de-phosphorylation switches³.

Among lineages, we find distinct numbers of IDH subunits that have evolved to act as heteromeric protein assemblies. Fungal and green algal IDHs are formed by two different subunits: IDH1 associated with allosteric regulation of the enzyme, and IDH2 responsible for substrate catalysis^6,9. IDH1 is also the name used for the cytosolic NADP-isocitrate dehydrogenase, whereas IDH2 is used for mitochondrial NADP-IDH (widely studied in humans). In the present study, we refer to IDH1 and IDH2 as the two subunits of the fungal NAD-dependent IDH. Metazoans, a lineage of opisthokonts (taxon composed by holozoans, including animals and fungi), present an exclusively octameric IDH with three different subunits: α, β and γ in a stoichiometric ratio 2:1:1¹⁰. The α subunit from mammals is structurally similar to the fungal IDH2 subunit, indicating that it has a catalytic function, whereas the β and γ subunits resemble the structure of the fungal IDH1 subunit, which serves as evidence of its regulatory function^11,12.

Regarding animals, the need for oxygen is almost exclusively associated with aerobic respiration. Previous investigators have suggested a causal link between the diversification of animals and the rise of atmospheric dioxygen levels in the late Neoproterozoic (1.000–541 Ma)^13,14. In fact, recent studies suggest that the earliest animals were likely small, soft-bodied, and collagen-poor—evolving under low oxygen levels that restricted their use of oxygen to high-priority physiological functions^15,16, including basal cellular aerobic respiration. Moreover, much of the molecular toolkit required for animal development originated deep in eukaryote evolutionary history during low oxygen periods¹⁷.

The evolutionary history of animals could be traced by considering single and multicellular eukaryotes as well as animal phylogenies, and the divergence times among their major lineages. Even for the metazoan lineage, which is one of the best-studied eukaryotic lineages, a consensus does not exist regarding its basal relationships (e.g., ctenophore vs porifera controversy). Moreover, reconstructions of the last common ancestor for animals are contentious^18,19,20,21, partly because early putative animal fossil records are problematic^22,23. While molecular dating for crown Metazoa range between 1298 and 615 Ma, the earliest metazoan fossils date to the late Ediacaran ~ 580 Ma^24,25,26. Fossils older than 600 Ma remain controversial and are limited to a few small sponges^27,28,29. Regarding eukaryotes, molecular clock estimates for their last common ancestor span ~ 1 billion years^30,31,32,33. Some differences can be attributed to the use of the fossil Bangiomorpha pubescens—a presumed bangiophyte red alga—as a calibration point. Time estimates obtained without employing B. pubescens as a calibration point are 200–300 Ma younger than those that do³⁴. Recently, the development of refined molecular clock methodologies has helped to reduce the disparities among molecular dating and the fossil evidence used to estimate clade age minima^26,35.

To elucidate the evolutionary processes that led to the diversification of major taxa during the geological history of the Earth, molecular estimates of divergence times are now used as a standard technique to infer chronograms³⁶ and have improved dramatically in recent years for deep time studies^37,38,39,40. The method consists of estimating the age of internal nodes based on the rates of nucleotide or amino acid substitutions among lineage sequences³⁶. Understanding species phylogeny is a prerequisite for studying protein function and adaptations at the molecular level. In fact, the proteins of an organism share the same phylogenetic history, and changes in protein sequences can be used to date the origin of various physiological adaptations⁴¹. Therefore, molecular dating of specific genes can provide new insights into well-established topics, such as the diversification of eukaryotes and animals^26,42. Although most deep time studies use hundreds of genes in order to estimate divergence times of species lineages^26,35, molecular dating of particular proteins can recover the evolutionary history of these proteins against a background of the evolution of major taxa that are intrinsically linked to these proteins. Molecular dating of deep divergences may be challenging, mostly because of issues such as sequence saturation, which can affect analyses by biasing the estimated genetic distances^43,44,45,46. However, estimated divergence times based on amino acid sequences that are more conserved compared to nucleotides sequences can alleviate the problem of saturation.

Because animals depend on the oxidation of organic molecules for cellular respiration, the evolution of proteins related to oxidative metabolism can allow us to trace their emergence time⁴⁷. The crucial role of NAD-IDH for cellular respiration and Krebs’ Cycle⁷ makes its evolutionary history a compelling source of information to study the evolution of early animals and eukaryotes. Previous works have presented the phylogenetic relationships of the IDH protein family in Prokaryotes and Eukaryotes⁸, however the evolutionary history of NAD-IDHs in animals, as well as molecular dating estimates of protein divergence, are still unclear. Previously, authors described lineage-specific duplication events that gave rise to three IDH subunits (α, β and γ) in metazoans only¹⁰, and these events can be used to estimate the time of animal diversification²⁶. We present the evolutionary history of eukaryotic NAD-IDHs and molecular dating estimates for the emergence of animal IDH and subsequent duplication events.

Results

Evolutionary history

Our final empirical dataset comprised 193 IDH sequences from eukaryotes and two IDH sequences from bacteria, used here as the outgroup (Supplementary Table S1; Supplementary Figure S2). Following alignment and trimming, the dataset included 353 residues (alignment file available at https://doi.org/10.6084/m9.figshare.13158116). Bayesian inference and maximum likelihood analyses recovered the same topology for the main eukaryotic lineages with high support values, and also poorly resolved nodes that are often observed in gene genealogies⁴⁸ (Fig. 1; IQ-tree tree file available at https://doi.org/10.6084/m9.figshare.13158101 and MrBayes tree file available at https://doi.org/10.6084/m9.figshare.13158110). Phylogenetic reconstruction performed with a mixture model, presented high concordance with Bayesian and maximum likelihood phylogenies based on regular substitution models (PhyloBayes tree file available at https://doi.org/10.6084/m9.figshare.14633157). In order to check if our heterogeneous taxonomic ensemble could bias the phylogenetic reconstructions, we tested the effect of taxonomic sampling by running a maximum likelihood analysis with only one representative for each taxon per subunit. The topology recovered was congruent with those obtained for our complete dataset.

Gene genealogy clearly evidenced multiple duplication events of eukaryotic NAD-IDH subunits that resulted in the three metazoan subunits, α, β and γ. IDHα was already present in the ancestor of metazoans, whereas IDHβ and IDHγ duplicated and diverged within the metazoan lineage (arrows, Fig. 1). IDHβ + IDHγ were sister groups to an unnamed IDH group—including Fungi, Capsaspora, and Choanoflagellate IDHs—forming a monophyletic opisthokonta clade. The opisthokonta IDHs showed a sister group relationship to another ciliate specific IDH that comprises the molecule in animals and the two subunits present in other opisthokonts (Fig. 1). This ciliate specific IDH likely originated from an earlier duplication event that pre-dated the opisthokonta ancestor.

The topology of the NAD-IDH gene tree corroborated the consensus of recent phylogenomic studies of Eukarya^49,50. We found two highly supported clades (PP. = 1; BP = 100%): one comprised exclusively of IDH sequences from euglenozoans (Discoba) and the second contained all remaining sequences. The latter clade was formed by two sister-groups and displayed the first duplication event present in the eukaryotic NAD-IDH evolutionary history. These two well-supported clades were clade I containing catalytic subunits IDH2 and animal subunit α (Fig. 1; pink clade; PP = 1; BP = 100%) and clade II containing regulatory subunits IDH1 and animal subunits β + γ (Fig. 1; blue and green clades; PP > 0.9; BP > 95%).

The clade containing the catalytic NAD-IDH subunits (Fig. 1; pink clade) revealed the divergence between a clade comprising the ciliate IDH sequences (Fig. 1; PP. = 1; BP = 100%) and another composed of Amorphea (Opisthokonta + Amoebozoa) IDH sequences (Fig. 1; PP. > 0.5; BP > 80%). The latter is further divided into two sister-groups: amoebozoans (Fig. 1; PP. = 1; BP = 100%) and opisthokonts IDHs (Fig. 1; PP = 1; BP = 100%). The opisthokont clade is further subdivided into a single IDH2 fungal sequence and a holozoan clade (Fig. 1; PP = 1; BP = 100%). In the holozoan clade, we found two IDH sequences, one from a Capsaspora species and one from a choanoflagellate species, which was the sister clade to the metazoans (Fig. 1; PP = 1; BP = 100%). The genealogy of the catalytic subunit(s) recapitulated the recent phylogenetic relationships recovered for these organisms, suggesting a common evolutionary history of the gene and the events of lineage divergence in the opisthokonts.

Clade II—composed of the regulatory subunits—showed evidence for a second eukaryotic gene duplication event that gave rise to animal subunits β and γ (Fig. 1; blue and green clades; PP = 1.0; BP = 100%). Prior to this duplication event, another separation of two gene lineages occurred: one formed by a ciliate specific gene clade and another formed by IDH1 genes of choanoflagellates, capsasporans, and fungi (Fig. 1; PP > 0.9; BP > 80%). The second duplication event in the clade was associated exclusively with metazoan IDH sequences (Fig. 1; blue and green clades; PP > 0.9; BP > 95%). All metazoan lineages used in this study were represented in both the β subunit (Fig. 1; blue clade; PP > 0.9; BP > 95%) and γ subunit (Fig. 1; green clade; PP = 1.0; BP = 100%) clades.

Molecular dating

The topology estimated by BEAST differed from the maximum-likelihood and Bayesian inferences. For instance, the euglenozoans NAD-IDH sequences formed a monophyletic clade within the catalytic subunit clade and only after the first eukaryotic duplication event. Moreover, a significant difference was observed in the position of choanoflagellate sequences in relation to animal subunits β and γ. In our former analysis (Fig. 1), IDH1 sequences of choanoflagellate, capsasporans, and fungi were the sister-group to animal subunits β and γ, prior to the second duplication event. In contrast, the BEAST analyses showed that the single choanoflagellate gene sequence appeared as a sister-group to animal subunit γ. With the exception of very deep divergences, node ages and credibility intervals estimated based on a mixture model in PhyloBayes generally matched the ones inferred by BEAST for the major clades (Supplementary Figure S3 and PhyloBayes timescale file available at https://doi.org/10.6084/m9.figshare.14633271). Because of this, we centered our results and discussion around BEAST’s timescale.

The age estimated for the first duplication event of eukaryotic NAD-IDH giving rise to catalytic subunits (IDH2 and α) and regulatory subunits (IDH1, β and γ) was approximately 1967 Ma (1641–2329 Ma) in the Paleoproterozoic (Fig. 2). Regarding the catalytic subunit, the split between the euglenozoan subunit and other eukaryotes occurred at ~ 1494 Ma (1186–1838 Ma). The split of amoebozoan subunits and opisthokont catalytic subunit was dated at ~ 1224 Ma (1016–1447 Ma), whereas amoebozoan catalytic NAD-IDH was dated at ~ 816 Ma (538–1120 Ma) (Fig. 2). The estimated divergence date between choanoflagellates and capsasporan NAD-IDH and animal α NAD-IDH occurred at approximately 943 Ma (807–1089 Ma). Finally, the emergence of subunit α for animal NAD-IDH was dated at ~ 866 Ma (753–991 Ma), in the Neoproterozoic (Fig. 2).

The gene lineage that gave rise to eukaryotic regulatory subunits IDH1 and metazoan regulatory subunits β and γ diverged at approximately 1747 Ma (1431–2023 Ma), before the second eukaryotic gene duplication event. This divergence gave rise to a fungal and capsasporan IDH1 clade dated at ~ 1254 Ma (854–1651 Ma) and a metazoan IDHβ + IDHγ clade dated ~ 1629 Ma (1334–1884 Ma). The latter clade, composed of metazoan subunits β and γ, located the second gene duplication event. The origin of subunit β was estimated at ~ 1249 Ma (1020–1490 Ma), while the subunit γ was dated at ~ 1531 Ma (1221–1769 Ma). Although the subunit β is exclusive to animals, this analysis recovered a choanoflagellate IDH sequence as a sister group to animal subunit γ. Thus, the estimated age for the origin of animal subunit γ (without considering the choanoflagellate IDH sequence) was approximately 1319 Ma (1100–1558 Ma; Fig. 2).

Discussion

We interrogated the emergence of metazoan NAD-IDH lineages in an attempt to resolve the relationships and evolutionary history of this protein among eukaryotic NAD-IDHs. Our results demonstrate that two gene duplication events occurred in the evolutionary history of NAD-IDH, one in the ancestor of eukaryotes approximately at 1967 Ma, and another at ~ 1629 Ma (both in the Paleoproterozoic Era). Additionally, we reveal that NAD-IDH regulatory subunits β and γ are exclusive to metazoans, and arose in the Mesoproterozoic. Previous studies have investigated the relationships between IDHs, focusing on NAD-IDH and NADP-IDH types^8,51. Although the first members of NAD-IDHs were likely to be present in the eukaryotic ancestor, the quaternary structure of these enzymes remain unresolved. Early IDHs may have functioned as a homodimer as observed in living prokaryotes⁴, an octameric structure as observed in extant eukaryotes³, or as intermediate heterodimers. Previous evolutionary analyses indicated eubacterial IDHs first evolved from an NAD-dependent precursor about 3.5 Ga⁸. In the evolutionary history of this protein, our findings suggest that there was a tendency to increase regulatory complexity in eukaryotes, which concurs with other studies³ (Fig. 3). In bacteria, the regulation of IDH is achieved by simple phosphorylation events⁴, whereas eukaryotic IDHs contain at least two different subunits that show allosteric regulation by distinct factors³. Animals are the only eukaryotic group with three different subunits; more binding motifs that can trigger protein activity have been identified (e.g., ATP and Ca2⁺)^52,53.

The ancestor of eukaryotic NAD-IDH arose ~ 1967 Ma (1641–2329 Ma) in the Paleoproterozoic. At that time, Earth was colonized mainly by microbial lifeforms, and representative species of only a few microbial clades are preserved in the fossil record. Moreover, the interpretation of these early eukaryote fossils is challenging because their key distinguishing characteristics, such as organelles and nuclei, do not preserve well⁵⁴. Our findings for the emergence of eukaryotic NAD-IDH during the Paleoproterozoic is in agreement with contemporary molecular clock estimates for eukaryotes emergence^32,33 and for the canonical view that stem and crown-group eukaryotes may have emerged early in the Proterozoic, keeping low diversity levels in restricted environments until a late Mesoproterozoic diversification^55,56,57. Previous studies suggested that the last common eukaryotic ancestor lived between 1866 and 1679 Ma^32,58, which is consistent with the earliest unambiguous microfossils interpreted as eukaryotic microfossils found in latest Paleoproterozoic rocks (ca. 1650 Ma)^59,60,61. Additionally, the early development of the eukaryotic molecular toolkit originating from a stem–line of descent—a propagating series of pluripotent cellular entities—at approximately 2000 Ma continues to gather support in the literature⁶². Using molecular clock estimates of protein folds, the emergence of eukaryotes in the Paleoproterozoic is also corroborated^63,64. Nevertheless, recent evidence suggest a late Mesoproterozoic origin of the eukaryotic crown group based mainly on eukaryotic sterols, which are presumed to have been present in the eukaryotic crown-ancestor and is absent from rock records until the early Neoproterozoic³⁴. Although our results may support a deep Paleoproterozoic origin of eukaryotes with a late Mesoproterozoic origin of the crown group, they do not support a late emergence of aerobic respiration (ca. 800 Ma; as suggested by Porter et al.³⁴).

The catalytic NAD-IDH subunit arose ca. 1967 Ma in the Paleoproterozoic as a result of the first gene duplication event in eukaryotes. This duplication event generated two NAD-IDHs, a subunit with catalytic function and another with regulatory function. According to our results, animal catalytic subunit α is closer to the fungal catalytic IDH2 sequences than to the other metazoan subunits β and γ, suggesting an orthologous relationship among animal α and fungal IDH2^11,12. The emergence of the opisthokont NAD-IDH catalytic subunit during the Mesoproterozoic (ca. 1224 Ma) corroborates recent paleontological studies from rocks dated at 1700–800 Ma and suggests a higher eukaryotic diversity during this earlier interval than previously known^65,66,67,68. The emergence of the amoebozoan NAD-IDH catalytic subunit was estimated to have occurred as recently as ~ 816 Ma in the Neoproterozoic, i.e., almost 700 Ma later than previous molecular dating estimates³². Amoebozoa is the eukaryotic supergroup sister to Obazoa, the group containing animals, fungi, and several microbial eukaryotic lineages^69,70 (Fig. 3).

The emergence of the NAD-IDH regulatory subunit was dated at ~ 1747 Ma, giving rise to two distinct gene lineages: a fungi + capsasporan NAD-IDH and a metazoan NAD-IDH β + γ. The emergence of the fungi/capsasporan NAD-IDH regulatory subunit occurred in the Mesoproterozoic and was dated at ~ 1254 Ma, corroborating previous estimates for the last common ancestor of extant Opisthokonta from 1389 to 1240 Ma³². The regulatory subunit IDH1 in fungi is closely related to subunits β and γ in animals, indicating that a second duplication event occurred. This result corroborates previously identified physiochemical properties between fungi and mammalian NAD-IDH sequences, such as, comparable molecular weights of subunits and active site architecture^12,53.

The NAD-IDH regulatory subunit originated from a gene duplication event in the Paleoproterozoic (ca. 1967 Ma). However, the emergence of the lineage that gave rise to metazoan subunits β + γ occurred later in the Mesoproterozoic (ca. 1629 Ma). Dates of emergence of animal-specific NAD-IDH subunit β (~ 1249 Ma) and NAD-IDH subunit γ (~ 1532 Ma) provide support for an earlier emergence of metazoans compared to previous estimates that date their origin to ~ 800 Ma^71,72. If animal divergence began in the Mesoproterozoic or even later in the Neoproterozoic, as suggested by recent molecular estimates^26,35,73, hypoxic conditions should have prevailed in most environmental settings for more than half of animal evolutionary history⁷⁴. An alternative scenario could be the existence of oxygenic niches present at the microscale since the rise of oxidative photosynthesis about 3.0 Ga^74,75. In spite of the strong debate surrounding those conditions^76,77,78,79, the fact is that hypoxic niches may have prevailed for most of Earth’s history. Thus, a long-lasting hypoxic period of animal evolution may have resulted in particular adaptations to meet the metabolic demands of oxygen, which could explain an evolutionary trend towards an increase of complexity in the NAD-IDH molecule. Beyond IDHs, tracing the presence/absence and evolutionary history of oxygen reductases (e.g., complex IV and cytochrome bd) would offer further insight into the prototypical pathways of aerobic metabolism in early metazoans when oxygen availability was a limiting factor.

Overall, our results support the hypothesis of an ‘‘earlier-than-Tonian’’ diversification of eukaryotes, as well as a pre-Cryogenian emergence of metazoans under low-oxygen conditions. This implies there has been an evolutionary trend towards increasing complexity in the regulatory subunit of animal NAD-IDH, which includes a complex of three subunits (one catalytic and two regulatory subunits). Our study also highlights the importance of molecular dating estimates of protein families for enhancing our understanding of evolution in deep time.

Methods

Sequence retrieval

The dataset consisted of 195 protein sequences obtained from the National Centre for Biotechnological Information (NCBI) database (Supplementary Table S1; Supplementary Figure S2). Considering that the accuracy of a phylogeny protocol is strongly dependent on the taxonomic sampling, number of sequences and the sequence length, we bolstered our dataset by searching for as many sequences from all metazoan phyla available at NCBI. Sequences functionally annotated as “isocitrate dehydrogenase” in the protein database were retrieved with the following criteria used to filter the appropriate sequences: 1) for taxa with more than 15 sequences available in NCBI, those sequences that were denoted as “Putative”, “Hypothetical”, “Low-quality”, and “partial” were removed; 2) sequences with fewer than 350 amino acid residues were also removed. To confirm the protein domain identity of the sequences, a local search using HMMER 3.3 was performed using the Pfam database with an e-value cutoff of 1e-5⁸⁰. Sequences remaining after those validation steps were incorporated into the final dataset for analysis.

Alignments and phylogenetic reconstructions

In order to infer homology between the sequences, we aligned the dataset using MAFFT software⁸¹ with the accurate algorithm “G-INS-i”. To obtain a biologically correct alignment, we performed a visual inspection and a manual curation to remove spuriously aligned sequences, based on similarity to the protein alignment as a whole. In order to eliminate poorly aligned regions, the alignment was trimmed using the software trimAl⁸² with a 75% gap threshold. To eliminate redundancy from the dataset, sequences with 100% similarity to each other after trimming were removed. The resulted alignment was used for downstream analysis.

Phylogenetic reconstructions rely on the assumption of empirical models of substitution and are therefore dependent on the correct choice of those models. Thus, the best-fit model of protein evolution for the dataset was selected using ModelFinder, implemented in the IQ-TREE software⁸³, which uses Akaike and Bayesian Information Criteria methods. IQ-TREE was also used to perform a maximum likelihood inference⁸⁴. The branch supports and the robustness of the analyses were obtained by using an ultrafast bootstrap approximation with 1000 replicates⁸⁵. Moreover, we also performed Bayesian inference with MrBayes 3.2.1⁸⁶ using two independent runs, each with four Metropolis-coupled chains for 10⁷ generations, sampling from the posterior distribution every 500 generations. To confirm whether chains achieved stationary and to determine an appropriate burn-in, we evaluated trace plots of all MrBayes parameter outputs in Tracer v1.6⁸⁷. The first 25% of samples were discarded as burn-in and a majority rule consensus tree was generated. The software FigTree 1.4.3⁸⁸ was used to summarize and root the resulting phylogenetic trees using two sequences of isocitrate dehydrogenase from bacteria as an outgroup. Because of the evolutionary depth of our dataset, the phylogenetic reconstruction was also performed with a mixture model, which alleviates the issues related to saturation and long branch attraction by accommodating site-specific features of protein evolution^89,90. This was done in PhyloBayes⁸⁹ by using the CAT-GTR model with a gamma distribution (Γ4) of site-rate heterogeneity and multiple chains to check for convergence.

Divergence time estimates were obtained using the software BEAST 2.4.7⁹¹ with the uncorrelated lognormal relaxed clock⁹² using the LG substitution model⁹³, and a birth–death tree prior with default settings. To calibrate divergence times, only speciation nodes were considered because calibration information derived from fossil data provides information regarding the split times between biological lineages (i.e., speciation events). Therefore, divergences classified as speciation nodes that reflected robust biological clades and were free of duplication events were chosen for calibration. These were the crown nodes of Pancrustacea and Gnathostomata. For both taxa, the aforementioned requirements were met three times (i.e., speciation nodes that included only Pancrustacea or Gnathostomata NAD-IDH sequences were recovered three times in the estimated phylogeny). Because of that, six speciation nodes were calibrated with uniform distributions with lower and upper boundaries based on estimates from Benton et al.⁹⁴ and dos Reis et al.²⁶. We used time ranges of 514–531.22 Ma and 420.7 to 468.4 Ma to calibrate the tMRCA (time to the Most Recent Common Ancestor) of pancrustaceans and gnathostomes, respectively. Our calibration points were determined using coherent criteria according to Parham et al.⁹⁵. It is worth mentioning that calibrated nodes were constrained to monophyletic, while other phylogenetic relationships were estimated in BEAST. MCMC (Markov Chain Monte Carlo) models were run for 200 million generations with a sampling frequency of 10,000 and a discarded burn-in period of 10% (20 million generations). To access convergence of chains, two independent MCMC runs were performed. In both runs, effective sample size values were higher than 200 after discarding the burn-in period. We also estimated divergence times in PhyloBayes by using the UGAM relaxed clock model and the phylogeny estimated previously in PhyloBayes by running multiple chains to check for convergence. To reduce computational burden, the CAT-Poisson mixture model was used with a gamma distribution (Γ4) of site-rate heterogeneity. As in BEAST, calibrations were provided as uniform distributions. Because PhyloBayes requires a root calibration, we used a loose gamma distribution (mean = 1520, SD = 240) to calibrate the divergence between Euglenozoa and the remaining eukaryotes, which was based on estimated times retrieved from the TimeTree database⁹⁶.

Data availability

The datasets generated and analyzed during the current study are available in the FigShare repository. IQ-tree tree file available at https://doi.org/10.6084/m9.figshare.13158101, MrBayes tree file available at https://doi.org/10.6084/m9.figshare.13158110 and alignment file available at https://doi.org/10.6084/m9.figshare.13158116. PhyloBayes tree file available at https://doi.org/10.6084/m9.figshare.14633157 and PhyloBayes timescale file available at https://doi.org/10.6084/m9.figshare.14633271.

References

Nelson, D. L., Lehninger, A. L. & Cox, M. M. Lehninger Principles of Biochemistry 6th edn. (W. H. Freeman, 2013).
Google Scholar
Nunes-Nesi, A., Araújo, W. L., Obata, T. & Fernie, A. R. Regulation of the mitochondrial tricarboxylic acid cycle. Curr. Opin. Plant Biol. 16, 335–343 (2013).
Article CAS PubMed Google Scholar
Taylor, A. B., Hu, G., Hart, P. J. & McAlister-Henn, L. Allosteric motions in structures of yeast NAD + -specific isocitrate dehydrogenase. J. Biol. Chem. 283, 10872–10880 (2008).
Article CAS PubMed PubMed Central Google Scholar
Hurley, J. H. et al. Structure of a bacterial enzyme regulated by phosphorylation, isocitrate dehydrogenase. Proc. Natl. Acad. Sci. 86, 8635–8639 (1989).
Article ADS CAS PubMed PubMed Central Google Scholar
Sun, P. et al. Molecular basis for the function of the αβ heterodimer of human NAD-dependent isocitrate dehydrogenase. J. Biol. Chem. 294, 16214–16227 (2019).
Article CAS PubMed PubMed Central Google Scholar
Martínez-Rivas, J. & Vega, J. M. Purification and characterization of NAD-isocitrate dehydrogenase from Chlamydomonas reinhardtii. Plant Physiol. 118, 249–255 (1998).
Article PubMed PubMed Central Google Scholar
Mailloux, R. J. et al. The tricarboxylic acid cycle, an ancient metabolic network with a novel twist. PLoS ONE 2, 1–10 (2007).
Article CAS Google Scholar
Dean, A. M. & Golding, G. B. Protein engineering reveals ancient adaptive replacements in isocitrate dehydrogenase. Proc. Natl. Acad. Sci. 94, 3104–3109 (1997).
Article ADS CAS PubMed PubMed Central Google Scholar
Cupp, J. R. & McAlister-Henn, L. NAD (+)-dependent isocitrate dehydrogenase. Cloning, nucleotide sequence, and disruption of the IDH2 gene from Saccharomyces cerevisiae. J. Biol. Chem. 266, 22199–22205 (1991).
Article CAS PubMed Google Scholar
Ramachandran, N. & Colman, R. F. Chemical characterization of distinct subunits of pig heart DPN-specific isocitrate dehydrogenase. J. Biol. Chem. 255, 8859–8864 (1980).
Article CAS PubMed Google Scholar
Kim, Y. O. et al. Characterization of a cDNA clone for human NAD+-specific isocitrate dehydrogenase α-subunit and structural comparison with its isoenzymes from different species. Biochem. J. 308, 63–68 (1995).
Article CAS PubMed PubMed Central Google Scholar
Nichols, B. J., Perry, A. C. F., Hall, L. & Denton, R. M. Molecular cloning and deduced amino acid sequences of the α- and β- subunits of mammalian NAD+-isocitrate dehydrogenase. Biochem. J. 310, 917–922 (1995).
Article CAS PubMed PubMed Central Google Scholar
Nursall, J. R. Oxygen as a prerequisite to the origin of the Metazoa. Nature 183, 1170–1172 (1959).
Article ADS Google Scholar
Canfield, D. E., Poulton, S. W. & Narbonne, G. M. Late-Neoproterozoic deep-ocean oxygenation and the rise of animal life. Science 315, 92–95 (2007).
Article ADS CAS PubMed Google Scholar
Mills, D. B. & Canfield, D. E. Oxygen and animal evolution: did a rise of atmospheric oxygen “trigger” the origin of animals?. BioEssays 36, 1145–1155 (2014).
Article CAS PubMed Google Scholar
Mills, D. B. et al. Oxygen requirements of the earliest animals. Proc. Natl. Acad. Sci. 111, 4168–4172 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Sebé-Pedrós, A., de Mendoza, A., Lang, B. F., Degnan, B. M. & Ruiz-Trillo, I. Unexpected repertoire of metazoan transcription factors in the unicellular holozoan Capsaspora owczarzaki. Mol. Biol. Evol. 28, 1241–1254 (2011).
Article PubMed CAS Google Scholar
Whelan, N. V., Kocot, K. M., Moroz, L. L. & Halanych, K. M. Error, signal, and the placement of Ctenophora sister to all other animals. Proc. Natl. Acad. Sci. 112, 5773–5778 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Pisani, D. et al. Genomic data do not support comb jellies as the sister group to all other animals. Proc. Natl. Acad. Sci. 112, 15402–15407 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Halanych, K. M. How our view of animal phylogeny was reshaped by molecular approaches: lessons learned. Org. Divers. Evol. 16, 319–328 (2016).
Article Google Scholar
Giribet, G. New animal phylogeny: future challenges for animal phylogeny in the age of phylogenomics. Org. Divers. Evol. 16, 419–426 (2016).
Article Google Scholar
Budd, G. E. & Jensen, S. A critical reappraisal of the fossil record of the bilaterian phyla. Biol. Rev. Camb. Philos. Soc. 75, 253–295 (2000).
Article CAS PubMed Google Scholar
Budd, G. E. The earliest fossil record of the animals and its significance. Philos. Trans. R. Soc. B Biol. Sci. 363, 1425–1434 (2008).
Article Google Scholar
Hedges, S. B., Blair, J. E., Venturi, M. L. & Shoe, J. L. A molecular timescale of eukaryote evolution and the rise of complex multicellular life. BMC Evol. Biol. 4, 2 (2004).
Article PubMed PubMed Central Google Scholar
Peterson, K. J. et al. Estimating metazoan divergence times with a molecular clock. Proc. Natl. Acad. Sci. 101, 6536–6541 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
dos Reis, M. et al. Uncertainty in the timing of origin of animals and the limits of precision in molecular timescales. Curr. Biol. 25, 2939–2950 (2015).
Article PubMed PubMed Central CAS Google Scholar
Brain, C. K. et al. The first animals: ca. 760-million-year-old sponge-like fossils from Namibia. S. Afr. J. Sci 108, 1–8 (2012).
Article Google Scholar
Antcliffe, J. B., Callow, R. H. T. & Brasier, M. D. Giving the early fossil record of sponges a squeeze. Biol. Rev. 89, 972–1004 (2014).
Article PubMed Google Scholar
Cordani, U. G., Fairchild, T. R., Ganade, C. E., Babinski, M. & Leme, J. de M. Dawn of metazoans: to what extent was this influenced by the onset of “modern-type plate tectonics”? Brazilian J. Geol. 50, e20190095 (2020).
Berney, C. & Pawlowski, J. A molecular time-scale for eukaryote evolution recalibrated with the continuous microfossil record. Proc. R. Soc. B Biol. Sci. 273, 1867–1872 (2006).
Article CAS Google Scholar
Chernikova, D., Motamedi, S., Csürös, M., Koonin, E. V. & Rogozin, I. B. A late origin of the extant eukaryotic diversity: divergence time estimates using rare genomic changes. Biol. Direct 6, 26 (2011).
Article CAS PubMed PubMed Central Google Scholar
Parfrey, L. W., Lahr, D. J. G., Knoll, A. H. & Katz, L. A. Estimating the timing of early eukaryotic diversification with multigene molecular clocks. Proc. Natl. Acad. Sci. 108, 13624–13629 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Eme, L., Sharpe, S. C., Brown, M. W. & Roger, A. J. On the age of Eukaryotes: Evaluating evidence from fossils and molecular clocks. Cold Spring Harb. Perspect. Biol. 6, a016139–a016139 (2014).
Article PubMed PubMed Central Google Scholar
Porter, S. M. Insights into eukaryogenesis from the fossil record. Interface Focus 10, 20190105 (2020).
Article PubMed PubMed Central Google Scholar
Dohrmann, M. & Wörheide, G. Dating early animal evolution using phylogenomic data. Sci. Rep. 7, 3599 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Mello, B. Estimating timetrees with MEGA and the TimeTree resource. Mol. Biol. Evol. 35, 2334–2342 (2018).
Article CAS PubMed Google Scholar
Misof, B. et al. Phylogenomics resolves the timing and pattern of insect evolution. Science 346, 763–767 (2014).
Article ADS CAS PubMed Google Scholar
Irisarri, I. et al. Phylotranscriptomic consolidation of the jawed vertebrate timetree. Nat. Ecol. Evol. 1, 1370–1378 (2017).
Article PubMed PubMed Central Google Scholar
Delsuc, F. et al. A phylogenomic framework and timescale for comparative studies of tunicates. BMC Biol. 16, 39 (2018).
Article PubMed PubMed Central CAS Google Scholar
Wolfe, J. M. et al. A phylogenomic framework, evolutionary timeline and genomic resources for comparative studies of decapod crustaceans. Proc. R. Soc. B Biol. Sci. 286, 20190079 (2019).
Article Google Scholar
Burmester, T. Origin and evolution of arthropod hemocyanins and related proteins. J. Comp. Physiol. B Biochem. Syst. Environ. Physiol. 172, 95–107 (2002).
Article CAS Google Scholar
Zhang, T. et al. Evolution of the cholesterol biosynthesis pathway in animals. Mol. Biol. Evol. 36, 2548–2556 (2019).
Article CAS Google Scholar
Wilke, T., Schultheiß, R. & Albrecht, C. As time goes by: a simple fool’s guide to molecular clock approaches in invertebrates. Am. Malacol. Bull. 27, 25–45 (2009).
Article Google Scholar
Schwartz, R. S. & Mueller, R. L. Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks. BMC Evol. Biol. 10, 5 (2010).
Article PubMed PubMed Central Google Scholar
Zheng, Y., Peng, R., Kuro-o, M. & Zeng, X. Exploring patterns and extent of bias in estimating divergence time from mitochondrial DNA sequence data in a particular lineage: a case study of salamanders (Order Caudata). Mol. Biol. Evol. 28, 2521–2535 (2011).
Article CAS PubMed Google Scholar
Magallón, S., Hilu, K. W. & Quandt, D. Land plant evolutionary timeline: Gene effects are secondary to fossil constraints in relaxed clock estimation of age and substitution rates. Am. J. Bot. 100, 556–573 (2013).
Article PubMed CAS Google Scholar
Decker, H. & van Holde, K. E. Oxygen and the Evolution of Life (Springer, 2011).
Book Google Scholar
DeSalle, R. Can single protein and protein family phylogenies be resolved better?. J. Phylogenet. Evol. Biol. 03, e116 (2015).
Article Google Scholar
Adl, S. M. et al. Revisions to the classification, nomenclature, and diversity of eukaryotes. J. Eukaryot. Microbiol. 66, 4–119 (2019).
Article PubMed PubMed Central Google Scholar
Burki, F., Roger, A. J., Brown, M. W. & Simpson, A. G. B. The new tree of eukaryotes. Trends Ecol. Evol. 35, 43–55 (2020).
Article PubMed Google Scholar
Wang, P., Lv, C. & Zhu, G. Novel type II and monomeric NAD+ specific isocitrate dehydrogenases: phylogenetic affinity, enzymatic characterization and evolutionary implication. Sci. Rep. 5, 9150 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Qi, F., Chen, X. & Beard, D. A. Detailed kinetics and regulation of mammalian NAD-linked isocitrate dehydrogenase. Biochim. Biophys. Acta Proteins Proteomics 1784, 1641–1651 (2008).
Article CAS Google Scholar
Nichols, B. J., Hall, L., Perry, A. C. F. & Denton, R. M. Molecular cloning and deduced amino acid sequences of the γ-subunits of rat and monkey NAD+-isocitrate dehydrogenases. Biochem. J. 295, 347–350 (1993).
Article CAS PubMed PubMed Central Google Scholar
Wood, R., Donoghue, P. C. J., Lenton, T. M., Liu, A. G. & Poulton, S. W. The origin and rise of complex life: progress requires interdisciplinary integration and hypothesis testing. Interface Focus 10, 20200024 (2020).
Article PubMed Central Google Scholar
Brocks, J. J. The transition from a cyanobacterial to algal world and the emergence of animals. Emerg. Top. Life Sci. 2, 181–190 (2018).
Article CAS PubMed Google Scholar
Jarrett, A. J. M. et al. Microbial assemblage and palaeoenvironmental reconstruction of the 1.38 Ga Velkerri Formation, McArthur Basin, northern Australia. Geobiology 17, 360–380 (2019).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, K. et al. Absence of biomarker evidence for early eukaryotic life from the Mesoproterozoic Roper Group: Searching across a marine redox gradient in mid-Proterozoic habitability. Geobiology 17, 247–260 (2019).
Article CAS PubMed Google Scholar
Betts, H. C. et al. Integrated genomic and fossil evidence illuminates life’s early evolution and eukaryote origin. Nat. Ecol. Evol. 2, 1556–1562 (2018).
Article PubMed PubMed Central Google Scholar
Javaux, E. J., Knoll, A. H. & Walter, M. Recognizing and interpreting the fossils of early eukaryotes. Orig. life Evol. Biosph. 33, 75–94 (2003).
Article ADS CAS PubMed Google Scholar
Knoll, A. H. Paleobiological perspectives on early eukaryotic evolution. Cold Spring Harb. Perspect. Biol. 6, a016121–a016121 (2014).
Article PubMed PubMed Central CAS Google Scholar
Miao, L., Moczydłowska, M., Zhu, S. & Zhu, M. New record of organic-walled, morphologically distinct microfossils from the late Paleoproterozoic Changcheng Group in the Yanshan Range, North China. Precambrian Res. 321, 172–198 (2019).
Article ADS CAS Google Scholar
Caetano-Anollés, G., Mittenthal, J. E., Caetano-Anollés, D. & Kim, K. M. A calibrated chronology of biochemistry reveals a stem line of descent responsible for planetary biodiversity. Front. Genet. 5, 306 (2014).
PubMed PubMed Central Google Scholar
Wang, M. et al. A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol. Biol. Evol. 28, 567–582 (2011).
Article CAS PubMed Google Scholar
Caetano-Anollés, G. RubisCO and the search for biomolecular culprits of planetary change. Bioessays 39, 1–3 (2017).
Article Google Scholar
Baludikay, B. K., Storme, J.-Y., François, C., Baudet, D. & Javaux, E. J. A diverse and exquisitely preserved organic-walled microfossil assemblage from the Meso-Neoproterozoic Mbuji-Mayi Supergroup (Democratic Republic of Congo) and implications for Proterozoic biostratigraphy. Precambrian Res. 281, 166–184 (2016).
Article ADS CAS Google Scholar
Agić, H., Moczydłowska, M. & Yin, L. Diversity of organic-walled microfossils from the early Mesoproterozoic Ruyang Group, North China Craton—a window into the early eukaryote evolution. Precambrian Res. 297, 101–130 (2017).
Article ADS CAS Google Scholar
Beghin, J. et al. Microfossils from the late mesoproterozoic—early neoproterozoic Atar/El Mreïti Group, Taoudeni Basin, Mauritania, northwestern Africa. Precambrian Res. 291, 63–82 (2017).
Article ADS CAS Google Scholar
Loron, C. C., Rainbird, R. H., Turner, E. C., Greenman, J. W. & Javaux, E. J. Organic-walled microfossils from the late Mesoproterozoic to early neoproterozoic lower Shaler Supergroup (Arctic Canada): diversity and biostratigraphic significance. Precambrian Res. 321, 349–374 (2019).
Article ADS CAS Google Scholar
Kang, S. et al. Between a pod and a hard test: the deep evolution of Amoebae. Mol. Biol. Evol. 34, 2258–2270 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lahr, D. J. G. et al. Phylogenomics and morphological reconstruction of Arcellinida testate amoebae highlight diversity of microbial eukaryotes in the Neoproterozoic. Curr. Biol. 29, 991–1001 (2019).
Article CAS PubMed Google Scholar
Mills, D. B. et al. The last common ancestor of animals lacked the HIF pathway and respired in low-oxygen environments. Elife 7, e31176 (2018).
Article PubMed PubMed Central Google Scholar
Sperling, E. A. & Stockey, R. G. The temporal and environmental context of early animal evolution: considering all the ingredients of an “explosion”. Integr. Comp. Biol. 58, 605–622 (2018).
Article CAS PubMed Google Scholar
Cartwright, P. & Collins, A. Fossils and phylogenies: integrating multiple lines of evidence to investigate the origin of early major metazoan lineages. Integr. Comp. Biol. 47, 744–751 (2007).
Article PubMed Google Scholar
Hammarlund, E. U. Harnessing hypoxia as an evolutionary driver of complex multicellularity. Interface Focus 10, 20190101 (2020).
Article PubMed PubMed Central Google Scholar
Crowe, S. A. et al. Atmospheric oxygenation three billion years ago. Nature 501, 535–538 (2013).
Article ADS CAS PubMed Google Scholar
Planavsky, N. J. et al. No evidence for high atmospheric oxygen levels 1,400 million years ago. Proc. Natl. Acad. Sci. 113, E2550–E2551 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhang, S. et al. Reply to Planavsky et al.: Strong evidence for high atmospheric oxygen levels 1400 million years ago. Proc. Natl. Acad. Sci. 113, 2552–2553 (2016).
ADS CAS Google Scholar
Diamond, C. W., Planavsky, N. J., Wang, C. & Lyons, T. W. What the ~14 Ga Xiamaling formation can and cannot tell us about the mid-Proterozoic ocean. Geobiology 16, 219–236 (2018).
Article CAS PubMed Google Scholar
Zhang, S. et al. Paleoenvironmental proxies and what the Xiamaling Formation tells us about the mid-Proterozoic ocean. Geobiology 17, 225–246 (2019).
Article CAS PubMed Google Scholar
Potter, S. C. et al. HMMER web server: 2018 update. Nucleic Acids Res. 46, W200–W204 (2018).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutierrez, S., Silla-Martinez, J. M. & Gabaldon, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: Improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Article CAS PubMed Google Scholar
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Article CAS PubMed Google Scholar
Rambaut, A., Suchard, M. A., Xie, D. & Drummond, A. J. Tracer v1.6. http://beast.community/tracer (2014).
Rambaut, A. FigTree. Tree figure drawing tool. http://tree.bio.ed.ac.uk/software/figtree/ (2007).
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Article CAS PubMed Google Scholar
Lartillot, N., Brinkmann, H. & Philippe, H. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol. Biol. 7, S4 (2007).
Article PubMed PubMed Central CAS Google Scholar
Bouckaert, R. et al. BEAST 2: A Software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 10, e1003537 (2014).
Article PubMed PubMed Central CAS Google Scholar
Drummond, A. J., Ho, S. Y. W., Phillips, M. J. & Rambaut, A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 4, 88 (2006).
Article CAS Google Scholar
Whelan, S. & Goldman, N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol. Biol. Evol. 18, 691–699 (2001).
Article CAS PubMed Google Scholar
Benton, M. J. et al. Constraints on the timescale of animal evolutionary history. Palaeontol. Electron. 18, 1–106 (2015).
Google Scholar
Parham, J. F. et al. Best practices for justifying fossil calibrations. Syst. Biol. 61, 346–359 (2012).
Article PubMed Google Scholar
Kumar, S. et al. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34(7), 1812–1819 (2017).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The study was supported by FAPESP, by thematic project (Proc. 2016/06114-6), coordinated by R.I.T. A fellowship to E.M.C-P. was provided by FAPESP (Fundação de Amparo à Pesquisa do Estado de São Paulo, Brazil—2018/20268-1). F.A.B. was supported by CNPq and FAPESP (Proc. 2019/18051-7). C.J.C.’s contributions are facilitated by start-up funds from the College of Science, Swansea University.

Author information

Authors and Affiliations

Institute of Biosciences, Department of Zoology, University of Sao Paulo, São Paulo, Brazil
Bruno Santos Bezerra, Flavia Ariany Belato, Federico Brown & Elisa Maria Costa-Paiva
Genetics Department, Biology Institute, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
Beatriz Mello
Department of Biosciences, Faculty of Science and Engineering, Swansea University, Swansea, UK
Christopher J. Coates
Geoscience Institute, University of Sao Paulo, São Paulo, Brazil
Juliana de Moraes Leme
Institute of Astronomy, Geophysics and Atmospheric Sciences, University of Sao Paulo, São Paulo, Brazil
Ricardo I. F. Trindade & Elisa Maria Costa-Paiva

Authors

Bruno Santos Bezerra
View author publications
You can also search for this author in PubMed Google Scholar
Flavia Ariany Belato
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Mello
View author publications
You can also search for this author in PubMed Google Scholar
Federico Brown
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Coates
View author publications
You can also search for this author in PubMed Google Scholar
Juliana de Moraes Leme
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo I. F. Trindade
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Maria Costa-Paiva
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S.B., R.I.F.T., and E.M.C.-P. conceived this study. B.S.B., F.A.B., B.M., and E.M.C.-P. performed the in silico analysis and interpreted the data. F.B., C.J.C., J.M.L., and R.I.F.T. participated in additional discussion. The manuscript was written by B.S.B. and E.M.C.-P. and critically reviewed by F.A.B., B.M., F.B., C.J.C., J.L.M., and R.I.F.T. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Elisa Maria Costa-Paiva.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Informations

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bezerra, B.S., Belato, F.A., Mello, B. et al. Evolution of a key enzyme of aerobic metabolism reveals Proterozoic functional subunit duplication events and an ancient origin of animals. Sci Rep 11, 15744 (2021). https://doi.org/10.1038/s41598-021-95094-4

Download citation

Received: 30 October 2020
Accepted: 16 July 2021
Published: 03 August 2021
DOI: https://doi.org/10.1038/s41598-021-95094-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.