Abstract
A longstanding question in evolutionary biology is how natural selection and environmental pressures shape the mitochondrial genomic architectures of organisms. Mitochondria play a pivotal role in cellular respiration and aerobic metabolism, making their genomes functionally highly constrained. Evaluating selective pressures on mitochondrial genes can provide functional and ecological insights into the evolution of organisms. Collembola (springtails) are an ancient hexapod group that includes the oldest terrestrial arthropods in the fossil record, and that are closely associated with soil environments. Of interest is the diversity of habitat stratification preferences (life forms) exhibited by different species within the group. To understand whether signals of positive selection are linked to the evolution of life forms, we analysed 32 published Collembola mitogenomes in a phylomitogenomic framework. We found no evidence that signatures of selection are correlated with the evolution of novel life forms, but rather that mutations have accumulated as a function of time. Our results highlight the importance of nuclear-mitochondrial interactions in the evolution of collembolan life forms and that mitochondrial genomic data should be interpreted with caution, as complex selection signals may complicate evolutionary inferences.
Similar content being viewed by others
Introduction
Understanding the interplay between genomes, morphology, functional ecology, and biogeography in driving organismal evolution has become central to answering fundamental questions in biology, both from historical (e.g. past speciation) and contemporary (e.g. evolutionary responses to climate change) viewpoints. Making sense of the immense complexity of life has proven a recalcitrant challenge to biologists. Additional lines of evidence pertaining to taxonomy (the fossil record in conjunction with morphological, behavioural, physiological, and molecular data) enhance our ability to piece together the diversification of life on Earth1,2,3,4. A central question in molecular evolution revolves around how natural selection influences the genomic architectures of organisms5,6,7,8,9,10. Environmental pressures drive functional and morphological changes, which are underpinned by genetic variation and, therefore, will manifest as signatures of selection on the genome that can be used to retrace the evolutionary history of organisms and better predict future adaptive evolutionary trajectories.
The mitochondrial genome comprises 13 protein-coding genes (PCGs), which, together with many nuclear genes, encode the proteins that make up the subunits of the four complexes involved in the oxidative phosphorylation chain11,12. Due to the crucial role of PCGs in the respiratory chain, these genes are functionally highly constrained5. An evaluation of signals of past selection on these genes, therefore, provides insights into the adaptive evolution and metabolic requirements of organisms, given that they are responsible for 95% of the energy metabolism and heat production in cells11,13,14. Mitochondrial genes are some of the most commonly used genetic markers, partly because they provide unique perspectives on population genetic variation and structure due to maternal inheritance and a general lack of recombination15,16,17, and are frequently used to answer questions related to historical processes and geographical structure13,14,18,19. However, these inferences often assume that the evolution of this genome is neutral20,21, an idea that has been challenged22,23,24,25,26. It is now widely accepted that the mitogenome is under selection due to its pivotal role in respiration and energy production, and the evolutionary constraints resulting from the co-evolution of nuclear and mitochondrial genes that are involved in these processes11,12,15,16,17,25. Additionally, positive selective sweeps, linked to thermal and aerobic respiratory adaptations, are well documented for mitochondrial genes across various taxa11,13,14,27,28,29,30,31,32. However, the investigation of positive selection on moderately conservative genomic regions, without an assessment of the functional consequence of amino acid mutations, is not adequate to consider the mitochondrial genome in its entirety to be under positive selection linked to adaptive processes13. Therefore, we identify both signals of positive selection and the functional and adaptive significance of amino acid mutations on the mitochondrial genomes of ancient soil-dwelling invertebrate taxa.
Given the influence of heterogeneous soil environments on soil biota community assemblages33,34,35, and that positive selection on the mitogenome is linked to thermal and aerobic respiratory adaptations, we sought to explore the effects of signatures of past selection on the mitogenome of one of the most ancient soil-dwelling invertebrate taxonomic groups, Collembola. Commonly referred to as springtails, this group of primitive, wingless, and largely terrestrial hexapods is regarded as among the first aquatic arthropods to have colonised land36, with evolutionary roots that predate the diversification of insects37,38,39,40. Indeed, the earliest known hexapod fossil (from the early Devonian period, ca. 400 Ma) is a collembolan41,42. This group's biology and ancient evolutionary history provide unique opportunities to understand the environmental and physiological mechanisms governing the evolution of early terrestrial life. Collembolans currently exploit a myriad of niches across all continents, including Antarctica40,43,44. They typically reside within, or are closely associated with soils, and are considered essential components of soil ecosystems due to their influence on soil microbial communities, their role in nutrient recycling, and their importance as prey to a wide range of predatory arthropods45,46,47,48. Of particular interest is the preference of various springtails for different habitats. Springtail species broadly fall into six different ecological life form categories, each displaying a unique stratification preference. These are atmobiotic (large species that inhabit macrophytes (grasses, bushes, tree trunks and branches)); epiedaphic (species found in vegetated habitats or the upper litter layer above the soil); hemiedaphic (which includes species that occupy decomposed litter or rotten wood on the soil surface); euedaphic (‘true’ soil living species that occupy the upper mineral-rich layers of the soil); myrmecophilous (describes species found in ant nests), and hydrophilous (hereafter referred to as aquatic for simplicity; species that live on the surfaces of, or are closely associated with, freshwater bodies)34,43,47,49,50.
Strong correlations exist between the life forms of springtails and their morphologies, suggesting that life forms are subject to selective forces exerted by their environment43,45,46,47,51,52,53. Additionally, springtails also exhibit highly specific habitat preferences for different soil types, soil chemistries, habitat types, and stratification within a given habitat47,49,52. It is remarkable that these morphological designs have probably persisted unchanged for millions of years in many instances. Indeed, springtail fossils dating back as far as the Eocene (ca 40–50 Ma) can be assigned to extant genera and, in some cases, even extant species54. This extreme morphological conservatism suggests that current morphospecies inventories underestimate true collembolan diversity. Not surprisingly, genetic evidence has shown that cryptic diversity is pervasive in this group, with current collembolan species richness thought to underestimate the actual number of species by at least an order of magnitude40,55,56. Therefore, the low number of known springtail species is not an inherent characteristic of the group, but rather the result of under sampling, outdated taxonomy, and a lack of taxonomic expertise56. While Collembola may be dwarfed by other hexapod groups, particularly insects, it still harbours immense diversity40.
Given that Collembola constitutes ecologically important taxa within soil environments48, which exhibit distinct microhabitat stratification that is driven by feeding strategies, food availability, and soil type and chemistry47,49,50,52,57, we sought to identify mitochondrial genes that are under positive selection for adaptations to these microhabitats. Additionally, given that metabolic activity and oxygen consumption differ depending on ecological life form58,59, we investigated whether signals of selection on the mitochondrial genome are associated with the evolution of life form traits (habitat preferences), in the context of environmental change (oxygen levels/metabolic requirements and vegetation alterations) that emerged during the group's evolutionary history. If this is the case, we expect substantial signals of selection to have accumulated along phylogenetic branches representing shifts in life forms compared to sister branches where the original life form was maintained.
Results
Dated phylogeny and ancestral state reconstruction
Our dated mitogenomic phylogeny demonstrates that Collembola first diverged sometime in the early Devonian (ca. 410 Ma), with most diversification taking place in the Triassic to Cretaceous periods (ca. 250–60 Ma) (Fig. S1).
To identify whether a link exists between signals of mitochondrial DNA selection and collembolan life form (see Dataset S1), we assessed phylogenetic relationships between 32 collembolan and three outgroup taxa (see Table S1). We assigned a life form trait (i.e. aquatic, epiedaphic, euedaphic, hemiedaphic, and myrmecophilous) to the ecology (i.e. habitat preference) of each springtail species included in this study (see Dataset S1 for more detail). We reconstructed the ancestral ecological life form state for the collembolan clade using three different methods (parsimony, maximum likelihood, and Bayesian inference), all of which identified hemiedaphic as the ancestral character (see Fig. 1). Across the collembolan phylogeny, five independent life form shifts occurred from the hemiedaphic to alternative life forms (see Fig. 1). For the terminal nodes, the most common ecological life form was hemiedaphic (22 taxa), followed by epiedaphic (five taxa), euedaphic (three taxa), myrmecophilous (one taxon), and aquatic (one taxon). To assess phylogenetic conservatism and homoplasy across the phylogeny, we calculated the retention index (RI)60 for each life form trait, which was 0.75 for epiedaphic, 0.36 for hemiedaphic, 0.33 for euedaphic, and 0 for both myrmecophilous and aquatic.
Selection analysis
To assess signals and direction of selection at different loci (i.e. positive vs purifying selection), we used the codon-based maximum likelihood (CodeML) algorithm to determine the ratio of nonsynonymous to synonymous mutations (ω = dN/dS), and the number of nonsynonymous mutations as a proportion of the total number of mutations (dN/(dN + dS))61,62,63,64. Site model selection analyses indicated overall positive selection across the phylogeny, shown by significant likelihood ratio tests (LRT) for the comparative site models, however, with no nucleotide positions under positive selection (Bayes Empirical Bayes score (BEB) < 0.9562) (Table S2). The comparison between the numbers of synonymous to nonsynonymous mutations across the terminal nodes of the phylogeny was significantly different (paired t test: t415 = − 9.48, p < 0.05). The number of nonsynonymous mutations as a proportion of the total number of mutations revealed no strong evidence for a link between life form transitions and the number of nonsynonymous mutations. Departures from the expectation that synonymous mutations are more common than nonsynonymous mutations65 were found in several taxa (e.g. Bilobella aurantiaca, Neelus murinus, Oncopodura yosiiana, Podura aquatica, and Tullbergia bisetosa) (see black and white pie charts in Fig. 1 and Table S3 for proportion values > 0.50). However, only the life histories of Podura aquatica and Tullbergia bisetosa deviated from the hemiedaphic ancestral state. The large proportions of nonsynonymous mutations are likely due to the older ages of these branches and illustrate that the proportion of nonsynonymous mutations increases as a function of time for the taxa with the ancestral state compared to those that represent life form shifts (i.e. the alternative state) (ancestral state: R2 = 0.52; alternative state: R2 = 0.82; also see Fig. 2). The 95% confidence intervals of these regressions broadly overlap, and the two models were not significantly different (p > 0.05). This indicates that no significant difference was found in the proportion of nonsynonymous mutations in relation to branch length for taxa that retained the ancestral state and for those whose ecological life form shifted.
The relationships (represented as coloured pie charts in Fig. 1) between the taxa that represent the life form transitions to alternative life forms and the respective sister taxon/taxa (ancestral state, i.e. hemiedaphic), again revealed that the longer branches have the greatest number of nonsynonymous mutations in comparison to the sister clades (see coloured pie charts A and D). No significant difference was found within pie charts B, C, and E, which show that the proportion of mutations for the branches that represent the alternative life form vs. ancestral state (51% for pie charts B and C, and 50% for pie chart E; paired t test: t4 = 0.23, p > 0.05) (Fig. 1).
Evidence for purifying (ω < 1) or diversifying selection (ω > 1) was determined for each of the terminal branches, and a total of 19 taxa had mitogenomes with signatures of purifying selection, while the mitogenomes of 11 taxa showed signs of diversifying selection, and two taxa had mitogenomes that satisfied the assumptions of neutrality (see Table S3). The proportion of changes per gene and per taxon reveal that atp8 exhibits the highest number of changes for the majority of the taxa (Figs. S2, S3; Table S4). Following atp8, all genes from complex I (i.e. nad1–nad6) had a large proportion of nonsynonymous mutations. Of the 13 PCGs, eight genes were identified as experiencing purifying selection, while five genes were identified as experiencing diversifying selection, with atp8 being under particularly strong diversifying selection (ω > 3.0) (Table S4). At a complex level, complex V (ATPase) revealed the highest percentage of nonsynonymous mutations, followed by complex I (NDs), complex IV (COXs), and complex III (cytb) (Fig. S3), which follows the estimated mutation rate of these complexes (i.e. ATPase > ND > COX > CYTB)66. The average value of ω for all genes across all terminal nodes was 0.96, which indicates that, overall, Collembola mitogenomes are experiencing slight purifying selection or no selection (i.e. selective neutrality).
Radical changes in physicochemical properties of amino acids
The impact of positive selection on the physical and chemical properties of 31 amino acids was evaluated for each of the 13 PCGs by detecting observed changes inferred by the phylogenetic tree under neutral conditions. Both positive and negative significant z-scores on the physicochemical properties of amino acids were found across the tree. Most physicochemical properties indicated that the radical changes were under purifying selection, with occasional radical changes under positive selection (Fig. S4). We detected two physicochemical properties—equilibrium constant (ionisation of COOH) and solvent accessible reduction ratio—with significant radical changes (z > 3.09; p < 0.001), indicating positive/diversifying selection for all 13 PCGs, while a third physicochemical property, buriedness, is under positive/diversifying selection for 10 PCGs (except cytb, nad3, and nad4l), with atp8 having the highest z-score.
Discussion
Using a comprehensive mitogenome dataset for springtails (32 species from all four orders), we determined the ancestral ecological life form state for Collembola. We then used this dataset to assess whether a correlation exists between signatures of mitogenome selection and the evolution of novel life forms. We show that the ancestral ecological life form state of Collembola is hemiedaphic, with epiedaphic life forms first emerging in the early Permian, roughly 290 Ma. Even though several previous studies have identified links between Collembolan mitogenomes and ecology, morphology, and physiology50,53,67,68,69,70,71, we found no strong evidence for a link between signatures of mitochondrial genome selection and life form evolution in Collembola.
Our mitogenomic phylogeny is broadly consistent with those presented in earlier studies37,38,39,72,73,74,75. The first phylogenetic split within Collembola occurred during the early Devonian (ca. 410 Ma), with most diversification taking place in the Triassic to Cretaceous periods (ca. 250–60 Ma). It is not surprising, therefore, that Collembola fossils preserved in amber and dating back to the Eocene (50-40 Ma) can be assigned to extant taxa with high certainty54, demonstrating the ultraconservative nature of life forms and basic body plans of the group.
Ancestral state reconstructions showed the ancestral ecological life form of collembolans to be hemiedaphic. Our finding contrasts with the assumption of a euedaphic origin of springtails proposed by D’Haese36,76. Additionally, we found that the epiedaphic life form evolved at least twice, first during the origin of Symphypleona (Fig. S1), approximately 290 Ma, and then again in the mid-Cretaceous with the emergence of the evolutionary lineage represented by Salina celebensis, roughly 115 Ma. During the Permian period, roughly 300 Ma, seed-producing gymnosperms such as gnetophytes, Pinaceae, and cupressophytes diverged from cycads and conifers77. Symphypleona show a habitat preference for Mediterranean-type shrublands, pine plantations, mixed forests, bushes, grasslands, and leaf litter (see Supplementary Information Dataset S1). This suggests that the emergence of major plant lineages, such as gymnosperms, created new ecological niches for the epiedaphic Symphypleona to exploit and diversify. The extant Bourletiella arvalis and Sminthurus viridis, however, show a habitat preference for grasslands, although these habitats would have only become available much later in the Cenozoic78,79,80. It is, therefore, reasonable to assume that the evolutionary lineage represented by these species would have occupied gymnosperm ecological analogs until their current niches became available. Unsurprisingly, the morphological features of these taxa are characteristic of the epiedaphic life form47,49,81. Additionally, our analyses illustrate that the euedaphic life form evolved at least twice, first when Tullbergia bisetosa evolved, roughly 287 Ma, and secondly during the origin of Onychiurus orientalis and Orthonychiurus folsomi (approximately 192 Ma). A shift from hemiedaphic to the myrmecophilous and aquatic life forms was also identified in the phylogeny. The origin of Cyphoderus albinus and the myrmecophilous life form occurred approximately 170 Ma, coinciding with the evolution of ants (Formicidae) around 168 Ma82,83. The aquatic life form first emerged about 225 Ma with the divergence of the lineage represented by Podura aquatica. The hypothesis of a semi-aquatic or aquatic life form as the ancestral state of Collembola was formulated and tested by D’Haese36,76, who found that the evolution of terrestrial to aquatic habitats, in a physiological context, is indeed more plausible and that an aquatic life form can be viewed as a “rare secondary acquisition”36,43,76.
We deem the five life forms we have identified to be somewhat homoplastic (see Yu et al.53) best illustrated by the epiedaphic and euedaphic forms that independently originated twice within the group. Additionally, the myrmecophilous and aquatic forms are homoplastic or autapomorphic, as these traits are unique to single taxa. There is also a great deal of ecological conservatism for the hemiedaphic trait where most extant taxa have retained the ancestral state.
While there appeared to be a general trend for taxa on longer phylogenetic branches to have a greater overall proportion of nonsynonymous mutations, we found no consistent relationship between life form shifts and the proportion of nonsynonymous mutations on these branches. Furthermore, even when comparing individual genes, no correlations were found between the proportion of nonsynonymous changes and the evolution of new life forms (Fig. S2). This may seem surprising given the pivotal role that mitochondria play in aerobic metabolism, and given that different life forms are constrained in different ways metabolically. It should be noted, however, that for a group as ancient as Collembola, the many adaptive nuclear changes that produced the conspicuous ecological changes upon which life forms are classified may have obscured mitochondrial signatures of selection related to the evolution of life forms. While our expectations pertaining to selection were premised on non-overlapping habitat stratification preferences, it should also be noted that interactions between collembolans and their environments are complex, and apparently divergent life forms may impose similar functional constraints on the organism, thereby confounding signatures of selection84.
Even though mitochondrial protein coding genes are typically believed to have faster mutation rates than nuclear genes16, most of these mutations are probably synonymous85,86, and thus functionally inconsequential (but see Lawrie et al.87). Our results support this idea, with more taxa and more genes under purifying selection than under diversifying selection (Tables S3 and S4). However, selection on the mitochondrial genome appears to be largely neutral due to the random fixation of selectively neutral mutations21. Given the crucial role of mitochondria in aerobic metabolism, functional requirements likely constrain the molecular evolution of mitogenomes, and this is perhaps why many studies have identified purifying selection88,89,90. It makes sense that strong functional constraints would mean that any deleterious mutation that arises would be rapidly removed from the population via purifying selection. Additionally, genetic drift plays a major role in structuring genetic diversity, and its effects on mitochondrial genomes are stronger than on nuclear genomes due to differences in effective population sizes between them and the uniparental inheritance and lack of recombination within the mitochondrial genome16,17,21.
When radical amino acid changes are favoured by selection, local directional shifts in the structure and function of proteins evolve. The physicochemical properties under positive selection for all PCGs were equilibrium constant (ionisation of COOH) and solvent accessible reduction ratio, while buriedness was under positive selection for 10 of the 13 PCGs. Positive selection for equilibrium constant is suggestive of adaptations to increased oxygen levels and metabolic requirements66, while positive selection for solvent accessible reduction ratio may lead to bulkier proteins, which increases the space for active site formation66. Buriedness increases protein stability, and thus optimal protein function, in the context of hydrophobicity, whereby some residues are buried on the interior of the globular protein91,92. These results illustrate the importance of these properties in the context of environmental conditions (oxygen levels and metabolic requirements).
The mitochondrial genome has long been considered to be selectively neutral22,23,24,25, but a plethora of studies have illustrated that it may be under strong positive selection as a result of thermal and aerobic respiratory adaptations11,13,14,27,28,29,30,31,32. Here, we investigated whether a link exists between mitochondrial genome selection and the evolution of ecological life forms in Collembola. The expected outcome of such a link would be evidence for strong signals of selection at branches representing the evolution of new life forms, even if mitochondrial genes themselves are not the primary drivers of ecophysiological adaptation. Our selection results reveal that, overall, positive selection is present across the collembolan phylogeny, best illustrated by the presence of positive selection on three amino acid physicochemical properties which likely play instrumental roles in adaptive responses to environmental conditions (oxygen levels and metabolic requirements). However, the only major trend identified here was that the proportion of nonsynonymous mutations increased with time, with longer branches (i.e. older taxa) having a greater proportion of nonsynonymous mutations than shorter branches. In addition to investigating selection associated with life form evolution, we assessed whether there was a link between signatures of selection across a latitudinal gradient from where each species can be found. This was done in two ways; first by selecting species across all latitudes but where their ranges overlapped, and secondly by selecting species that had no overlapping distributions. As was the case for life form shifts, we found no clear correlation between the number of nonsynonymous mutations and latitudinal gradients for either analysis.
The lack of a correlation between life form shifts and mitogenome selection may hint at nuclear selection, and suggests that nuclear-mitochondrial interactions are of greater importance in the evolution of life form shifts than selection pressures on the mitochondrial genome alone. Our findings raise some precautionary flags in that selection, where present, is more complex than generally assumed, given the complex interactions of the mitochondrial and nuclear genomes. Importantly, evidence for strong selection on the mitochondrial genome cannot simply be extrapolated to reflect adaptation to changing environmental conditions.
Methods
Phylomitogenomic analysis
Phylomitogenomic relationships between Collembola taxa based on available data on GenBank (May 2020) were reconstructed using the protein-coding genes (PCGs) of 32 partial or complete Collembola mitogenomes, as well as three outgroup taxa (see Supplementary Information Table S1). Each PCG was aligned using the MUSCLE algorithm in MEGA X93, followed by concatenating the alignments into a single dataset. Substitution saturation was assessed on each codon position using DAMBE v7.3.594, which revealed high substitution saturation, likely due to highly divergent taxa95,96,97. As a result, we took a conservative approach and used the amino acid sequences of each PCG for our phylogenetic reconstructions. Each PCG was independently translated into amino acid sequences using the invertebrate mitochondrial genetic code, and were then aligned in MEGA X using the MUSCLE algorithm, followed by manual inspection. Bayesian phylogenetic inferences were executed in BEAST v2.6.398 by linking site and clock models as well as trees. This was done because of the largely uniparental inheritance of mitochondrial genomes and a lack of recombination15,16,17,27. A discrete trait was added to define the ecological life form of each species (see Supplementary Information Dataset S1 for more detail). Given the role of mitochondria in different environments, and rather than linking life forms to morphological characteristics, we linked the life forms (i.e. aquatic, myrmecophilous, euedaphic, hemiedaphic, and epiedaphic) to the ecology (i.e. habitat preferences) of each springtail species included in this study however, with the exception of three taxa; Onychiurus orientalis, Orthonychiurus folsomi, and Tullbergia bisetosa, which were assigned based on morphology. These species, although ecologically hemiedaphic47,49,50 (see Dataset S1 for ecological life forms), differ from other hemiedaphic taxa by having all three major euedaphic morphological characteristics (no eyes, no pigment, and no furca), and are thus categorised as “euedaphomorphic” sensu lato99. As such, these taxa were assigned to the euedaphic life form which aligns to a peculiar ecology within litter (i.e. lower dispersal ability and associated traits) that differs from the other hemiedaphic taxa. The mitochondrial REV + Г model was selected, with an uncorrelated relaxed lognormal clock and a birth–death tree prior. Monophyly was constrained for several clades based on a consensus tree constructed using the NCBI taxonomy database in PhyloT v2 (https://phylot.biobyte.de/), and visualised in iTOL100. These were (a) all taxa (to calibrate the root of the tree, as described below), (b) the Hexapoda clade (all taxa excluding the distantly related Daphnia pulex) to calibrate the split between Collembola and other basal hexapod taxa (Japyx solifugus and Trigoniophthalmus alternatus) and Collembola as described below, and (c) a clade that excluded all non-Collembola taxa. Monophyly was not constrained for each collembolan order (Poduromorpha, Entomobryomorpha, Neelipleona, and Symphypleona) since monophyly is not always supported for these groups101,102,103.
Considering that some collembolan fossils have been assigned to contemporary species despite being more than 20 Ma old, dating recent nodes using fossil data can be challenging for these taxa. We therefore used secondary calibration points from Leo et al.104 and included calibration points that matched the two deepest fossil-calibrated nodes. To ensure that our calibration points matched the 95% highest probability density (HPD) confidence intervals of Leo et al.104 for the same nodes, we followed their protocol in setting the priors for these nodes. Accordingly, we set the prior at the root of the tree to have a normal distribution and a mean of 510 Ma with a standard deviation of 7 Ma. The second prior was placed at the split between the higher insects (Japyx solifugus and Trigoniophthalmus alternatus) and Collembola, with a normal distribution and a mean of 485 Ma with a standard deviation of 6 Ma (for the rationale behind this prior assignment, see Leo et al.104).
A Bayesian consensus phylogenetic tree was calculated using BEAST submitted through the CIPRES Science Gateway105 from three independent chains at different starting seeds for a chain length of 280 million generations, a thinning interval of 10,000, and 25% burn-in to ensure that stationarity was reached. Convergence was assessed using Tracer v1.7106, trees were summarised using TreeAnnotator v2.6.298 (20% burn-in), and the resulting topology was visualised in FigTree v1.4.3 (http://tree.bio.ed.ac.uk/software/figtree/).
Ancestral state reconstruction
A quantitative phyletic approach was used to assess the evolutionary relationships between taxa based on this biological information. Ancestral state reconstructions were performed using both parsimony and maximum likelihood methods in Mesquite v3.61107; the discrete trait of each taxon (from Supplementary Information Dataset S1) was provided as an input for the analyses. The likelihood ratio test of the maximum likelihood method was used for the Markov k-state one-parameter (Mk1) and the asymmetrical Markov k-state 2 parameter (AsymmMk) models107,108, while the rate of character evolution was estimated under the Mk1 model. The Bayesian consensus phylogenetic tree obtained from the 28,000 trees (after the burn-in) sampled from BEAST was used for the ancestral state reconstruction. The retention index (RI)60 was evaluated for each life form trait to assess phylogenetic conservatism and homoplasy across the phylogeny. High RI values represent little to no homoplasy (typically above 0.85, see Yu et al.53), while low RI values indicate homoplasy and low phylogenetic signal in a given dataset.
To account for uncertainties with phylogenetic mapping and the occurrence of topographical incongruence across sampled trees, a Bayesian MCMC method implemented in BayesTraits v3.0.2109, was used for ancestral state reconstructions. The AddTag and AddMRCA options were implemented to find the ancestral state associated with the node of collembolan clade. These options were used to overcome uncertainty when estimating ancestral states109. A reversible-jump (RJ) hyper-prior was implemented to reduce uncertainty and arbitrariness when selecting priors109. The hyper-prior draws up values from a uniform distribution between 0 and 30 which is used to seed the exponential distribution for the MCMC analyses. Three independent MCMC chains were run for 30 million iterations with a 25% burn-in. These analyses were repeated with a gamma reversible-jump hyper-prior (mean: 0, 30; variance 0, 30), but no appreciable difference was identified between the posterior distributions between these analyses when visualised in Tracer. MCMC convergence was assessed in Tracer by ensuring that the effective sample size (ESS) ≥ 200 for all estimated parameters. The proportion of each state was calculated from the mean values from approximately 20,000 trees randomly chosen by BayesTraits from the 28,000 trees sampled from BEAST. The states that were reconstructed to be equivocal (probability of 0.2 for each state) were discarded and the probability that the collembolan clade was in a given state was determined. The average of the estimated probability for the three independent runs was calculated, and any character state with a proportion of > 0.7 was considered to be supported110.
Selection analysis
The ratio of nonsynonymous, (dN; i.e. amino acid-altering) to synonymous (dS; i.e. silent) substitution rates, ω = dN/dS, is used to understand the evolutionary dynamics of genes61,62,63,64. We computed ω using the codon-based maximum likelihood (CodeML) algorithm in EasyCodeML v1.2162,111. To evaluate the influence of selection on the PCG sequence alignments, we implemented the preset (nested models) running mode with the site model within the framework of the topology of the phylogeny retrieved by the phylogenetic analyses above112. The codon substitution models described by Geo et al.62, and the significance of their respective dN/dS ratios, were evaluated using the likelihood ratio test (LRT). To identify whether a link between signatures of mitochondrial selection and a shift in ecological life form exists, we determined the number of nonsynonymous mutations per gene as a proportion of the total mutations (i.e. dN/(dN + dS)) for all terminal branches, and then averaged these proportions per branch. Moreover, we compared the proportion of nonsynonymous mutations for each life form shift (aquatic, myrmecophilous, euedaphic, and epiedaphic) vs. the respective sister branches/taxa that represented the ancestral state (hemiedaphic) to ascertain whether a link between the number nonsynonymous mutations to the ecological life form transition exists. Where multiple branches were present (clades), we accounted for the number of taxa by averaging the proportion of mutations for each terminal branch, and then followed the path from the terminal branch back until reaching the node where life form divergence (i.e. nearest common ancestor) took place. Statistical analyses were performed using a paired t test to compare the proportion of synonymous to nonsynonymous mutations across the phylogeny as well as where each branch is paired with its sister branch that represents a life form shift. Plots of regression analyses were performed using the R package ggplot2113 to assess the correlation between the proportions of nonsynonymous changes vs. branch length for the taxa that retained the ancestral state and for those who represent a life form shift (i.e. the alternative state). Univariate general linear models of the regressions were performed in SPSS 27 (Inc., Chicago, IL, USA). Furthermore, and following the same method as described above, we assessed whether there was a link between selection and species that originated from different latitudinal gradients. This was done twice, first by identifying species across latitudinal gradients where their ranges overlapped, and secondly by selecting species that had no overlapping distributions. A phylogenetic tree was reconstructed for both analyses, and selection analyses were conducted, and the number of nonsynonymous mutations were determined for each terminal branch/species.
Additionally, we sought to determine the evolutionary dynamics in the context of purifying or diversifying selection for each terminal branch based on the count of nonsynonymous and synonymous mutations (i.e. ω) for each gene and complex. To represent these results to a gene or complex level (in the form of a percentage), the number of changes per gene/complex were standardised based on gene size (i.e. total number of nonsynonymous changes/(gene size × total number of branches) × 100).
Radical changes in physicochemical properties of amino acids
The impact of positive selection on physical and chemical properties of 31 amino acids was evaluated for each of the 13 PCGs using TreeSAAP v3.2 (Selection on Amino Acid Properties)114 by detecting observed changes inferred by the phylogenetic tree under neutral conditions. The impact of 20 physicochemical properties (excluding 11 amino acids with < 85% accuracy following McClellan and Ellison115) were evaluated. Under the assumption of neutrality, a significant positive z-score as calculated by TreeSAAP (z-score > 3.09, p < 0.001) for a physicochemical property is indicative of the prevalence of nonsynonymous substitutions, while a significant negative z-score (z-score < − 3.09, p < 0.001) indicates the prevalence of synonymous substitutions. A sliding window of 15 codons was utilized, based on a categorical scale of 1 (most conservative) to 8 (most radical). Only the most radical changes (category scores of 7 and 8, p ≤ 0.001) were considered, and for the physicochemical properties that were significant for both category 7 and 8, only the z-scores that were significant for category 8 were included.
Data availability
The authors confirm that all relevant data used for this manuscript are available online from the GenBank NCBI online repository (https://www.ncbi.nlm.nih.gov/genbank/). All sequence accession numbers are list in the Supplementary Information of this manuscript (Table S1).
References
Zachos, F. E. (New) Species concepts, species delimitation and the inherent limitations of taxonomy. J. Genet. 97, 811–815. https://doi.org/10.1007/s12041-018-0965-1 (2018).
Zachos, F. E. et al. Species inflation and taxonomic artefacts—A critical comment on recent trends in mammalian classification. Mamm. Biol. 78, 1–6. https://doi.org/10.1016/j.mambio.2012.07.083 (2013).
De Queiroz, K. Species concepts and species delimitation. Syst. Bot. 56, 879–886. https://doi.org/10.1080/10635150701701083 (2007).
Morard, R. et al. Nomenclature for the nameless: A proposal for an integrative molecular taxonomy of cryptic diversity exemplified by planktonic Foraminifera. Syst. Biol. 65, 925–940. https://doi.org/10.1093/sysbio/syw031 (2016).
Shtolz, N. & Mishmar, D. The mitochondrial genome—On selective constraints and signatures at the organism, cell, and single mitochondrion levels. Front. Ecol. Evol. 7, 342. https://doi.org/10.3389/fevo.2019.00342 (2019).
Pease, J. B., Haak, D. C., Hahn, M. W. & Moyle, L. C. Phylogenomics reveals three sources of adaptive variation during a rapid radiation. PLoS Biol. 14, e1002379. https://doi.org/10.1371/journal.pbio.1002379 (2016).
Rosenblum, E. B., Römpler, H., Schöneberg, T. & Hoekstra, H. E. Molecular and functional basis of phenotypic convergence in white lizards at White Sands. Proc. Natl. Acad. Sci. U.S.A. 107, 2113–2117. https://doi.org/10.1073/pnas.0911042107 (2010).
Rosenblum, E. B., Parent, C. E. & Brandt, E. E. The molecular basis of phenotypic convergence. Annu. Ecol. Evol. Syst. 45, 203–226. https://doi.org/10.1146/annurev-ecolsys-120213-091851 (2014).
Brawand, D. et al. The genomic substrate for adaptive radiation in African cichlid fish. Nature 513, 375–381. https://doi.org/10.1038/nature13726 (2014).
Dunn, C. W. & Munro, C. Comparative genomics and the diversity of life. Zool. Scr. 45, 5–13. https://doi.org/10.1111/zsc.12211 (2016).
Finch, T. M., Zhao, N., Korkin, D., Frederick, K. H. & Eggert, L. S. Evidence of positive selection in mitochondrial complexes I and V of the African elephant. PLoS One 9, e92587. https://doi.org/10.1371/journal.pone.0092587 (2014).
van den Heuvel, L. P. & Smeitink, J. A. M. Nuclear DNA and oxidative phosphorylation. In Oxidative Phosphorylation in Health and Disease (eds Smeitink, J. A. M. et al.) 117–129 (Springer, 2004). https://doi.org/10.1007/0-387-26992-4_7.
da Fonseca, R. R., Johnson, W. E., O’Brien, S. J., Ramos, M. J. & Antunes, A. The adaptive evolution of the mammalian mitochondrial genome. BMC Genomics 9, 1–22. https://doi.org/10.1186/1471-2164-9-119 (2008).
Teacher, A. G. F., André, C., Merilä, J. & Wheat, C. W. Whole mitochondrial genome scan for population structure and selection in the Atlantic herring. BMC Evol. Biol. 12, 1–14. https://doi.org/10.1186/1471-2148-12-248 (2012).
Pesole, G., Gissi, C., De Chirico, A. & Saccone, C. Nucleotide substitution rate of mammalian mitochondrial genomes. J. Mol. Evol. 48, 427–434. https://doi.org/10.1007/pl00006487 (1999).
Ballard, J. W. O. & Whitlock, M. C. The incomplete natural history of mitochondria. Mol. Ecol. 13, 729–744. https://doi.org/10.1046/j.1365-294x.2003.02063.x (2004).
Galtier, N., Nabholz, B., Glemin, S. & Hurst, G. D. Mitochondrial DNA as a marker of molecular diversity: A reappraisal. Mol. Ecol. 18, 4541–4550. https://doi.org/10.1111/j.1365-294x.2009.04380.x (2009).
Moritz, C. Applications of mitochondrial DNA analysis in conservation: A critical review. Mol. Ecol. 3, 401–411. https://doi.org/10.1111/j.1365-294x.1994.tb00080.x (1994).
Avise, J. C. & Ellis, D. Mitochondrial DNA and the evolutionary genetics of higher animals. Philos. Trans. R. Soc. B Biol. Sci. 312, 325–342. https://doi.org/10.1098/rstb.1986.0011 (1986).
Kimura, M. The Neutral Theory of Molecular Evolution (Cambridge University Press, 1983).
Kimura, M. The neutral theory of molecular evolution: A review of recent evidence. J. Jpn. Genet. 66, 367–386. https://doi.org/10.1266/jjg.66.367 (1991).
Ballard, J. W. O. & Kreitman, M. Is mitochondrial DNA a strictly neutral marker?. Trends Ecol. Evol. 10, 485–488. https://doi.org/10.1016/s0169-5347(00)89195-8 (1995).
Rand, D. M. The units of selection on mitchondrial DNA. Annu. Rev. Ecol. Syst. 32, 415–448. https://doi.org/10.1146/annurev.ecolsys.32.081501.114109 (2001).
Ballard, J. W. O. & Kreitman, M. Unraveling selection in the mitochondrial genome of Drosophila. Genetics 138, 757–772. https://doi.org/10.1093/genetics/138.3.757 (1994).
Ballard, J. W. O. & Rand, D. M. The population biology of mitochondrial DNA and its phylogenetic implications. Annu. Ecol. Evol. Syst. 36, 621–642. https://doi.org/10.1146/annurev.ecolsys.36.091704.175513 (2005).
Teske, P. R. et al. Mitochondrial DNA is unsuitable to test for isolation by distance. Sci. Rep. 8, 1–9. https://doi.org/10.1038/s41598-018-25138-9 (2018).
Shen, Y.-Y. et al. Adaptive evolution of energy metabolism genes and the origin of flight in bats. Proc. Natl. Acad. Sci. 107, 8666–8671. https://doi.org/10.1073/pnas.0912613107 (2010).
Yu, L., Wang, X., Ting, N. & Zhang, Y. Mitogenomic analysis of Chinese snub-nosed monkeys: Evidence of positive selection in NADH dehydrogenase genes in high-altitude adaptation. Mitochondrion 11, 497–503. https://doi.org/10.1016/j.mito.2011.01.004 (2011).
Silva, G., Lima, F. P., Martel, P. & Castilho, R. Thermal adaptation and clinal mitochondrial DNA variation of European anchovy. Proc. R. Soc. B 281, 20141093. https://doi.org/10.1098/rspb.2014.1093 (2014).
Consuegra, S., John, E., Verspoor, E. & de Leaniz, C. G. Patterns of natural selection acting on the mitochondrial genome of a locally adapted fish species. Genet. Sel. Evol. 47, 1–10. https://doi.org/10.1186/s12711-015-0138-0 (2015).
Li, X. et al. Positive selection drove the adaptation of mitochondrial genes to the demands of flight and high-altitude environments in grasshoppers. Front. Genet. 9, 605. https://doi.org/10.3389/fgene.2018.00605 (2018).
Lajbner, Z., Pnini, R., Camus, M. F., Miller, J. & Dowling, D. K. Experimental evidence that thermal selection shapes mitochondrial genome evolution. Sci. Rep. 8, 1–12. https://doi.org/10.1038/s41598-018-27805-3 (2018).
Briones, M. J. I. The serendipitous value of soil fauna in ecosystem functioning: The unexplained explained. Front. Environ. Sci. 6, 149. https://doi.org/10.3389/fenvs.2018.00149 (2018).
Orgiazzi, A. et al. Geographical and temporal distribution. In Global Soil Biodiversity Atlas (eds Orgiazzi, A. et al.) 66–91 (European Commission, Publications Office of the European Union, 2016). https://doi.org/10.2788/2613.
Bardgett, R. D., Yeates, G. W. & Anderson, J. M. Patterns and determinants of soil biological diversity. In Biological Diversity and Functions in Soils (eds Bardgett, R. D. et al.) 100–118 (Cambridge University Press, 2005). https://doi.org/10.1017/cbo9780511541926.007.
D’Haese, C. A. Were the first springtails semi-aquatic? A phylogenetic approach by means of 28S rDNA and optimization alignment. Proc. R. Soc. Lond. B 269, 1143–1151. https://doi.org/10.1098/rspb.2002.1981 (2002).
Kristensen, N. P. Phylogeny of insect orders. Annu. Rev. Entomol. 26, 135–157. https://doi.org/10.1146/annurev.en.26.010181.001031 (1981).
Carapelli, A., Nardi, F., Dallai, R. & Frati, F. A review of molecular data for the phylogeny of basal hexapods. Pediobiologia 50, 191–204. https://doi.org/10.1016/j.pedobi.2006.01.001 (2006).
Nardi, F. et al. Hexapod origins: Monophyletic or paraphyletic?. Science 299, 1887–1889. https://doi.org/10.1126/science.1078607 (2003).
Cicconardi, F., Fanciulli, P. P. & Emerson, B. C. Collembola, the biological species concept and the underestimation of global species richness. Mol. Ecol. 22, 5382–5396. https://doi.org/10.1111/mec.12472 (2013).
Whalley, P. & Jarzembowski, E. A. A new assessment of Rhyniella, the earliest known insect, from the Devonian of Rhynie, Scotland. Nature 291, 317. https://doi.org/10.1038/291317a0 (1981).
Hirst, S. & Maulik, S. On some arthropod remains from the Rhynie Chert (old red sandstone). Geol. Mag. 63, 69–71. https://doi.org/10.1017/s0016756800083692 (1926).
Deharveng, L., D’Haese, C. A. & Bedos, A. Global diversity of springtails (Collembola; Hexapoda) in freshwater. Hydrobiologia 595, 329–338. https://doi.org/10.1007/s10750-007-9116-z (2008).
Hopkin, S. P. Biology of the Springtails (Insecta: Collembola) (Oxford University Press, 1997).
Ponge, J., Dubs, F., Gillet, S., Sousa, J. P. & Lavelle, P. Decreased biodiversity in soil springtail communities: The importance of dispersal and landuse history in heterogeneous landscapes. Soil Biol. Biochem. 38, 1158–1161. https://doi.org/10.1016/j.soilbio.2005.09.004 (2006).
Auclerc, A., Ponge, J. F., Barot, S. & Dubs, F. Experimental assessment of habitat preference and dispersal ability of soil springtails. Soil Biol. Biochem. 41, 1596–1604. https://doi.org/10.1016/j.soilbio.2009.04.017 (2009).
Salmon, S. et al. Linking species, traits and habitat characteristics of Collembola at European scale. Soil Biol. Biochem. 75, 73–85. https://doi.org/10.1016/j.soilbio.2014.04.002 (2014).
Rusek, J. Biodiversity of Collembola and their functional role in the ecosystem. Biodivers. Conserv. 7, 1207–1219. https://doi.org/10.1023/a:1008887817883 (1998).
Potapov, A. A., Semenina, E. E., Korotkevich, A. Y., Kuznetsova, N. A. & Tiunov, A. V. Connecting taxonomy and ecology: Trophic niches of collembolans as related to taxonomic identity and life forms. Soil Biol. Biochem. 101, 20–31. https://doi.org/10.1016/j.soilbio.2016.07.002 (2016).
Malcicka, M., Berg, M. P. & Ellers, J. Ecomorphological adaptations in Collembola in relation to feeding strategies and microhabitat. Eur. J. Soil Biol. 78, 82–91. https://doi.org/10.1016/j.ejsobi.2016.12.004 (2017).
Parisi, V., Menta, C., Gardi, C., Jacomini, C. & Mozzanica, E. Microarthropod communities as a tool to assess soil quality and biodiversity: A new approach in Italy. Agric. Ecosyst. Environ. 105, 323–333. https://doi.org/10.1016/j.agee.2004.02.002 (2005).
Martins da Silva, P. et al. Traits of collembolan life-form indicate land use types and soil properties across an European transect. Appl. Soil Ecol. 97, 69–77. https://doi.org/10.1016/j.apsoil.2015.07.018 (2016).
Yu, D. et al. Molecular phylogeny and trait evolution in an ancient terrestrial arthropod lineage: Systematic revision and implications for ecological divergence (Collembola, Tomocerinae). Mol. Phylogenet. Evol. 154, 106995. https://doi.org/10.1016/j.ympev.2020.106995 (2020).
Robin, N., D’Haese, C. & Barden, P. Fossil amber reveals springtails’ longstanding dispersal by social insects. BMC Evol. Biol. 19, 1–12. https://doi.org/10.1186/s12862-019-1529-6 (2019).
Cicconardi, F., Nardi, F., Emerson, B. C., Frati, F. & Fanciulli, P. P. Deep phylogeographic divisions and long-term persistence of forest invertebrates (Hexapoda: Collembola) in the North-Western Mediterranean basin. Mol. Ecol. 19, 386–400. https://doi.org/10.1111/j.1365-294x.2009.04457.x (2010).
Deharveng, L. Recent advances in Collembola systematics. Pediobiologia 48, 415–433. https://doi.org/10.1016/j.pedobi.2004.08.001 (2004).
Berg, M. P., Kniese, J., Bedaux, J. & Verhoef, H. A. Dynamics and stratification of functional groups of micro- and mesoarthropods in the organic layer of a Scots pine forest. Biol. Fertil. Soils 26, 268–284. https://doi.org/10.1007/s003740050378 (1998).
Petersen, H. Population dynamic and metabolic characterization of Collembola species in a beech forest ecosystem. In Soil Biology as Related to Land Use Practices (ed. Dindal, D. L.) 806–833 (Office of Pesticide and Toxic Substance, 1980).
Coulibaly, S. F. M. et al. Functional assemblages of Collembola determine soil microbial communities and associated functions. Front. Environ. Sci. 7, 52. https://doi.org/10.3389/fenvs.2019.00052 (2019).
Farris, S. The retention index and the rescaled consistency index. Cladistics 5, 417–419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x (1989).
Yang, Z. & Bielawski, J. P. Statistical methods for detecting molecular adaptation. Trends Ecol. Evol. 15, 496–503. https://doi.org/10.1016/s0169-5347(00)01994-7 (2000).
Gao, F. et al. EasyCodeML: A visual tool for analysis of selection using CodeML. Ecol. Evol. 9, 3891–3898. https://doi.org/10.1002/ece3.5015 (2019).
Ina, Y. Pattern of synonymous and nonsynonymous substitutions: An indicator of mechanisms of molecular evolution. J. Genet. 75, 91–115. https://doi.org/10.1007/bf02931754 (1996).
Yang, Z. & Nielsen, R. Synonymous and nonsynonymous rate variation in nuclear genes of mammals. J. Mol. Evol. 46, 409–418. https://doi.org/10.1007/pl00006320 (1998).
Choudhuri, S. Fundamentals of molecular evolution. In Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools (ed. Choudhuri, S.) 27–53 (Academic Press, 2014). https://doi.org/10.1016/b978-0-12-410471-6.00002-5.
George, D. & Blieck, A. Rise of the earliest tetrapods: An early Devonian origin from marine environment. PLoS One 6, e22136. https://doi.org/10.1371/journal.pone.0022136 (2011).
Zhang, F. & Deharveng, L. Systematic revision of Entomobryidae (Collembola) by integrating molecular and new morphological evidence. Zool. Scr. 44, 298–311. https://doi.org/10.1111/zsc.12100 (2014).
Zhang, F., Sun, D., Yu, D. & Wang, B. Molecular phylogeny supports S-chaetae as a key character better than jumping organs and body scales in classification of Entomobryoidea (Collembola). Sci. Rep. 5, 12471. https://doi.org/10.1038/srep12471 (2015).
Chen, T., Sandmann, P., Schaefer, I. & Scheu, S. Neutral lipid fatty acid composition as trait and constraint in Collembola evolution. Ecol. Evol. 7, 9624–9638. https://doi.org/10.1002/ece3.3472 (2017).
Ding, Y., Yu, D., Guo, W., Li, J. & Zhang, F. Molecular phylogeny of Entomobrya (Collembola: Entomobryidae) from China: Colour pattern groups and multiple origins. Insect Sci. 26, 587–597. https://doi.org/10.1111/1744-7917.12559 (2018).
Zhang, F. et al. Molecular phylogeny reveals independent origins of body scales in Entomobryidae (Hexapoda: Collembola). Mol. Phylogenet. Evol. 70, 231–239. https://doi.org/10.1016/j.ympev.2013.09.024 (2014).
Timmermans, M. J. T. N., Roelofs, D., Mariën, J. & van Straalen, N. M. Revealing pancrustacean relationships: Phylogenetic analysis of ribosomal protein genes places Collembola (springtails) in a monophyletic Hexapoda and reinforces the discrepancy between mitochondrial and nuclear DNA markers. BMC Evol. Biol. 8, 83. https://doi.org/10.1186/1471-2148-8-83 (2008).
Misof, B. et al. Phylogenomics resolves the timing and pattern of insect evolution. Science 346, 763–767. https://doi.org/10.1126/science.1257570 (2014).
Beutel, R. G., Yavorskaya, M. I., Mashimo, Y. & Fukui, M. The phylogeny of Hexapoda (Arthropoda) and the evolution of megadiversity. Proc. Arthropodan Embryol. Soc. Jpn. 51, 1–15 (2017).
Carapelli, A., Liò, P., Nardi, F., Van der Wath, E. & Frati, F. Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea. BMC Evol. Biol. 7, 1–13. https://doi.org/10.1186/1471-2148-7-s2-s8 (2007).
D’Haese, C. A. Morphological appraisal of Collembola phylogeny with special emphasis on Poduromorpha and a test of the aquatic. Zool. Scr. 32, 563–586. https://doi.org/10.1046/j.1463-6409.2003.00134.x (2003).
Magallón, S. A review of the effect of relaxed clock method, long branches, genes, and calibrations in the estimation of angiosperm age. Bot. Sci. 92, 1–22. https://doi.org/10.17129/botsci.37 (2014).
Pimentel, M., Escudero, M., Sahuquillo, E., Minaya, M. Á. & Catalán, P. Are diversification rates and chromosome evolution in the temperate grasses (Pooideae) associated with major environmental changes in the Oligocene-Miocene?. PeerJ 5, 1–29. https://doi.org/10.7717/peerj.3815 (2017).
Estep, M. C. et al. Allopolyploidy, diversification, and the Miocene grassland expansion. PNAS 111, 15149–15154. https://doi.org/10.1073/pnas.1404177111 (2014).
Strömberg, C. Decoupled taxonomic radiation and ecological expansion of open-habitat grasses in the Cenozoic of North America. PNAS 102, 11980–11984. https://doi.org/10.1073/pnas.0505700102 (2005).
Fjellberg, A. The Collembola of Fennoscandia and Denmark, Part II: Entomobryomorpha and Symphypleona (Koninklijke Brill NV, 2007).
Moreau, C. S., Bell, C., Vila, R., Archibald, S. & Pierce, N. Phylogeny of the ants: Diversification in the age of angiosperms. Science 312, 101–104. https://doi.org/10.1126/science.1124891 (2006).
Peters, R. S. et al. Evolutionary history of the Hymenoptera. Curr. Biol. 27, 1013–1018. https://doi.org/10.1016/j.cub.2017.01.027 (2017).
Prinzing, A. & Woas, S. Habitat use and stratification of Collembola and oribatid mites. In Arthropods of Tropical Forests: Spatio-temporal Dynamics and Resource Use in the Canopy (eds Basset, Y. et al.) 271–281 (Cambridge University Press, 2003).
Satta, Y., Ishiwa, H. & Chigusa, S. I. Analysis of nucleotide substitutions of mitochondrial DNAs in Drosophila melanogaster and its sibling species. Mol. Biol. Evol. 4, 638–650. https://doi.org/10.1093/oxfordjournals.molbev.a040464 (1987).
Moriyama, E. N. & Powell, J. R. Synonymous substitution rates in Drosophila: Mitochondrial versus nuclear genes. J. Mol. Evol. 45, 378–391. https://doi.org/10.1007/pl00006243 (1997).
Lawrie, D. S., Messer, P. W., Hershberg, R. & Petrov, D. A. Strong purifying selection at synonymous sites in D. melanogaster. PLoS Genet. 9, 33–40. https://doi.org/10.1371/journal.pgen.1003527 (2013).
Nachman, M. W. Deleterious mutations in animal mitochondrial DNA. Genetica 102(103), 61–69. https://doi.org/10.1007/978-94-011-5210-5_6 (1998).
Meiklejohn, C. D., Montooth, K. L. & Rand, D. M. Positive and negative selection on the mitochondrial genome. Trends Genet. 23, 259–263. https://doi.org/10.1016/j.tig.2007.03.008 (2007).
Stewart, J. B. et al. Strong purifying selection in transmission of mammalian mitochondrial DNA. PLoS Biol. 6, 63–71. https://doi.org/10.1371/journal.pbio.0060010 (2008).
Ponnuswamy, P. K., Prabhakaran, M. & Manavalan, P. Hydrophobic packing and spatial arrangement of amino acid residues in globular proteins. Biochim. Biophys. Acta 623, 301–316. https://doi.org/10.1016/0005-2795(80)90258-5 (1980).
Gromiha, M. M., Oobatake, M., Kono, H., Uedaira, H. & Sarai, A. Relationship between amino acid properties and protein stability: Buried mutations. J. Protein Chem. 18, 565–578. https://doi.org/10.1023/a:1020603401001 (1999).
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. https://doi.org/10.1093/molbev/msy096 (2018).
Xia, X. DAMBE7: New and improved tools for data analysis in molecular biology and evolution. Mol. Biol. Evol. 35, 1550–1552. https://doi.org/10.1093/molbev/msy073 (2018).
Castoe, T. A., Jiang, Z. J., Gu, W., Wang, Z. O. & Pollock, D. D. Adaptive evolution and functional redesign of core metabolic proteins in snakes. PLoS One 3, e2201. https://doi.org/10.1371/journal.pone.0002201 (2008).
Xia, X. & Lemey, P. Assesing substitution saturation with DAMBE. In Phylogenetic Handbook: A Practical Approach to Phylogenetic Analysis and Hypothesis Testing (eds Lemey, P. et al.) 611–626 (Cambridge University Press, 2009).
Xia, X., Hafner, M. S. & Sudman, P. D. On transition bias in mitochondrial genes of pocket gophers. J. Mol. Evol. 43, 32–40. https://doi.org/10.1007/bf02352297 (1996).
Bouckaert, R. et al. BEAST 2: A software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 10, e1003537. https://doi.org/10.1371/journal.pcbi.1003537 (2014).
Deharveng, L. & Bedos, A. Diversity of terrestrial invertebrates in subterranean habitats. In Cave Ecology, Ecological Studies (eds Moldovan, O. T. et al.) 107–172 (Springer International Publishing, 2018).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL): An online tool for phylogenetic tree display and annotation. Bioinformatics 23, 127–128. https://doi.org/10.1093/bioinformatics/btl529 (2007).
Xiong, Y., Gao, Y., Yin, W. & Luan, Y. Molecular phylogeny of Collembola inferred from ribosomal RNA genes. Mol. Phylogenet. Evol. 49, 728–735. https://doi.org/10.1016/j.ympev.2008.09.007 (2008).
Sun, X. et al. Phylomitogenomic analyses on collembolan higher taxa with enhanced taxon sampling and discussion on method selection. PLoS One 15, e0230827. https://doi.org/10.1371/journal.pone.0230827 (2020).
Cucini, C. et al. Re-evaluating the internal phylogenetic relationships of Collembola by means of mitogenome data. Genes 12, 44. https://doi.org/10.3390/genes12010044 (2021).
Leo, C., Carapelli, A., Cicconardi, F., Frati, F. & Nardi, F. Mitochondrial genome diversity in Collembola: Phylogeny, dating and gene order. Diversity 11, 169. https://doi.org/10.3390/d11090169 (2019).
Miller, M. A. et al. A RESTful API for access to phylogenetic tools via the CIPRES Science Gateway. Evol. Bioinform. 11, 43–48. https://doi.org/10.4137/ebo.s21501 (2015).
Rambaut, A., Drummond, A., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904. https://doi.org/10.1093/sysbio/syy032 (2018).
Maddison, W. P. & Maddison, D. R. Mesquite: A modular system for evolutionary analysis. http://www.mesquiteproject.org (2019).
Lewis, P. O. A likelihood approach to estimating phylogeny from discrete morphological character data. Syst. Biol. 50, 913–925. https://doi.org/10.1080/106351501753462876 (2001).
Pagel, M., Meade, A. & Barker, D. Bayesian estimation of ancestral character states on phylogenies. Syst. Biol. 53, 673–684. https://doi.org/10.1080/10635150490522232 (2004).
Jordan, F. M., Gray, R. D., Greenhill, S. J. & Mace, R. Matrilocal residence is ancestral in Austronesian societies. Proc. R. Soc. B 276, 1957–1964. https://doi.org/10.1098/rspb.2009.0088 (2009).
Yang, Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591. https://doi.org/10.1093/molbev/msm088 (2007).
Yang, Z. & Nielsen, R. Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol. 19, 908–917. https://doi.org/10.1093/oxfordjournals.molbev.a004148 (2002).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer, 2016).
Woolley, S., Johnson, J., Smith, M. J., Crandall, K. A. & Mcclellan, D. A. TreeSAAP: Selection on amino acid properties using phylogenetic trees. Bioinform. Appl. Note 19, 671–672. https://doi.org/10.1093/bioinformatics/btg043 (2003).
McClellan, D. A. & Ellison, D. D. Assessing and improving the accuracy of detecting protein adaptation with the TreeSAAP analytical software. Int. J. Bioinform. Res. Appl. 6, 120–133. https://doi.org/10.1504/ijbra.2010.032116 (2010).
Acknowledgements
The authors thank the National Research Foundation (NRF) Competitive Programme for Rated Researcher grant to B.v.V (NRF CPRR: grant number 138016), the University of Johannesburg funding to B.v.V. and GES 4.0 fellowship awarded to D.M.M. and D.C.M., and the South African (NRF)/Russia (RFBR) Joint Science and Technology Research Collaboration project no. 19-516-60002 and no. 118904 (NRF) awarded to M.P. and C.JS., respectively. The authors wish to thank the analyses platform and bioinformatics support provided by the Centre for High Performance Computing (CHPC) in Cape Town, South Africa. We appreciate the constructive criticism received by the two anonymous reviewers which improved the manuscript.
Author information
Authors and Affiliations
Contributions
D.M.M., A.E.-K., and B.v.V. designed the research; D.M.M. and D.C.M. analysed data; D.M.M., C.J.-S., L.D., A.B., M.P., and S.P.P. compiled and corrected the ecological life form Supplementary Table; all authors contributed to the writing of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Monsanto, D.M., Main, D.C., Janion-Scheepers, C. et al. Mitogenome selection in the evolution of key ecological strategies in the ancient hexapod class Collembola. Sci Rep 12, 14810 (2022). https://doi.org/10.1038/s41598-022-18407-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-18407-1
This article is cited by
-
A computational study on mitogenome-encoded proteins of Pavo cristatus and Pavo muticus identifies key genetic variations with functional implications
Journal of Genetic Engineering and Biotechnology (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.