Divergent vertebral formulae shape the evolution of axial complexity in mammals

Li, Yimeng; Brinkworth, Andrew; Green, Emily; Oyston, Jack; Wills, Matthew; Ruta, Marcello

doi:10.1038/s41559-023-01982-5

Download PDF

Article
Open access
Published: 06 March 2023

Divergent vertebral formulae shape the evolution of axial complexity in mammals

Nature Ecology & Evolution volume 7, pages 367–381 (2023)Cite this article

6600 Accesses
4 Citations
82 Altmetric
Metrics details

Subjects

Abstract

Complexity, defined as the number of parts and their degree of differentiation, is a poorly explored aspect of macroevolutionary dynamics. The maximum anatomical complexity of organisms has undoubtedly increased through evolutionary time. However, it is unclear whether this increase is a purely diffusive process or whether it is at least partly driven, occurring in parallel in most or many lineages and with increases in the minima as well as the means. Highly differentiated and serially repeated structures, such as vertebrae, are useful systems with which to investigate these patterns. We focus on the serial differentiation of the vertebral column in 1,136 extant mammal species, using two indices that quantify complexity as the numerical richness and proportional distribution of vertebrae across presacral regions and a third expressing the ratio between thoracic and lumbar vertebrae. We address three questions. First, we ask whether the distribution of complexity values in major mammal groups is similar or whether clades have specific signatures associated with their ecology. Second, we ask whether changes in complexity throughout the phylogeny are biased towards increases and whether there is evidence of driven trends. Third, we ask whether evolutionary shifts in complexity depart from a uniform Brownian motion model. Vertebral counts, but not complexity indices, differ significantly between major groups and exhibit greater within-group variation than recognized hitherto. We find strong evidence of a trend towards increasing complexity, where higher values propagate further increases in descendant lineages. Several increases are inferred to have coincided with major ecological or environmental shifts. We find support for multiple-rate models of evolution for all complexity metrics, suggesting that increases in complexity occurred in stepwise shifts, with evidence for widespread episodes of recent rapid divergence. Different subclades evolve more complex vertebral columns in different configurations and probably under different selective pressures and constraints, with widespread convergence on the same formulae. Further work should therefore focus on the ecological relevance of differences in complexity and a more detailed understanding of historical patterns.

Stepwise shifts underlie evolutionary trends in morphological complexity of the mammalian vertebral column

Article Open access 07 November 2019

Katrina E. Jones, Kenneth D. Angielczyk & Stephanie E. Pierce

Serial disparity in the carnivoran backbone unveils a complex adaptive role in metameric evolution

Article Open access 15 July 2021

Borja Figueirido, Alberto Martín-Serra, … Roger J. Benson

Regionalization of the axial skeleton predates functional adaptation in the forerunners of mammals

Article 03 February 2020

Katrina E. Jones, Sarah Gonzalez, … Stephanie E. Pierce

Main

Biological complexity¹, taxonomic diversity^2,3 and morphological disparity^4,5 are three fundamental components of macroevolutionary dynamics. Increasingly, diversity and disparity have been examined alongside each other, especially in analyses of extinct organisms^6,7,8,9,10. In contrast, biological complexity remains remarkably understudied. Measuring complexity is a more challenging prospect than quantifying either diversity or disparity¹¹ but has vast potential for illuminating the origin of body plans^12,13, the imbalances in species richness across groups¹⁴ and the temporal and group-specific patterns of morphological diversification^15,16,17.

It has long been recognized that biological complexity can be indexed at various levels and that changes across levels are often decoupled, such that one is a poor predictor of the others¹⁸. In its simplest formulation, complexity is defined as the number of constituent parts, or types of parts, in an organism (for example, genes, cells, tissues and organs). This definition is the most common in empirical studies of complexity, partly because of its immediacy but also because it translates into simple indices^19,20,21,22. The hierarchical organization of biological systems offers additional proxies for complexity, such as the length and interconnectedness of biochemical pathways and gene regulatory networks or the degree of integration and modularity in the form and function of organismal parts²³. One key aspect of anatomical complexity is the proliferation of, and the differentiation between, serial homologues^{23,24,25,26,27,28,29,30}. The complexity of serial structures can be quantified as the number of elements forming a series (for example, 24 presacral vertebrae), the number of element types (for example, three types of presacral vertebrae: cervical, thoracic and lumbar) and/or the number of elements of each type (for example, 7 cervicals, 12 thoracics and 5 lumbars). Furthermore, these numbers can be synthesized as summative indices of the relative abundance and distribution of element types, analogous to the diversity of species across communities or individuals within species in an ecological sample^21,25,26. Lastly, the morphological complexity of serial structures can be quantified, inter alia, as the total range of variation among elements, the sum of differences between sequentially adjacent elements and the direction and magnitude of gradients and slopes along a series^24,28,29.

Throughout the history of life, there has been an undisputable net increase in diversity³¹, disparity³² and maximum complexity³³. However, as with other empirical evolutionary rules (for example, Cope–Depéret Rule of increasing body size over time^34,35,36,37), the dynamics of increases in complexity are unclear and evidence for their generality is equivocal³⁸. At one extreme, the ‘zero-force evolutionary law’ (ZFEL) states that in the absence of drivers or constraints, mean complexity tends to increase over time^22,39. This could happen because increasing numbers of serial homologues or gene copies increase the number of degrees of freedom available as the substrate for differentiation. Unless otherwise constrained by anatomical, functional, genetic or developmental links, these components may passively differentiate with time. Natural selection may facilitate or prevent this divergence but will operate either in addition or in opposition to the ZFEL^{23,24,25,40,41,42,43}. Increases in mean and maximum complexity can result from the balance between three, mutually non-exclusive processes often operating simultaneously and potentially in opposite directions across subclades^35,44,45. (1) Passive processes (‘diffusive’ evolution) imply that evolving lineages undergo random walks but the extent of decreases is limited by the existence of lower bounds, such as biomechanical or physiological constraints. (2) Clade sorting involves the preferential radiation of clades with higher intrinsic complexity and the extinction of those with less. (3) Driven processes point toward more frequent increases than decreases, such that minimum, mean and maximum values move progressively farther away from the initial lower bound^{1,18,22,25,44,45}. Driven trends have a particular conceptual importance because individual lineages can be viewed as independent statistical replicates of the evolutionary process, such that parallel increases in multiple lineages can be taken as evidence for an underlying evolutionary tendency^22,25,44,45. So far, support for a driven trend in increasing complexity through time has only been found in an overarching study of crustacean tagmosis²⁵, where complexity was indexed as a function of the number of paired appendages in different morphofunctional categories. However, no analyses with a comparable taxonomic reach exist for other groups.

Here, we use a similar methodological and conceptual approach to examine complexity in another exemplary model system, the vertebral column of mammals^28,29,30, although other serial structures (for example, body segments, paired appendages and teeth) would be suitable in other groups (for example, annelids, arthropods and vertebrates)^24,25,26,27. The plasticity of the mammalian column results from developmental^{46,47,48,49,50,51}, ecological^47,52,53,54, functional^51,55,56,57 and evolutionary^{48,49,53,54,58,59} factors, which also explain differences in vertebral numbers. For example, thoracic and lumbar counts are relatively conserved in some groups (for example, Marsupialia, ‘Artiodactyla’ and Felidae) but vary in others (for example, Afrotheria, Cetacea, Chiroptera, Primates and Xenarthra)^{46,47,48,49,52,53,60,61,62,63,64,65}. Recent studies have examined axial regionalization near the origin of crown mammals and in several extant clades^{28,29,53,54,55,56,57,58,59,66}. Furthermore, inferred ancestral conditions for vertebral counts have been mapped onto phylogenies to reconstruct major shifts in homoeotic domains^{48,49,52,53,56,67}.

In this article, we test for trends in the evolution of presacral complexity and investigate whether any such trends arose by passive or driven processes. We address three integrated hypotheses: H₀1, vertebral counts do not differ significantly between major mammal groups; H₀2, changes in complexity are non-directional; and H₀3, rates of change do not exhibit departures (shifts) from an initial ‘background’ rate (namely, the evolutionary rate inherited by all branches of a phylogeny under a Brownian motion model of trait evolution). To address these hypotheses, we calculate several indices of complexity from the presacral formulae of a large and diverse sample of extant species and estimate index values at the internal nodes of a time-scaled phylogeny⁶⁸. We use these estimates to quantify the magnitude and directionality of changes along ancestor–descendant lineages and to analyse temporal and group-specific patterns of axial ‘complexification’^22,35,44,66. We measure the relative incidence of passive versus driven processes on trends by examining how the total skewness of each index is partitioned across groups⁴⁵. Lastly, we model the evolution of complexity under a relaxed-clock Brownian motion^69,70 process to detect branch-specific shifts in rates of change.

Complexity indices and reference phylogeny

Two complexity indices derived from information theory and adapted from metrics used in ecology and archaeology²¹ are applied to the presacral and thoracolumbar sections of the vertebral column (Fig. 1) of 1,136 extant species (Supplementary Data 1 and 2). As the number of cervical vertebrae is nearly constant in mammals, we discuss primarily the thoracolumbar section, unless specified otherwise. The Brillouin index of numerical diversity (H_R) and the evenness index of proportional distribution (H´_R)^{25,26,27,71,72,73,74,75} quantify complexity in terms of the relative numerical richness of vertebrae in each column region and the degree to which they are apportioned uniformly across regions, respectively. Furthermore, we use presacral (CTL), thoracolumbar (TL), thoracic (T) and lumbar (L) counts, as well as thoracic to lumbar (T:L) ratios (both unstandardized and logit-transformed), as additional indices that describe region-specific elongation and the relative sizes of the thoracic and lumbar homoeotic domains^48,53,76. We use a taxonomically pruned version of a recent phylogeny⁶⁸ (Fig. 2 and Supplementary Data 3), with species assigned to ten groups: Afrotheria, Cetartiodactyla, Chiroptera, Eulipotyphla, Glires, Monotremata + Marsupialia, Perissodactyla, Ferae, Euarchonta and Xenarthra. Except for Monotremata + Marsupialia (a paraphyletic group to accommodate the small number of monotremes in our taxon sample), all other groups are clades.

**Fig. 1: Examples of mammalian vertebral columns and calculations of three complexity indices.**

**Fig. 2: Time-calibrated phylogeny of 1,136 extant mammal species used in this study.**

Results

Vertebral counts differ significantly between groups

For each of the CTL, TL, T and L counts, Poisson regression analyses⁷⁷ find significant differences in vertebral numbers between groups. Goodness-of-fit tests (analyses of deviance) indicate that all regression models depart significantly from a null model (H₀ = predicted group-specific counts identical to observed counts) (Supplementary Table 1). Out of 45 pair-wise group comparisons, post hoc Tukey tests retrieve 18 significant differences between group-specific counts with TL, 12 with CTL, 16 with T and 15 with L (Supplementary Table 1). Cetartiodactyla, Perissodactyla, Afrotheria, Chiroptera, Glires and Xenarthra are the most frequently occurring clades in the total number of significant pair-wise comparisons across all counts. All 12 significant comparisons with CTL are also retrieved with TL, 10 are shared between TL and T, 7 between TL and L and 2 between T and L. Afrotheria and Perissodactyla are the most widely represented clades in significant pair-wise comparisons of T counts, while Xenarthra and Cetartiodactyla feature prominently in comparisons of L counts. Finally, the only two significant comparisons in common to T and L involve Cetartiodactyla and each of Perissodactyla and Afrotheria (Supplementary Table 1 and Supplementary Information).

Complexity is distributed unevenly across groups

For the CTL and TL Brillouin and evenness and the untransformed and logit T:L, the total range of values is apportioned unevenly across groups, with varying degrees of overlap between group-specific index values and with differences in the number and location of distribution modes (Fig. 3 and Supplementary Data 1). Spearman’s rank-order correlation tests show that the group-specific index ranges do not correlate significantly with group size (Supplementary Data 1) and phylogenetic analyses of variance⁷⁸ return no significant differences in mean index values between groups (Supplementary Table 2). Monotremata + Marsupialia, Afrotheria and Xenarthra encompass the largest proportions of the total range of values for CTL Brillouin, CTL and TL evenness and T:L. Monotremata + Marsupialia and Afrotheria, along with Cetartiodactyla, also span much of the total range for TL Brillouin, while Monotremata + Marsupialia, Cetartiodactyla and Glires occupy a large proportion of the total range for logit T:L. In contrast, Ferae and Perissodactyla include the smallest proportions of the total range for most indices (Supplementary Data 1). In most groups, the indices are either bimodally or multimodally distributed (Hartigans’ dip tests; H₀, unimodal distributions)⁷⁹, suggesting distinct evolutionary trajectories (for example, trends towards separate optima) but in three early diverging groups—Monotremata + Marsupialia, Afrotheria and Xenarthra—they are unimodal (Fig. 3 and Supplementary Data 1).

Changes in complexity are concentrated in younger branches

We visualize changes in complexity indices across the phylogeny using continuous trait mapping⁸⁰ (Fig. 4 and Extended Data Fig. 1), with colour-coded maximum likelihood estimates at internal nodes and interpolated values along branches. We focus on TL (Fig. 4a), TL Brillouin and evenness (Fig. 4b,c) and untransformed T:L (Fig. 4d). The distributions of CTL Brillouin and evenness (Extended Data Fig. 1a,b) and logit T:L (Extended Data Fig. 1c), resemble their counterparts in Fig. 4 and are not discussed.

**Fig. 4: Continuous mapping of complexity indices across the phylogeny.**

In agreement with previous findings⁴⁹, a TL count of 19 is estimated at the deepest nodes of the phylogeny (for example, separation between Marsupialia and Placentalia; emergence of major placental cohorts) and near the roots of most extant orders. Small deviations from this plesiomorphic value characterize Chiroptera with an estimated ancestral count of 18 (19 in ref. ⁴⁹), Carnivora with 20 (see also ref. ⁴⁶) and Perissodactyla and Afrotheria with 22 each (22–24 in Perissodactyla and 21–30 in Afrotheria in ref. ⁴⁶). Within Afrotheria, 22 is also the most likely ancestral estimate for the ecologically diverse Paenungulata (Sirenia, Proboscidea and Hyracoidea) (23 in ref. ⁴⁹). A TL count of 19 is also probably plesiomorphic for primates, despite conspicuous differences in vertebral counts in this group^49,81. At less deep nodes, TL estimates are somewhat variable. This is a function of remarkable differences in vertebral counts in small groups of closely related species and in large subclades within some orders (for example, Chiroptera and Cetacea; Fig. 4a and Supplementary Data 1).

Along the deepest branches, changes in complexity near the roots of sister clades reveal consistent patterns for all information theory indices. The branches subtending Monotremata and Theria (Marsupialia + Placentalia) feature a decrease and an increase, respectively. Increases occur along the branches leading to Marsupialia and Placentalia. Within Placentalia, Atlantogenata (Xenarthra + Afrotheria) show a decrease, while their more speciose sister clade, Boreoeutheria, features an increase. Increases also occur near the roots of the two major boreoeutherian clades—Laurasiatheria and Euarchontoglires—as well as along the basal branches of Euarchonta and Glires. Within Laurasiatheria, Eulipotyphla show a decrease, while other laurasiatherians (Scrotifera) experience an increase. At the next deepest node, Chiroptera show a decrease whereas remaining Scrotifera (Ferae + Perissodactyla + Cetartiodactyla) exhibit an increase (for an account of group-specific patterns, see Supplementary Information).

TL counts correlate negatively with T:L ratios

A phylogenetically independent contrasts⁸² analysis yields a significant negative correlation between TL counts and T:L ratios across all sampled species (Supplementary Table 3). As with other indices, the ratios reveal a markedly heteroskedastic distribution (Supplementary Fig. 1), with a little over 2% of variance in the data explained by the regression model. A locally estimated scatterplot smoothing (LOESS) curve fitted to the T:L versus TL scatterplot (Fig. 5a) shows a steep negative slope for TL up to 19, a negligible rise to 20 and a gentle negative slope throughout higher TL counts (≥20). The dispersion of T:L values increases rapidly at TL ≥16 but reduces drastically at TL ≥29, with the long tail at the right-hand side of the distribution reflecting the large number of lumbar vertebrae in many Cetacea^57,62 (Fig. 5b,c and Supplementary Data 1; for the correlation between T:L and each of T and L, see Supplementary Information).

**Fig. 5: Patterns of thoracolumbar differentiation across major groups.**

TL counts correlate positively with complexity

TL counts are positively correlated with each of the TL Brillouin and evenness indices, although significantly so only for TL Brillouin (Supplementary Figs. 2 and 3 and Supplementary Table 3; see Supplementary Information and Supplementary Figs. 4 and 5 for patterns associated with CTL). This suggests that mammals with increasingly more elongate postcervical regions show a tendency for thoracic and lumbar elements to be distributed equitably. This pattern is rendered more elaborate by variations in T and L, whereby mammals with lower T and/or higher L also reveal greater relative numerical richness (Supplementary Information).

Thoracolumbar complexity and domains

Phylogenetic generalized least square regressions⁸³ support a strong, negative and highly significant correlation between T:L ratios and each of the TL evenness and Brillouin indices (Supplementary Figs. 6 and 7 and Supplementary Table 3). The seemingly linear decrease in TL Brillouin and evenness values with increasing T:L ratios is not consistent among groups and reversals (especially for TL Brillouin) occur in Afrotheria, Cetartiodactyla, Euarchonta and Glires (Supplementary Figs. 6 and 7).

Changes in complexity are both directional and sustained

To establish whether changes in complexity conform to trends and whether any such trends are predominantly passive or driven, we implement descendant–ancestor tests^84,85,86 and subclade tests^{45,84,87,88,89,90}. For the descendant–ancestor tests, we first examine temporal patterns of complexity change by regressing maximum likelihood estimates of the complexity indices at the internal nodes of the phylogeny (ancestral node values) against node ages. Subsequently, we test whether high initial levels of complexity bias downstream changes towards increases in daughter lineages by regressing the differences between successive node estimates (descendant value minus ancestor value for each branch) against ancestral node values. We model all correlations using robust linear regression, which is relatively insensitive to outliers and heteroskedasticity⁹¹.

Heteroskedasticity-robust Wald F-tests⁹² show that the slopes and intercepts of most regression models differ significantly from zero (Supplementary Table 4). The ancestral values of the CTL and TL Brillouin and evenness indices correlate positively (but not strongly) with node ages, both across phylogeny (Fig. 6a,c, Extended Data Fig. 2a,c and Supplementary Table 4) and in most individual groups (Supplementary Figs. 8–11 and Supplementary Table 4). The only clades exhibiting a temporal trend of decreasing complexity are Afrotheria (CTL Brillouin; CTL and TL evenness), Xenarthra (TL Brillouin) and Perissodactyla (CTL and TL Brillouin and evenness) but in none of them is the regression slope significant for any index (Supplementary Table 4). For the unstandardized and logit T:L, node estimates and node ages are mostly negatively correlated (except in Perissodactyla for unstandardized and logit T:L and Afrotheria for logit T:L), indicating a tendency for ratios to decrease towards the present (that is, thoracic and lumbar vertebrae tend to be apportioned uniformly) (Fig. 6e,g, Supplementary Figs. 12 and 13 and Supplementary Table 4). This pattern is mainly associated with increasingly larger numbers of lumbar elements, primarily within Cetacea and in several Afrotheria and primates.

**Fig. 6: Bivariate plots of node estimates of complexity indices versus node ages and descendant–ancestor differences versus node estimates.**

For all indices, the correlations between ancestral values and descendant–ancestor differences, both across the entire taxon set and in most major groups, are positive, although never strong. Except in Eulipotyphla (for all indices), Afrotheria and Xenarthra (for logit T:L) and Afrotheria (for TL Brillouin), the slopes of the regression models are mostly significant (Fig. 6b,d,f,h, Extended Data Fig. 2b,d, Supplementary Figs. 14–19 and Supplementary Table 4), suggesting that, as complexity values at nodes become larger, so do the differences between adjacent nodes⁹³. More broadly, for all information theory indices, increases significantly outnumber decreases (two-tailed sign tests) across phylogeny. Furthermore, the mean magnitude of combined increases and decreases is positive and significantly different from zero (two-tailed, one-sample Wilcoxon tests), with the mean magnitude of increases greatly exceeding the mean absolute magnitude of decreases (two-tailed, unpaired two-sample Wilcoxon tests) (Supplementary Table 4). For the unstandardized T:L, decreases outnumber increases, indicating a tendency for thoracic and lumbar elements to attain similar proportions.

In addition to the tests above, we quantify subclade skewness as a partial test of a driven trend⁴⁵. The skewness of a continuous trait (in particular, a right-skewed trait) within a parent clade suggests a hard bound in the opposite direction to the skew, with diffusion away from this bound. Where subclades within the parent clade are also skewed in a similar direction, this suggests a replicated and driven trend (note: the mean values of the subclades may be distributed symmetrically around the parent clade’s mean or may occur further in the direction of the overall skew⁴⁵). Formally, large contributions of within-group skewness (SCW) indicate that trends are predominantly shaped by driven processes, while large contributions of between-group (SCB) and heteroskedasticity-related (SCH) skewness suggest the prevalence of passive processes⁴⁵. In the case of the unstandardized T:L, a subclade test shows that slightly over 32% of its total skewness is accounted for by SCW, a little over 50% by SCH and the rest by SCB (Supplementary Table 5). For other indices, the proportion of SCW varies (32% for TL Brillouin, 29.9% for TL evenness, 26.5% for CTL Brillouin and 24.6% for CTL evenness) and accounts for the second largest proportion of total skewness after SCH (Supplementary Figs. 20–24 and Supplementary Table 5). At the broad taxonomic scale used here, these results indicate that passive processes are primarily responsible for trends in axial complexity but that driven components of change are also apparent. The limited contribution of SCW may be attributed to various factors⁴⁵. For instance, the variance of a trait may differ across subgroups or correlate negatively with their size. In addition, a trait may appear skewed in opposite directions in different subgroups and its distribution in one or more of these may fall outside the right-hand tail of the parent clade’s distribution. Lastly, large subgroups may obliterate smaller-scale patterns in the distribution of total skewness⁴⁵. In the Supplementary Information, we comment briefly on the influence of group size on the distribution of skewness using the TL Brillouin and evenness indices as examples (Extended Data Fig. 3).

Rate shifts reveal divergent patterns between clades

We find support for a multiple-rate model of complexity change. All complexity indices are characterized by widespread rate shifts with broadly congruent branch locations, directions, magnitudes and posterior probabilities. We discuss rate distribution mostly in relation to TL Brillouin (Extended Data Fig. 4a) and evenness (Extended Data Fig. 4b) (for patterns associated with other indices, see Extended Data Fig. 5). Shifts are absent or negligible in most deep branches, indicating few departures from background rates. In contrast, many of the more apical branches reveal shifts with large posterior probabilities, mostly representing rate decreases. Although increases are similarly widespread, they feature mainly at, or near, the tips of the phylogeny and are linked to divergent vertebral formulae occurring in single species and/or small subclades within the more speciose groups. Most of the branches subtending major mammal cohorts and orders are not underpinned by rate shifts, except for Atlantogenata (Afrotheria + Xenarthra) and Scandentia (tree shrews) which feature, respectively, an increase (for both indices) and a decrease (for TL evenness) with large posterior probabilities. For both indices, two conspicuous decreases occur in Marsupialia, one near the base of a diverse clade of small to mid-sized omnivores and carnivores (Tasmanian wolf, quolls, numbats, bilbies and bandicoots), the other near the base of macropods (bettongs, kangaroos, potoroos and wallabies). Additional decreases, usually with large posterior probabilities and typically located near the terminal branches, feature in all other groups. Notable examples of such decreases (involving TL Brillouin and/or evenness) characterize Chiroptera (mouse-eared bats; TL evenness), Ferae (weasels, ferrets, mink and their allies, felids, canids and pinnipeds), Cetacea (various mesoplodont beaked whales; TL evenness), ‘Artiodactyla’ (Old World deer and several bovid clades, such as Caprini and Antelopini), primates (bamboo, ruffed and true lemurs, leaf-eating monkeys and macaques), Lagomorpha (rabbits and hares) and Rodentia (early diverging groups, such as several flying, ground and tree squirrels within sciurids and many clades in more deeply nested rodent suborders) (Extended Data Fig. 4).

In the case of CTL evenness (Extended Data Fig. 5b), as well as the unstandardized and logit T:L (Extended Data Fig. 5c,d), complex patterns of rate shifts occur in Carnivora, Cetartiodactyla and Chiroptera. For those three indices, Carnivora show internested increases and decreases with low to moderate posterior probabilities along the basal branches separating the major clades of Caniformia and (for T:L only) Feliformia. Internested increases and decreases in T:L with moderate to high posterior probabilities also occur in Chiroptera (evening bats). Uniquely among all tested indices, T:L shows small internested increases within Perissodactyla⁹⁴ (one at the base of the entire clade, the other at the base of Rhinocerotidae) (Extended Data Fig. 5c). In Cetacea, interspersed increases and decreases characterize CTL Brillouin and evenness (Extended Data Fig. 5a,b), whereas widespread increases are associated with the unstandardized and logit T:L (Extended Data Fig. 5c,d). Increases with large posterior probabilities feature predominantly among baleen whales (Mysticeti) and dolphins, whereas decreases occur near the roots of toothed whales (Odontoceti).

Discussion

The strikingly divergent thoracolumbar patterns of five clades—Cetacea, Afrotheria, Xenarthra, Carnivora and Chiroptera^{95,96,97,98,99,100,101,102,103,104}—have broad evolutionary and ecological relevance to our understanding of axial regionalization and are therefore discussed in some detail. With the largest variance in L counts of all groups and the third largest variance in T counts after those of Xenarthra and Afrotheria (Supplementary Data 1), Cetacea rank among the mammals with the most complex thoracolumbar regions (TL Brillouin, 0.5775–0.6298; TL evenness, 0.8911–0.997). In Cetacea, postcervical elongation varies across lineages and according to habitats and ecologies^57,62,63. Among toothed whales (Odontoceti), beaked whales commonly have 9 or 10 (rarely, 11) thoracic vertebrae, whereas in other odontocete families, as well as in baleen whales (Mysticeti), T counts ≥12 are common. Appreciably more striking are the variations in L counts. For example, the stout-bodied South American river dolphin and the pygmy right whale have three lumbar vertebrae, while the slender and elongate right whale dolphins have >30. Freshwater cetaceans generally exhibit lower TL counts and their vertebral centra tend to be elongate and spool-like. In marine species, two divergent patterns emerge⁶³. Higher TL counts, coupled with abbreviated and disc-like centra, are frequently observed in small taxa adapted to fast swimming and active hunting (for example, oceanic dolphins and porpoises). Biomechanically, such features both afford rigidity and stability in the trunk and enable a powerful tail beat, allowing these animals to chase swift and agile prey. In contrast, lower TL counts usually occur in mid-sized to gigantic taxa (for example, baleen whales, killer whales, sperm whales, narwhals, belugas and beaked whales), in which variably elongate to isodimensional centra provide flexibility and manoeuvrability. Such characteristics assist these species with their vast repertoire of feeding habits, including foraging on the seabed or in discontinuous, high-density food patches in the water column⁶³. During cetacean evolution, the vertebral column was released from the functional requirements associated with stance, gait and weight-bearing, all of which limit the range of vertebral numerical variation in terrestrial species^46,52,61 and became less functionally plastic and more morphologically homogeneous⁵⁷. Alongside a simplification in the shape of the individual vertebrae, an increase in vertebral number was a key step during the transition to a fully aquatic lifestyle⁶². This transition was marked by the progressive reduction and subsequent loss of the hind limbs and a functional shift from paddling to manoeuvring in the fore limbs. At the same time, thoracic and lumbar elongation conferred greater stability to the trunk. Selection for improved stability and acquisition of a streamlined body profile may explain why numerous cetaceans exhibit similar numbers of thoracic and lumbar elements. Thus, in 48 out of 82 species in our sample, T:L varies from ~0.8 to 1.25, with absolute differences between T and L counts ranging from 0 to 2.

Unlike Cetacea, Afrotheria and Xenarthra attain trunk elongation mostly through additions of thoracic vertebrae. Afrotheria also showcase major differences in L counts, with the second largest variance after Cetartiodactyla. The plasticity of the vertebral column in Afrotheria reflects multiple biomechanical and ecological adaptations^49,52,53, exemplified by graviportal elephants, swimming manatees, burrowing aardvarks and various small to mid-sized insectivorous taxa of semi-aquatic to terrestrial habits, such as tenrecs, golden moles, otter shrews and elephant shrews. Manatees and golden moles are remarkably convergent in possessing moderately elongate thoracic regions (15, 17 or 19 vertebrae) and extremely abbreviated lumbar regions (2–4 vertebrae). Whereas in manatees these characteristics enhance column stability during swimming⁹⁵, in various golden moles they act as shock-absorbing devices, rendering the body compact while these animals probe through sand with their fore limbs and muzzle⁹⁶. Hyraxes rank among the Afrotheria with the largest T (20–23) and L (7 and 8) counts. Their plump appearance and short limbs contrast with their elongate, strongly arched and flexible columns, allowing them to move rapidly on rock surfaces and trees⁹⁷. Like manatees, elephant shrews also display relatively low TL counts (20 and 21). Similar to other mammals adapted to running or leaping^52,61, elephant shrews show restricted variation in their presacral formulae and, alongside aardvarks, feature the most complex presacral columns of all Afrotheria (CTL Brillouin, 0.9359–0.8985; CTL evenness, 0.9641–0.9277).

Similarly varied in terms of column construction are Xenarthra, characterized by their unusual vertebral articulations and a wide range of specializations⁹⁸, including osteoderms and burrowing lifestyles in armadillos (9–12 thoracics and 3–5 lumbars), fossoriality and arboreality in anteaters (15–17 thoracics and 2 or 3 lumbars) and suspensoriality in sloths (15–24 thoracics and 3 or 4 lumbars). The southern long-nosed armadillo shows the highest vertebral complexity in the group despite its low presacral count (C7, T9, L5; H_R = 0.9174; H´_R = 0.9727), whereas the Hoffmann’s two-toed sloth features the lowest despite its higher presacral count (formula: C6, T23, L3; H_R = 0.6743; H´_R = 0.6862). Hence, there is some suggestion that within Xenarthra higher vertebral complexity is driven in part by greater trunk stability associated with the evolution of a defensive armour and curling behaviours in armadillos, while lower complexity is partly the result of an expansion of the thoracic region and a concurrent reduction in the number of lumbar elements in arboreal species (various sloths)^52,61.

Two placental clades—the Carnivora and Chiroptera—stand out because of their markedly discontinuous vertebral count distributions. In Carnivora, the posterior thoracic and lumbar vertebrae are generally characterized by greater individual disparity across species and lower serial differentiation than the anterior thoracic and cervical vertebrae⁵⁵. Such attributes are thought to facilitate the striking functional versatility of the posterior trunk region in this clade, as well as among mammals more broadly⁵⁹. Consistent with this hypothesis is the fact that, despite a substantial degree of conservatism in TL counts⁹⁹, several Carnivora showcase highly divergent presacral formulae (for example, grisons, hyaenas, otters, seals and weasels^48,49,53,55). The probability density distributions of their complexity indices (Fig. 3 and Supplementary Data 1) are strikingly multimodal. Both TL Brillouin and TL evenness show no fewer than four modes, each reflecting convergent patterns of axial regionalization in separate lineages. For both indices, felids, canids and herpestids (mongooses and their kin) cluster around an optimum at the right-hand side of the probability distributions. The first two groups consist of small- to large-sized, long-limbed digitigrade hunters built for strength and speed. In contrast, herpestids include small carnivore generalists with elongate bodies and tails and short limbs. Narrowly separated from this optimum is a second peak in the probability distribution, including large ambush predators, such as ursids, and the small, long-bodied and short-tailed mustelids (weasels and their kin), a diverse clade of terrestrial, arboreal, aquatic and fossorial taxa. Largely separate from the first two distribution modes is a third optimum almost exclusively dominated by pinnipeds, the thoracolumbar formulae of which differ from those of most other carnivores (T15, L5 in most species)¹⁰⁰. Finally, an inconspicuous mode at the left-hand side of the distribution includes several ursids, mustelids and some hyaenids.

As in Carnivora, the distribution of complexity modes in Chiroptera is decoupled from phylogenetic clustering. Both TL Brillouin and (to a lesser degree) TL evenness are bimodal (Fig. 3 and Supplementary Data 1), with species from phylogenetically separate families clustered around each mode. Unlike axial complexity, patterns of vertebral fusion in Chiroptera are mostly phylogenetically structured¹⁰¹. This paradox may be explained by considering the construction of the vertebral column in light of recent embryological data. Vertebral fusion is ubiquitous in bats, confers stability and rigidity to the column and is a key adaptation for sustained flight^64,65. As in other dorsostable groups (for example, suspensory and slow-climbing taxa), variations in TL counts are poorly constrained in bats^52,61. Uniquely among mammals, bats feature a delayed onset of column ossification, with substantial morphogenetic patterning taking place in prenatal developmental stages¹⁰². This patterning is responsible for the specialized traits observed in the adults of several species, including varying degrees of vertebral fusion, restructuring of vertebral bony processes and remodelling of cervical elements in response to roosting habits^64,65. A recent hypothesis for the origin of flight in bats¹⁰³ posits that the ancestral morphotype of Chiroptera was a nocturnal, insectivorous and arboreal placental with well-developed interdigital webbing. Subsequent stages in the evolution of bats included interdigital webbing expansion, the transition from arboreal to roosting habits via an intermediate suspensory phase¹⁰⁴ and shifts in prenatal sequence heterochrony. Such shifts may have relaxed constraints on thoracolumbar counts and altered the boundaries between the thoracic and lumbar domains, resulting in the remarkable proliferation of vertebral formulae in extant bats⁶⁵.

Methods

Data collection

Vertebral formulae (Supplementary Data 1) were obtained from published compendia^{46,47,48,49,52,53,60,61,65} (Supplementary Data 2) and supplemented by data in their accompanying bibliography and citations, Boolean searches in Google Scholar (combining ‘and/or’ operators with keywords such as ‘mammal(s)’, ‘vertebra(e)’, ‘formula(e)’, ‘thoracic’, ‘lumbar’ and the names of individual mammal orders), as well as synopses from Mammalian Species (https://academic.oup.com/mspecies). Species were sampled from as many families as possible within each order to provide adequate coverage of presacral variation. Where intraspecific variation was documented, we selected the most widely represented count (usually, ≥50% of all specimens listed in a publication; but see ref. ⁵²). We excluded the sacral and caudal regions because their vertebral counts were often difficult to obtain^48,49. All analyses, tests and graphs were produced in R v.4.2.0. R codes are provided in Supplementary Data 4.

Phylogeny construction

We used a recently assembled time-calibrated phylogeny of mammals⁶⁸ built from a Bayesian analysis of a 31-gene supermatrix coupled with fossil-based backbone relationships and divergence time estimates. We exported 1,000 randomly selected trees from the posterior distribution available at vertlife.org and stored them as a ‘multiphylo’ object in R. Subsequently, we chose the single tree from the random sample closest to the centroid of tree space and available in Supplementary Data 3. To this end, we first extracted pair-wise tree-to-tree distances in phangorn¹⁰⁵ (KF.dist function), using the branch score distance (BSD)¹⁰⁶ as the preferred distance metric. BSD is calculated as the square root of the sum of squared differences between the branch lengths of a given tree pair. It is preferred over the more widely used Robinson–Fould (RF) distance¹⁰⁷ because the latter often results in narrow value distributions, heavy index saturation and limited power to differentiate alternative tree topologies. For instance, transpositions of even a single taxon pair can yield maximal changes in RF¹⁰⁸. The matrix of pair-wise BSD distances was used to calculate total summed distances between each tree and all other trees in the random sample (colSums function; base R). The tree with the smallest sum of column-wise distances from all the others was chosen for all subsequent analyses and pruned to include the species in our sample (drop.tip function in ape¹⁰⁹). Before performing phylogenetic comparative analyses, we re-ordered all data tabulations to ensure that the order of taxon labels matched that of the tree (match.phylo.data function in picante¹¹⁰). The circular tree (Fig. 2) with branches coloured by group was obtained with the groupClade (colour coding) and ggtree (aesthetic mapping) functions in ggtree¹¹¹.

Complexity indices

Counts and ratios were derived from the tabulated vertebral formulae (Supplementary Data 1) using base R. The logit T:L ratios were obtained with the logit function in car¹¹². The logit values are a desirable alternative to log-transformations in that they both stabilize the variance of the ratio distribution and extend its left and right extremes, such that ratios that differ only slightly are more widely spread at the tail ends of the logit distribution. The information theory indices were calculated with the heterogeneity and evenness functions in tabula¹¹³. The formula for the Brillouin index (H_R) is:

$$H_\mathrm{R} = \frac{1}{N}\mathrm{log}\frac{{N!}}{{\mathop {\prod }\nolimits_{i = 1}^k n_i!}}$$

where N is the total number of vertebrae, n_i is the number of vertebrae in the ith region and k is the number of regions. The base of the log operator can take any value. The formula for the evenness index (H′_R) is:

$$H^\prime _\mathrm{R} = \frac{{N \times H_\mathrm{R}}}{{\mathrm{log}N! - \left( {k - d} \right) \times \mathrm{log}c! - d \times \mathrm{log}\left( {c + 1} \right)!}}$$

where H_R is the Brillouin index and c and d are the integer and modulus of N and k, respectively. The Brillouin index is appropriate for count data derived from a known collection rather than a random sample⁷³, accommodates variation associated with the least numerically diverse vertebral types⁷⁴ and offers an intuitive characterization of complexity, in that higher values indicate greater dissimilarity across column regions while accounting for the numerical richness of vertebral types in each. Similarly, the evenness index explains how uniformly spread the vertebrae are across regions. While related, the Brillouin and evenness indices describe numerical variation in subtly different ways and need not correlate. As an example, consider the following numerical sets, each consisting of two integers that add up to the same total count: a = {10, 1}, b = {9, 2}, c = {8, 3}, d = {7, 4} and e = {6, 5}. The Brillouin index increases from a to e (0.2179, 0.3643, 0.4641, 0.5271 and 0.5577), indicating increases in the relative numerical abundance of elements across the five sets. The evenness index also increases in the same order (0.3908, 0.6531, 0.8321, 0.9451 and 1), reaching its maximum value in e, where the two integers contribute most equitably to their sum (note: swapping the positions of the integers does not alter the values of either index). In this example, the two indices are positively correlated. However, the correlation becomes less predictable when the sum of integers varies across sets. For instance, in the two sets j = {14, 9} and k = {8, 6}, the corresponding sums are 23 and 14, the Brillouin indices are 0.5918 and 0.5719 and the evenness indices are 0.9643 and 0.9835. An inverse correlation between the total count and the Brillouin index is possible. For instance, w = {7, 3} (sum = 10) has a higher Brillouin index (0.4787) than z = {9, 3} (sum = 12; index = 0.4494). Following on from these examples and to assist the reader in interpreting the polarity of changes, it is appropriate to view increases in the two indices as complementary but distinct facets of column ‘complexification’. We produced graphic summaries of the index distributions across groups in the form of violin plots in vioplot¹¹⁴ (Fig. 3) and probability density distributions in ggplot2 (ref. ¹¹⁵) (Supplementary Data 1).

Poisson regressions and phylogenetic analyses of variance

To test for differences in each of the CTL, TL, T and L counts between groups, we performed Poisson regressions of counts versus groups (Supplementary Table 1) using the glm.nb function in MASS¹¹⁶ to build regression models. We used the Anova function in car to assess the degree and significance of parameter deviance from a null model. We used the nagelkerke function in rcompanion⁷⁷ to calculate pseudo-R² coefficients, measuring how well each of the Poisson regression models explains the data. In addition, we used emmeans¹¹⁷ for conducting post hoc tests of significant pair-wise differences between estimated means of group-specific counts. Lastly, we evaluated significant differences among the mean values of each index across groups⁷⁸ (Supplementary Table 2) using the phylANOVA function in phytools¹¹⁸.

Independent contrasts and phylogenetic generalized least squares analyses

We used phylogenetic independent contrasts (brunch function in caper⁸³) (Supplementary Table 3) to model the regression between TL counts and T:L ratios (Fig. 5a and Supplementary Fig. 1), as well as between each of the CTL and TL counts and their associated Brillouin and evenness indices (Supplementary Figs. 2–5). The ‘brunch’ regression model is suitable for predictors, such as counts, that take the form of ordered multinomial data. In addition, we used phylogenetic generalized least squares regression (pgls function in caper) (Supplementary Table 3) to correlate T:L ratios with each of the TL Brillouin and evenness indices (Supplementary Figs. 6 and 7). Comparative data objects for both brunch and pgls regressions used the phylogenetic variance–covariance matrix with the full array of individual branch lengths contributing to the shared lengths between any two tips of the phylogeny. For each regression, we output diagnostic plots using the same functions. Given the heteroskedastic distributions of all indices, local variations in the distribution of values for the predictor and response values are more appropriately visualized through LOESS regression curves¹¹⁹ in the bivariate scatterplots (note: log-transforming the variables, either each one separately or both together, does not remove the influence of heteroskedasticity).

Continuous trait mapping

We used the contMap function in phytools to produce colour-coded plots of the complexity indices onto the phylogeny (Fig. 4 and Extended Data Fig. 1). Index values at nodes represent maximum likelihood estimates⁸⁰, while those along the branches are interpolated following methods in ref. ¹²⁰.

Descendant–ancestor tests

For each index, we calculated estimated values at internal nodes under maximum likelihood with the fastAnc function in phytools. We adapted protocols expounded in ref. ⁹³ to correlate descendant–ancestor differences with ancestral node values across the entire tree (Fig. 6b,d,f,h and Extended Data Fig. 2b,d) and for each individual group (Supplementary Figs. 13–19) after correcting for the regression to the mean artefact. The regression to the mean is a statistical phenomenon whereby, due to imperfect correlations between variables (such as may be caused by sampling error and/or inadequately representative samples), the value of a variable tends to occur outside the ‘norm’ when first measured but it is likely to approach the population mean in subsequent measures¹²¹. As the regression to the mean is intrinsic to any imperfect correlation, we took it into account as integral to the regression procedure. The first step in this procedure involved testing for equality of variances associated with each of the sets of ancestor and descendant values. Depending upon outcome, one can use the Pearson’s correlation coefficient ‘r’ to adjust the descendant–ancestor differences when the equality of variances is not rejected or a modified correcting index when it is rejected. In the first scenario, let X and Y represent, respectively, vectors of ancestor and descendant values. The difference between X and Y can be modified as follows, to account for the regression to the mean artefact:

$${{D = r}}\, {\times} \left( {{{{\mathbf{X}-{\mathrm{mean}}}}}\left( {{{\mathbf{X}}}} \right)} \right){{{\mathrm{-}}}}\left( {{{{\mathbf{Y}-{\mathrm{mean}}}}}\left( {{{\mathbf{Y}}}} \right)} \right)$$

In the second scenario, let sX and sY be the standard deviations, respectively, of X and Y and varX and varY their respective variances. Following ref. ¹²², we introduce a correcting factor, termed ‘adj’, given by the following formula:

$${{{\mathrm{adj = }}}}\left( {{{2\times r\times {\mathrm{s}}{\mathbf{X}}\times {\mathrm{s}}{\mathbf{Y}}}}} \right){{{\mathrm{/}}}}\left( {\mathrm{var}}{\mathbf{X}} + {\mathrm{var}}{\mathbf{Y}} \right)$$

We then calculate adjusted differences as follows:

$${{{D1 = {\mathrm{adj}} \, \times }}}\left( {{{{\mathbf{X} - {\mathrm{mean}}}}}\left( {{{\mathbf{X}}}} \right)} \right){{{\mathrm{-}}}}\left( {{{{\mathbf{Y}-{\mathrm{mean}}}}}\left( {{{\mathbf{Y}}}} \right)} \right)$$

Note: the sign of these differences was subsequently reversed to ensure that negative and positive differences (descendant minus ancestor value) represent, respectively, decreases and increases. We tested for significant differences between increases and decreases (positive and negative D1 values, respectively) in three ways using base R functions. First, we established whether the number of increases and the number of decreases deviated from a 1:1 null proportional distribution with a binomial (sign test) of equal proportions. Second, we tested whether the mean of all D1 values differed from zero using a two-tailed, one-sample Wilcoxon signed rank test. Third, we compared the absolute mean of all negative D1s with the mean of all positive D1s through an unpaired, two-tailed, two-sample Wilcoxon rank sum test. Following these preliminary characterizations of the distributions of D1 values and to establish whether large or small ancestral values tend to be associated chiefly with large or small descendant–ancestor differences, we carried out robust linear regressions in MASS (Supplementary Table 4). The significance of the slope and intercept of all regression models was established with a heteroskedasticity-robust Wald F-test using sfsmisc. Robust linear regressions were further applied to individual groups. To provide direct tests of the overall direction of index variation over time, we undertook robust linear regressions of ancestor values against node ages in MASS for the entire taxon set (Fig. 6a,c,e,g and Extended Data Fig. 2a,c) and for individual groups (Supplementary Figs. 8–12). The node ages were obtained in paleotree¹²³ (dateNodes function).

Subclade tests

To establish whether trends conform to passive or driven processes, we applied a subclade test of skewness⁴⁵. The test operates by partitioning the total skewness of a trait in a parent group into three main components: skewness between subclades (SCB); skewness within subclades (SCW); and heteroskedasticity-related skewness (SCH; skewness caused by heterogeneity in trait variance in subclades) (Supplementary Figs. 20–24, Extended Data Fig. 3 and Supplementary Table 5). The test quantifies the combined effects of passive and driven processes in terms of the proportional contributions of SCB, SCW and SCH to the total skewness. Such contributions are based upon the normal versus non-normal distributions of mean subclade values relative to the mean value of the parent group and upon the degree of skewness in the subclades (that is, small versus large standard deviations of subclade values within the right tail of the total group). The prevalence of either SCB or SCH suggests passive trends, while the prevalence of SCW suggests driven trends. High proportions of SCW suggest that the overall skewness pattern is replicated in several constituent subclades. This is itself indicative of a driven trend, as a tendency towards higher or lower values will skew not only the ensemble distribution but also the distributions of most or all subclades^{45,84,88,89,90}. As an example, consider a parent group with a right-skewed trait (for example, complexity or size). Furthermore, suppose that each constituent subclade shows symmetrically distributed values around its own mean and that the means of the subclade distributions are right-skewed around the parent group’s mean. In this scenario, the right-skewed distribution of the parent group results from a passive trend, as SCB prevails. Now assume that the distribution of each subclade is also right-skewed. In this scenario, SCW prevails, pointing to a driven trend. This would also be the case if the means of the subclades were symmetrically distributed around the parent group’s mean. In a final scenario, suppose that each subclade has symmetrically distributed values and, further, that the variance increases in subclades at the right tail end of the parent group’s distribution (that is, those subclades exhibit a greater spread of values around their own means). In this case, the right-skewed distribution of the parent group is caused by heteroskedasticity (SCH) and, as in the case of SCB, it indicates a passive process⁴⁵. The test code builds probability density functions for the values of the parent group and those of its subclades and outputs a list of the percentage contributions of SCB, SCH and SCW to the skewness of the parent group. For each index, the total skewness was calculated in e1071. As analyses of skewness are predicated on right-skewed distributions^45,84 and because most indices are negatively skewed (except for unstandardized T:L), we transformed those indices by taking their negative logarithms⁸⁹ before subjecting them to the test. Subclade tests on selected major groups followed identical protocols.

Analyses of rates

We used reversible-jump Markov Chain Monte Carlo methods in geiger⁶⁹ to detect shifts in rates of complexity changes⁷⁰ (Extended Data Figs. 4 and 5). For each tested index, we ran the rjmcmc.bm function for 5 × 10⁷ generations under a relaxed-clock Brownian motion model of evolution, sampling every 500 generations. Under relaxed-clock Brownian motion, changes are permitted following two modalities: (1) unusually high or low rates occurring on single branches, resulting in a shift in global optimum along those branches and (2) shifts in rate of evolution inherited by all descendants of a given branch. After each run, we checked chain convergence towards equilibrium in Tracer¹²⁴ and visualized the posterior Bayesian rate estimates (median values) and rate shifts on the tree using plotting functions in geiger. Branches showing median rates that are higher or lower than the background rates are colour-coded with gradients of colour intensity (maroon for higher rates and steelblue for lower), where greater intensity signifies greater rates of change. The posterior probabilities of rate shifts are indicated by solid circles, the size of which is proportional to their posterior probabilities and with colour coding identical to that used for the branch rates (maroon, upturn shift; steelblue, downturn shift).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available in Figshare and accessible at https://doi.org/10.6084/m9.figshare.21622284. Supplementary Tables 1–5 include outputs from the following analyses: Poisson regressions of counts versus groups; phylogenetic analyses of variance for the complexity indices; phylogenetically corrected correlations between various categories of vertebral counts and complexity indices; robust linear regressions between ancestral complexity values and descendant–ancestor differences, as well as between node ages and ancestral values; and subclade tests. Supplementary Data 1 lists taxa, their presacral counts and their complexity indices. It also reports univariate statistics and histogram distributions for the thoracic and lumbar counts alongside probability density distributions for various complexity indices. The literature sources on vertebral formulae are listed in Supplementary Data 2. The time-scaled phylogeny is available in Supplementary Data 3 as an object of class ‘phylo’. R code is reproduced in Supplementary Data 4 and is accompanied by templates for running analyses on individual data files extracted from Supplementary Data 1. Such data files are combined as separate tabs within individual spreadsheets and are available as Source Data for Figs. 3–6 and for Extended Data Figs. 1–5. These Source Data can be redeployed for building Supplementary Figs 1–24.

Code availability

All results and graphics are reproducible using the protocols detailed in the Methods and the examples accompanying the relevant functions of various packages. R code is deposited in Figshare and accessible at https://doi.org/10.6084/m9.figshare.21622284.

References

McShea, D. W. in Evolutionary Developmental Biology: A Reference Guide (eds de la Rosa, L. N. & Müller, G.) 169–179 (Springer, 2021).
Benton, M. J. The Red Queen and the Court Jester: species diversity and the role of biotic and abiotic factors through time. Science 323, 728–732 (2009).
Article CAS PubMed Google Scholar
Foote, M. The evolution of morphological diversity. Ann. Rev. Ecol. Syst. 28, 129–152 (1997).
Article Google Scholar
Guillerme, T. et al. Disparities in the analysis of morphological disparity. Biol. Lett. 16, 20200199 (2020).
Article PubMed PubMed Central Google Scholar
Fortey, R. A., Briggs, D. E. G. & Wills, M. A. The Cambrian evolutionary ‘explosion’: decoupling cladogenesis from morphological disparity. Biol. J. Linn. Soc. 57, 13–33 (1996).
Google Scholar
Ruta, M., Angielczyk, K. D., Fröbisch, J. & Benton, M. J. Decoupling of morphological disparity and taxic diversity during the adaptive radiation of anomodont therapsids. Proc. Biol. Sci. 280, 20131071 (2013).
PubMed PubMed Central Google Scholar
Hopkins, M. J. Decoupling of taxonomic diversity and morphological disparity during decline of the Cambrian trilobite family Pterocephaliidae. J. Evol. Biol. 26, 1665–1676 (2013).
Article CAS PubMed Google Scholar
Wan, J. et al. Decoupling of morphological disparity and taxonomic diversity during the end-Permian mass extinction. Paleobiology 47, 402–417 (2021).
Article Google Scholar
Cavin, L., Piuz, A., Ferrante, C. & Guinot, G. Giant Mesozoic coelacanths (Osteichthyes, Actinistia) reveal high body size disparity decoupled from taxic diversity. Sci. Rep. 11, 11812 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bapst, D. W., Bullock, P. C., Melchin, M. J., Sheets, H. D., & Mitchell, C. E. Graptoloid diversity and disparity became decoupled during the Ordovician mass extinction. Proc. Natl Acad. Sci. USA 109, 3428–3433 (2012).
Article CAS PubMed PubMed Central Google Scholar
Adami, C. What is complexity? BioEssays 24, 1085–1094 (2002).
Article PubMed Google Scholar
Deline, B. et al. Evolution of metazoan morphological disparity. Proc. Natl Acad. Sci. USA 115, E8909–E8918 (2018).
Article CAS PubMed PubMed Central Google Scholar
Erwin, D. H. Developmental capacity and the early evolution of animals. J. Geol. Soc. Soc. 178, jgs2020–jgs2245 (2021).
Google Scholar
Benson, R. B., Butler, R., Close, R. A., Saupe, E. & Rabosky, D. L. Biodiversity across space and time in the fossil record. Curr. Biol. 31, 1225–1236 (2021).
Article Google Scholar
Puttick, M. N., Guillerme, T. & Wills, M. A. The complex effects of mass extinctions on morphological disparity. Evolution 74, 2207–2220 (2020).
Article PubMed Google Scholar
Wang, M., Lloyd, G. T., Zhang, C. & Zhou, Z. The patterns and modes of the evolution of disparity in Mesozoic birds. Proc. Biol. Sci. 288, 20203105 (2021).
PubMed PubMed Central Google Scholar
Ferron, H. G. et al. Categorical versus geometric morphometric approaches to characterizing the evolution of morphological disparity in Osteostraci (Vertebrata, stem Gnathostomata). Palaeontology 63, 717–732 (2020).
Article Google Scholar
Brandon, R. & McShea, D. W. The Missing Two-Thirds of Evolutionary Theory (Cambridge Univ. Press, 2020).
Leslie, A. B., Simpson, C. & Mander, L. Reproductive innovations and pulsed rise in plant complexity. Science 373, 1368–1372 (2021).
Article CAS PubMed Google Scholar
Carroll, S. B. Chance and necessity: the evolution of morphological complexity and diversity. Nature 409, 1102–1109 (2001).
Article CAS PubMed Google Scholar
Rebout, N. et al. Measuring complexity in organisms and organizations. R. Soc. Open Sci. 8, 200895 (2021).
Article PubMed PubMed Central Google Scholar
McShea, D. W. & Brandon, R. N. Biology’s First Law: The Tendency for Diversity and Complexity to Increase in Evolutionary Systems (Univ. of Chicago Press, 2010).
Klingenberg, C. P. Studying morphological integration and modularity at multiple levels: concepts and analysis. Philos. Trans. R. Soc. Lond. B Biol. Sci. 369, 20130249 (2014).
Article PubMed PubMed Central Google Scholar
Fusco, G. & Minelli, A. Measuring morphological complexity of segmented animals: centipedes as model systems. J. Evol. Biol. 13, 38–46 (2001).
Article Google Scholar
Adamowicz, S. J., Purvis, A. & Wills, M. A. Increasing morphological complexity in multiple parallel lineages of the Crustacea. Proc. Natl Acad. Sci. USA 105, 4786–4791 (2008).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. A superarmored lobopodian from the Cambrian of China and early disparity in the evolution of Onychophora. Proc. Natl Acad. Sci. USA 112, 8678–8683 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lafuma, F., Corfe, I. J., Clavel, J. & Di-Poï, N. Multiple evolutionary origins and losses of tooth complexity in squamates. Nat. Commun. 12, 6001 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jones, K. E., Gonzalez, S., Angielczyk, K. D. & Pierce, S. E. Regionalization of the axial skeleton predates functional adaptation in the forerunners of mammals. Nat. Ecol. Evol. 4, 470–478 (2020).
Article PubMed Google Scholar
Felice, R. N. & Angielczyk, K. D. in Early Evolutionary History of the Synapsida (eds Kammerer, C. F. et al.) 25–51 (Springer, 2014).
Figueirido, B. et al. Body-axis organization in tetrapods: a model-system to disentangle the developmental origins of convergent evolution in deep time. Biol. Lett. 18, 20220047.
Benton, M. J. & Emerson, B. C. How did life become so diverse? The dynamics of diversification according to the fossil record and molecular phylogenetics. Palaeontology 50, 23–40 (2007).
Article Google Scholar
Oyston, J. W., Hughes, M., Gerber, S. & Wills, M. A. Why should we investigate the morphological disparity of plant clades? Ann. Bot. 117, 859–879 (2016).
Article PubMed Google Scholar
Valentine, J. W., Collins, A. G. & Meyer, C. P. Morphological complexity increase in metazoans. Paleobiology 20, 131–142 (1994).
Article Google Scholar
Stanley, S. M. An explanation for Copeas rule. Evolution 27, 1–26 (1973).
Article PubMed Google Scholar
Alroy, J. Cope’s rule and the dynamics of body mass evolution in North American fossil mammals. Science 280, 731–734 (1998).
Article CAS PubMed Google Scholar
Bokma, F. et al. Testing for Depéret’s rule (body size increase) in mammals using combined extinct and extant data. Syst. Biol. 65, 98–108 (2016).
Article PubMed Google Scholar
Raia, P. & Fortelius, M. Cope’s law of the unspecialized, Cope’s rule, and weak directionality in evolution. Evol. Ecol. Res. 15, 747–756 (2013).
Google Scholar
Lineweaver, C. H., Davies, P. C. W. & Ruse, M (eds) Complexity and the Arrow of Time (Cambridge Univ. Press, 2013).
McShea, D. W., Wang, S. C. & Brandon, R. N. A quantitative formulation of biology’s first law. Evolution 73, 1101–1115 (2019).
Article PubMed Google Scholar
Szathmáry, E., Jordán, F. & Pál, C. Can genes explain biological complexity? Science 292, 1315–1316 (2001).
Article PubMed Google Scholar
Taft, R. J. & Mattick, J. S. Increasing biological complexity is positively correlated with the relative genome-wide expansion of non-protein-coding DNA sequences. Gen. Biol 5, P1 (2003).
Article Google Scholar
Duclos, K. K., Hendrikse, J. L. & Jamniczky, H. A. Investigating the evolution and development of biological complexity under the framework of epigenetics. Evol. Dev. 21, 276–293 (2019).
Article PubMed Central Google Scholar
Hagolani, P. F., Zimm, R., Vroomans, R. & Salazar-Ciudad, I. On the evolution and development of morphological complexity: a view from gene regulatory networks. PLoS Comput. Biol. 21, e1008570 (2021).
Article Google Scholar
McShea, D. W. Mechanisms of large‐scale evolutionary trends. Evolution 48, 1747–1763 (1994).
Article PubMed Google Scholar
Wang, S. C. Quantifying passive and driven large‐scale evolutionary trends. Evolution 55, 849–858 (2001).
Article CAS PubMed Google Scholar
Narita, Y. & Kuratani, S. Evolution of the vertebral formulae in mammals: a perspective on developmental constraints. J. Exp. Zool. B Mol. Dev. Evol 304, 91–106 (2005).
Article PubMed Google Scholar
Kawashima, T., Thorington, R. W. Jr, Bohaska, P. W. & Sato, F. Variability and constraint of vertebral formulae and proportions in colugos, tree shrews, and rodents, with special reference to vertebral modification by aerodynamic adaptation. Folia Morphol. 77, 44–56 (2018).
Article CAS Google Scholar
Müller, J. et al. Homeotic effects, somitogenesis and the evolution of vertebral numbers in recent and fossil amniotes. Proc. Natl Acad. Sci. USA 107, 2118–2123 (2010).
Article PubMed PubMed Central Google Scholar
Sánchez‐Villagra, M. R., Narita, Y. & Kuratani, S. Thoracolumbar vertebral number: the first skeletal synapomorphy for afrotherian mammals. Syst. Biodiv. 5, 1–7 (2007).
Article Google Scholar
Hautier, L., Weisbecker, V., Sánchez-Villagra, M. R., Goswami, A. & Asher, R. J. Skeletal development in sloths and the evolution of mammalian vertebral patterning. Proc. Natl Acad. Sci. USA 107, 18903–18908 (2010).
Article CAS PubMed PubMed Central Google Scholar
Arnold, P. Evolution of the mammalian neck from developmental, morpho-functional, and paleontological perspectives. J. Mammal. Evol. 28, 173–183 (2021).
Article Google Scholar
Williams, S. A. et al. Increased variation in numbers of presacral vertebrae in suspensory mammals. Nat. Ecol. Evol. 3, 949–956 (2019).
Article PubMed Google Scholar
Asher, R. J., Bennett, N. & Lehmann, T. The new framework for understanding placental mammal evolution. Bioessays 31, 853–864 (2009).
Article CAS PubMed Google Scholar
Jones, K. E., Angielczyk, K. D. & Pierce, S. E. Stepwise shifts underlie evolutionary trends in morphological complexity of the mammalian vertebral column. Nat. Comm. 10, 5071 (2019).
Figueirido, B. et al. Serial disparity in the carnivoran backbone unveils a complex adaptive role in metameric evolution. Comm. Biol. 4, 863 (2021).
McCollum, M. A., Rosenman, B. A., Suwa, G., Meindl, R. S. & Lovejoy, C. O. The vertebral formula of the last common ancestor of African apes and humans. J. Exp. Zool. B Mol. Dev. Evol 314, 123–134 (2010).
PubMed Google Scholar
Buchholtz, E. A. Vertebral osteology and swimming style in living and fossil whales (Order: Cetacea). J. Zool. 253, 175–190 (2001).
Article Google Scholar
Jones, K. E., Benitez, L., Angielczyk, K. D. & Pierce, S. E. Adaptation and constraint in the evolution of the mammalian backbone. BMC Evol. Biol. 18, 172 (2018).
Jones, K. E. et al. Fossils reveal the complex evolutionary history of the mammalian regionalized spine. Science 361, 1249–1252 (2018).
Article CAS PubMed Google Scholar
Owen, R. Descriptive Catalogue of the Osteological Series Contained in the Museum of the Royal College of Surgeons of England Vol. 2 (Taylor and Francis, 1853).
Galis, F. et al. Fast running restricts evolutionary change of the vertebral column in mammals. Proc. Natl Acad. Sci. USA 111, 11401–11406 (2014).
Article CAS PubMed PubMed Central Google Scholar
Buchholtz, E. A. Modular evolution of the cetacean vertebral column. Evol. Dev. 9, 278–289 (2007).
Article CAS PubMed Google Scholar
Gillet, A., Frédérich, B. & Parmentier, E. Divergent evolutionary morphology of the axial skeleton as a potential key innovation in modern cetaceans. Proc. Biol. Sci. 286, 20191771 (2019).
PubMed PubMed Central Google Scholar
Gaudioso, P. J., Díaz, M. M. & Barquez, R. M. Morphology of the axial skeleton of seven bat genera (Chiroptera: Phyllostomidae). An. Acad. Bras. Cienc. 89, 2341–2358 (2017).
Article PubMed Google Scholar
Walton, D. W. & Walton, G. M. in About Bats (eds Slaughter, B. H. & Walton, D. W.) 93–126 (Southern Methodist Univ. Press, 1970).
McShea, D. W. Evolutionary change in the morphological complexity of the mammalian vertebral column. Evolution 47, 730–740 (1993).
Article PubMed Google Scholar
Mao, F., Zhang, C., Liu, C. & Meng, J. Fossoriality and evolutionary development in two Cretaceous mammaliamorphs. Nature 592, 577–582 (2021).
Article CAS PubMed Google Scholar
Upham, N. S., Esselstyn, J. A. & Jetz, W. Inferring the mammal tree: species-level sets of phylogenies for questions in ecology, evolution, and conservation. PLoS Biol. 17, e3000494 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pennell, M. et al. geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30, 2216–2218 (2014).
Article CAS PubMed Google Scholar
Eastman, J. M., Alfaro, M. E., Joyce, P., Hipp, A. L. & Harmon, L. J. A novel comparative method for identifying shifts in the rate of character evolution on trees. Evolution 65, 3578–3589 (2011).
Article PubMed Google Scholar
Brillouin, L. Science and Information Theory (Academic Press, 1956).
Laxton, R. R. The measure of diversity. J. Theor. Biol. 70, 51–67 (1978).
Article CAS PubMed Google Scholar
Peet, R. K. Relative diversity indices. Ecology 56, 496–498 (1975).
Article Google Scholar
Pielou, E. C. Ecological Diversity (Wiley, 1975).
Magurran, A. E. Measuring Biological Diversity (Blackwell Publishing, 2004).
Soul, L. C. & Benson, R. J. Developmental mechanisms of macroevolutionary change in the tetrapod axis: a case study of Sauropterygia. Evolution 71, 1164–1177 (2017).
Article PubMed PubMed Central Google Scholar
Mangiafico, S. S. An R Companion for the Handbook of Biological Statistics Version 1.3.2 (2015); rcompanion.org/documents/RCompanionBioStatistics.pdf
Garland, T. Jr., Dickerman, A. W., Janis, C. M. & Jones, J. A. Phylogenetic analysis of covariance by computer simulation. Syst. Biol. 42, 265–292 (1993).
Article Google Scholar
Hartigan, J. A. & Hartigan, P. M. The dip test of unimodality.Ann. Stat 13, 70–84 (1985).
Article Google Scholar
Revell, L. J. Two new graphical methods for mapping trait evolution on phylogenies. Methods Ecol. Evol. 4, 754–759 (2013).
Article Google Scholar
Schultz, A. H. & Straus, W. L. Jr. The numbers of vertebrae in primates. Proc. Am. Philos. Soc. 89, 601–626 (1945).
CAS PubMed Google Scholar
Burt, A. Comparative methods using phylogenetically independent contrasts. Oxf. Surv. Evol. Biol. 6, 33–53 (1989).
Google Scholar
Orme, D. et al. The caper package: comparative analysis of phylogenetics and evolution in R. R package version 5 https://mirror.rcg.sfu.ca/mirror/CRAN/web/packages/caper/ (2013).
Hunt, G. & Carrano, M. T. Models and methods for analyzing phenotypic evolution in lineages and clades. Paleontol. Soc. Pap. 16, 245–269 (2010).
Article Google Scholar
Alroy, J. Understanding the dynamics of trends within evolving lineages. Paleobiology 26, 319–329 (2000).
Article Google Scholar
Marcot, J. D. & McShea, D. W. Increasing hierarchical complexity throughout the history of life: phylogenetic tests of trend mechanisms. Paleobiology 33, 182–200 (2007).
Article Google Scholar
McShea, D. W. Metazoan complexity and evolution: is there a trend? Evolution 50, 477–492 (1996).
PubMed Google Scholar
Diniz-Filho, J. A. F., Rangel, T. F. L., Bini, L. M. & Hawkins, B. A. Macroevolutionary dynamics in environmental space and the latitudinal diversity gradient in New World birds. Proc. Biol. Sci. 274, 43–52 (2007).
PubMed Google Scholar
Verdú, M. Tempo, mode and phylogenetic associations of relative embryo size evolution in angiosperms. J. Evol. Biol. 19, 625–634 (2006).
Article PubMed Google Scholar
Maurer, B. A. The evolution of body size in birds. I. Evidence of non-random diversification. Evol. Ecol. 12, 925–934 (1998).
Article Google Scholar
Fox, J. Applied Regression Analysis, Linear Models, and Related Models (Sage, 1997).
Ghosh, A., Mandal, A., Martín, N. & Pardo, L. Influence analysis of robust Wald-type tests. J. Multivar. Anal. 147, 102–126 (2016).
Article Google Scholar
Baker, J., Meade, A., Pagel, M. & Venditti, C. Adaptive evolution toward larger size in mammals. Proc. Natl Acad. Sci. USA 112, 5093–5098 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jones, K. E. New insights on equid locomotory evolution from the lumbar region of fossil horses. Proc. Biol. Sci. 283, 20152947 (2016).
PubMed PubMed Central Google Scholar
Buchholtz, E. A., Booth, A. C. & Webbink, K. E. Vertebral anatomy in the Florida manatee, Trichechus manatus latirostris: a developmental and evolutionary analysis. Anat. Rec. 290, 624–637 (2007).
Article Google Scholar
Gasc, J. P., Jouffroy, F. K., Renous, S. & Blottnitz, F. V. Morphofunctional study of the digging system of the Namib Desert golden mole (Eremitalpa granti namibensis): cinefluorographical and anatomical analysis. J. Zool. 208, 9–35 (1986).
Article Google Scholar
Fischer, M. S. Crouched posture and high fulcrum, a principle in the locomotion of small mammals: the example of the rock hyrax (Procavia capensis) (Mammalia: Hyracoidea). J. Hum. Evol. 26, 501–524 (1994).
Article Google Scholar
Vizcaíno, S. F. & Loughry, W. J. Biology of the Xenarthra (Univ. Press of Florida, 2008).
Goswami, A. & Friscia, A. (eds) Carnivoran Evolution: New Views on Phylogeny, Form and Function Vol. 1 (Cambridge Univ. Press, 2010).
Pierce, S. E., Clack, J. A. & Hutchinson, J. R. Comparative axial morphology in pinnipeds and its correlation with aquatic locomotory behaviour. J. Anat. 219, 502–514.
Larkey, D. J., Datwyler, S. L. & Lancaster, W. C. in Evolutionary History of Bats (eds Gunnell, G. F. & Simmons, N. B.) 500–529 (Cambridge Univ. Press, 2012).
López-Aguirre, C., Hand, S. J., Koyabu, D., Son, N. T. & Wilson, L. A. Postcranial heterochrony, modularity, integration and disparity in the prenatal ossification in bats (Chiroptera). BMC Evol. Biol. 19, 75 (2019).
Anderson, S. C. & Ruxton, G. D. The evolution of flight in bats: a novel hypothesis. Mamm. Rev. 50, 426–439 (2020).
Article Google Scholar
Simmons, N. B., Seymour, K. L., Habersetzer, J. & Gunnell, G. F. Primitive Early Eocene bat from Wyoming and the evolution of flight and echolocation. Nature 451, 818–821 (2008).
Article CAS PubMed Google Scholar
Schliep, K. phangorn: phylogenetic analysis in R. Bioinformatics 27, 592–593 (2011).
Article CAS PubMed Google Scholar
Kuhner, M. K. & Felsenstein, J. A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol. Biol. Evol. 11, 459–468 (1994).
CAS PubMed Google Scholar
Robinson, D. F. & Foulds, L. R. Comparison of phylogenetic trees. Math. Biosci. 53, 131–147 (1981).
Article Google Scholar
Yu, L., Rajan, V. & Moret, B. M. E. A metric for phylogenetic trees based on matching. IEEE/ACM Trans. Comput. Biol. Bioinf. 9, 1014–1022 (2011).
Google Scholar
Paradis, E. & Schliep, K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35, 526–528 (2019).
Article CAS PubMed Google Scholar
Kembel, S. et al. Picante: R tools for integrating phylogenies and ecology. Bioinformatics 26, 1463–1464 (2010).
Article CAS PubMed Google Scholar
Yu, G., Smith, D., Zhu, H., Guan, Y. & Lam, T. T. ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Meth. Ecol. Evol. 8, 28–36 (2017).
Article Google Scholar
Fox, J. & Weisberg, S. An R Companion to Applied Regression 3rd edn (Sage, 2019).
Frerebeau, N. tabula: an R package for analysis, seriation, and visualization of archaeological count data. J. Open Source Soft. 4, 1821 (2019).
Article Google Scholar
Adler, D. & Kelly, S. T. vioplot: violin plot. R package version 0.3.6 https://cran.r-project.org/web/packages/vioplot/index.html (2021).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer-Verlag, 2016).
Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S 4th edn (Springer, 2002).
Searle, S. R., Speed, F. M. & Milliken, G. A. Population marginal means in the linear model: an alternative to least squares means. Am. Stat. 34, 216–221 (1980).
Google Scholar
Revell, L. J. phytools: an R package for phylogenetic comparative biology (and other things). Meth. Ecol. Evol. 3, 217–223 (2012).
Article Google Scholar
Brocklehurst, N. & Benson, R. J. Multiple paths to morphological diversification during the origin of amniotes. Nat. Ecol. Evol. 5, 1243–1249 (2021).
Article PubMed Google Scholar
Felsenstein, J. Phylogenies and the comparative method. Am. Nat. 125, 1–15 (1985).
Article Google Scholar
Galton, F. Regression towards mediocrity in hereditary stature. J. Anthropol. Inst. Gt Brit. Irel. 15, 246–263 (1886).
Google Scholar
Kelly, C. & Price, T. D. Correcting for regression to the mean in behavior and ecology. Am. Nat. 166, 700–707 (2005).
Article PubMed Google Scholar
Bapst, D. W. paleotree: an R package for paleontological and phylogenetic analyses of evolution. Meth. Ecol. Evol. 3, 803–807 (2012).
Article Google Scholar
Rambaut, A., Drummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are grateful to the John Templeton Foundation (grant no. 61408 to M.W. and M.R.), BBSRC (grant nos BB/K015702/1 and BB/K006754/1 to M.W.) and NERC (studentship 2276912 to A.B. and M.W.) for supporting this research. For the purpose of Open Access, the authors have applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. We thank R. Benson (University of Oxford) for providing insightful remarks on an earlier draft of this work and for allowing us to use the three-dimensional reconstructions of the vertebral columns in Fig. 1. Such reconstructions were skilfully rendered from computerized tomography scans by E. Griffiths (University of Oxford). S. Wang (Swarthmore College) kindly supplied a copy of his skewness test. We extend our gratitude to L. Barber for her stalwart efforts in locating important literature sources and O. Wills for her beautiful rendition of Fig. 1. We also thank T. Michael Keesey for obtaining the icons in Fig. 2 from PhyloPic.

Author information

Authors and Affiliations

Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, UK
Yimeng Li, Andrew Brinkworth, Jack Oyston & Matthew Wills
Nanjing Institute of Geology and Palaeontology, CAS, Nanjing, China
Yimeng Li
Joseph Banks Laboratories, Department of Life Sciences, University of Lincoln, Lincoln, UK
Emily Green & Marcello Ruta

Authors

Yimeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Brinkworth
View author publications
You can also search for this author in PubMed Google Scholar
Emily Green
View author publications
You can also search for this author in PubMed Google Scholar
Jack Oyston
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Wills
View author publications
You can also search for this author in PubMed Google Scholar
Marcello Ruta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.R., M.W. and Y.L. conceived the project. E.G., Y.L. and M.R. collected the data. A.B., E.G., Y.L., J.O., M.R. and M.W. conceptualized analytical and statistical protocols. A.B., J.O. and M.R. supplied codes. A.B., E.G., Y.L., J.O. and M.R. analysed the data. M.R. tabulated results, produced figures and tables and wrote an initial draft of the manuscript. All authors contributed equally to the final draft of the work.

Corresponding authors

Correspondence to Matthew Wills or Marcello Ruta.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Katrina Jones, Stephanie Pierce and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Continuous mapping of complexity indices across the phylogeny.

The values of three complexity indices are mapped onto the phylogeny using colour gradients. Index values at the internal nodes are estimated through maximum likelihood, and those along the branches are interpolated between the nodal estimates. For each index, the colour scales range from its minimum to its maximum value. The lengths of the scale bars are in millions of years. a, Presacral Brillouin index. b, Presacral evenness index. c, Logit thoracic:lumbar ratio.

Source data

Extended Data Fig. 2 Bivariate plots of node estimates of complexity indices vs. node ages and descendant–ancestor differences vs. node estimates.

Robust linear regressions between maximum likelihood node estimates of two complexity indices and node ages (a, c), and between descendant–ancestor differences (corrected for the regression to the mean) and node estimates (b, d). The brown solid lines are regression lines. The black dashed horizontal lines separate positive (increases) from negative (decreases) descendant–ancestor differences. Node ages are in millions of years, with time decreasing in the positive direction of the axis (that is, closer to the present). Mammal groups are identified by distinct colours and symbols. Black circles denote the deepest nodes of the phylogeny, corresponding to the separation between major mammal cohorts. a-b, Presacral Brillouin index. c-d, Presacral evenness index.

Source data

Extended Data Fig. 3 Subclade tests in selected mammal groups.

Results of skewness partitioning tests applied to the thoracolumbar Brillouin and evenness indices in various mammal groups. In each plot, the probability density distributions of the index values for each subclade are shown by colour-coded thin lines, whereas the index distribution for the entire group is shown by a thick black curve. The mean values of the individual subclades are represented by colour-coded circles, whereas the mean value of the entire distribution is marked by a black vertical bar. a-b, Euarchonta. c-d, Carnivora. e-f, Cetartiodactyla. g-h, Afrotheria. i-j, Chiroptera. Image credits for mammal silhouettes are as in Fig. 1 of the main text.

Source data

Extended Data Fig. 4 Shifts in rates of complexity change for the thoracolumbar region.

Colour-coded evolutionary rates and shifts mapped onto the phylogeny. Grey branches exhibit background rates. Maroon and steelblue branches exhibit rates that are, respectively, higher and lower than the background rates. Colour intensity is proportional to the rate values, with darker tones indicating a greater difference between background and non-background rates. The circles mark the locations of rate shifts. Circle sizes are drawn in proportion to the posterior Bayesian probability of shifts. Circle colours represent shift magnitude, with darker maroon (respectively, steelblue) tone indicating a shift of greater magnitude towards a rate increase (respectively, decrease) relative to the rates of adjacent branches. a, Thoracolumbar Brillouin index. b, Thoracolumbar evenness index. Image credits for mammal silhouettes are as in Fig. 1 of the main text.

Source data

Extended Data Fig. 5 Shifts in rates of complexity change for the presacral region and thoracolumbar domains.

Colour-coded evolutionary rates and shifts mapped onto the phylogeny. For explanations of colours and symbols, see caption of Extended Data Fig. 4. a, Presacral Brillouin index. b, Presacral evenness index. c, Unstandardized thoracic:lumbar ratio. d, Logit-transformed thoracic:lumbar ratio.

Source data

Supplementary information

Supplementary Information

Supplementary Results, Discussion and Figs. 1–24.

Reporting Summary

Peer Review File

Supplementary Tables

Table 1: results of Poisson regressions; Table 2: phylogenetic ANOVAs; Table 3: ‘brunch’ and PGLS analyses; Table 4: robust linear regressions; and Table 5: skewness tests.

Supplementary Data 1 and 2

Presacral counts, complexity indices and index distributions and statistics (1) and species reference list (2).

Supplementary Data 3

Time-calibrated phylogeny of 1,136 extant mammal species. The file is formatted as an object of class ‘phylo’ readable in R. It can be opened in WORDS and saved as a text file, after replacing the ‘.R’ suffix with ‘.nex’.

Supplementary Data 4

R codes.

Source data

Source Data Fig. 3

Tabulations of six complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Fig. 4

Tabulations of thoracolumbar counts and three thoracolumbar complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Fig. 5

Tabulations of thoracolumbar, thoracic and lumbar counts, as well as thoracic:lumbar ratios, re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Fig. 6

Tabulations of ancestral values, ancestor–descendant differences and node ages for four thoracolumbar complexity indices re-ordered by group.

Source Data Extended Data Fig. 1

Tabulations of two presacral and one thoracolumbar complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Extended Data Fig. 2

Tabulations of ancestral values, ancestor–descendant differences and node ages for two presacral complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Extended Data Fig. 3

Tabulations of two thoracolumbar complexity indices in five major groups re-ordered by subclade. Each tabulation is reported in a separate Excel sheet.

Source Data Extended Data Fig. 4

Tabulations of two thoracolumbar complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Source Data Extended Data Fig. 5

Tabulations of two presacral and two thoracolumbar complexity indices re-ordered by group. Each tabulation is reported in a separate Excel sheet.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, Y., Brinkworth, A., Green, E. et al. Divergent vertebral formulae shape the evolution of axial complexity in mammals. Nat Ecol Evol 7, 367–381 (2023). https://doi.org/10.1038/s41559-023-01982-5

Download citation

Received: 31 January 2022
Accepted: 03 January 2023
Published: 06 March 2023
Issue Date: March 2023
DOI: https://doi.org/10.1038/s41559-023-01982-5

This article is cited by

The neck as a keystone structure in avian macroevolution and mosaicism
- Ryan D. Marek
- Ryan N. Felice
BMC Biology (2023)
Bird clades with less complex appendicular skeletons tend to have higher species richness
- Andrew Brinkworth
- Emily Green
- Matthew A. Wills
Nature Communications (2023)
How to weigh a fossil mammal? South American notoungulates as a case study for estimating body mass in extinct clades
- Allison Nelson
- Russell K. Engelman
- Darin A. Croft
Journal of Mammalian Evolution (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Complexity indices and reference phylogeny

Results

Vertebral counts differ significantly between groups

Complexity is distributed unevenly across groups

Changes in complexity are concentrated in younger branches

TL counts correlate negatively with T:L ratios

TL counts correlate positively with complexity

Thoracolumbar complexity and domains

Changes in complexity are both directional and sustained

Rate shifts reveal divergent patterns between clades

Discussion

Methods

Data collection

Phylogeny construction

Complexity indices

Poisson regressions and phylogenetic analyses of variance

Independent contrasts and phylogenetic generalized least squares analyses

Continuous trait mapping

Descendant–ancestor tests

Subclade tests

Analyses of rates

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links