Reply to: ‘Reconstructed evolutionary patterns from crocodile-line archosaurs demonstrate the impact of failure to log-transform body size data’

Stockdale, Maximilian T.; Benton, Michael J.

doi:10.1038/s42003-022-03072-x

Download PDF

Matters Arising
Open access
Published: 25 February 2022

Reply to: ‘Reconstructed evolutionary patterns from crocodile-line archosaurs demonstrate the impact of failure to log-transform body size data’

Communications Biology volume 5, Article number: 170 (2022) Cite this article

1458 Accesses
5 Altmetric
Metrics details

Subjects

The Original Article was published on 25 February 2022

replying to R. Benson et al. Communications Biology https://doi.org/10.1038/s42003-022-03071-y (2022)

In our recent analysis of body size evolution in the Pseudosuchia¹, we concluded a variable rate model significantly outperforms a random walk, and that evolutionary rates show interactions between body size evolution and the environment. Benson et al.² express concern that these findings are inconsistent with previous work^3,4, which have found an Ornstein-Uhlenbeck (OU) model to be well supported. They attribute this to our not having used a log transformation, and propose that this undermines our findings due to the effects of relative scaling. Benson et al. raise some important points; however, there is insufficient evidence to accept the revised conclusions that they propose. In this revision of our analysis, we conclude that there are too few exceptionally large taxa to change the outcomes of our analysis, and that the strength of any Ornstein–Uhlenbeck process is negligible. Simulations replicate the findings of Benson et al using random data, suggesting log transformation may inflate and suppress some model likelihoods.

Our analysis has a number of features that distinguish it from previous publications. Our phylogenetic tree incorporates molecular data, causing substantive changes to the topology of the crocodile crown-group⁵. Models have also been fitted using a Bayesian model-fitting algorithm. This makes comparisons with previous work difficult. Previous publications have not been consistent in recovering support for the OU model, depending on what body size proxy has been used³. Therefore the claim by Benson et al. that our analysis is not consistent with previous publications is difficult to justify.

We concede that the high evolutionary rates observed in the largest taxa are likely to be a result of scaling bias, and this is a more likely explanation than was speculated in our manuscript. However, the range is not a meaningful measure of variance in a normal distribution, and common ancestors must also be included. In practice, the fraction of very large taxa is small. To demonstrate this we examined the distributions of two well-represented characters from our original dataset (Supplementary Data 1), skull width and the length from the posterior-most point of the supraoccipital to the anterior-most point of the frontal. We added estimated common ancestors using an ancestral state reconstruction, using the APE library for R⁶. The distributions of these characters reveal that 80% of skull width measures, and 83% of frontal-supraoccipital length measures, were within 10cm of the median. Therefore, the overall variance within the body size data is modest.

Figure 1a of Benson et al. shows a divergence of transformed and untransformed metrics from a linear relationship. However, much of this divergence is driven by a minority of extremely large taxa (Supplementary Information Document 1). Exceptionally large taxa are too few in number to significantly change the outcomes of our analysis. This can be demonstrated by replicating our analysis with the exceptionally large and small taxa removed. We created a new dataset with the largest 20% of taxa and the smallest 10% of taxa removed. Random walk and variable rates models were then fitted to this revised dataset using BayesTraits version 3⁷. These models were then compared using Bayes factors. The variable rate model yielded a Bayes factor of 41 when compared to Brownian motion, indicating strong support for the variable rate model. Therefore support for the variable rate model cannot be attributed to scaling bias in exceptionally large taxa. Plotting branch rates on a phylogenetic tree (Fig. 1) reveals a rate pattern similar to our original analysis¹, and our original observations still apply. In particular, evolutionary rate shifts do not appear to be associated with phylogenetic groups. Instead, there is a background of low evolutionary rates, punctuated by discrete increases in evolutionary rate. This fully supports our original conclusion that body size evolution follows a pattern of punctuated equilibrium.

**Fig. 1: Variable evolutionary rates plotted on a phylogenetic tree, derived from a dataset removed of exceptionally large and small taxa.**

The revised body size dataset was used to plot a time series of body size variance (Fig. 2), comparable with Fig. 3 from our original manuscript¹. This revised time series shows striking similarities with the original across all three approaches to curve estimation. Our manuscript describes periods where body size variance remains steady, punctuated by steps up and down in variance during the Late Triassic, Middle Jurassic, Palaeogene and Neogene. These features of our original curve are clearly visible in the revised curve shown in Fig. 2. Therefore the distribution of body size variance through time observed in our analysis is not an artefact caused by including exceptionally large taxa. The distribution of variance through time shown in Fig. 2c shows periods of relative stability, interrupted occasionally by sharp increases and decreases in body size disparity. This strongly supports the pattern of punctuated equilibrium proposed in our original manuscript¹.

**Fig. 2: Body size variance through time estimated using a dataset removed of exceptionally large taxa.**

Fig. 3: Demonstrating the effects of log transformations on support for the variable rates and OU models using random data. Centre lines of each series indicate the median; boxes in grey show upper and lower quartiles.

The comments by Benson et al. make no reference to published concerns about the OU model, in particular its propensity to give false positive results². A previous analysis⁸ has recommended that OU models should not be fitted to datasets with fewer than 200 taxa without a Bayesian model-fitting approach. Not all previous publications meet these criteria³. Even so, false-positive rates of 100% have been reported in trees with as many as 1000 taxa when there are high rates of measurement error⁸, even with a Bayesian model-fitting approach. Body size measurement error in fossil datasets can be assumed to be high due to the effects of digenesis and other factors such as imputation error, morphological variation, ontogeny and sexual dimorphism. The OU model includes a parameter to indicate the strength of selection towards an optimum trait value, known as alpha. The α-value can range from 0 to infinity⁸. We replicated the analysis as described by Benson et al., using log-transformed measurements and fitted an OU model using Bayestraits. The mean α-value across all iterations of the MCMC chain came to 0.016. It has been advised⁸ that the α-value should be expressed relative to the total height of the tree and expressed as the phylogenetic half-life⁹. An α-value of 0.016 is equal to a half-life of 43.3 million years. This is relatively short compared to the phylogenetic tree, superficially suggesting a realistic OU process. However, this is considerably longer than the average branch length, around 10 million years, and longer than 89% of all branches on the tree. Notwithstanding the risk of false-positive results, the contribution of an OU process is clearly extremely slow. Further, this half-life is as long, or considerably longer than major subclades within the Pseudosuchia, such as the Notosuchia and Thalattosuchia. These groups have highly distinctive ecomorphologies, and it is unrealistic to assume that OU parameters would be continuous between them.

Log-transformation changes the distribution and variance of data. Benson et al make an assumption that this transformation does not inherently promote or suppress support for particular evolutionary regimes. The effects of log transformations on model likelihoods can be explored using simulations. We simulated ten random phylogenetic trees with 200 tips each, and ten sets of random trait data with an arbitrary mean trait value of 200, and standard deviations ranging from 5 to 50 (Supplementary Data 2). These random trait data represent a hypothetical continuous character. They do not represent body size specifically, and these simulations do not make inferences about body size evolution. Log transformations were then applied to duplicates of these trait datasets. We fitted random walk, variable rates and OU models to each tree using BayesTraits, once using the untransformed trait data, and repeated using logged trait data. The performance of the variable rates and OU models relative to the random walk model were compared using Bayes factors (Fig. 3). The Bayes factor of the variable rate model using logged data is lower than that of the untransformed data. This suggests that log transformation can suppress support for the variable rate model. The Bayes factors of OU models fitted using untransformed data show only modest support relative to a random walk. By contrast, the Bayes factors of models fitted using logged data show very strong support for the OU model over a random walk. This is despite the data being random and not generated using an Ornstein–Uhlenbeck process. Therefore the lack of support for the variable rate model and increased support for the OU model described by Benson et al. seems likely to be a direct result of the log transformation. These simulations do not question the efficacy of log transformation in correcting scaling bias. Nor do they suggest that scaling should not be accounted for when fitting phylogenetic models. However, they do suggest that OU models in particular should be only be accepted with caution when using log-transformed traits.

The remarks by Benson et al. have highlighted the importance of scaling in the analysis of body size evolution. We concede that scaling was not sufficiently discussed in our analysis, and that this explains the disproportionately high rates observed in very large taxa. We also concede that our rationale for not log transforming the data was incomplete. However, there are too few very large taxa to significantly change our conclusions, and there is not sufficient evidence to accept the alternative conclusions proposed.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All the data used in this analysis are included in the supplementary data files included with this publication.

References

Stockdale, M. T. & Benton, M. J. Environmental drivers of body size evolution in crocodile-line archosaurs. Commun. Biol. 4, 38 (2021).
Article Google Scholar
Benson, R. B. J., Godoy, P., Bronzati, M., Butler, R. & Gearty, W. Reconstructed evolutionary patterns for crocodile-line archosaurs demonstrate impact of failure to log-transform body size data. Commun. Biol. in press (2022).
Godoy, P. L., Benson, R. B. J., Bronzati, M. & Butler, R. The multi–peak adaptive landscape of crocodylomorph body size evolution. BMC Evol. Biol. 19, 167 (2019).
Article Google Scholar
Gearty, W. & Payne, J. L. Physiological constraints on body size distributions in Crocodyliformes. Evolution 74, 245–255 (2020).
Article Google Scholar
Lee, M. S. Y. & Yates, A. M. Tip-dating and homoplasy: reconciling the shallow molecular divergences of modern gharials with their long fossil record. Proc. R. Soc. B. 285, 20181071 (2018).
Article Google Scholar
Paradis, E. & Schliep, K. 2018. Ape 5.0: an environment for modern phylogenetics and evolutionary analysis in R. Bioinformatics 35, 526–528 (2018).
Article Google Scholar
Pagel, M. & Meade, A. Bayesian analysis of correlated evolution of discrete characters by reversible–jump Markov chain Monte Carlo. Am. Nat. 167, 808–825 (2006).
Article Google Scholar
Cooper, N., Thomas, G. H., Venditti, C., Meade, A. & Freckleton, R. P. A cautionary note on the use of Ornstein Uhlenbeck models in macroevolutionary studies. Biol. J. Linn. Soc. 118, 64–77 (2016).
Article Google Scholar
Slater, G. J. Iterative adaptive radiations of fossil canids show no evidence for diversity-dependent trait evolution. Proc. Natl Acad. Sci. USA 112, 4897–4902 (2015).

Download references

Author information

Authors and Affiliations

School of Geographical Sciences, University of Bristol, Bristol, BS8 1RL, UK
Maximilian T. Stockdale
School of Earth Sciences, University of Bristol, Bristol, BS8 1RL, UK
Michael J. Benton

Authors

Maximilian T. Stockdale
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Benton
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T.S. designed and implemented the analysis and wrote the manuscript. M.J.B. provided consultation and revised the MS.

Corresponding author

Correspondence to Maximilian T. Stockdale.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Caitlin Karniski.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stockdale, M.T., Benton, M.J. Reply to: ‘Reconstructed evolutionary patterns from crocodile-line archosaurs demonstrate the impact of failure to log-transform body size data’. Commun Biol 5, 170 (2022). https://doi.org/10.1038/s42003-022-03072-x

Download citation

Received: 08 April 2021
Accepted: 25 January 2022
Published: 25 February 2022
DOI: https://doi.org/10.1038/s42003-022-03072-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.