Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa

Redmond, Anthony K.; McLysaght, Aoife

doi:10.1038/s41467-023-36152-5

Download PDF

Matters Arising
Open access
Published: 10 February 2023

Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa

Nature Communications volume 14, Article number: 710 (2023) Cite this article

1961 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

The Original Article was published on 10 February 2023

replying to N. V. Whelan & K. M. Halanych Nature Communications https://doi.org/10.1038/s41467-023-36151-6 (2023)

In Redmond and McLysaght¹ (R&M) we integrated precomputed site-heterogeneous amino acid substitution models and amino acid recoding into partitioned phylogenomics and showed that this improved model-fit and resistance to long-branch attraction (LBA; a common phylogenetic error). These better fitting modelling strategies revealed a shift in support away from Ctenophora (comb jellies), and towards Porifera (sponges), as sister to all other animals¹. Whelan and Halanych² (W&H) criticised our recoded analyses and interpretations on the placement of Porifera and Ctenophora in the animal phylogeny. Here we counter these criticisms, and bolster evidence for Porifera as sister to all other animals.

W&H claim we did not adequately consider well-accepted relationships (called ‘generally accepted topologies’ in R&M¹) when assessing the performance of new approaches in R&M². The site-heterogeneous models and recoding strategy that we employed have been tested and used elsewhere (see R&M¹), and we deemed further validation redundant. However, we did test the ability of partitioned site-heterogeneous models and recoding to combat specific and well-characterised LBA problems in real datasets^3,4 (see R&M¹), finding that both improved upon standard partitioned phylogenomics¹. Although we agree that new approaches should recover well-accepted clades, W&H’s argument against our approaches (particularly recoding), based on failure to recover Chordata (LEA[N/P] datasets¹) and Deuterostomia (WEA17 dataset and reduced support in BEA dataset¹), is weak. Whether deuterostomes are monophyletic is currently unresolved⁵, meaning deuterostome non-monophyly cannot convincingly cast doubt on our site-heterogeneous, recoded analyses of WEA17 (including recovery of Porifera-sister) or BEA. The LEA(N/P) datasets were not designed to assess chordate monophyly⁴, and Chordata was never recovered with strong support in any of our¹, or W&H’s², LEAN/LEAP analyses, but neither was any alternative¹. Interestingly, despite each being expected to improve phylogenetic inference, (i) using closer-related outgroups has also disrupted Chordata in some past analyses of these datasets⁴, and (ii) partitioning-by-gene outperforms W&H’s better fitting partition scheme in recovering Chordata (W&H Fig. 1 in ref. ²). Conspicuously, the problematic lineages, Ambulacraria and Cephalochordata, are represented by a single species each, and improved taxon sampling resolves this issue for a related dataset⁶. Thus, these datasets appear to harbour little signal either for or against Chordata, rather than recoding causing an inference problem. Given the numerous potential lineage-specific dataset and biological biases that can arise, we do not believe that data insufficient to recover one off-target clade with available modelling strategies cannot reliably be used to assess target relationships.

**Fig. 1: Unpartitioned recoding reanalyses and ancestral ctenophore branch lengths.**

As W&H noted, our site-heterogeneous recoded analyses showed poor resolution and inconsistencies between REA, WEA15 and WEA17, an issue they ascribe to recoding². Pertinently, our site-homogeneous recoded analyses (for which GTR-based models fit best) do not suffer from this issue, and thus, in R&M we proposed that combining partitioning and recoding had reduced the number of complex alignment site patterns per partition such that simple F81 exchangeabilities fit better than GTR exchangeabilities when using site-heterogeneous models¹. We test this here with unpartitioned SR4-recoded reanalyses of these datasets, as this might provide enough data for models that are both site-heterogeneous and GTR-based to fit best. As expected, topologies and branch supports are largely consistent with those recovered in our original RL2 analyses (R&M Fig. 3c in ref. ¹) under the best-fitting unpartitioned site-heterogeneous F81 models (top row in Fig. 1a). However, site-heterogeneous GTR-based models fit best under AIC for all three datasets, and under BIC for WEA17. As anticipated, these GTR-based reanalyses recover highly consistent relationships between the five major animal lineages across datasets (bottom row in Fig. 1a) and with previous studies^7,8,9, and all three support Porifera-sister (UFBOOT: REA = 100%, WEA15 = 83%, WEA17 = 99%; Fig. 1a). Together with Ctenophora-sister being recovered in R&M when using recoding with less well fitting site-homogeneous models¹, these new findings clearly negate W&H’s argument that inappropriate recoding drives Porifera-sister in R&M. The issues they note are instead ascribable to using site-heterogeneous models with simple F81 (rather than GTR) exchangeabilities in partitioned analyses of recoded data, accounting for which reinforces Porifera-sister.

W&H favour the WEA17 dataset as it includes more ctenophore species². While improved ingroup taxon sampling can undoubtedly improve phylogenetic inference, it is noteworthy that the extra species do not break the long branch leading to the Ctenophora clade—the very branch which is purported to cause LBA (see also⁸). This branch is longer in WEA17 than in REA or WEA15 (Fig. 1b). Regardless of whether this increased branch length is (i) an improvement resulting from better ctenophore taxon sampling and/or (ii) results from other factors (such as the different gene families, alignment sites, or errors in each dataset) it nonetheless indicates that if Ctenophora-sister arises from LBA, then it will be harder to overcome for WEA17. This is consistent with our findings in R&M¹, where the shift (as better-fitting models are applied) towards supporting Porifera sister is recovered more slowly for WEA17 (only apparent at the partition-specific level when not recoding), and most easily for WEA15 (which has the shortest ctenophore branch). This may explain why Porifera-sister is only recovered for WEA17 when the data are recoded^1,8. Thus, W&H’s emphasis on ctenophore sampling inadvertently prioritizes a dataset (WEA17) with increased potential for ctenophore LBA. This is a major concern given that Ctenophora-sister is far more likely than Porifera-sister to be erroneously recovered in simulations affected by LBA-inducing systematic error¹⁰.

W&H claim we were unfairly critical of previous studies². First, we disagree that we inappropriately dismissed Hernandez and Ryan’s¹¹ concerns about recoding given our above points on the recovery of Deuterostomia and Chordata in our test datasets and evidence that W&H’s issues with recoded analyses in R&M do not in fact derive from recoding. Other simulation studies have supported recoding¹², or are at least ambivalent¹³, and as we advocated in R&M, ‘a fuller understanding of the implications of recoding is needed’¹. Second, our claim that REA and WEA15 contain paralog contamination referenced other work⁷ and personal communication was limited to WEA17¹, which has now been shown to support Porifera-sister without recoding when orthogroups with poor orthologous signal (i.e., inability to recover major animal lineages at the gene tree level) are excluded¹⁴. We concede that personal communication was less than ideal, particularly as sorting orthologs from paralogs is at least somewhat dependent on the approach employed/investigator interpretation, but our comment also referred to the relatively high percentage of missing data for ctenophores in WEA17, which is directly observable in the dataset. W&H contend that if these datasets contain paralog contamination it would invalidate the findings in R&M², yet it also would invalidate the original findings favouring Ctenophora-sister, a hypothesis derived from phylogenomics. Furthermore, phylogenomic analyses with site-homogeneous models, from which the strongest evidence and support for Ctenophora-sister emerges¹, appear more easily misled by orthology errors than site-heterogeneous approaches¹⁵, with which we observe a shift towards support for Porifera-sister in R&M¹. Lastly, we disagree that discussing W&H’s simulations¹⁶ comparing the Phylobayes¹⁷ CAT¹⁸ model with partitioning is irrelevant, as although we did not directly use the CAT model, we did employ partitioning with previously defined, precomputed variants of CAT¹⁹ using IQ-tree²⁰ in R&M¹.

Our points above refute W&H’s arguments that our use of recoding was spurious and we strongly reject the notion that we do not accurately present results favouring Ctenophora-sister (branch and partition-specific support values were fully reported in R&M¹) and do not apply an ‘objective lens’ in our interpretations². Rather W&H’s assertion that ‘partitioning with linked branches and site-heterogeneous models recovered the Ctenophora-sister hypothesis’² disregards the clear pattern observed in R&M¹ for all datasets of increasing support for Porifera-sister (over Ctenophora-sister) as model fit increases, downplaying this as reduced support in ‘some’ datasets/analyses². In summary, our primary conclusions remain firmly intact.

Methods

Unpartitioned, site-heterogeneous, SR4-recoded animal phylogenomics

SR4²¹ recoded phylogenomic analyses were performed on the REA, WEA15, and WEA17 datasets from R&M¹ using IQ-tree version 1.6.12²⁰ and employing 1000 ultrafast bootstrap replicates²². All analyses performed here were unpartitioned in order to test whether poor resolution and inconsistent results between datasets are ascribable to (i) SR4 recoding (as contended by W&H²) or (ii) simple, flat F81 exchangeabilities (as proposed here and in R&M), which were always better-fitting than GTR exchangeabilities when combining partitioning, recoding and site-heterogeneous models in R&M¹. The logic behind this is that when a single model is applied to the entire phylogenomic dataset rather than separate models to each partition, then there may be enough data for site-heterogeneous models with GTR, rather than simpler F81, exchangeabilities to fit best when recoding is applied. ModelFinder²³ in IQ-tree was used to assess the best-fitting models under both the AIC and BIC. The site-homogeneous models F81 and GTR were tested, as well as pairings of each of these exchangeability matrices with SR4 recoded derivations (from R&M¹) of the site-heterogenous C10, C20, C30, C40, C50, and C60 precomputed CAT models¹⁹. All site-heterogeneous models also incorporated 4 discrete gamma categories to help accommodate rate heterogeneity across sites. For example, the model ‘SR4C60GTR’ (see naming as applied in Fig. 1a), is SR4 recoded, employs GTR exchangeabilities, the 60 site frequency categories from C60, and 4 discrete gamma categories for rate heterogeneity. Each dataset was analysed under the best fitting GTR-based model (GTR-based models always fit best under AIC), as well as under the best-fitting F81-based model (best-fitting under BIC for REA and WEA15), enabling comparison of the resultant maximum likelihood consensus trees and support values between GTR-based and F81-based analyses, as well as with previous F81-based partitioned analyses performed in R&M¹.

Ancestral Ctenophora branch length as an LBA severity measure

The long ancestral ctenophore branch (i.e., the internal branch leading to the extant ctenophores in the animal tree of life) has been suggested to cause LBA between Ctenophora and non-animal outgroups, producing tree topologies supporting Ctenophora as sister to all other animals^1,7,8,10,12. We contend that the longer this branch is (i.e., the more substitutions per site along this branch) in a given dataset (branch length may vary due to substitution model applied, differing gene and site content, variation in alignment and orthology errors, etc.) the more difficult it will be to overcome potential LBA of Ctenophora towards the root of the animal tree when analysing that dataset. We extracted this branch length value from the maximum likelihood consensus trees resulting from standard partitioned phylogenomics of the REA, WEA15, and WEA17 datasets performed in R&M (i.e., named analysis level ‘L1’ in R&M¹). Branch lengths were plotted in Fig. 1b for comparison across datasets, both in ‘Raw’ form (as directly extracted from the consensus trees) and in ‘Normalised’ form (raw branch length divided by total tree length for that dataset, then multiplied by the average total tree length of the three datasets).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Datasets and tree files from our reanalyses are available at https://doi.org/10.6084/m9.figshare.16856152. Derivations of the C10-C60 precomputed CAT models for SR4 recoding are from R&M¹ and available at https://doi.org/10.6084/m9.figshare.12746972.

References

Redmond, A. K. & McLysaght, A. Evidence for sponges as sister to all other animals from partitioned phylogenomics with mixture models and recoding. Nat. Commun. 12, 1783 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Whelan, N. V. & Halanych, K. M. Available data do not rule out Ctenophora as the sister group to all other Metazoa. Nat Commun. https://doi.org/10.1038/s41467-023-36151-6.
Brinkmann, H., Van Der Giezen, M., Zhou, Y., De Raucourt, G. P. & Philippe, H. An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics. Syst. Biol. 54, 743–757 (2005).
Article PubMed Google Scholar
Lartillot, N., Brinkmann, H. & Philippe, H. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol. Biol. 7, S4 (2007).
Article PubMed PubMed Central Google Scholar
Kapli, P. et al. Lack of support for Deuterostomia prompts reinterpretation of the first Bilateria. Sci. Adv. 7, eabe2741 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Delsuc, F., Tsagkogeorga, G., Lartillot, N. & Philippe, H. Additional molecular support for the new chordate phylogeny. Genesis 46, 592–604 (2008).
Article PubMed Google Scholar
Simion, P. et al. A large and consistent phylogenomic dataset supports sponges as the sister group to all other animals. Curr. Biol. 27, 958–967 (2017).
Article CAS PubMed Google Scholar
Feuda, R. et al. Improved modeling of compositional heterogeneity supports sponges as sister to all other animals. Curr. Biol. 27, 3864–3870 (2017).
Article CAS PubMed Google Scholar
Laumer, C. E. et al. Support for a clade of Placozoa and Cnidaria in genes with minimal compositional bias. Elife 7, e36278 (2018).
Article PubMed PubMed Central Google Scholar
Kapli, P. & Telford, M. J. Topology-dependent asymmetry in systematic errors affects phylogenetic placement of Ctenophora and Xenacoelomorpha. Sci. Adv. 6, eabc5162 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Hernandez, A. M. & Ryan, J. F. Six-state amino acid recoding is not an effective strategy to offset compositional heterogeneity and saturation in phylogenetic analyses. Syst. Biol. 70, 1200–1212 (2021).
Article CAS PubMed PubMed Central Google Scholar
Giacomelli, M., Rossi, M. E., Lozano-Fernandez, J., Feuda, R. & Pisani, D. Resolving tricky nodes in the tree of life through amino acid recoding. iScience. 25, 105594 (2022).
Foster, P. G. et al. Recoding amino acids to a reduced alphabet may increase or decrease phylogenetic accuracy. Syst. Biol. Syac042, https://doi.org/10.1093/sysbio/syac042 (2022).
McCarthy, C. G. P., Mulhair, P. O., Siu-Ting, K., Creevey, C. & O’Connell, M. J. Improving orthologous signal and model fit in datasets addressing the root of the animal phylogeny. Mol. Biol. Evol. 40, msac276 (2022).
Siu-Ting, K. et al. Inadvertent paralog inclusion drives artifactual topologies and timetree estimates in phylogenomics. Mol. Biol. Evol. 36, 1344–1356 (2019).
Article CAS PubMed PubMed Central Google Scholar
Whelan, N. V. & Halanych, K. M. Who let the CAT out of the bag? Accurately dealing with substitutional heterogeneity in phylogenomic analyses. Syst. Biol. 66, 232–255 (2017).
CAS PubMed Google Scholar
Lartillot, N., Lepage, T. & Blanquart, S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25, 2286–2288 (2009).
Article CAS PubMed Google Scholar
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Article CAS PubMed Google Scholar
Si Quang, L., Gascuel, O. & Lartillot, N. Empirical profile mixture models for phylogenetic reconstruction. Bioinformatics 24, 2317–2323 (2008).
Article Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Susko, E. & Roger, A. J. On reduced amino acid alphabets for phylogenetic inference. Mol. Biol. Evol. 24, 2139–2150 (2007).
Article CAS PubMed Google Scholar
Minh, B. Q., Nguyen, M. A. T. & von Haeseler, A. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K., Von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Despite reaching a clear impasse on phylogenetic approaches and interpretations we would like to extend our genuine thanks to Nathan V. Whelan and Kenneth M. Halanych for further contributing to and engaging us in this important debate in phylogenetics and animal evolution. This work was supported by funding from Irish Research Council Government of Ireland Postdoctoral Fellowship grant GOIPD/2021/466 to A.K.R. and by European Research Council grant agreement 771419 to A.McL.

Author information

Authors and Affiliations

Smurfit Institute of Genetics, Trinity College Dublin, Dublin, Ireland
Anthony K. Redmond & Aoife McLysaght

Authors

Anthony K. Redmond
View author publications
You can also search for this author in PubMed Google Scholar
Aoife McLysaght
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K.R. designed and conducted analyses. A.K.R. and A.McL. wrote the manuscript.

Corresponding authors

Correspondence to Anthony K. Redmond or Aoife McLysaght.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Redmond, A.K., McLysaght, A. Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa. Nat Commun 14, 710 (2023). https://doi.org/10.1038/s41467-023-36152-5

Download citation

Received: 23 October 2021
Accepted: 18 January 2023
Published: 10 February 2023
DOI: https://doi.org/10.1038/s41467-023-36152-5

This article is cited by

The origin and evolution of Wnt signalling
- Michaela Holzem
- Michael Boutros
- Thomas W. Holstein
Nature Reviews Genetics (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa

Subjects

Methods

Unpartitioned, site-heterogeneous, SR4-recoded animal phylogenomics

Ancestral Ctenophora branch length as an LBA severity measure

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Reporting Summary

Rights and permissions

About this article

Cite this article

This article is cited by

The origin and evolution of Wnt signalling

Comments

Evidence for sponges as sister to all other animals from partitioned phylogenomics with mixture models and recoding

Search

Quick links

Subjects

Methods

Unpartitioned, site-heterogeneous, SR4-recoded animal phylogenomics

Ancestral Ctenophora branch length as an LBA severity measure

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The origin and evolution of Wnt signalling

Comments

Search

Quick links