Available data do not rule out Ctenophora as the sister group to all other Metazoa

Whelan, Nathan V.; Halanych, Kenneth M.

doi:10.1038/s41467-023-36151-6

Download PDF

Matters Arising
Open access
Published: 10 February 2023

Available data do not rule out Ctenophora as the sister group to all other Metazoa

Nature Communications volume 14, Article number: 711 (2023) Cite this article

2565 Accesses
5 Citations
16 Altmetric
Metrics details

Subjects

Matters Arising to this article was published on 10 February 2023

The Original Article was published on 19 March 2021

arising from A. K. Redmond & A. McLysaght Nature Communications https://doi.org/10.1038/s41467-021-22074-7 (2021)

Redmond and McLysaght (RM)¹ conclude that the position of Ctenophora as sister to all other animals is unsupported. Here, we contend that this conclusion is not consistent with their analyses. Close inspection of RM’s results and reanalyses indicate that when assessing phylogenetic inference methods using “known” relationships, RM did not discuss all widely-accepted relationships, some of which were unusual and cast doubt on their conclusions about method performance. RM also state that decreased support in some analyses means that the hypothesis of Ctenophora as sister to all other animals is unsupported by the data. However, less than 100% support in some analyses is not zero support, and Ctenophora-sister is the best supported hypothesis in most analyses.

RM examined various approaches to phylogenetic inference on three datasets (BEA, LEAP, LEAN). Accuracy of each approach was based on how well each method recovered benchmark, or widely accepted, relationships. To put these results in fuller context, we reanalyzed RM’s datasets using the 20% relaxed clustering partition testing approach with IQTREE (1.6.12)² and also considered a broader set of widely accepted relationships. Analyses from RM on WEA 17 were not redone, but we more carefully inspected their results on WEA17 using a broader set of widely accepted relationships. RM conclude that only SR4 data recoding (sensu³) will result in accurate relationships on the LEAP and LEAN dataset. However, the fact that Chordata was recovered as paraphyletic, in contrast to abundant data⁴, is not discussed. Both SR4 analyses on the LEAP dataset and one SR4 analyses on the LEAN dataset resulted in a non-monophyletic Chordata (Fig. 1; Supplementary Figs. 5, 7 in ref. ¹), indicating inaccuracy of SR4 recoding. Although SR4 recoding results in the accepted relationship of Arthropoda + Platyhelminthes with LEAP and fully supported Nematoda + Arthropoda with LEAN, the failure to recover monophyletic Chordata trades one inaccurate relationship for another. Furthermore, when support values and all possible widely accepted animal relationships⁴ are taken as a whole, non-recoded partitioning with site-heterogeneous models performed better than SR4 recoding on the BEA dataset (Fig. 1). Thus, justifying the use of recoding by citing the method’s inference of select relationships, while not considering its failure to recover other widely accepted relationships, is arbitrary. The inability of any method to recover all accepted relationships on the LEAP and LEAN datasets should call into question the utility of those datasets for assessing method performance.

**Fig. 1: Inference of benchmark relationships following phylogenetic approaches used by Redmond and McLysaght¹ with partitioning by gene and partitioning using 20% relaxed clustering.**

In dismissing past work indicating problems with amino acid recoding⁵, RM claim that failure to recover some accepted relationships is expected with SR4 recoding. They state that other lines of evidence can be used to indicate which relationships inferred with SR recoding are the accurate relationships. RM then use that statement to claim that the Porifera-sister hypothesis is accurate. These claims are logically questionable for two reasons: 1) the first claim ignores poor methodological performance if at least one preferred relationship is recovered, and 2) most analyses have greater support for ctenophores sister at the partition-specific support level (Fig. 4 in ref. ¹), so even if claim 1 was accepted, RM lack other lines of evidence to support the assertion that SR4 recoding is accurately inferring sponges as the sister group to all other animals. Moreover, the inability of SR4 recoding with site-heterogeneous models to recover a monophyletic Chordata on LEAP and LEAN or monophyletic deuterostomes on WEA17 indicates problems with SR4 recoding. Accepting SR4 recoding as accurate requires one to overlook some parts of the tree while focusing only on particular relationships of interest. Ideally, a robust method should recover all relationships known to be well-supported, which is true of non-recoded analyses on the WEA17 dataset. A simpler explanation that does not require ignoring certain parts of the tree is that SR recoding is inaccurate because it removes information content, rather than reducing systematic error⁶.

All non-recoded analyses done in RM recover Ctenophora as the sister lineage of all other animals. Yet, RM claim that Ctenophora-sister is implausible. A primary line of evidence for this conclusion is decreased support values for Ctenophora-sister with more complex models. However, Ctenophora-sister is the best supported hypothesis in almost every analysis, including those that use the most complex models. Contrary to RM’s claims, decreased support for Ctenophora-sister in some analyses does not indicate unequivocal support for sponges-sister. Taxon sampling is also widely accepted as aiding accurate phylogenetic inference⁷. Notably, all non-recoded analyses with the most taxon-rich dataset, WEA17, strongly support Ctenophora-sister. Phylogenetic inference with SR4 recoding and site-heterogeneous models was the only analysis on WEA17 that did not recover Ctenophora-sister, but it failed to recover deuterostome monophyly (Supplementary Fig. 25 in ref. ¹). RM also indicated problems with overparameterization in their analysis of WEA17 with SR4 and site-heterogeneous models. We agree, and a recent study indicated that overparameterization can drive inference of sponges-sister⁶. Thus, the most reliable analyses of RM indicate Ctenophora-sister as the most likely hypothesis.

Careful reading indicates that numerous criticisms made by RM about other studies are unwarranted. For example, SR recoding failed to recover accepted relationships on most datasets (Fig. 1; Supplementary Information of ref. ¹), indicating that concerns raised by Hernandez and Ryan⁵ about data recoding were inappropriately dismissed. Hernandez and Ryan⁵ robustly tested amino acid recoding in simulation, whereas most other studies have relied on assumed relationships (e.g., Porifera sister to all other metazoans; see⁶). Given that empirical phylogenies cannot be known with absolute certainty, simulation approaches are strong evidence that recoding is problematic. Thus, dismissing Hernandez and Ryan⁵ and preferring results with SR4 recoding, even when recoding caused non-monophyly of accepted clades, is problematic.

RM also state that the REA, WEA15, and WEA17 datasets are filled with paralogs that negatively influence phylogenetic inference without providing evidence aside from unpublished “personal communication.” Given the nature and gravity of this debate, unpublished “personal communication” is unreliable evidence as it cannot be easily verified. Importantly, WEA15 and WEA17 were curated to control for paralogs using tree-based approaches⁸. Although we disagree with the premise that WEA15 and WEA17 are fundamentally flawed, if the datasets are as problematic as claimed, then they are unsuitable for inferring relationships, and RM’s conclusions would need to be rejected in favor of acknowledged uncertainty in the phylogenetic position of sponges and ctenophores. Finally, the criticisms of RM about Whelan and Halanych⁹, a study that compared the CAT models of PhyloBayes¹⁰ to partitioning, are without basis as RM did not perform analyses with the CAT models of PhyloBayes.

Although we support attempts to better model substitutional heterogeneity, methods must be robustly tested. We agree with RM and others that combining partitioning and site-heterogeneous models in a maximum likelihood framework may be a computational tractable and accurate approach for phylogenomic inference¹¹ (Fig. 1). Notably, non-recoded partitioning with linked branches and site-heterogeneous models recovered the Ctenophora-sister hypothesis¹ (Fig. 1). When testing approaches, an objective lens must be applied to assess support and rejection of alternative hypotheses. RM’s presentation does not accurately reflect the level of support in their analyses for the Ctenophora-sister hypothesis. Moreover, the Porifera-sister hypothesis is not viewed with the same critical lens. Researchers are actively generating data from more taxa and genes, which will hopefully shed light on this challenging phylogenetic issue.

Methods

Maximum likelihood trees were inferred with non-recoded and SR³ recoded datasets BEA, LEAP, and LEAN from RM¹. Following RM¹, six analyses were done on each dataset (i.e., L1, L2, L3, L4, RL1, RL2) to test for the influence of including site-heterogenous models in analyses. Best-fit partitions and substitution models were inferred with ModelFinder¹², as implemented in IQ-TREE 1.6.12², with 20% relaxed clustering; branch lengths were linked and each partition was allowed to have its own evolutionary rate. For model testing, L1 analyses included only standard site homogeneous protein subtitution models (e.g., Dayhoff, JTT), L2 analyses included all models from L1 analyses and multi-matrix models (e.g., EX2, LG4M), L3 analyses included all models from L1 and L2 analyses plus multi-profile models with Poisson exchangeabilities (e.g., C10, C20), and L4 analyses included all models from L1, L2, and L3 analyses plus multi-profile models with non-Poisson exchangeabilities (e.g., JTT + C10, LG-C30). Model testing for RL1 analyses included site-homogeneous nucleotide (i.e., 4-state) models. Model testing for RL2 analyses included all models from RL1 analyses plus multi-profile site-heterogenous models with either Poisson or GTR exchangeabilities. All analyses included testing models with a parameter for rate heterogeneity. Following RM¹, partition finding was done only on L1 analyses; model-testing for L2, L3, L4, RL1, and RL2 analyses was done with the best-fit partitions from L1 analyses on each dataset. Model testing, as described above, was consistent with what was done by RM on the same datasets but with IQ-TREE 1.6.12. Tree inference was done with IQ-TREE using best-fit paritions and models. Support was assessed with 1000 ultrafast boostrap replicates¹³.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Raw data were downloaded from the FigShare reposiroty of ref. ¹ (https://doi.org/10.6084/m9.figshare.12746972.v1). Tree files are available in Supplementary Data 1.

References

Redmond, A. K. & McLysaght, A. Evidence for sponges as sister to all other animals from partitioned phylogenomics with mixture models and recoding. Nat. Commun. 12, 1783 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Susko, E. & Roger, A. J. On reduced amino acid alphabets for phylogenetic inference. Mol. Biol. Evol. 24, 2139–2150 (2007).
Article CAS PubMed Google Scholar
Dunn, C. W., Giribet, G., Edgecombe, G. D. & Hejnol, A. Animal phylogeny and its evolutionary implications. Annu. Rev. Ecol., Evolution, Syst. 45, 371–395 (2014).
Article Google Scholar
Hernandez, A. M. & Ryan, J. F. Six-state amino acid recoding is not an effective strategy to offset compositional heterogeneity and saturation in phylogenetic analyses. Syst. Biol. 70, 1200–1212 (2021).
Article CAS PubMed PubMed Central Google Scholar
Li, Y., Shen, X.-X., Evans, B., Dunn, C. W. & Rokas, A. Rooting the animal tree of life. Mol. Biol. Evol. 38, 4322–4333 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hedtke, S. M., Townsend, T. M. & Hillis, D. M. Resolution of phylogenetic conflict in large datasets by increased taxon sampling. Syst. Biol. 55, 522–529 (2006).
Article PubMed Google Scholar
Kocot, K. M., Citarella, M. R., Moroz, L. L. & Halanych, K. M. PhyloTreePruner: A Phylogenetic Tree-Based Approach for Selection of Orthologous Sequences for Phylogenomics. Evol. Bioinform 9, 429–435 (2013).
Article CAS Google Scholar
Whelan, N. V. & Halanych, K. M. Who let the CAT out of the bag? Accurately dealing with subtitutional heterogeneity in phylogenomics analyses. Syst. Biol. 66, 232–255 (2017).
CAS PubMed Google Scholar
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Article CAS PubMed Google Scholar
Wang, H.-C., Susko, E. & Roger, A. J. The relative importance of modeling site pattern heterogeneity versus partition-wide heterotachy in phylogenomic inference. Syst. Biol. 68, 1003–1019 (2019).
Article PubMed Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hoang, D. T., Chernomor, O., Von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was made possible in part by a grant of high-performance computing resources and technical support from the Alabama Supercomputer Authority. The findings and conclusions in this study are those of the authors and do not necessarily represent the views of the United States Fish and Wildlife Service.

Author information

Authors and Affiliations

Southeast Conservation Genetics Lab, Warm Springs Fish Technology Center, US Fish and Wildlife Service, Auburn, AL, USA
Nathan V. Whelan
School of Fisheries, Aquaculture, and Aquatic Sciences, College of Agriculture, Auburn University, Auburn, AL, USA
Nathan V. Whelan
Center for Marine Science, University of North Carolina at Wilmington, Wilmington, NC, USA
Kenneth M. Halanych

Authors

Nathan V. Whelan
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth M. Halanych
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.V.W. conceived the study and performed analyses. N.V.W. and K.M.H. wrote the manuscript.

Corresponding author

Correspondence to Nathan V. Whelan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Whelan, N.V., Halanych, K.M. Available data do not rule out Ctenophora as the sister group to all other Metazoa. Nat Commun 14, 711 (2023). https://doi.org/10.1038/s41467-023-36151-6

Download citation

Received: 30 September 2021
Accepted: 18 January 2023
Published: 10 February 2023
DOI: https://doi.org/10.1038/s41467-023-36151-6

This article is cited by

The origin and evolution of Wnt signalling
- Michaela Holzem
- Michael Boutros
- Thomas W. Holstein
Nature Reviews Genetics (2024)
Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa
- Anthony K. Redmond
- Aoife McLysaght
Nature Communications (2023)
Incongruence in the phylogenomics era
- Jacob L. Steenwyk
- Yuanning Li
- Antonis Rokas
Nature Reviews Genetics (2023)
Chemical cognition: chemoconnectomics and convergent evolution of integrative systems in animals
- Leonid L. Moroz
- Daria Y. Romanova
Animal Cognition (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Available data do not rule out Ctenophora as the sister group to all other Metazoa

Subjects

Methods

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

About this article

Cite this article

This article is cited by

The origin and evolution of Wnt signalling

Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa

Incongruence in the phylogenomics era

Chemical cognition: chemoconnectomics and convergent evolution of integrative systems in animals

Comments

Search

Quick links

Subjects

Methods

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

The origin and evolution of Wnt signalling

Reply to: Available data do not rule out Ctenophora as the sister group to all other Metazoa

Incongruence in the phylogenomics era

Chemical cognition: chemoconnectomics and convergent evolution of integrative systems in animals

Comments

Search

Quick links