Rhodophyta (red algae) is one of three lineages of Archaeplastida1, a supergroup that is united by the primary endosymbiotic origin of plastids in eukaryotes2,3. Red algae are a diverse and species-rich group, members of which are typically photoautotrophic, but are united by a number of highly derived characteristics: they have relatively small intron-poor genomes, reduced metabolism and lack cytoskeletal structures that are associated with motility, flagella and centrioles. This suggests that marked gene loss occurred around their origin4; however, this is difficult to reconstruct because they differ so much from the other archaeplastid lineages, and the relationships between these lineages are unclear. Here we describe the novel eukaryotic phylum Rhodelphidia and, using phylogenomics, demonstrate that it is a closely related sister to red algae. However, the characteristics of the two Rhodelphis species described here are nearly opposite to those that define red algae: they are non-photosynthetic, flagellate predators with gene-rich genomes, along with a relic genome-lacking primary plastid that probably participates in haem synthesis. Overall, these findings alter our views of the origins of Rhodophyta, and Archaeplastida evolution as a whole, as they indicate that mixotrophic feeding—that is, a combination of predation and phototrophy—persisted well into the evolution of the group.
Access optionsAccess options
Subscribe to Journal
Get full journal access for 1 year
only $3.90 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Raw transcriptome and genome reads from R. limneticus and R. marinus are deposited in GenBank (PRJNA544719), along with full SSU rRNA gene sequences for R. marinus (MK966712) and R. limneticus (MK966713). Assembled transcriptomes and genomes, along with raw light and electron-microscopy images, individual gene alignments, concatenated and trimmed alignments, single-gene trees, and maximum-likelihood and Bayesian tree files for the 151-taxon and 153-taxon datasets have been deposited in Dryad (https://doi.org/10.5061/dryad.tr6d8q2). The family Rhodelphidae (urn:lsid:zoobank.org:act:80B5C004-2954-4A57-A411-482BCD29E85D), genus Rhodelphis (urn:lsid:zoobank.org:act:6D09D4D9-D9FC-4D0C-8FB2-55FD9DDEAD53) and species Rhodelphis limneticus (urn:lsid:zoobank.org:act:695ACD0B-8151-4609-97FC-A044A312BE22) and Rhodelphis marinus (urn:lsid:zoobank.org:act:84233191-4710-43D1-A2DA-914B8E7B7E01) have been registered with the Zoobank database (http://zoobank.org/).
All unpublished code is available upon reasonable request from the corresponding authors.
Burki, F. The eukaryotic tree of life from a global phylogenomic perspective. Cold Spring Harb. Perspect. Biol. 6, a016147 (2014).
Archibald, J. M. The puzzle of plastid evolution. Curr. Biol. 19, R81–R88 (2009).
Keeling, P. J. The number, speed, and impact of plastid endosymbioses in eukaryotic evolution. Annu. Rev. Plant Biol. 64, 583–607 (2013).
Qiu, H., Price, D. C., Yang, E. C., Yoon, H. S. & Bhattacharya, D. Evidence of ancient genome reduction in red algae (Rhodophyta). J. Phycol. 51, 624–636 (2015).
Yoon, H. S. et al. Single-cell genomics reveals organismal interactions in uncultivated marine protists. Science 332, 714–717 (2011).
Salichos, L. & Rokas, A. Inferring ancient divergences requires genes with strong phylogenetic signals. Nature 497, 327–331 (2013).
Zhang, C., Rabiee, M., Sayyari, E. & Mirarab, S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinformatics 19, 153 (2018).
Spiegel, F. W. Contemplating the first Plantae. Science 335, 809–810 (2012).
Qiu, H., Yoon, H. S. & Bhattacharya, D. Red algal phylogenomics provides a robust framework for inferring evolution of key metabolic pathways. PLoS Curr. 8, https://doi.org/10.1371/currents.tol.7b037376e6d84a1be34af756a4d90846 (2016).
Pazour, G. J., Agrin, N., Leszyk, J. & Witman, G. B. Proteomic analysis of a eukaryotic cilium. J. Cell Biol. 170, 103–113 (2005).
Maruyama, S. & Kim, E. A modern descendant of early green algal phagotrophs. Curr. Biol. 23, 1081–1084 (2013).
Burns, J. A., Pittis, A. A. & Kim, E. Gene-based predictive models of trophic modes suggest Asgard archaea are not phagocytotic. Nat. Ecol. Evol. 2, 697–704 (2018).
Gornik, S. G. et al. Endosymbiosis undone by stepwise elimination of the plastid in a parasitic dinoflagellate. Proc. Natl Acad. Sci. USA 112, 5767–5772 (2015).
Xu, P. et al. The genome of Cryptosporidium hominis. Nature 431, 1107–1112 (2004).
Gould, S. B., Maier, U.-G. & Martin, W. F. Protein import and the origin of red complex plastids. Curr. Biol. 25, R515–R521 (2015).
Oborník, M. & Green, B. R. Mosaic origin of the heme biosynthesis pathway in photosynthetic eukaryotes. Mol. Biol. Evol. 22, 2343–2353 (2005).
Smith, D. R. & Lee, R. W. A plastid without a genome: evidence from the nonphotosynthetic green algal genus Polytomella. Plant Physiol. 164, 1812–1819 (2014).
Fernández Robledo, J. A. et al. The search for the missing link: a relic plastid in Perkinsus? Int. J. Parasitol. 41, 1217–1229 (2011).
Muñoz-Gómez, S. A. et al. The new red algal subphylum Proteorhodophytina comprises the largest and most divergent plastid genomes known. Curr. Biol. 27, 1677–1684 (2017).
Lartillot, N., Lepage, T. & Blanquart, S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25, 2286–2288 (2009).
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Tikhonenkov, D. V., Mazeĭ, IuA. & Embulaeva, E. A. [Degradation succession of heterotrophic flagellate communities in microcosms]. Zh. Obshch. Biol. 69, 57–64 (2008).
Tikhonenkov, D. V. et al. Description of Colponema vietnamica sp.n. and Acavomonas peruviana n. gen. n. sp., two new alveolate phyla (Colponemidia nom. nov. and Acavomonidia nom. nov.) and their contributions to reconstructing the ancestral state of alveolates and eukaryotes. PLoS ONE 9, e95467 (2014).
Luft, J. H. Improvements in epoxy resin embedding methods. J. Biophys. Biochem. Cytol. 9, 409–414 (1961).
Picelli, S. et al. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protocols 9, 171–181 (2014).
Tikhonenkov, D. V., Janouškovec, J., Keeling, P. J. & Mylnikov, A. P. The morphology, ultrastructure and SSU rRNA gene sequence of a new freshwater flagellate, Neobodo borokensis n. sp. (Kinetoplastea, Excavata). J. Eukaryot. Microbiol. 63, 220–232 (2016).
Andrews, S. FastQC: a quality control tool for high throughput sequence data. version 0.10.1 https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Miller, I. J. et al. Autometa: automated extraction of microbial genomes from individual shotgun metagenomes. Nucleic Acids Res. 47, e57 (2019).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Wu, T. D. & Watanabe, C. K. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21, 1859–1875 (2005).
Crooks, G. E., Hon, G., Chandonia, J.-M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Ryan, J. F. Baa.pl: a tool to evaluate de novo genome assemblies with RNA transcripts. Preprint at https://arxiv.org/abs/1309.2087 (2013).
Gruber-Vodicka, H. R., Seah, B. K. B. & Pruesse, E. phyloFlash – rapid SSU rRNA profiling and targeted assembly from metagenomes. Preprint at https://www.biorxiv.org/content/10.1101/521922v1 (2019).
Burki, F. et al. Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta and Cryptista. Proc. R. Soc. B 283, 20152802 (2016).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010).
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Roure, B., Rodriguez-Ezpeleta, N. & Philippe, H. SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics. BMC Evol. Biol. 7 (Suppl. 1), S2 (2007).
Maddison, W. P. & Maddison, D. R. Mesquite: a modular system for evolutionary analysis. Version 3.5 https://www.mesquiteproject.org/ (2018).
Emanuelsson, O., Nielsen, H., Brunak, S. & von Heijne, G. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol. 300, 1005–1016 (2000).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 26, 1641–1650 (2009).
Whelan, S., Irisarri, I. & Burki, F. PREQUAL: detecting non-homologous characters in sets of unaligned homologous sequences. Bioinformatics 34, 3929–3930 (2018).
Katoh, K. & Standley, D. M. A simple method to control over-alignment in the MAFFT multiple sequence alignment program. Bioinformatics 32, 1933–1942 (2016).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Hoang, D. T., Chernomor, O., von Haeseler, A., Minh, B. Q. & Vinh, L. S. UFBoot2: improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518–522 (2018).
Moriya, Y., Itoh, M., Okuda, S., Yoshizawa, A. C. & Kanehisa, M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 35, W182–W185 (2007).
Emms, D. M. & Kelly, S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157 (2015).
We thank L. Nguyen-Ngoc, H. Doan-Nhu, E. S. Gusev, Y. Dubrovsky and the staff of the Russian-Vietnam Tropical Centre, Coastal Branch for assistance with sample collection and trip management; S. A. Karpov for assistance with interpretation of transmission electron microscopy images; Compute/Calcul Canada for computational resources, especially the Orcinus (Westgrid) and Mammouth Parallèle II (Calcul Québec) clusters. This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada to P.J.K. (grant 227301). Field work in Vietnam is part of the project ‘Ecolan 3.2’ of the Russian-Vietnam Tropical Centre. R.M.R.G. was supported by a fellowship from the Canadian Institutes of Health Research and a grant from the Tula Foundation to the Centre for Microbial Diversity and Evolution. Cell isolation and culturing, generation of material for sequencing, light and electron microscopy and analysis were supported by the Russian Science Foundation to D.V.T. (grant 18-14-00239). F.H. is supported by an EMBO fellowship (ALTF 1260–2016).
The authors declare no competing interests.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Peer review information Nature thanks Geoffrey McFadden and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Extended data figures and tables
Related to Fig. 1. a, b, Scanning electron microscopy images showing the flagella and mastigonemes on the posterior flagellum. c, Section of an anterior flagellum. d, Section of a posterior flagellum. e–g, Arrangement of the transitional zone of a flagellum with transverse plate and cylinder. h, Wide microtubular band 2 accompanies the posterior flagellum. i–l, Cell sections from anterior to posterior. m, Single microtubules inside the cytoplasm. n, A rhizoplast connects the basal body of the posterior flagellum to the nucleus. o, Area of cell with nucleus, rudiments of glycostyles inside the vesicles and smooth endoplasmic reticulum. p, Contractile vacuole. q, Osmiophilic body. r, s, Phagocytosis of eukaryotic prey and bacteria. t, Cell section showing food vacuole with several engulfed bacterial cells. bc, bacterium; cl, cylinder; cv, contractile vacuole; fv, food vacuole; gl, glycostyles; bb1, basal body of posterior flagellum; bb2, basal body of anterior flagellum; mn, mastigonemes; mrt, microtubule; n, nucleus; nmb, narrow microtubular band; ob, osmiophilic body; pf, posterior flagellum; pr, eukaryotic prey; rgl, rudiments of glycostyles; rp, rhizoplast; ser, sac of smooth endoplasmic reticulum; sm, single microtubule; ss, striated structure; st, satellite of basal body; tp, transverse plate; wmb1, wide microtubular band 1; wmb2, wide microtubular band 2. Scale bars, 5 μm (a, b), 0.5 μm (c, d, i–o), 0.2 μm (e–h, q), 1 μm (p) and 2 μm (r–t). These experiments were repeated three (a, b) and seven (c–t) times, with similar results.
a, b, Living cells, obtained by light microscopy. c, Longitudinal section of the cell, d, Region of the cell surface with glycostyles. e, Basal body of posterior flagellum with outgoing fibrils and striated structure. f, Section of the flagellum covered with glycostyles and dark granules. g, Emergence of a posterior flagellum. h, Basal body of the posterior flagellum and mitochondrion with tubular cristae. i, Formation of rudiments of glycostyles in perinuclear space. j, Nucleus, mitochondrion and reserve substance. k, Osmiophilic formation and microtubules. cr, cristae; dg, dark granules; fb, fibril; gl, glycostyles; bb1, basal body of posterior flagellum; bb2, basal body of anterior flagellum; mt, mitochondrion; mrt, microtubules; n, nucleus, of, osmiophilic (dark) formation; pf, posterior flagellum; rgl, rudiments of glycostyles; rs, reserve substance; ser, sac of smooth endoplasmic reticulum; ss, striated structure. Scale bars, 10 μm (a, b), 2 μm (c), 0.2 μm (d, e), 0.5 μm (f–i, k) and 1 μm (j). These experiments were repeated ten (a, b) and three (c–k) times, with similar results.
a, Bayesian tree using the CAT + GTR evolutionary model as implemented in PhyloBayes. b, Maximum-likelihood tree using the LG + C60 + F + G4 model as implemented in IQ-TREE. Black dots denote full statistical support (Bayesian posterior probability = 1.0, maximum-likelihood ultrafast bootstrap and SH-aLRT = 100%); support values <0.7/70% are not shown (indicated by ‘–’). c, Bootstrap support for maximum-likelihood trees (PROT + CAT + LG + F) after progressive removal of the fastest evolving amino acid sites shows that both the Rhodelphis and red algae and the picozoa, Rhodelphis and red algae relationships are relatively robust to data removal. Similar to the 151/253 dataset, support for Archaeplastida paraphyly (Cryptista, green plants and glaucophytes (green and glauc.)) decreases with data removal, whereas Archaeplastida monophyly support increases. Support for Opisthokonta monophyly serves as a control for the presence of sufficient information for phylogenomic inference.
a, Bayesian tree using the CAT + GTR evolutionary model as implemented in PhyloBayes. b, Maximum-likelihood tree using the LG + C60 + F + G4 model as implemented in IQ-TREE. Black dots denote full statistical support (Bayesian posterior probability = 1.0, maximum-likelihood ultrafast bootstrap and SH-aLRT = 100%); support values <0.7/70% are not shown.
Internode certainty and internode certainty-all scores were calculated with RAxML v.8.1.6 and mapped onto the 151/253 maximum-likelihood tree topology presented in Extended Data Fig. 4b. a, Internode certainty and internode certainty-all scores for 253 individual bootstrapped maximum-likelihood trees used to generate the concatenated alignment for Extended Data Fig. 4b. b, Internode certainty and internode certainty-all scores for the 50 trees among the 253-tree dataset that have the highest RTC scores, which are expected to improve the robustness of phylogenomic analyses. Internode certainty scores for the sister relationship of Rhodelphis and red algae are higher for the 50 best-supported trees, indicating that they also support this relationship, and with fewer conflicting signals.
Extended Data Fig. 6 Phylogenomic analysis based on concatenation of 50 single-gene datasets with highest RTC scores.
Maximum-likelihood tree using the LG + PMSF + G model as implemented in IQ-TREE (151 taxa, 50 proteins, 21,886 sites). Black dots denote full statistical support (maximum-likelihood ultrafast bootstrap and SH-aLRT = 100%); support values <0.7/70% are not shown. The sister relationship of Rhodelphis and red algae still receives full statistical support with a highly reduced, phylogenetically well-supported dataset.
Extended Data Fig. 7 A coalescence phylogenomic framework recovers Rhodelphis as sister to red algae based on individual gene trees.
Individual bootstrapped gene trees were generated with RAxML v.8.1.6 and used to generate a species tree with ASTRAL-III under default parameters and 100 bootstrap replicates. Support values <0.7/70% are not shown. a, Species trees were made from all 253 single-gene trees from the 151/253 dataset. b, Species trees were made from the 50 trees with the highest relative tree certainty. The sister relationship of Rhodelphis and red algae is recovered with both datasets, and is consistent with concatenated phylogenomic analyses.
a, b, Principal component (PC) plots of gene ontology (GO) category score from free-living phagocytes (a) and photosynthetic organisms (b). a, Rhodelphis associate with phagocytes, but not with photosynthetic eukaryotes. Dashed ellipses represent 95% confidence intervals based on training datasets using a model defined by free-living phagocytes (a; n = 86 GO categories, 474 proteins) and photosynthesis model (b; n = 37 GO categories, 243 proteins). c, Heat map of phagocyte GO terms showing (as in a) that Rhodelphis gene repertoires are similar to phagocytes. Analyses were performed using PredictTrophicMode_Tool.R.
Extended Data Fig. 9 Rhodelphis encode plastid-targeted proteins with N-terminal targeting sequences and homologues of the TIC/TOC plastid import system.
a, An alignment of plastid-type chaperonin 60 (related to Fig. 3b) shows that Rhodelphis nuclear genomes encode plastid-targeted proteins that have clear N-terminal extensions (cTP) relative to plastid-encoded red algal homologues (names in red) and cyanobacterial homologues (names in cyan), but lack a signal peptide (SP) characteristic of proteins targeted to complex secondary or tertiary plastids, as found in Plasmodium (orange). b–e, Rhodelphis nuclear genomes encode bona fide homologues of plastid protein import subunits TIC20, TIC32, TIC22 and TOC75, which are specific genetic markers for plastid presence.
Supplementary Table 1 Predicted Rhodelphis plastid-targeted proteins. Rhodelphis proteins with predicted N-terminal plastid transit peptides and/or plastid-type origin are presented. R. marinus and R. limneticus data are presented on separate tabs. Protein sequences are derived from transcriptome datasets, and augmented for R. limneticus by preliminary gene predictions from nuclear genomic data. Note that some protein sequences are incomplete.
About this article
Current Biology (2019)