Heterotrophic protists (unicellular eukaryotes) form a major link from bacteria and algae to higher trophic levels in the sunlit ocean. Their role on the deep seafloor, however, is only fragmentarily understood, despite their potential key function for global carbon cycling. Using the approach of combined DNA metabarcoding and cultivation-based surveys of 11 deep-sea regions, we show that protist communities, mostly overlooked in current deep-sea foodweb models, are highly specific, locally diverse and have little overlap to pelagic communities. Besides traditionally considered foraminiferans, tiny protists including diplonemids, kinetoplastids and ciliates were genetically highly diverse considerably exceeding the diversity of metazoans. Deep-sea protists, including many parasitic species, represent thus one of the most diverse biodiversity compartments of the Earth system, forming an essential link to metazoans.
Although deep-sea sediment life and its extraordinary representatives have been studied for more than two centuries1,2, we still lack a firm understanding of diversity and ecological functions in the largest ecosystem of the biosphere due to the difficulty to access it3. In the last two decades, the establishment of new tools for studying the molecular identity of microbial communities has revolutionized our understanding of the microbial world, and revealed a large and unique diversity of prokaryotes4 and previously unknown protistan lineages in surface waters and the deep sea5,6,7. In parallel, morphological and molecular studies of cultured species have widened our perception of poorly represented branches of the tree of eukaryotic life8. Despite the fundamental roles of protists in the food web of marine surface waters9,10,11, we know little about them on the deep-sea floor. Assessing deep-sea sediments’ protist diversity and its biogeographic distribution is crucial to understand the ecosystem functions of eukaryotes in distinct basins, as well as the overall role of eukaryotes in global carbon cycling.
The important role of protists in energy transfer through aquatic food webs has been well established for shallow benthic and pelagic marine ecosystems12,13, where protists have developed a wide range of nutritional strategies14. Within the euphotic water column, marine photosynthetic plankton forms the base of ocean food webs having a profound influence on the global carbon cycle. Protists are known as important grazers of bacteria and nutrient remineralizers in many aquatic ecosystems9,15,16. Delivery of fixed carbon to the deep sea via sinking detritus and carcasses provides a link between surface‐associated and deep‐sea detritus-based microbial food webs17,18. The sparse records on the functional diversity of naked and testate protists reported from the deep seafloor7 suggests that deep-sea microbial food webs might function in a similar way as those in surface waters. Barotolerant or barophilic nanoprotists (<20 µm) may live at high hydrostatic pressure and can feed on prokaryotes in porewater as well as on those attached to particles7. Omnivorous protists, such as many ciliates and some rhizopods and flagellates, consume a broad spectrum of food particles including other protists and detritus. Archaeal assemblages are known to play a major role in inorganic carbon fixation in deep benthic systems19 and at least from surface water assemblages it is known that they can form a suitable food source for protists.
Most benthic deep-sea studies has focused up to now on assumed hot spots like hydrothermal vents, cold seeps, or anoxic basins at bathyal depths ranging from 1000 to 3000 m20,21,22. There are only few studies focusing on protist communities inhabiting abyssal sediments (3000 to 6000 m depths), which cover more than half of the Earth´s surface, and even less on hadal trenches ranging from 6000 to 11,000 m depths23,24,25. Global scale comparisons, as they were made for the eukaryotic plankton community of the euphotic zone11 or the dark ocean26, are missing for benthic deep-sea protists.
Results and discussion
Deep-sea metabarcoding approach
To explore protistan diversity in different deep-sea basins, we collected sediment samples from 20 sampling sites (3 bathyal sites, 15 abyssal sites, 2 hadal sites) in 11 regions in the Pacific and Atlantic Ocean (Fig. 1a–c, Supplementary Data 1, map created with Ocean Data View27). Besides sampling on a large scale to compare different deep-sea regions, we also investigated protist communities on a small spatial scale (see Supplementary Data 1). We used the approach combining DNA metabarcoding of the hypervariable V9 region of the 18S rDNA11 and direct microscopic live observations (Fig. 1d) with cultivation of protists. Morphological and molecular characterizations of the cultures were obtained to verify results from DNA metabarcoding, and their potential of barotolerance was also investigated28,29. Strict bioinformatic quality control led to a final eukaryotic dataset of ~47,000 operational taxonomic units (OTUs) (~70 million reads), of which the majority (87%) could be taxonomically assigned to groups of heterotrophic protists (Supplementary Tables 1 and 2). Keeping in mind that the number of sampled stations was more than twice as high, the eukaryotic richness in the euphotic zone of marine waters was also more than twice as high (~110,000 OTUs, the majority belonged to heterotrophic protistan groups11) when compared to our deep-sea eukaryotic OTUs. Within the Malaspina expedition, targeting the eukaryotic life in the deep water column, ~42,000 OTUs associated with picoeukaryotes could be recovered30. Protist richness in other benthic environments was lower when compared with our benthic deep-sea dataset. In the neotropical rainforests protist richness was much lower (~26,000 protist OTUs31). Within marine coastal sediments, the protist diversity was found to be ~6000 OTUs32. Comparing the number of eukaryotic deep-sea OTUs with other environments shows that the diversity of deep-sea assemblages is higher than that of coastal sediment communities and has a comparable size as the marine pelagic communities. One should keep in mind that comparing our observed protist richness with studies from other environmental biomes is difficult due to the fact that some of them used different target regions and filtering/clustering methods. Therefore, we compared the eukaryotic community of the deep seafloor with that of de Vargas et al.11 from the sunlit ocean where similar filtering and clustering methods were used.
Taxonomic assignment and link to deep-sea cultivable protists
For the taxonomic assignment of sequences, we used a reference database called V9_DeepSea33 (Zenodo, Supplementary Fig. 1 and Data 2). Besides sequences from the Protist Ribosomal Reference database PR2 v4.11.1 (ref. 34), we included 102 in-house Sanger-sequenced strains (see Supplementary Data 2) of which the majority was isolated from deep-sea (57 strains) and marine surface waters (33 strains). We could recover 31 strains of these 102 cultivated marine protists (i.e. 21 deep-sea strains, 8 surface water strains) belonging to 20 species (19 OTUs, ~170,000 reads) with a V9 sequence similarity of 100% including Stramenopiles (bicosoecids, placidids), Discoba (kinetoplastids), Alveolata (ciliates), Obazoa (choanoflagellates), Rhizaria (cercozoans), and Cryptista (cryptophyceans). This highlights the importance of cultivation-based approaches for detailed molecular and morphological description of marine protists and the proper assignment of reads produced by NGS methods. Adding sequences from our strains increased the number of taxonomically assignable OTUs by 0.6% (273 OTUs, ~300,000 reads) with sequence similarities ranging from 80 to 100%. Overall, only 2.4% of our total protist OTUs were 100% identical to reference sequences (on average 90.4% similarity). This points to a specific and genetically distinct protist fauna in deep-sea sediments (Fig. 1d, e), which has previously been reported from studies targeting specific groups or using a smaller sampling size20,23,25.
High reference sequence similarity of diplonemids
The Discoba had a higher proportion of OTUs with an overall higher similarity to reference sequences as compared to the other deep-sea protistan groups within our dataset. From the 7111 Discoba OTUs (sequence similarity ≥94%) ~89% (6300 OTUs) were associated with diplonemids. Pelagic diplonemids are depth stratified and more abundant and diverse in the deep ocean35. The majority of the diplonemids are thought to have a parasitic lifestyle and one possibility is that they might be not as host specific as it is known for other protists (e.g. gregarines in insects36). Another possibility could be that their recovery in molecular surveys might be better than for other protist lineages resulting in an over-representation in public databases. But these are only thoughts and further detailed studies of this interesting and important taxon are necessary37,38.
Sampling saturation and differences between depth zones
When assessing protist diversity and saturation in our sampling effort, we could recover 71% of the total estimated sampling saturation of deep-sea heterotrophic protist OTUs by using incidence-based estimators (Fig. 1f). When considering the read abundance, saturation was nearly reached (Supplementary Fig. 2). We found great differences in OTU richness between the bathyal, the abyssal, and the hadal regions with only a small proportion of shared OTUs (Fig. 1f, g). Over half of them could only be detected in abyssal sediments, a result that might be biased by the higher sampling number of abyssal sites (Fig. 1g).
Deep-sea eukaryotic life compared with diversity in the sunlit ocean
A comparison with the Tara Oceans metabarcoding survey of eukaryotic diversity in the world sunlit ocean11 revealed a fundamental difference with only a small proportion of shared OTUs with our benthic deep-sea dataset (Fig. 2 and Supplementary Fig. 3B). We found 11 hyperdiverse deep-sea protist lineages (containing ≥1000 OTUs, Fig. 2c), particularly within the Discoba (diplonemids, kinetoplastids), Rhizaria (foraminiferans), Alveolata (dinoflagellates, MALV II, MALV I, ciliates), and cryptophyceans, which accounted together for more than half of all OTUs (~56%), but only 19% of the reads. A much higher richness characterized the deep-sea diplonemid (~27.7% of the total OTUs, ~4.6% of the total reads) and kinetoplastid flagellates (~3.8% of the total OTUs, ~1.4% of the total reads), foraminiferans (~8.2% of the total OTUs, ~2.5% of the total reads), ciliates (~6.7% of the total OTUs, ~2% of the total reads), and cryptophyceans (~2.4% of the total OTUs, ~1.8% of the total reads), as compared to their surface water relatives (Fig. 2c, e). Richness was by far the highest in diplonemids, a feature that has also been observed in deep layers of the pelagic realm35 indicating their potential importance for deep ocean ecosystems not only in the pelagial (2.1% of the total read abundance), but also in deep-sea sediments (4.6% of the total read abundance). Local sedimentation of debris/marine snow as well as dark inorganic carbon fixation19,39 have challenged our understanding of organic carbon available for deep-sea microbial communities40,41,42. The high number of reads associated with phototrophic species within our deep-sea dataset, e.g., within the Archaeplastida (mainly green microalgae from the family of Chloropicophyceae) and the Cryptophyta (mainly Cryptomonadales) might be due to sinking cells from surface waters down to the deep sea. On the other hand, the majority of them only had a low sequence similarity of 80–85% to Archaeplastida and Cryptophyta in the reference database and might be associated to unknown taxonomic groups especially adapted to deep-sea conditions. Several studies have reported the presence of phototrophic protists in deep waters, suggesting that mixotrophy could help them to thrive in the aphotic zone43. There is also the possibility for those species to enter an encysted state upon sinking44.
Cafeteria burkhardae as potential global player in the marine realm
Particularly striking was the extremely high read abundance of bicosoecids, including one OTU (~2.6 million reads) 100% identical to the species C. burkhardae (Fig. 2b). C. burkhardae was detected at all investigated deep-sea sites, matching our observation of the dominance of this species during cultivation-based approaches of deep-sea protists from several deep-sea expeditions45. One could argue that the occurrence of one OTU in all samples might be due to cross-sample contamination. However, sediment samples were sampled during different expeditions and the sediment was processed and analyzed separately in the laboratory. Thus, a cross-sample contamination seems to be unlikely. Interestingly, C. burkhardae made also a majority of the bicosoecid reads from the Tara Oceans surface plankton metabarcoding dataset11,45 as well as within Malaspina metabarcoding dataset46 targeting the water column from surface to bathypelagic waters. These occurrences in both pelagic and deep benthic ecosystems, together with recent experiments demonstrating survival at high hydrostatic pressures47, underline the cosmopolitan distribution of selected protist species in the world’s oceans across extreme environmental conditions.
Distributional patterns of deep-sea protist richness on small and large spatial scales
Each of the 27 sediment samples from the 11 investigated regions showed a highly distinct heterotrophic protist community (Fig. 3a) with the highest heterotrophic protist richness within the Alveolata, Discoba, and Rhizaria in each sediment sample (Fig. 3b), a pattern that has also been reported from previous bathyal and abyssal deep-sea floor studies20,24,25. However, diplonemids and dinoflagellates (mainly representatives of the marine alveolate (MALV) clusters) dominated the diversity at the deep seafloor (Fig. 3b). Stramenopiles (mainly bicosoecids) clearly dominated in regards of read abundances followed by high read abundances within the Alveolata, Discoba, and Rhizaria (Supplementary Fig. 4). The relative proportion of reads per sampling site and division level showed subtle differences (Supplementary Figs. 4 and 5). While the three bathyal stations from the Pacific Ocean formed a highly supported cluster, the two hadal regions from the North Atlantic Ocean clustered together with abyssal stations from the Atlantic (winter expedition) and Pacific Ocean (Fig. 3a). Furthermore, we observed distinct protist communities on much smaller spatial scale (stations NA4*, NA8*, NA9*) from sediment samples extracted just a few meters apart from each other (Fig. 3a and Supplementary Fig. 6). This could be explained by the sediment patchiness at the abyssal seafloor, which can be very high as indicated by metazoan grazing tracks, or falls of larger organic particles (e.g. debris of macrophytes, wood or dead organisms from the pelagial; Fig. 1c). The high number (~60% OTUs) of heterotrophic protists being unique to one sediment sample and the low percentage (0.6% OTUs) of heterotrophic protists shared between all samples point to the potential of highly endemic protist communities in deep-sea sediments (Fig. 3c). Such a pattern has also been found for benthic deep-sea prokaryotes in different deep-sea basins4 and deep-sea Foraminifera48. The majority of “unique” heterotrophic protist OTUs had only a few reads, and several with 10–200 reads (Supplementary Fig. 7). The majority of the heterotrophic protist OTUs was represented by 16–64 reads (Supplementary Fig. 8). There was a high variation of unique protist OTUs and their taxonomic assignment per sampling site and depths (Supplementary Fig. 9). One could argue that this high dissimilarity and clustering could be the result of the high number of unique OTUs with low read abundances (Supplementary Fig. 7). However, even very conservative filtering steps (OTU abundances ≥50 or ≥100 reads) revealed a similar clustering of stations and still resulted in a great dissimilarity between protist communities on both small and large spatial scale (Supplementary Fig. 10).
Feeding modes of deep-sea protists
Abyssal plains are not flat or featureless, but rather strongly influenced, both by the underlying plate geology and subsequent sedimentary processes49, which could explain that we did not observe a homogeneous deep-sea diversity pattern. The majority of taxa recorded from the different deep-sea regions belonged either to bacterivorous groups (e.g. discicristates, stramenopiles, most cercomonads, several ciliates, foraminiferans, lobose amoebae9, or forms parasitizing other eukaryotes (e.g. perkinseans, apicomplexans, and most MALV taxa among dinoflagellates). Deep-sea studies have observed protist grazing, indicating the potential of substantial reductions of the prokaryote standing stock due to protist grazing20. However, the quantification of protist grazing in the deep sea still needs to be investigated. Members of several groups are known to feed also on other protists (e.g. several ciliates28,29). Global and local differences in prokaryote diversity and abundance4 as a main food source, endemicity of macrofauna10,50 as important host for putative parasites might, amongst many other environmental factors varying across deep-sea habitats50, shape deep-sea protist communities on small and large spatial scale. The impact of multiple processes and possible interactions, which might operate at the same time resulting in unique protist communities on the abyssal and hadal seafloor, still needs to be resolved.
Role of protists in the deep-sea food web
Our results provide a unique view on the genetic diversity and specificity of deep-sea protist communities and point to their very important though still underestimated role in shaping seafloor communities. The estimate of heterotrophic protist species richness (Fig. 1f) for the samples from the deep-sea floor was one order of magnitude higher than that of metazoans, a tendency also obtained from the pelagial (Fig. 2 and ref. 11). According to our data, protist communities comprise representatives of different trophic levels consisting of feeders on bacteria and archeans, on detritus, dissolved organic carbon, small eukaryotes as well as parasites of protists and metazoans (illustrated in Fig. 3d). Thus, a major part of organic carbon in deep-sea sediments is channeled not only via long known deep-sea inhabiting foraminiferans51 but also through an unsuspected and extensive variety of small naked heterotrophic protists with different functions. These deep-sea protists form an essential link to metazoans via several trophic levels of flagellated, amoeboid, and ciliated protists by providing biochemically enriched organic matter to metazoans40. In addition, due to the parasitic lifestyle of many deep-sea protists (e.g. diplonemids, MALV II6,35) they might act as important remineralizers of other protists and metazoans channeling carbon back to prokaryotes8,14. Ammonia-oxidizing Archaea have shown to dominate microbial communities in abyssal clay in the North Atlantic Ocean52. Due to their high abundances, Archaea should also be considered as a potential food source for deep-sea protists. In a recent study, it was shown that the probably most commonheterotrophic flagellate taxon Cafeteria feeds on Archaea53. In addition, several protists from freshwater systems have been found to positively select Archaea as food source over Eubacteria54. New techniques and large-scale studies, as well as long-term surveys/time series, may further elucidate the diverse composition of seafloor communities over both space and time, which is critical to our understanding of global biogeochemical cycles in the Earth’s largest habitat.
The highly diverse species composition of heterotrophic protists in the deep sea demanded a combination of culture-independent (metabarcoding) and culture-dependent methods55. Isolation and cultivation of deep-sea protists were carried out for 102 strains to create an extended reference database (see below). In addition, eco-physiological studies were conducted for most of the strains regarding their survival at deep-sea pressure to check for their potential to belong to an active deep-sea community28,29,45,47,56. During four different expeditions in the Pacific and Atlantic Ocean on board of the research vessels R/V Sonne (SO237, SO223T) and R/V Meteor (M79/1, M139) sediment samples from 20 different stations (3 bathyal, 15 abyssal, 2 hadal) at 11 deep-sea basins/regions were collected using a Multi-Corer (MUC) (Supplementary Data 1). Temperature at the deep sea ranged between 2 and 4 °C; salinity was about 36 PSU. Detailed data on the conditions are available from published cruise reports of M139 (https://doi.org/10.2312/cr_m139), M79.1 (https://doi.org/10.2312/cr_m79_1), SO223T (urn:nbn:de:gbv:46-00102735-15), and SO237 (https://doi.org/10.3289/GEOMAR_REP_NS_23_2015). Subsamples of the MUC-system were taken from the upper 2 mm sediment layer by means of a sterile syringe. Only tubes with undisturbed sediment and overlaying water were used for further analyses. For 17 stations (SA1–SA3, P1–P5, NA1–NA3, NA5–NA7, NA10–NA12) taken during expeditions SO237, SO223T, and M79/1, three replicate sediment samples from three MUCs (corresponds to one core per MUC) were taken in total per station (Supplementary Data 1). For the three stations (NA4*, NA8*, NA9*) from the expedition M139, two to four replicates from three MUCs (corresponds to one to two cores per MUC) per station were taken (Supplementary Data 1). Samples were either fixated with 70% molecular biology graded ethanol and stored at −80 °C or directly deep frozen at −80 °C.
DNA extraction, PCR amplification, and sequencing of 18S V9 rDNA metabarcodes
Ethanol preserved sediments were treated in a speed vac for 45 min at 45 °C to evaporate the ethanol. For 17 stations (see above) taken during expeditions SO237, SO223T, and M79/1 the environmental DNA was extracted from 0.5 g sediment of each replicate sample (a total of 1.5 g per station) using the DNeasy Power Lyzer Power Soil DNA isolation kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol (Supplementary Data 1). For the three stations from the expedition M139 (see above) the environmental DNA was extracted from an adapted sample volume using the same kit (Supplementary Data 1). Prior to the kit, sediment samples were pre-washed with three washing solutions to improve the success of DNA amplification by PCR in marine sediments57. Total DNA was quantified using a Nanodrop Spectrophotometer. For sediment samples taken during the expeditions SO237, SO223T, and M79/1, DNA of the three replicates per station were pooled in same concentrations prior to PCR amplifications. Sediment samples from the expedition M139 were separately PCR amplified without prior pooling of DNA per station to investigate small-scale patterns of deep-sea protist diversity. PCR amplifications of the hypervariable V9 region of the 18S rDNA gene was performed with the Phusion® High-Fidelity DNA Polymerase (ThermoFisher) and the forward/reverse primer-pair 1389F (5′-TTG TAC ACA CCG CCC-3′) and 1510R (5′-CCT TCY GCA GGT TCA CCT AC-3′)58. The PCR mixtures (25 µL final volume) contained 5 ng of total DNA template with 0.35 µM final concentration of each primer, 3% of DMSO, and 2× of GC buffer Phusion Master Mix (Finnzymes). PCR amplifications (98 °C for 30 s; 25 cycles of 10 s at 98 °C, 30 s at 57 °C, 30 s at 72 °C; and 72 °C for 10 min) of all samples were carried out with a reduced number of cycles to avoid the formation of chimeras during the plateau phase of the reaction, and in triplicates (M139) or six replicates (SO237, SO223T, and M79/1) in order to smooth the intra-sample variance while obtaining sufficient amounts of amplicons for Illumina sequencing. PCR products were checked on a 1.5% agarose gel for amplicon lengths. Amplicons were then pooled and purified using the PCR Purification Kit (Jena Bioscience, Jena, Germany). Bridge amplification and paired-end (2 × 150 bp) sequencing of the amplified fragments were performed using an Illumina Genome Analyzers IIx system at the Cologne Center of Genomics (CCG).
Due to the lack of reference sequences for the V9 region in common public databases (e.g. NCBI, PR2), we generated a dataset consisting of the V9 region of 102 marine protist strains of our Heterotrophic Flagellate Collection Cologne (HFCC), of which several have not been published yet (Supplementary Data 2). Subsamples of a few milliliters of the sediment of the MUC samples (see above) suspension were cultivated in 50 ml tissue-culture flasks (Sarstedt, Nümbrecht, Germany). Isolation was carried out using a micromanipulator or microtiter plates (liquid aliquot method59). All cultures were supplied with sterilized quinoa or wheat grains as an organic food source for autochthonous bacteria. After isolation, the strains were cultivated in 50 ml tissue-culture flasks (Sarstedt, Nümbrecht, Germany) filled with 30 ml Schmaltz-Pratt medium60 (35 PSU; per liter 28.15 g NaCl, 0.67 g KCl, 5.51 g MgCl2 × 6 H2O, 6.92 g MgSO4 × 7 H2O, 1.45 g CaCl2 × 2H2O, 0.10 g KNO3, 0.01 g K2HPO4 × 3H2O). The cultures were stored at 10 °C in the dark. Isolates were characterized morphologically using AVEC high-resolution video microscopy and electron microscopy. For molecular studies, protistan cultures were concentrated by centrifugation (4000 × g, 20 min at 4 °C, Megafuge 2.0R, Heraeus Instruments). Genomic DNA of each isolated protist strain was extracted using the Quick-gDNATM Mini Prep Kit (Zymo Research, USA). We amplified a long sequence from the 18S rDNA to the 28S rDNA with the primers 18S-For (5′-AAC CTG GTT GAT CCT GCC AGT-3′, ref. 61) binding at the beginning of the 18S rDNA and either NLR1126/22 (5′-GCT ATC CTG AGG GAA ACT TCG G-3′, ref. 62) or NLR2098/24 (5′-AGC CAA TCC TTW TCC CGA AGT TAC-3′, ref. 62) binding in the 28S rDNA. PCR reactions were performed in 25 µl PCR reaction mixtures containing 5.5 µl ddH2O, 1.5 units TAQ (Mastermix, VWR Germany), 2 µl DNA and 2.5 µl of each primer (forward and reverse) at a final concentration of 1.6 nM. The PCR conditions for amplifying the SSU–ITS–LSU region were as follows: pre-denaturation at 98 °C for 2 min, 35 cycles of 98 °C for 30 s, 55 °C for 45 s, and 72 °C for 4 min 30 s; final extension at 72 °C for 10 min. For bodonid strains, a different primer combination was used: 18SForBodo (5′-CTG GTT GAT TCT GCC AGT-3′, ref. 63) + NLR1126/22 (5′-GCT ATC CTG AGG GAA ACT TCG G-3′, ref. 62). Internal primers were used for sequencing (Supplementary Table 2). We established a new reference database for the V9 region by combining the Protist Ribosomal Reference database PR2 v4.11.1 (ref. 34) with the 102 sequences of marine protist strains of the Heterotrophic Flagellate Collection Cologne. Using Cutadapt64, the final in-house reference database, called V9_DeepSea33, was trimmed to the V9 region.
Downstream analyses and taxonomic assignment
Our bioinformatic pipeline (adapted from Frédéric Mahé, https://github.com/frederic-mahe/swarm/wiki/Fred’s-metabarcoding-pipeline) allowed filtering of high-quality V9 rDNA reads/amplicons and their clustering into OTUs (Supplementary Fig. 1). HiSeq sequencing resulted in ~223 million raw reads. Overlapping reads were assembled via VSEARCH v.2.13.4 (ref. 65) using fastq_ mergepairs with default parameters and –fastq_allowmergestagger resulting in ~209 million assembled reads for all stations. Paired reads were retained for downstream analyses if they contained both forward and reverse primers and no ambiguously named nucleotides (Ns) using cutadapt and VSEARCH. Reads from all stations were combined in one file and de-replicated into strictly identical amplicons (metabarcodes) with VSEARCH while the information on their abundance was retained. Low abundance metabarcodes with a read abundance of one and two reads were removed from the dataset prior to OTU clustering in order to avoid potential biases associated with sequencing errors. Metabarcodes were clustered into biologically meaningful OTUs, using Swarm v2.1.5 (ref. 66), with the parameter d = 1 and the fastidious option on. OTUs were taxonomically assigned to our reference database V9_DeepSea33 using VSEARCH’s global pairwise alignment and –iddef 1 (matching columns/alignment length). Amplicons were assigned to their best hit, or co-best hits in the reference database, using a pipeline called Stampa67. The most abundant amplicon in each OTU was searched for chimeric sequences with the chimera search module of VSEARCH, and their OTUs were removed even if they occurred in multiple samples. Sequences with a quality value (min. expected error rate/sequence length) higher than 0.0002 were discarded. Reads shorter than 87 bp were removed from the dataset. Only OTUs with a pairwise identity of ≥80% to a reference sequence were used for downstream analyses. In addition, OTUs were discarded, when a phylogenetic placement within the kingdom level was not possible. Furthermore, OTUs assigned to Metazoa, Fungi, Archaeplastida, and exclusively phototrophic organisms, including several classes of Ochrophyta (Eustigmatophyceae, Pelagophyceae, Phaeophyceae, Phaeothamniophyceae, Pinguiophyceae, Raphidophyceae, Synurophyceae, Xanthophyceae, Bacillariophyta, Chrysomerophyceae), Bacillariophytina, Filosa-Chlorarachnea within the cercozoans as well as the Cryptomonadales within the Cryptophyta, were removed (Supplementary Table 1), resulting in a final dataset of 40,623 heterotrophic protist OTUs and 55,283,811 reads. Except for Fig. 2, which compares the eukaryotic life of the deep sea with that of the euphotic zone, we used the final heterotrophic protist dataset for all graphs.
Comparison of eukaryotic life in the sunlit ocean (Tara Ocean project) and the deep sea
For a comparative analysis of the total eukaryotic life in the sunlit ocean to our deep-sea NGS dataset, we downloaded the available “Database W4”11 containing the total V9 rDNA information organized at the metabarcode (unique sequences) level from the Tara Ocean project website (http://taraoceans.sb-roscoff.fr/EukDiv/#extraction). This table contained all the 1,521,174 metabarcodes from the 47 sampled stations and the abundance information per metabarcode (in total 568,976,385 reads). We extracted this information together with the V9 sequence metabarcode and pooled these Tara Ocean metabarcodes with our deep-sea metabarcodes of 20 stations together in one file. Dereplication, clustering of metabarcodes in OTUs using Swarm, assigning the taxonomy of the representative OTU sequence to the V9-DeepSea reference database, and filtering (see steps in downstream analyses and taxonomic assignment) led to a final dataset of 123,120 eukaryotic OTUs and 589,807,407 reads. Taxonomic groups with more than 1,000 OTUs were here defined as hyperdiverse (see Fig. 2), as conducted within the framework of Tara Ocean11.
Statistics and reproducibility
Stampa plots were applied to visualize our taxonomic coverage assessment to the reference database sequences. A high proportion of environmental reads assigned with a high similarity to references indicates a good coverage, while low similarity values indicate a lack of coverage67. Statistical analyses were conducted with R v.3.5.2 and graphs were created with the R package “ggplot2”68. The alpha diversity of each of the stations was assessed based on several different indices with regard to species (OTU) richness and their evenness of distribution (read abundance) including the Shannon Wiener Index, effective number of species, Simpson’s Index, Pielou evenness, and Chao1 index (see Supplementary Table 2) implemented in the “fossil” package69. The total species richness and the species richness per depth region (bathyal, abyssal, hadal) were estimated with the incidence-based coverage estimator (ICE) using the “fossil” package. As we expected many rare species in deep-sea protist communities, we used ICE to appropriately estimate asymptotic species richness from datasets with many rare species32,70. Rarefaction curves were additionally used in order to investigate the degree of sample saturation by calling the function “rrafey” implemented in the “vegan” package71. We fit the Preston’s log-normal model to abundance (read) data by calling the function “prestonfit” within the “vegan” package, which groups species frequencies into doubling octave classes and fits Preston’s log-normal model. We used the function “veildedspec” to calculate the total extrapolated richness from the fitted Preston model resulting in extrapolated 44,657 OTUs. Binary-Jaccard distances were used as a measure of beta-diversity by calling the function “vegdist” within the “vegan” package. The Jaccard distance values were then used for the unweighted pair-group method with arithmetic means (UPGMA) cluster analyses (“hclust” function). Results of the cluster analyses were visualized in dendrograms by using “ggplot2”. Bootstrap analyses of clusters were conducted by using the function “clusterboot” with 500,000 bootstrap replicates within the “fpc” package72. Venn diagrams73,74 were used to visualize the number of shared and unique OTUs between the three depth zones and the three stations where we investigated the small-scale distribution. Heatmaps were created by using the package “pheatmap”75. Read abundances per division level were scaled by implementing the parameter for scaling used within the heatmap.2() package ((x − mean(x))/sd(x)). Sample sizes and replicate details are described in the other method section parts (see also 76,77,78,79,80,81,82 and supplementary tables.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The data analyzed in this study are deposited at the Sequence Read Archive SRA (PRJNA635512), BioProject ID PRJNA635512, BioSamples SAMN15042370-SAMN15042370. The 18S rDNA sequences from 50 HFCC strains are deposited at GenBank under the Accession numbers MT355104–MT355153. Accession numbers of all 102 strains within our V9_DeepSea reference database33 can be found in the Supplementary Data 2. The deep-sea reference database V9_DeepSea33 can be downloaded from Zenodo.
Data collection: Protist Ribosomal Reference database PR2 v4.11.1 and Database W4 from the Tara Ocean project website (http://taraoceans.sb-roscoff.fr/EukDiv/#extraction). Data analysis: Downstream analysis of NGS raw data as described in https://github.com/frederic-mahe/swarm/wiki/Fred’s-metabarcoding-pipeline and our Material and methods part. Statistical analyses were conducted with R v.3.5.2 (packages: fossil, ggplot2, vegan, fpc).
Danovaro, R., Snelgrove, P. V. R. & Tyler, P. Challenging the paradigms of deep-sea ecology. Trends Ecol. Evol. 29, 465–475 (2014).
Ebbe, B. et al. In Life in the World’s Oceans: Diversity, Distribution, and Abundance (ed. McIntyre, A. D.) 139–160 (Blackwell Publishing Ltd, 2010).
Edgcomb, V. Marine protist associations and environmental impacts across trophic levels in the twilight zone and below. Curr. Opin. Microbiol. 31, 169–175 (2016).
Bienhold, C., Zinger, L., Boetius, A. & Ramette, A. Diversity and biogeography of bathyal and abyssal seafloor bacteria. PLoS ONE 11, e0148016 (2016).
del Campo, J. & Massana, R. Emerging diversity within chrysophytes, choanoflagellates and bicosoecids based on molecular surveys. Protist 162, 435–448 (2011).
López-García, P., Rodríguez-Valera, F., Pedrós-Alió, C. & Moreira, D. Unexpected diversity of small eukaryotes in deep-sea Antarctic plankton. Nature 409, 603–607 (2001).
Gooday, A. J., Schoenle, A., Dolan, J. R. & Arndt, H. Protist diversity and function in the dark ocean—challenging the paradigms of deep-sea ecology with special emphasis on foraminiferans and naked protists. Eur. J. Protistol. 75, 125721 (2020).
Caron, D. A. et al. Probing the evolution, ecology and physiology of marine protists using transcriptomics. Nat. Rev. Microbiol. 15, 6–20 (2017).
Jürgens, K. & Massana, R. In Microbial Ecology of the Oceans (ed. Kirchman, D. L.) 383–441 (Wiley, 2008).
Moran, M. A. The global ocean microbiome. Science 350, aac8455 (2015).
de Vargas, C. et al. Eukaryotic plankton diversity in the sunlit ocean. Science 348, 1261605 (2015).
Azam, F. et al. The ecological role of water-column microbes in the sea. Mar. Ecol. Prog. Ser. 10, 257–263 (1983).
Patterson, D. J., Nygaard, K., Steinberg, G. & Turley, C. M. Heterotrophic flagellates and other protists associated with oceanic detritus throughout the water column in the mid North Atlantic. J. Mar. Biol. Assoc. UK 73, 67 (1993).
Worden, A. Z. et al. Rethinking the marine carbon cycle: factoring in the multifarious lifestyles of microbes. Science 347, 1257594 (2015).
Arndt, H. et al. In The Flagellates—Unity, Diversity and Evolution (eds. Leadbeater, B. S. & Green, J. C.) 240–268 (Taylor & Francis Ltd, 2000).
Boenigk, J. & Arndt, H. Bacterivory by heterotrophic flagellates: community structure and feeding strategies. Antonie van. Leeuwenhoek 81, 465–480 (2002).
Caron, D. A., Davis, P. G., Madin, L. P. & Sieburth, J. M. Heterotrophic bacteria and bacterivorous protozoa in oceanic macroaggregates. Science 218, 795–797 (1982).
Gooday, A. J. Biological responses to seasonally varying fluxes of organic matter to the ocean floor: a review. J. Oceanogr. 58, 305–332 (2002).
Molari, M., Manini, E. & Dell’Anno, A. Dark inorganic carbon fixation sustains the functioning of benthic deep-sea ecosystems. Glob. Biogeochem. Cycles 27, 212–221 (2013).
Pasulka, A. et al. SSU-rRNA gene sequencing survey of benthic microbial eukaryotes from Guaymas Basin hydrothermal vent. J. Eukaryot. Microbiol. 66, 637–653 (2019).
Stoeck, T., Taylor, G. T. & Epstein, S. S. Novel eukaryotes from the permanently anoxic Cariaco Basin (Caribbean Sea). Appl. Environ. Microbiol. 69, 5656–5663 (2003).
Pachiadaki, M. G. et al. In situ grazing experiments apply new technology to gain insights into deep-sea microbial food webs. Deep Sea Res. Part II Top. Stud. Oceanogr. 129, 223–231 (2016).
Cordier, T., Barrenechea, I., Lejzerowicz, F., Reo, E. & Pawlowski, J. Benthic foraminiferal DNA metabarcodes significantly vary along a gradient from abyssal to hadal depths and between each side of the Kuril-Kamchatka trench. Prog. Oceanogr. 178, 102175 (2019).
Pawlowski, J. et al. Eukaryotic richness in the abyss: insights from pyrotag sequencing. PLoS ONE 6, e18169 (2011).
Scheckenbach, F., Hausmann, K., Wylezich, C., Weitere, M. & Arndt, H. Large-scale patterns in biodiversity of microbial eukaryotes from the abyssal sea floor. Proc. Natl Acad. Sci. USA 107, 115–120 (2010).
Pernice, M. C. et al. Large variability of bathypelagic microbial eukaryotic communities across the world’s oceans. ISME J. 10, 945–958 (2016).
Schlitzer, R. Ocean Data View (2012). http://odv.awi.de.
Schoenle, A., Nitsche, F., Werner, J. & Arndt, H. Deep-sea ciliates: recorded diversity and experimental studies on pressure tolerance. Deep Sea Res. Part I: Oceanograp. Res. Pap. 128, 55–66 (2017).
Živaljić, S. et al. A barotolerant ciliate isolated from the abyssal deep sea of the North Atlantic: Euplotes dominicanus sp. n. (Ciliophora, Euplotia). Eur. J. Protistol. 73, 125664 (2020).
Logares, R. et al. Disentangling the mechanisms shaping the surface ocean microbiota. Microbiome 8, 55 (2020).
Mahé, F. et al. Parasites dominate hyperdiverse soil protist communities in Neotropical rainforests. Nat. Ecol. Evol. 1, 0091 (2017).
Forster, D. et al. Benthic protists: the under-charted majority. FEMS Microbiol. Ecol. 92, fiw120 (2016).
Schoenle, A., Hohlfeld, M., Hermanns, K. & Arndt, H. V9_DeepSea (Deep Sea Reference Database) [Data set]. Commun. Biol., Zenodo https://doi.org/10.5281/zenodo.4305675 (2021).
Guillou, L. et al. The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote small sub-unit rRNA sequences with curated taxonomy. Nucl. Acids Res. 41, D597–D604 (2013).
Flegontova, O. et al. Extreme diversity of diplonemid eukaryotes in the ocean. Curr. Biol. 26, 3060–3065 (2016).
Clopton, R. E., Janovy, J. & Percival, T. J. Host stadium specificity in the gregarine assemblage parasitizing Tenebrio molitor. J. Parasitol. 78, 334–337 (1992).
Leander, B. S. Marine gregarines: evolutionary prelude to the apicomplexan radiation? Trends Parasitol. 24, 60–67 (2008).
del Campo, J. et al. Assessing the diversity and distribution of apicomplexans in host and free-living environments using high-throughput amplicon data and a phylogenetically informed reference framework. Front. Microbiol. 10, 2373 (2019).
Herndl, G. J. & Reinthaler, T. Microbial control of the dark end of the biological pump. Nat. Geosci. 6, 718–724 (2013).
Baker, P. et al. Potential contribution of surface-dwelling Sargassum algae to deep-sea ecosystems in the southern North Atlantic. Deep Sea Res. Part II Top. Stud. Oceanogr. 148, 21–34 (2018).
Boeuf, D. et al. Biological composition and microbial dynamics of sinking particulate organic matter at abyssal depths in the oligotrophic open ocean. Proc. Natl Acad. Sci. USA 116, 11824–11832 (2019).
Krause-Jensen, D. & Duarte, C. M. Substantial role of macroalgae in marine carbon sequestration. Nat. Geosci. 9, 737–742 (2016).
Xu, D. et al. Pigmented microbial eukaryotes fuel the deep sea carbon pool in the tropical Western Pacific Ocean. Environ. Microbiol. 20, 3811–3824 (2018).
Agusti, S. et al. Ubiquitous healthy diatoms in the deep sea confirm deep carbon injection by the biological pump. Nat. Commun. 6, 7608 (2015).
Schoenle, A. et al. Global comparison of bicosoecid Cafeteria-like flagellates from the deep ocean and surface waters, with reorganization of the family Cafeteriaceae. Eur. J. Protistol. 73, 125665 (2020).
Massana, R. et al. Gene expression during bacterivorous growth of a widespread marine heterotrophic flagellate. ISME J. 15, 154–167 (2021).
Živaljić, S. et al. Survival of marine heterotrophic flagellates isolated from the surface and the deep sea at high hydrostatic pressure: literature review and own experiments. Deep Sea Res Part II Top. Stud. Oceanogr. 148, 251–259 (2018).
Lecroq, B. et al. Ultra-deep sequencing of foraminiferal microbarcodes unveils hidden richness of early monothalamous lineages in deep-sea sediments. Proc. Natl Acad. Sci. USA 108, 13177–13182 (2011).
Devey, C. W. et al. Habitat characterization of the Vema Fracture Zone and Puerto Rico Trench. Deep Sea Res Part II Top. Stud. Oceanogr. 148, 7–20 (2018).
Levin, L. A. & Sibuet, M. Understanding continental margin biodiversity: a new imperative. Annu. Rev. Mar. Sci. 4, 79–112 (2012).
Gooday, A. J. In Encyclopedia of Ocean Science (eds. Cochran, J. et al.) 684–705 (Elsevier, 2019).
Vuillemin, A. et al. Archaea dominate oxic subseafloor communities over multimillion-year time scales. Sci. Adv. 5, eaaw4108 (2019).
De Corte, D., Paredes, G., Yokokawa, T., Sintes, E. & Herndl, G. J. Differential response of Cafeteria roenbergensis to different bacterial and archaeal prey characteristics. Micro. Ecol. 78, 1–5 (2019).
Ballen-Segura, M., Felip, M. & Catalan, J. Some mixotrophic flagellate species selectively graze on Archaea. Appl. Environ. Microbiol. 83, e02317–16 (2017).
Schoenle, A. et al. Methodological studies on estimates of abundance and diversity of heterotrophic flagellates from the deep-sea floor. J. Mar. Sci. Eng. 4, 22 (2016).
Schoenle, A. et al. New phagotrophic euglenids from deep sea and surface waters of the Atlantic Ocean (Keelungia nitschei, Petalomonas acorensis, Ploeotia costaversata). Eur. J. Protistol. 69, 102–116 (2019).
Danovaro, R. Methods for the Study of Deep-sea Sediments, their Functioning and Biodiversity (ed. Danovaro, R.) 181–196 (CRC Press, 2010).
Amaral-Zettler, L. A., McCliment, E. A., Ducklow, H. W. & Huse, S. M. A method for studying protistan diversity using massively parallel sequencing of V9 hypervariable regions of small-subunit ribosomal RNA genes. PLoS ONE 4, e6372 (2009).
Butler, H. & Rogerson, A. Temporal and spatial abundance of naked amoebae (gymnamoebae) in marine benthic sediments of the Clyde Sea area, Scotland. J. Eukaryot. Microbiol. 42, 724–730 (1995).
Goryatcheva, N. V. The cultivation of colourless marine flagellate Bodo marina. Biol. Inland Waters Bull. 11, 25–28 (1971).
Medlin, L., Elwood, H. J., Stickel, S. & Sogin, M. L. The characterization of enzymatically amplified eukaryotic 16S-like rRNA-coding regions. Gene 71, 491–499 (1988).
Van der Auwera, G., Chapelle, S. & De Wächter, R. Structure of the large ribosomal subunit RNA of Phytophthora megasperma, and phylogeny of the oomycetes. FEBS Lett. 338, 133–136 (1994).
Hillis, D. M., Dixon, M. T. & Ribosomal, D. N. A. Molecular evolution and phylogenetic inference. Q. Rev. Biol. 66, 411–453 (1991).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet 17, 10–12 (2011).
Rognes, T., Flouri, T., Nichols, B., Quince, C. & Mahé, F. VSEARCH: a versatile open source tool for metagenomics. PeerJ 4, e2584 (2016).
Mahé, F., Rognes, T., Quince, C., de Vargas, C. & Dunthorn, M. Swarm v2: highly-scalable and high-resolution amplicon clustering. PeerJ 3, e1420 (2015).
Mahé, F. Stampa: sequence taxonomic assigment by massive pairwise aligments. https://github.com/frederic-mahe/stampa (2018).
Wickham, H. ggplot2: Elegant Graphics for Data Analysis (Springer, 2009).
Vavrek, M. J. Fossil: palaeoecological and palaeogeographical analysis tools. Palaeontol. Electron. 14, 1T (2011).
Colwell, R. K. et al. Models and estimators linking individual-based and sample-based rarefaction, extrapolation and comparison of assemblages. J. Plant Ecol. 5, 3–21 (2012).
Oksanen, J. et al. vegan: Community Ecology Package. The R Project for Statistical Computing. https://cran.r-project.org, https://github.com/vegandevs/vegan (2019).
Hennig, C. fpc: Flexible Procedures for Clustering. The R Project for Statistical Computing. https://www.unibo.it/sitoweb/christian.hennig/en/ (2019).
Chen, H. VennDiagram: Generate High-Resolution Venn and Euler Plots. The R Project for Statistical Computing. https://rdrr.io/cran/VennDiagram/ (2018).
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Kolde, R. pheatmap: Pretty Heatmaps. The R Project for Statistical Computing. https://CRAN.R-project.org/package=pheatmap (2019).
Archibald, J. M., Simpson, A. G. B. & Slamovits, C. H. Handbook of the Protists. (eds. Archibald, J. M. et al.) 1–1657 (Springer, 2017).
Okamura, T. & Kondo, R. Suigetsumonas clinomigrationis gen. et sp. nov., a novel facultative anaerobic nanoflagellate isolated from the meromictic Lake Suigetsu, Japan. Protist 166, 409–421 (2015).
Rybarski, A. et al. Revision of the phylogeny of Placididea (Stramenopiles): molecular and morphological diversity of novel placidid protists from extreme aquatic environments. Eur. J. Protistol.(in press).
Scheckenbach, F., Wylezich, C., Weitere, M., Hausmann, K. & Arndt, H. Molecular identity of strains of heterotrophic flagellates isolated from surface waters and deep-sea sediments of the South Atlantic based on SSU rDNA. Aquat. Microb. Ecol. 38, 239–247 (2005).
Park, J. S. & Simpson, A. G. B. Characterization of halotolerant Bicosoecida and Placididea (Stramenopila) that are distinct from marine forms, and the phylogenetic pattern of salinity preference in heterotrophic stramenopiles: novel halotolerant heterotrophic stramenopiles. Environ. Microbiol. 12, 1173–1184 (2010).
Moriya, M., Nakayama, T. & Inouye, I. Ultrastructure and 18S rDNA sequence analysis of Wobblia lunata gen. et sp. nov., a new heterotrophic flagellate (Stramenopiles, Incertae Sedis). Protist 151, 41–55 (2000).
Živaljić, S. et al. Influence of hydrostatic pressure on the behaviour of three ciliate species isolated from the deep sea. Mar. Biol. 167, 63 (2020).
We are very grateful to the Capt. Oliver Meyer, Uwe Pahl, Rainer Hammacher, and the scientific and technical crews for valuable help during sampling and the excellent support during the expeditions SO223T, SO237, M79/1, and M139. We thank Rosita Bieg, Brigitte Gräfe, and Bärbel Jendral (University of Cologne, Germany) for valuable technical support. This work was supported by grants from the Federal Ministry of Education and Research (BMBF; ProtAbyss 03G0237B and 02WRM1364) and by the German Research Foundation (DFG; AR 288/5, 10, 15, 23; MerMet 17-97; MerMet 17-11; CRC 1211 B02/03 268236062) to H.A.; C.d.V. was supported by the French Government “Investissements d’Avenir” program OCEANOMICS (ANR-11-BTBR- 0008); F.N. was supported by German Research Foundation (FN 1097/3).
Open Access funding enabled and organized by Projekt DEAL.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Schoenle, A., Hohlfeld, M., Hermanns, K. et al. High and specific diversity of protists in the deep-sea basins dominated by diplonemids, kinetoplastids, ciliates and foraminiferans. Commun Biol 4, 501 (2021). https://doi.org/10.1038/s42003-021-02012-5
This article is cited by
Recent expansion of metabolic versatility in Diplonema papillatum, the model species of a highly speciose group of marine eukaryotes
BMC Biology (2023)
Water masses shape pico-nano eukaryotic communities of the Weddell Sea
Communications Biology (2023)
Typical structure of rRNA coding genes in diplonemids points to two independent origins of the bizarre rDNA structures of euglenozoans
BMC Ecology and Evolution (2022)
Microbial predators form a new supergroup of eukaryotes
Trophic flexibility of marine diplonemids - switching from osmotrophy to bacterivory
The ISME Journal (2022)
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.