Improved 18S and 28S rDNA primer sets for NGS-based parasite detection

Kounosu, Asuka; Murase, Kazunori; Yoshida, Akemi; Maruyama, Haruhiko; Kikuchi, Taisei

doi:10.1038/s41598-019-52422-z

Download PDF

Article
Open access
Published: 31 October 2019

Improved 18S and 28S rDNA primer sets for NGS-based parasite detection

Asuka Kounosu¹,
Kazunori Murase¹,
Akemi Yoshida¹,
Haruhiko Maruyama¹ &
…
Taisei Kikuchi ORCID: orcid.org/0000-0003-2759-9167¹

Scientific Reports volume 9, Article number: 15789 (2019) Cite this article

21k Accesses
31 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The development and application of next-generation sequencing (NGS) have enabled comprehensive analyses of the microbial community through extensive parallel sequencing. Current analyses of the eukaryotic microbial community are primarily based on polymerase chain reaction amplification of 18S rRNA gene (rDNA) fragments. We found that widely-used 18S rDNA primers can amplify numerous stretches of the bacterial 16S rRNA gene, preventing the high-throughput detection of rare eukaryotic species, particularly in bacteria-rich samples such as faecal material. In this study, we employed in silico and NGS-based analyses of faecal samples to evaluated the existing primers targeting eukaryotic 18S and 28S rDNA in terms of avoiding bacterial read contamination and improving taxonomic coverage for eukaryotes, with a particular emphasis on parasite taxa. Our findings revealed that newly selected primer sets could achieve these objectives, representing an alternative strategy for NGS.

Single-cell RNA-seq of the rare virosphere reveals the native hosts of giant viruses in the marine environment

Article 11 April 2024

Amir Fromm, Gur Hevroni, … Assaf Vardi

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

Article Open access 06 November 2019

Jethro S. Johnson, Daniel J. Spakowicz, … George M. Weinstock

Lineage dynamics of the endosymbiotic cell type in the soft coral Xenia

Article Open access 17 June 2020

Minjie Hu, Xiaobin Zheng, … Yixian Zheng

Introduction

Next-generation sequencing (NGS) using the 16S rRNA gene (16S rDNA) has been widely used to examine bacterial diversity^1,2. In our previous study, we applied NGS using the 18S rRNA gene (18S rDNA) to analyse eukaryotic parasite diversity^3,4. Compared with conventional methods which rely on host dissections and/or microscopic observations, NGS-based methods are easy and sufficiently sensitive for high-throughput analyses^3,4.

18S rDNA has been widely used for the identification and diversity analyses of eukaryotes because it is well conserved among species and it contains variable regions^5,6. Within 18S rDNA, hypervariable regions 4 (V4) and 9 (V9) are currently the popular options for NGS-based analyses^5,6,7,8. The Earth Microbiome Project (EMP), which aims to construct a global catalogue of the uncultured microbial diversity on the Earth⁹, recommends the use of primers that amplify a short fragment (approximately 150 bp) containing the V9 region of 18S rDNA for eukaryote analyses. As with those studies using PCR, primer selection is a critical factor for successful NGS-based analyses because non-universal primers are subject to taxonomic biases. Along with the elongation of the read length by the Illumina sequencer, some recent studies seeking to develop improved primer sets which amplify longer fragments have compared 18S rDNA among all eukaryotes^5,6 or specific taxa⁸ via in silico sequence analysis and identified conserved regions best suited for amplifying the hypervariable regions. However, although those primer sets were designed to amplify eukaryotic 18S rDNA fragments, several bacterial 16S rDNA fragments were also amplified^10,11,12, indicating their poor specificity. Specifically, for bacteria-rich samples, such as faecal material, bacterial read contamination represents a critical drawback, preventing the detection of rare eukaryotic species. In addition, refined classification of the detected reads to the genus or species level is often difficult using 18S rDNA primers because the amplicon sequence does not represent sufficient sequence diversity to distinguish closely related genera or species. Other genomic regions, such as the large subunit (LSU) of rDNA, which varies from 25S to 28S in size depending on the species (in this article we use “28S rDNA” to refer eukaryotic LSU), or the ITS regions of rDNA, which show higher diversity than 18S rDNA, represent alternative targets for PCR amplification^13,14.

In this study, we sought to identity primer sets that provide high taxonomic resolution and less bacterial read contamination to investigate eukaryotic microbial diversity with a particular emphasis on parasitic taxa. Primer screening was performed using 18S and 28S rDNA via in silico sequence analyses, and selected primers were further evaluated for sensitivity, specificity, taxonomic discrimination capacity, amplification efficiency and reproducibility via quantitative PCR (qPCR) and NGS analyses of faecal samples obtained from parasite-infected animals.

Results

In silico screening

For NGS-based analyses of eukaryote diversity, previous studies have mostly used 1391F/EukBr, as recommended by EMP^10,15,16,17, or 563F/1132R^{18,19,20,21,22}, which targets the V9 or V4-5 regions of 18S rDNA, respectively. In this article, these two primer sets are referred to as ‘conventional primer sets’ and used as comparators. To identify primer pairs that can efficiently detect a wide variety of parasites while avoiding bacterial DNA amplification for use in NGS-based parasite detection, we screened all possible 18S and 28S rDNA primers. Some previous studies have extensively tested 18S rDNA primers in silico to design universal eukaryotic primers to be used as standards for NGS-based analyses of eukaryote diversity^5,6,13. Therefore, we used these recommended 18S rDNA primer sets to re-evaluate for detection of parasitic taxa groups (Table 1; Supplementary Table S1). For 28S rDNA, we retrieved the possible universal primers (n = 52) (Supplementary Table S1) from previous reports^13,23,24 and screened them based on their melting temperatures (Tm) and amplicon sizes (Materials and Methods), yielding 13 primer pairs (Table 2).

Table 1 List of primer sets targeting the 18S rRNA gene and their coverage in 14 taxonomic groups.

Full size table

Table 2 List of primer sets targeting the 28S rRNA gene and their coverages in 14 taxonomic groups.

Full size table

We then evaluated those primer pairs based on their sequence identity with eukaryotic parasites, fungi and bacteria using the SILVA non-redundant sequence dataset (Tables 1 and 2). The EMP primer set (1391F/EukBr) can be used to detect a wide variety of parasitic taxa, exhibiting 43.4–66.7% coverage in majority of the tested taxa, excluding Nematoda, Platyhelminthes, Longamoebia and Fungi. The other conventional primer set 563F/1132R exhibited higher coverage than the EMP primer set (≥87.4% excluding Haemosporodia). However, these two primer sets showed similarity with bacterial 16S rDNA sequences (13.0% and 89.9%, respectively). Although the other 18S primer sets demonstrated lower taxonomic coverage for eukaryotes than 563F/1132R, they appeared to amplify less bacterial rDNA. For instance, 574F/952R showed low coverage for Nematoda, Fornicata and Parabasalia. Moreover, 574F/952R, 574*F/952R and 1183F/1631R showed low coverage for Acanthocephala, while 1183F/1631R showed low coverage for Parabasalia, Haemosporidia and Entamoebida, despite their low similarities to bacterial sequences (coverage <0.1%). These tendencies were also observed when we used strict or mild parameters for taxonomic coverage evaluations (Tables S2 and S3). Based on these results, 616*F/1132R and 1183F/1631R were selected for further evaluation as the best primer sets for the 18S V4-5 and V7-8 regions, respectively.

Data for the 28S rDNA primers are summarised in Tables 2, S2 and S3. All 28S primer sets showed low similarity with bacterial sequences. The taxonomic coverage for some eukaryotic parasites was variable, especially for some protozoan groups including Haemosporidia, Fornicata, Parabasalia and Entamoebida. On the other hand, the coverage for the other parasitic taxa were not very different, although the primer sets designed for the D3–D5 regions showed low coverage for Nematoda, Platyhelminthes and Discicristata. Among the five primer sets designed based on the D8–D9 regions, GA20F/RM8R, RM7F/RM9R and GA20F/RM9R exhibited wide taxonomic coverage except for Entamoebida. Based on these results, we selected seven 28S primer sets, namely DM568F/RM2R, RM2F/RM3R, RM3F/RM4R, GA12F/RM4R, GA20F/RM7R, GA20F/RM8R and GA20F/RM9R, for further evaluation.

All the 18S and 28S primers tested showed high coverages (>60%) for Euteleostomi which includes their possible hosts. In particular, human and mouse DNA are likely to be amplified by all primers tested (Tables 1 and 2).

qPCR

To confirm whether the selected primer sets could efficiently amplify eukaryotic rDNA without dimer or hairpin structure generation, we performed qPCR using C. elegans DNA as representative eukaryote DNA because all selected primers displayed 100% sequence similarity with C. elegans rRNA. 18S rDNA from 0.1 ng of C. elegans genomic DNA (final concentration, 0.01 ng/µl), which corresponds to ~200,000 copies of rRNA, was amplified at ~21 cycles (mean Ct ± SD = 21.28 ± 0.71) using the EMP primer set (1391F/EukBr), corresponding to the amplification efficiency of 80–87%. 1183F/1631R and 563F/1132R exhibited similar PCR efficiencies, whereas 616*F/1132R showed lower efficiency (Supplementary Fig. S1A). To assess the avoidance of bacterial DNA amplification, we used a bacterial DNA mixture. Amplification from 0.1 ng of bacterial DNA (final concentration of 0.01 ng/µl), which corresponds to ~20000 copies of rRNA, required ~27 cycles (mean Ct ± SD = 27.10 ± 0.81) using the EMP primer set. Ct difference between eukaryotic and bacterial DNA was the largest for 1183F/1631R, followed by 616*F/1132R, whereas this difference was the smallest for the EMP primer set. The results for 28S rDNA primer sets are shown in Supplementary Fig. S1B. Amplification efficiencies of the 28S primer sets for C. elegans DNA exceeded 70% except for that of RM3F/RM4R, which required 12–16 additional cycles for amplification. All 28S primers demonstrated lower sensitivity to bacterial DNA and required more than 10 additional PCR cycles compared with the EMP primer set.

The detection limits of C. elegans DNA using the primer sets were 0.2–2 pg, corresponding to 1–10 C. elegans cells. Products were detected for no-template negative controls at approximately 30 cycles using RM2F/RM3R compared with approximately 35 cycles using the EMP primer sets. No non-specific amplification was detected with 40 cycles using the other primer sets.

Based on these results, we selected one primer set for each variable region of 28S rDNA, namely DM568F/RM2R for D3-4, RM2F/RM3R for D4-5, GA12F/RM4R for D5-6 and GA20F/RM9R for D8–9.

Deep sequencing

Next, we performed MiSeq analysis of 18S or 28S rDNA amplicons using the two conventional and six newly selected primer sets. We used DNA extracted from the faeces of wild rats and a domesticated bovid as templates, which were anticipated to be highly rich in bacteria. Our previous morphological observations have revealed that five rats (i.e., WR4–8) were heavily infected with parasitic nematodes, while one rat (ZR4) was infected with tapeworms⁴. In contrast, the bovine sample (MB1) was rich in protozoan parasites. In total, 1,311,788 high-quality reads, with a mean of 23,425 reads per test (samples × primers), were obtained via Illumina MiSeq (Table S4).

Taxonomic classification of the sequence reads revealed that EMP primer set (1391F/EukBr) amplicons contained numerous bacterial reads, with the highest observed in ZR4 (approximately 53%) and the lowest in WR5 (approximately 10%) (Fig. 1; Supplementary Table S5). 563F/1132R amplicons contained more bacterial reads than the EMP primer set amplicons for all samples. In particular, MB1 contained approximately 97% bacterial reads. 616F*/1132R amplicons contained fewer bacterial reads (<10%) than EMP primer set amplicons for all samples. The other primer set amplicons (i.e. 1183F/1631R, DM568F/RM2R, RM2F/RM3R, GA12F/RM4R and GA20F/RM9R) contained none or only a few bacterial reads (0–0.15%).

Archaea reads were detected in EMP, 616*F/1132R and GA20/RM9R amplicons, although few (<5%), except for MB1 amplicons of the EMP primer set which contained >30% Archaea reads. Relatively more unassigned reads (displaying no similarity with sequences in the database) were detected in MB1 amplicons of EMP, 616*F/1132R and 1183F/1631R primer sets (8.5%, 9.0% and 10.7%, respectively). Other combinations of primer sets and DNA samples revealed few unassigned reads (<2.2%).

Numbers of operational taxonomic units (OTUs) detected using the EMP primer set ranged from 150 to 400 (Table S4). However, approximately half of the OTUs were assigned to either Bacteria or Archaea. Although the number of OTUs detected using 563F/1132R was the highest for each DNA sample (>1200 OTUs), after removing the bacteria and archaea reads, this number became the lowest among all primer sets. Other primer sets detected few or no bacterial OTUs, and the eukaryotic OTUs ranged from 40 to 500. Among these, RM2F/RM3R detected the lowest number of OTUs in the six rat samples, whereas GA12/RM4R detected the lowest numbers in the bovine sample. Overall, the six newly selected primer sets more readily avoided bacterial DNA amplification than the conventional primer sets. Finer classifications after removing the bacteria and archaea reads are shown in Figs 2–4.

Nematode-infected samples

At the phylum level (SILVA level 7) classification, all primer sets exhibited similar taxon distribution patterns in WR4, although small proportional differences were noted (Fig. 2). Many reads (46–91%) were assigned to the phylum Nematoda and some were assigned to the phylum Chordata and sub-phylum Saccharomycotina using all primer sets. At SILVA level 10, Nematoda reads were further classified to the family or genus level. Using the three 18S primer sets (i.e. EMP, 563F/1132R and 616*F/1132R), many (>85%) Nematoda reads was assigned to ‘Rhabditida; Ambiguous’. Conversely, using 1183F/1631R, the proportion of ambiguous taxa became smaller and more reads were assigned to genera such as Strongyloides and Ancylostoma. Using the 28S primer sets, no ‘Rhabditida; Ambiguous’ reads were detected and all Nematoda reads were subdivided into genera, including Heligmosomoides, Nippostrongylus and Strongyloides; this trend was similar to the nematode taxon distribution observed in our previous morphological identification⁴. Similar results were noted in other WR samples (i.e. WR5, WR7 and WR8), although differences in minor taxon distributions were observed (Supplementary Tables S7 and S8). In WR6, Eimeriorina was detected in addition to Nematoda, Chordata and Saccharomycotina using all primer sets (Fig. 2). At SILVA level 10, the Eimeriorina reads were further classified into Eimeria or ‘Eimeriorina; Ambiguous’ taxa.

Tapeworm-infected samples

ZR4 harboured Hymenolepis tapeworms in its intestine. At SILVA level 7, all 18S primer sets detected high proportions of Platyhelminthes as well as Saccharomycotina and Trichomonas (or ‘Trichomonas; Ambiguous’) (Fig. 3; Supplementary Table S7). The three 28S primer sets did not detect Trichomonas reads, while GA20F/RM9R detected few Trichomonas reads (approximately 0.1%); therefore, Saccharomycotina and Platyhelminthes occupied higher proportions of the total reads using the 28S primer sets than using the 18S primer sets. Chordata reads were detected by all 18S primers and two 28S primer sets (i.e., DM568F/RM2R and RM2F/RM3R). Mastotermes was detected only by the GA20F/RM9R primer set. At SILVA level 10, Platyhelminthes reads were further classified to the order Cyclophyllidea using the 18S primer sets and to the genus Hymenolepis using the 28S primer sets. Saccharomycotina was further classified to the order Saccharomycetales using the 18S primer sets and to the genera Saccharomyces and Kazachstania using the 28S primer sets.

Protozoa-rich samples

Various protozoa occur in the bovine gastrointestinal tract; thus, protozoal cysts are frequently detected in faecal samples. Most of these protozoa form a part of the normal ruminal microflora called ciliated protozoa^25,26; however, some of these, such as Eimeria (Coccidia), Cryptosporidium, Giardia, Entamoeba and Trichomonas, are pathogenic and thus possess clinical significance²⁷. In this study, we used bovine faeces as prototypical protozoa-rich samples.

At SILVA level 4, the EMP primer set (1391F/EukBr) detected reads assigned in descending order to Retaria, Parabasalia, Fungi, Stramenopiles, Apicomplexa and Metazoa (Fig. 4; Supplementary Table S6). Concomitantly, approximately 30% of the reads did not share similarities with known rDNA sequences in the database (unassigned). The other 18S primer sets produced similar patterns as the EMP primer sets. Approximately 10–40% of the reads were unassigned, and the remaining reads were primarily assigned to Parabasalia, Fungi, Apicomplexa and Stramenopiles. The largest differences from the EMP primer sets were Retaria and Entamoebida, which were detected only by the EMP and only by the other three 18S primer sets, respectively. The 28S primer sets detected lower proportions of unassigned reads than the 18S primer sets. The main detected taxa were similar between the 28S and 18S primer sets. However, Parabasalia reads were not detected by DM568F/RM2R, RM2F/RM3R and GA12F/RM4R, whereas Entamoebida reads were not detected by GA12F/RM4R and GA20F/RM9R. Instead of ‘Stramenopiles; Incertae sedis’, four 28S primer sets detected Stramenopiles; Blastocystis, although the proportion with RM2F/RM3R was minute.

At SILVA level 7, Parabasalia detected by the 18S primer sets were further classified into Trichomonadea taxa, including Trichomonas, Ditrichomonas, Tetratrichomonas, Pentatrichomonas and Simplicimonas (Fig. 4; Supplementary Table S7). Apicomplexa were further classified to Eimeriorina using all primer sets. Fungi were subdivided into the orders Neocallimastigomycetena, Saccharomycotina and Pezizomycotina, albeit without noticeable differences among the primer sets.

Beta diversity analyses

A technical replicate experiment was performed from PCR amplification to MiSeq independently from the first experiment using the newly selected primer sets (Dataset 2; Supplementary Tables S9–S12). The dendrograms of cluster analysis based on the Bray–Curtis dissimilarity of taxon abundance from the two replicate experiments are shown in Fig. 5A. All replicates (Datasets 1–2) in MB1 and ZR4 were clustered together in the dendrogram, suggesting high reproducibility of the methods using these primer sets (Fig. 5A). For WR samples, although the replicates were largely clustered together, some technical replicates were nested within the other DNA samples (e.g. WR4 with WR7 and WR5 with WR8), perhaps because those samples showed very similar taxonomic compositions. Principal coordinates analysis (PCoA) plots were generated for ZR4, MB1 and WR samples (Fig. 5B–D, respectively). In the three plots, PCoA1 separated the samples based on the PCR target regions (18S or 28S), although the separation in the WR plot, which contained five DNA samples, was not as obvious as that in the other plots. Among the 28S primer sets, RM2F/RM3R and GA12F/RM4R were clustered together in all the plots, whereas DM568F/RM2R and GA20F/RM4R were clustered together in the ZR and WR plots but not in the MB plot. Among the 18S primer sets, 563F/1132R and 616*/F1132R were clustered together in all the plots. These results correspond to the target regions of 18S or 28S (Tables 1 and 2).

Discussion

Bacterial read contamination of PCR amplicons often poses a critical problem in NGS-based analyses of eukaryotic diversity or diagnoses. In extreme cases, as with MiSeq of a bovine faecal sample in the present study, over 95% of the total sequence reads can be derived from bacterial DNA, making it difficult to detect rare eukaryotes in the samples. Increasing data acquisition may resolve this issue; however, presence of raw data with only one or two orders of magnitude than non-contaminated cases is inefficient and therefore prevents high-throughput analyses. The primer sets newly screened in this study, which can efficiently amplify rDNA from a wide range of eukaryotes without bacterial DNA amplification, are anticipated to be suitable tools for diversity analyses of eukaryotic microbes, including parasites.

At the same time, we noted that each primer set could not detect a specific taxonomy groups. According to our deep sequencing analysis, two of the 28S primer sets could not to detect Trichomonas species, which were detected by all other 18S primer sets. Spironucleus reads were detected from rat faeces using only one 28S primer set (GA20F/RM9R). Entamoeba could not be detected using the EMP and GA20F/RM9R primer sets. In addition, the results of in silico analysis suggested that one primer set is unlikely to cover all taxonomic groups of parasites. For instances, Plasmodium spp., one of the most medically important parasites, was difficult to detect using any of the tested 18S rDNA primer sets, although it may be detected using the two 28S rDNA primer sets. Trypanosoma and Leishmania, two other important parasitic genera, could be detected only using 563F/1132R and 1183F/1631R among the tested primers. Collectively, these results suggest the importance of selecting primer sets according to the study objective.

To achieve fine taxonomic resolution, long sequences containing sufficient diversity to distinguish closely related species are essential. Although sequencing technologies capable of producing long sequences, such as PacBio and NanoPore, are available^28,29, these remain impractical for rDNA-based microbiome analyses because of their higher error rates and lower throughputs than those of Illumina sequencing. Therefore, many studies have used Illumina sequencing, for which the maximum length is 600 bp (300-bp paired-end). Although we used variable regions of 18S rDNA with fragment lengths ranging from 150 to 570 for taxonomic classification, we were unable to further assign the reads to the genus or species level in most cases. On the contrary, reads of 28S rDNA, which has higher sequence diversity than 18S rDNA¹³, could sometimes be further assigned to the genus level, suggesting that 28S rDNA represents a good option for studies in which finer classification is necessary. One of the challenges in 28S rDNA-based population analyses is the enlargement of the database because database sizes affect fine taxonomic classification. The current database (SILVA r132) contains 198,843 28S rDNA sequences compared with 695,171 18S rDNA sequences (https://www.arb-silva.de/). In addition, we discarded primer sets with amplicon sizes that were out of range even though they demonstrated good taxonomic coverages (Supplementary Table S13). These primers can be used as alternates if they are capable of amplifying sequences to meet the length requirement.

Host DNA contamination did not hamper analyses in this study. Small proportions of mammalian (Chordata) reads were detected with any combination of samples and primers. This is probably because the faecal samples used in this study were collected from wild animal and contained high number of eukaryotic microbes. However, our in-silico analysis revealed that all the tested primer sets theoretically cannot avoid amplification of host DNA. Therefore, when samples are expected to have small amounts of eukaryotic microbes, such as clinical samples from human or samples from well-kept pets, PCR blockers may be required, which prevent host DNA amplification^30,31,32. Applying taxon-specific primers is an alternative option to avoid amplification of host DNA. Recently, Cannon et al.³³ proposed a high-throughput method to detect a wide range of parasites by a combination of multiple taxon-specific primers. We tested those primers using our evaluation criteria and confirmed that those primer sets amplify each targeted taxa and can avoid host and bacterial DNA amplification (Table S14). Although this strategy requires optimisation for multiplex PCR (amplification of multiple targets in a single PCR) for high throughput studies and may require a reasonable normalisation method for amplification bias by each primer set for a reliable estimation of taxa distribution in a sample, the assay still has an advantages in customizability to easily include additional targeted taxa³³. Therefore, the primer sets selected in this study can be added to the multiplex assay, which could achieve more comprehensive “parasitome” analyses.

The benefits and drawbacks of the newly selected primer sets and conventional primers are summarised in Table 3. First, the newly selected primer sets could avoid bacterial DNA amplification. However, taxonomic coverage differed with each primer set. Ultimately, the primer sets should be selected according to the study objectives, taking the parasites that need to be covered and the required resolution into account. However, we recommend the use of 616*/F1132R for 18S rDNA or DM568F/RM2R for 28S rDNA, or a combination of those, as new standard primer sets for parasite detection because these provide wide taxonomic coverage of parasitic eukaryotes with minimal bacterial DNA contamination.

Table 3 A summary of 18S and 28S rDNA primer set evaluation.

Full size table

Methods

SSU and LSU primer screening

Potential universal primer sequences targeting eukaryote rDNA were obtained from previous studies^5,6,13,23,24. The primers were filtered to select primer pairs suitable for Illumina MiSeq analysis under the following criteria: Tm in the range of 55 °C–70 °C, a difference in Tm between the two primers of <5 °C and an amplicon size of 200–580 bp. These primer pairs were further evaluated for similarities with eukaryote and bacterial rDNA sequences using TestPrime 1.0 and the SILVA 132 database under the following parameters: maximum number of mismatch = 4 bp and the length of 0-mismatch zone at the 3′ end = 3 bp). We used the non-redundant reference dataset (Ref NR) build by a dereplication of the full reference set using a 99% identity criterion and were suggested by SILVA to be used as a representative dataset for classification, phylogenetic analysis and probe design.

DNA samples

A bacterial DNA mixture was prepared by combining 70 ng DNA extracted from pure cultures of seven bacterial species (Escherichia coli, Enterobacter sp., Serratia sp., Bacillus subtilis, Klebsiella pneumoniae, Group A Streptococcus and Staphylococcus epidermidis) using a QIAmp DNA Mini Kit (Qiagen). C. elegans DNA was extracted from approximately 10,000 worms using the same kit.

For MiSeq analyses, DNA extracted from the faeces of rats caught in the Miyazaki City Phoenix Zoo (ZR, Rattus rattus) or in Miyazaki downtown (WR, Rattus norvegicus) in our previous study⁴ were used. Faecal samples from a domesticated bovid (MB, Bos taurus) were provided by the veterinary parasitology lab of the University of Miyazaki, and DNA was extracted using a Maxwell RSC Purefood GMO Kit (Promega), as described previously³⁴.

qPCR

qPCR was performed to test the amplification efficiency of each primer set using C. elegans DNA or the bacterial DNA mixture as a template. Reactions were performed in triplicates using a StepOnePlus Real-Time PCR System (Applied Biosystems) under the following conditions: 95 °C for 10 min, followed by 40 cycles of 95 °C for 15 s, 50 °C for 30 s and 60 °C for 1 min (for 18S rDNA amplification), or 95 °C for 10 min, followed by 40 cycles of 95 °C for 15 s and 60 °C for 1 min (for 28S rDNA amplification). The reaction volume was 10 μl, including 5 μl of the Power SYBR Green PCR Master Mix (2x), 0.9 μM of each primer and 1 μl of DNA solution. To calculate the PCR efficiencies and detection limits, serial 10-fold dilutions of C. elegans DNA (1 ng to 0.01 pg) were used as templates.

MiSeq sequencing

PCR was performed using Tks Gflex DNA Polymerase (Takara), and a 30-µl reaction mixture containing 1 µl of template DNA (1–3 ng of DNA), 15 µl of 2 × Gflex buffer, 0.5 µl each of the forward/reverse primers with the Illumina MiSeq Adapter (10 µM final concentration), 0.5 µl (100 U) of DNA polymerase and 13 µl of nuclease-free H₂O. Reactions were performed using Veriti Thermal Cycler (Applied Biosystems) under the following conditions: 95 °C for 1 min, followed by 35 cycles of 95 °C for 15 s, 60 °C (for 28S rDNA amplification) or 50 °C (18S rDNA amplification) for 1 min and 68 °C for 1 min. Duplicate PCRs were performed independently, and the produced materials were then mixed. The PCR products were confirmed via agarose gel electrophoresis and purified using AMpure XP beads (Beckman Coulter). Index PCR was performed to attach dual indices and Illumina sequencing adapters to the first PCR products using the Nextera XT Index Kit (Illumina) and KAPA HiFi HotStart Ready Mix (Kapa Biosystems) under the following conditions: 95 °C for 3 min, followed by 8 cycles of 95 °C for 30 s, 55 °C for 30 s and 72 °C for 30 s, and the final extension at 72 °C for 5 min. The PCR product was cleaned using AMpure XP beads, pooled at equal concentrations and then sequenced using the MiSeq Reagent Nano Kit v3 (600 cycles) according to the manufacturer’s protocol (http://icom.illumina.com/) to produce 300-bp paired-end reads.

Bioinformatic analysis

Illumina sequence data were processed using QIIME version 1.9.1³⁵. Paired-end reads were joined using the ‘fastq-join’ method (join_paired_ends.py). After QIIME quality filtering (split_libraries_fastq.py: -store_qual_scores -q 9 -max_barcode_errors 2 -sequence_max_n 1 -max_bad_run_length 2 -p 0.5 –r 3), chimeric sequences were detected using the UCHIME algorithm, which is included in the free version of USEARCH61, and eliminated from further analyses. Cleaned reads were clustered and assigned to OTUs using the open-reference OTU-picking protocol with the SILVA 128 database³⁶ at 97% identity with ‘blast’ (pick_open_reference_otus.py).

Similarity in taxa composition and the relative abundance were analysed via PCoA and hierarchical cluster analyses using the Bray–Curtis similarity index with R vegan package³⁷.

Data availability

The sequencing data have been deposited to the DNA Data Bank of Japan Sequence Read Archive under the BioProject PRJDB3050.

References

Langille, M. G. et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nature biotechnology 31, 814 (2013).
Article CAS Google Scholar
Turnbaugh, P. J. et al. The effect of diet on the human gut microbiome: a metagenomic analysis in humanized gnotobiotic mice. Science translational medicine 1, 6ra14, https://doi.org/10.1126/scitranslmed.3000322 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hino, A., Maruyama, H. & Kikuchi, T. A novel method to assess the biodiversity of parasites using 18S rDNA Illumina sequencing; parasitome analysis method. Parasitology international 65, 572–575, https://doi.org/10.1016/j.parint.2016.01.009 (2016).
Article CAS PubMed Google Scholar
Tanaka, R. et al. Assessment of helminth biodiversity in wild rats using 18S rDNA based metagenomics. PloS one 9, e110769, https://doi.org/10.1371/journal.pone.0110769 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hadziavdic, K. et al. Characterization of the 18S rRNA gene for designing universal eukaryote specific primers. PloS one 9, e87624, https://doi.org/10.1371/journal.pone.0087624 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Hugerth, L. W. et al. Systematic design of 18S rRNA gene primers for determining eukaryotic diversity in microbial consortia. PloS one 9, e95567 (2014).
Article ADS Google Scholar
Stoeck, T. et al. Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Molecular ecology 19(Suppl 1), 21–31, https://doi.org/10.1111/j.1365-294X.2009.04480.x (2010).
Article CAS PubMed Google Scholar
Bradley, I. M., Pinto, A. J. & Guest, J. S. Design and Evaluation of Illumina MiSeq-Compatible, 18S rRNA Gene-Specific Primers for Improved Characterization of Mixed Phototrophic Communities. Appl Environ Microbiol 82, 5878–5891, https://doi.org/10.1128/aem.01630-16 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, J. A., Jansson, J. K. & Knight, R. Earth Microbiome Project and Global Systems Biology. mSystems 3, https://doi.org/10.1128/mSystems.00217-17 (2018).
Maritz, J. M. et al. An 18S rRNA Workflow for Characterizing Protists in Sewage, with a Focus on Zoonotic Trichomonads. Microbial ecology 74, 923–936 (2017).
Article CAS Google Scholar
Amaral-Zettler, L. A., McCliment, E. A., Ducklow, H. W. & Huse, S. M. A Method for Studying Protistan Diversity Using Massively Parallel Sequencing of V9 Hypervariable Regions of Small-Subunit Ribosomal RNA Genes. PLOS ONE 4, e6372, https://doi.org/10.1371/journal.pone.0006372 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Pawlowski, J. et al. Eukaryotic Richness in the Abyss: Insights from Pyrotag Sequencing. PLOS ONE 6, e18169, https://doi.org/10.1371/journal.pone.0018169 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Machida, R. J. & Knowlton, N. PCR primers for metazoan nuclear 18S and 28S ribosomal DNA sequences. PloS one 7, e46180, https://doi.org/10.1371/journal.pone.0046180 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Schoch, C. L. et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proceedings of the National Academy of Sciences of the United States of America 109, 6241–6246, https://doi.org/10.1073/pnas.1117018109 (2012).
Article ADS PubMed PubMed Central Google Scholar
Korajkic, A. et al. Changes in bacterial and eukaryotic communities during sewage decomposition in Mississippi river water. Water research 69, 30–39 (2015).
Article CAS Google Scholar
Porazinska, D. L. et al. Plant diversity and density predict belowground diversity and function in an early successional alpine ecosystem. Ecology 99, 1942–1952 (2018).
Article Google Scholar
Watts, M. P., Spurr, L. P., Gan, H. M. & Moreau, J. W. Characterization of an autotrophic bioreactor microbial consortium degrading thiocyanate. Applied microbiology and biotechnology 101, 5889–5901 (2017).
Article CAS Google Scholar
Huang, T. et al. Dalangtan Playa (Qaidam Basin, NW China): Its microbial life and physicochemical characteristics and their astrobiological implications. PloS one 13, e0200949, https://doi.org/10.1371/journal.pone.0200949 (2018).
Article CAS PubMed PubMed Central Google Scholar
Benitez, E. et al. Bottom-up effects on herbivore-induced plant defences: a case study based on compositional patterns of rhizosphere microbial communities. Scientific reports 7, 6251, https://doi.org/10.1038/s41598-017-06714-x (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Braithwaite, K. Innovative approaches to identifying the cause of chlorotic streak and new management strategies: final report 2013/357 (2017).
Mullins, M. T. Sample Collection and DNA Extraction Methods for Environmental DNA Metabarcoding in Headwater Streams, Eastern Kentucky University (2017).
Wylezich, C., Herlemann, D. P. & Jürgens, K. Improved 18S rDNA amplification protocol for assessing protist diversity in oxygen-deficient marine systems. Aquatic Microbial Ecology 81, 83–94 (2018).
Article Google Scholar
Van der Auwera, G., Chapelle, S. & De Wächter, R. Structure of the large ribosomal subunit RNA of Phytophthora megasperma, and phylogeny of the oomycetes. FEBS letters 338, 133–136 (1994).
Article Google Scholar
Moreira, D. et al. Global eukaryote phylogeny: combined small-and large-subunit ribosomal DNA trees support monophyly of Rhizaria, Retaria and Excavata. Molecular phylogenetics and evolution 44, 255–266 (2007).
Article CAS Google Scholar
Williams, A. G. Rumen holotrich ciliate protozoa. Microbiological reviews 50, 25–49 (1986).
CAS PubMed PubMed Central Google Scholar
Corliss, J. O. On the evolution and systematics of ciliated protozoa. Systematic Zoology 5, 68–91 (1956).
Article Google Scholar
Taylor, M. Protozoal disease in cattle and sheep. In practice-london-british veterinary association 22, 604–626 (2000).
Google Scholar
Branton, D. et al. In Nanoscience And Technology: A Collection of Reviews from Nature Journals 261–268 (World Scientific, 2010).
Buermans, H. & Den Dunnen, J. Next generation sequencing technology: advances and applications. Biochimica et Biophysica Acta (BBA)-Molecular Basis of Disease 1842, 1932–1941 (2014).
Article CAS Google Scholar
Belda, E. et al. Preferential suppression of Anopheles gambiae host sequences allows detection of the mosquito eukaryotic microbiome. Scientific reports 7, 3241, https://doi.org/10.1038/s41598-017-03487-1 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Comeau, A. M., Douglas, G. M. & Langille, M. G. Microbiome helper: a custom and streamlined workflow for microbiome research. mSystems 2, e00127–00116 (2017).
Article Google Scholar
Vestheim, H., Deagle, B. E. & Jarman, S. N. Application of blocking oligonucleotides to improve signal-to-noise ratio in a PCR. Methods in molecular biology (Clifton, N.J.) 687, 265–274, https://doi.org/10.1007/978-1-60761-944-4_19 (2011).
Article CAS Google Scholar
Cannon, M. V. et al. A high-throughput sequencing assay to comprehensively detect and characterize unicellular eukaryotes and helminths from biological and environmental samples. Microbiome 6, 195, https://doi.org/10.1186/s40168-018-0581-6 (2018).
Article PubMed PubMed Central Google Scholar
Afrin, T., Kounosu, A., Billah, M.-M., Murase, K. & Kikuchi, T. Evaluation of magnetic cellulose bead-based DNA extraction from faecal materials for high-throughput bacterial community analyses. Applied Entomology and Zoology 53, 281–286 (2018).
Article CAS Google Scholar
Caporaso, J. G. et al. QIIME allows analysis of high-throughput community sequencing data. Nature methods 7, 335–336, https://doi.org/10.1038/nmeth.f.303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Quast, C. et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic acids research 41, D590–596, https://doi.org/10.1093/nar/gks1219 (2013).
Article CAS PubMed Google Scholar
Oksanen, J. et al. Vegan: community ecology package. R package version 1.17-4, http://cran.r-project.org. Acesso em 23, 2010 (2010).

Download references

Acknowledgements

We thank Akina Hino, Tanzila Afrin, Ryusei Tanaka, Vicky Hunt, Mark Bligh, Mana Abe and Yasunobu Maeda for helpful comments and support and Aya Adachi for technical assistance.

Author information

Authors and Affiliations

Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
Asuka Kounosu, Kazunori Murase, Akemi Yoshida, Haruhiko Maruyama & Taisei Kikuchi

Authors

Asuka Kounosu
View author publications
You can also search for this author in PubMed Google Scholar
Kazunori Murase
View author publications
You can also search for this author in PubMed Google Scholar
Akemi Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Haruhiko Maruyama
View author publications
You can also search for this author in PubMed Google Scholar
Taisei Kikuchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.M. and T.K. conceived and designed the study. A.K. and A.Y. performed experiments. A.K., K.M. and T.K. analysed data. K.A. and T.K. wrote the manuscript from input from K.M. All authors reviewed and approved the manuscript.

Corresponding author

Correspondence to Taisei Kikuchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kounosu, A., Murase, K., Yoshida, A. et al. Improved 18S and 28S rDNA primer sets for NGS-based parasite detection. Sci Rep 9, 15789 (2019). https://doi.org/10.1038/s41598-019-52422-z

Download citation

Received: 04 February 2019
Accepted: 07 October 2019
Published: 31 October 2019
DOI: https://doi.org/10.1038/s41598-019-52422-z

This article is cited by

A combined amplicon approach to nematode polyparasitism occurring in captive wild animals in southern China
- Hongyi Li
- Zhengjiu Ren
- Dongjuan Yuan
Parasites & Vectors (2024)
VESPA: an optimized protocol for accurate metabarcoding-based characterization of vertebrate eukaryotic endosymbiont and parasite assemblages
- Leah A. Owens
- Sagan Friant
- Tony L. Goldberg
Nature Communications (2024)
Unveiling microbial guilds and symbiotic relationships in Antarctic sponge microbiomes
- Mario Moreno-Pino
- Maria F. Manrique-de-la-Cuba
- Nicole Trefault
Scientific Reports (2024)
Genetic diversity and haplotype analysis of Leishmania tropica identified in sand fly vectors of the genera Phlebotomus and Sergentomyia using next-generation sequencing technology
- Amer Al-Jawabreh
- Suheir Ereqat
- Abedelmajeed Nasereddin
Parasitology Research (2023)
Worms and bugs of the gut: the search for diagnostic signatures using barcoding, and metagenomics–metabolomics
- Marina Papaiakovou
- D. Timothy J. Littlewood
- Cinzia Cantacessi
Parasites & Vectors (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

In silico screening

qPCR

Deep sequencing

Nematode-infected samples

Tapeworm-infected samples

Protozoa-rich samples

Beta diversity analyses

Discussion

Methods

SSU and LSU primer screening

DNA samples

qPCR

MiSeq sequencing

Bioinformatic analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links