Phylogenomic analyses uncover origin and spread of the Wolbachia pandemic

Gerth, Michael; Gansauge, Marie-Theres; Weigert, Anne; Bleidorn, Christoph

doi:10.1038/ncomms6117

Article
Published: 06 October 2014

Phylogenomic analyses uncover origin and spread of the Wolbachia pandemic

Michael Gerth¹,
Marie-Theres Gansauge²,
Anne Weigert¹ &
…
Christoph Bleidorn^1,3

Nature Communications volume 5, Article number: 5117 (2014) Cite this article

6905 Accesses
90 Citations
10 Altmetric
Metrics details

Subjects

Abstract

Of all obligate intracellular bacteria, Wolbachia is probably the most common. In general, Wolbachia are either widespread, opportunistic reproductive parasites of arthropods or essential mutualists in a single group of filarial nematodes, including many species of medical significance. To date, a robust phylogenetic backbone of Wolbachia is lacking and consequently, many Wolbachia-related phenomena cannot be discussed in a broader evolutionary context. Here we present the first comprehensive phylogenomic analysis of Wolbachia supergroup relationships based on new whole-genome-shotgun data. Our results suggest that Wolbachia has switched between its two major host groups at least twice. The ability of some arthropod-infecting Wolbachia to universally infect and to adapt to a broad range of hosts quickly is restricted to a single monophyletic lineage (containing supergroups A and B). Thus, the currently observable pandemic has likely a single evolutionary origin and is unique within the radiation of Wolbachia strains.

You have full access to this article via your institution.

Download PDF

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Josefin Stiller, Shaohong Feng, … Guojie Zhang

Single-cell RNA-seq of the rare virosphere reveals the native hosts of giant viruses in the marine environment

Article 11 April 2024

Amir Fromm, Gur Hevroni, … Assaf Vardi

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

Article Open access 06 November 2019

Jethro S. Johnson, Daniel J. Spakowicz, … George M. Weinstock

Introduction

Like all members of the order Rickettsiales (Alphaproteobacteria), Wolbachia are obligate intracellular symbionts. Main evolutionary Wolbachia lineages are termed ‘supergroups’¹ and differ markedly in their host distribution and biology. Supergroup A and B Wolbachia strains are found in many groups of terrestrial arthropods, making it one of the most common endosymbionts worldwide. An estimated 40% of all species are infected². In many arthropod hosts, Wolbachia enhance their spread by inducing reproductive alterations such as cytoplasmic incompatibility (CI), parthenogenesis, male-killing and feminization³. Although Wolbachia is generally transmitted vertically (from mother to offspring), regular horizontal transmissions between arthropod hosts as well as recurrent gains and losses are evident from a lack of co-cladogenesis of Wolbachia with its hosts^4,5.

In stark contrast, Wolbachia of supergroups C and D are found exclusively in some filarial nematodes and their long-lasting intimate association has led to various mutual dependencies⁶. Other distinct Wolbachia strain groups are known only from a small number of hosts: supergroup E is found in springtails (Hexapoda, Collembola), supergroup H in termites (Hexapoda, Isoptera) and further, so far unclassified strains were detected in Ctenocephalides felis (Hexapoda, Siphonaptera), Dipetalonema gracile (Nematoda, Filarioidea), Bryobia sp. (Arachnida, Acari) and Cordylochernes scorpioides (Arachnida, Pseudoscorpiones)^{7,8,9,10,11,12}. The nature of the symbiosis in all of these cases is only superficially understood. Interestingly, supergroup F Wolbachia may infect both arthropods and nematodes, and strains of this supergroup may act as a mutualist and can induce CI^13,14,15. Although found in many higher ranked arthropod taxa (for example, insect orders), supergroup F Wolbachia are generally rare¹¹.

Given the diverging lifestyles of Wolbachia supergroups, the question arises whether Wolbachia from arthropods and nematodes represent distinct, monophyletic evolutionary lineages and, if so, which phylogenetic position can be attributed to supergroup F that is not constrained to a single host group. An intriguing hypothesis suggests that this group is a basal branching lineage that might represent Wolbachia’s ancestral lifestyle¹⁶. While phylogenetic analyses of Wolbachia strains based on a single or a few genes usually enable correct supergroup assignments, relationships between supergroups remain poorly resolved and consequently, partially conflicting phylogenetic hypotheses were proposed^{11,17,18,19,20}. Furthermore, these data sets are especially prone to artefacts caused by recombination between Wolbachia strains²¹. Owing to the fact that hitherto, whole-genome data from supergroups other than A, B, C and D are lacking, phylogenomic analyses (albeit providing well-resolved trees) were restricted to a limited sampling of Wolbachia strains^16,22. In addition, a large evolutionary distance to its closest relatives has hampered an unequivocal rooting of the Wolbachia tree²³. However, a well-resolved rooted tree is needed to interpret the direction of major lifestyle transitions in Wolbachia’s evolutionary history.

In the present study, we aim to address the major challenges in reconstructing Wolbachia’s evolutionary history by enhancing taxon and gene sampling. To this end, we created new whole-genome-shotgun (WGS) data of so far unsampled supergroup E from the springtail Folsomia candida, supergroup H from the termite Zootermopsis nevadensis and supergroup F from the solitary bee Osmia caerulescens. A data set of 90 carefully selected single-copy orthologues from these data and from already published Wolbachia genomes (supergroups A, B, C and D) were used for phylogenomic analyses. We integrated various phylogenetic approaches as well as measures to identify and subsequently reduce systematic biases. We consequently present a robust and well-supported phylogenetic hypothesis for the evolution of Wolbachia strains. Our findings indicate that the ubiquitous Wolbachia supergroups A and B belong to a single, monophyletic lineage and consequently, the ability to adapt to a large range of taxonomically and physiologically diverse hosts has a single origin in that lineage. Furthermore, the Wolbachia strains that are obligate mutualists of nematodes are a paraphyletic assemblage, suggesting that host switches from arthopods to nematodes (or back) occurred at least twice in the evolutionary history of Wolbachia.

Results

Reconstructing Wolbachia’s evolutionary history

To reconstruct Wolbachia supergroup relationships via a phylogenomic pipeline, we utilized available genomic sequences of Wolbachia supergroups A, B, C and D as well as supergroup F Wolbachia sequences originating from a Strepsiptera genome project (Table 1). In addition, we performed WGS sequencing of four arthropod hosts carrying distinct Wolbachia strains so far not represented by genomic data (Table 1). BLAST searches in the corresponding assemblies allowed us to identify most of the 90 loci to be employed for phylogenetic analyses from wOc (87/90), wFol (82/90) and wCte (78/90). For wZoo and wMen, only 19 and 38 loci were recovered, respectively. Preliminary supergroup assignment with multilocus sequence typing (MLST) loci that were extracted from the assemblies showed that wOc and wMen clustered within arthropod and nematode supergroup F strains, and that wFol represents a distinct lineage of the Wolbachia radiation (Supplementary Fig. 1). Unexpectedly and in contrast to previously published results²⁴, wCte from the present study fell within supergroup B, suggesting that C. felis populations differ in their endosymbiont composition.

Table 1 Origin of sequence data used in this study.

Full size table

In the single-gene alignments used for subsequent analyses, no evidence for intragenic recombination or nucleotide substitution saturation was detected. The resulting masked supermatrices were composed of 21 taxa and 69,677 and 23,262 characters for nucleotides and amino acids, respectively. Ingroup relationships estimated from all data sets and analyses (Fig. 1; Supplementary Figs 2–5) resulted in the same, highly supported topology with the exception of the placement of supergroup H. All supergroups represented by >1 strain were recovered as monophyletic, with the ubiquitous arthropod-infecting Wolbachia A and B being reciprocally monophyletic. The nematode-infecting supergroups (C and D) form a monophyletic group with supergroup F, in which C and F are sister taxa. Only the placement of supergroup H is ambiguous. A sister group relationship with E was not recovered in all analyses (Supplementary Figs 2–5).

**Figure 1: Unrooted phylogram showing relationships between investigated *Wolbachia* strains.**

The analyses including outgroups Ehrlichia ssp. and Anaplama ssp. yielded identical topologies, again receiving almost maximal support for all nodes (Fig. 2; Supplementary Figs 6–17). Once more, the placement of supergroup H was not consistent across analyses and data sets. Notably, supergroup E was placed at the base of the Wolbachia radiation with maximal statistical support in all analyses (Fig. 2; Supplementary Figs 6–17). In none of our analytical approaches a conflicting rooting was proposed. Furthermore, both Shimodaira–Hasegawa (SH) and approximately unbiased (AU) tests favoured this rooting over any other (Table 2). Consequently, the strain that likely induces parthenogenesis in the collembolan F. candida²⁵ is the sister group to all other Wolbachia supergroups analysed.

**Figure 2: Rooted maximum likelihood phylogeny of 21 *Wolbachia* strains representing all sampled supergroups.**

Table 2 Results of Shimodaira–Hasegawa (SH) and approximately unbiased (AU) tests for alternative root positions of the Wolbachia phylogeny.

Full size table

To control for systematic biases in our phylogenetic reconstructions, we used various approaches, including visual checks for compositional biases via heat maps (Supplementary Fig. 18), data recoding, slow-fast analyses, single-gene analyses, partition jackknifing, exclusion of compositionally biased genes and usage of non-stationary, non-homogenous models (see Methods). None of these analyses demonstrated conflict in our original data set, but instead consistently converged to a single topology (Figs 1 and 2; Supplementary Figs 2–17).

Insights from shared gene analysis

To assess whether the newly proposed groupings are also reflected in shared genes among their genomes, we performed OrthoMCL-clustering using protein sequences of all Wolbachia supergroups. BLAST searches revealed a number of genes being present in all arthropod Wolbachia strains but missing in supergroups C and D (Supplementary Table 2). Most of these genes lack annotation, but two competence-related genes and one phage-related gene could be identified by reciprocal BLAST searches. In addition, we found that almost all of the 24 phage WO gene products we searched for are present in the assemblies of supergroups E and F (Supplementary Table 3).

Discussion

For phylogenomic analyses of Wolbachia strains, we used a set of 90 informative loci that were recently shown to resolve supergroup level relationships of Wolbachia¹⁶. We here present a phylogenetic hypothesis of seven Wolbachia supergroups that receives high statistical support throughout all analytical approaches and data sets. Our results suggest that the ability to opportunistically adapt to a large range of hosts has evolved only once in Wolbachia and that major host switches (from arthropods to nematodes or back) have occurred at least twice. This is the most comprehensive phylogenomic analysis of Wolbachia strains to date.

Only correct rooting of a phylogeny allows interpreting the directionality of evolutionary events and reconstruction of ancestral states²⁶. In some instances, however, distant outgroups may lead to biased reconstructions and long-branch artefacts²⁷. Recently, Bordenstein et al.²³ suggested that Wolbachia phylogeny might represent such a case, with closest relatives Anaplasma and Ehrlichia being separated by a comparatively long branch.

In the present study, we used multiple approaches to test for systematic biases such as rooting artefacts. The data set was analysed under different nucleotide and amino-acid substitution models (including the CAT model, which suppresses long-branch artefacts²⁸), both with and without outgroups. The impact of compositional biases was explored by visually inspecting compositional heterogeneities via heat maps (Supplementary Fig. 18), using a non-homogeneous, non-stationary nucleotide model of nucleotide sequence evolution and by excluding compositionally biased loci from the amino-acid supermatrix. Furthermore, we reduced the distance between Wolbachia and its outgroups by excluding fast-evolving third-codon positions, by excluding fast-evolving genes, by considering only transversions (in the RY-coded supermatrix) or by recoding amino-acid supermatrices. Confounding effects of potentially recombined genes were assessed with a partition jackknifing approach and with single-gene analyses. Four loci were identified that significantly reject the topology obtained from the complete matrix (SH test, P<0.01), which may be a result of recombination events. However, the topology obtained from a supermatrix without these genes did not differ from the original reconstruction, suggesting that recombination, if present, did not critically bias our results. Finally, SH and AU tests were performed to test for alternative rooting positions. Since none of these approaches suggested the presence of systematic errors or alternative, statistically supported topologies, we conclude that the here presented data and analyses enable the erection of a solid phylogenetic hypothesis for Wolbachia supergroups (consensus in Fig. 3). We further infer that the placement of supergroup E at the base of the Wolbachia tree can be considered as robust.

**Figure 3: Consensus supergroup-level *Wolbachia* phylogeny as determined in this study.**

Contrastingly, the placement of supergroup H proved to be not fully resolvable. Depending on the analysis employed, supergroup H was either the sister group of E, sister to all strains except E, sister to (A, B) or sister to (C, F, D). Furthermore, in PhyloBayes analysis the chains did not converge even after >20,000 generations, resulting in an unresolved position of wZoo. Without supergroup H, however, convergence was reached and all splits were highly supported (Supplementary Fig. 8). This inconsistency is very likely due to the limited amount of Wolbachia sequence data recovered from the assembly of wZoo—only 19 of 90 loci could be included in phylogenetic analyses. Since all other splits of the Wolbachia tree received maximal support in almost all approaches used, an increase in loci for wZoo will likely enable a stable placing of this supergroup as well.

However, supergoup H was most frequently placed at the base of the tree in our analyses (Supplementary Figs 2–17), either as a sister goup to E or as a sister group to a clade uniting all strains except E. Furthermore, in previous investigations supergroups E and H were consistently recovered as sister groups^{8,11,18,23,29,30} and no conflicting grouping was proposed so far. Consequently, a placement of supergroup H as a sister group to supergroup E has received most support so far and seems most likely, although it could not unequivocally be demonstrated with our analyses (Fig. 2).

Several important implications can be deduced from the here presented results. First, the last common ancestor of Wolbachia was likely an endosymbiont of arthropods with a limited host range. Although most obvious in supergroups C and D (which infect only filarial nematodes), a certain degree of host specificity can be observed in all strains except for supergroups A and B (Fig. 3): supergroups E and H are found only in springtails^31,32 and termites²⁹, respectively, and some supergroup F Wolbachia are also restricted to single host taxa^19,33. Thus, the ubiquitous arthropod Wolbachia that are found in 40% of terrestrial arthropods² belong to a single, derived phylogenetic lineage (supergroups A+B). The lifestyle of the last common ancestor of all Wolbachia strains cannot be reconstructed with confidence, as the lifestyles of the two basal branching lineages (supergroups E and H) are not fully understood. Furthermore, Wolbachia lifestyles are not always unambiguous to interpret³⁴ and the phylogenetic placement of further, potentially distinct Wolbachia lineages is still unclear²³. However, it has been demonstrated that Wolbachia induces parthenogenesis in F. candida and that in turn F. candida depends on Wolbachia to produce viable offspring^25,35. This argues for some degree of evolved dependency, which is scarcely distributed among arthropod Wolbachia, where CI seems to be the prevailing induced phenotype^3,34. Consequently, supergroups A and B may not only be phylogenetically derived, but also in terms of physiology and thus in impact on their hosts. Comparative genomic analyses especially of basal Wolbachia supergroups could corroborate this hypothesis.

Second, our results suggest a sister group relationship between supergroups C and F. This grouping was recovered in a recent analysis using sequences of 52 ribosomal proteins of six Wolbachia strains³⁶, as well as in all of our analyses. Since both nematodes and arthropods may carry supergroup F Wolbachia, at least one host switch from nematodes to arthropods (or vice versa) must have occurred within that group (Fig. 3). Some supergroup F Wolbachia act as mutualists in arthropods¹³ and in the filarial nematode Mansonella, this strain is essential for the survival of its host, which is similar to what can be observed for supergroups C and D¹⁴. Moreover, remnants of Wolbachia genes were found in naturally Wolbachia-free filarial nematodes, indicating multiple independent losses of the infection³⁷. Therefore, when considering phylogenetic evidence, mutualism may be common in supergroup F and more cases of so far undetected obligate mutualism can be expected in this supergroup. To assess whether supergroup F has emerged only recently in nematodes and thus originated from arthropod hosts¹⁸, a broader taxon sampling of supergroup F strains is needed.

Third, gene content analyses suggest that a number of genes were lost in the genomes of supergroups C and D Wolbachia (see Supplementary Table 2). Since the streamlined genomes of these nematode-infecting Wolbachia are a consequence of long-lasting mutualistic relationships with their hosts^38,39, these losses have most likely occurred independently in both lineages. Interestingly, two of the annotated genes present in all arthropod Wolbachia, but missing in supergroups C and D, are competence-related, that is, involved in uptake of external DNA (Supplementary Table 2). Exchange of genetic elements is common in Wolbachia and other endosymbionts⁴⁰, but may be reduced like any other nonessential functions in stable obligate symbioses⁴¹. Similarly, phage WO genes are absent in supergroups C and D, but might have been present at some time in these groups⁴². Our screen revealed that phage elements are present in all other Wolbachia supergroups (see Supplementary Table 3), which is further evidence for convergent secondary losses of phage genes in supergroups C and D.

This first comprehensive, rooted phylogeny of the genus Wolbachia shows that supergroups A and B are not only peculiar in the huge diversity of host interactions, their ability to regularly adapt to new hosts and in their pandemic spread, but also that they constitute a phylogenetically derived group within the radiation of Wolbachia strains. Most likely, the bacteria from which Wolbachia originated were less flexible in terms of their host choice. This lifestyle is to some extent reflected in the basal Wolbachia lineages E and H. Alternatively, these basal lineages may be the remnants of a past Wolbachia pandemic that has subsequently been replaced by supergroups A and B, or these lineages have specialized on a single host secondarily. Our results will thus be the basis for further exploring the evolutionary history of Wolbachia.

Methods

Sampling and sequencing

The data sets used in this study were compiled from published Wolbachia genomes (supergroups A, B, C and D), Anaplasma and Ehrlichia outgroups and Wolbachia supergroup F sequence data originating from the Mengenilla moldrzyki sequencing project⁴³ (Table 1). Furthermore, we performed WGS sequencing of supergroups for which comparable data were so far unpublished or unavailable: supergroup F Wolbachia from O. caerulescens (collected in Fürstenberg/Havel, Germany), supergroup H from Z. nevadensis (collected near Bamfield, BC, Canada), supergroup E from F. candida (kindly provided by David Russell and Ulrich Burkhardt, Görlitz, Germany) and Wolbachia from C. felis (kindly provided by Dieter Striese and Ronny Wolf, Görlitz, Germany and Leipzig, Germany, respectively). DNA was extracted from a single individual of each O. caerulescens (including its Wolbachia strain wOc) and Z. nevadensis (carrying wZoo), and from 10 pooled individuals of F. candida (with wFol) and C. felis (with wCte) by proteinase K digestion and subsequent chloroform extraction. Double-index sequencing libraries with average insert sizes of around 300 bp were prepared as previously described^44,45. The libraries were sequenced as a 125-bp paired-end run on an Illumina Hi-Seq 2000.

Raw data processing and assembly

Base calling was performed with freeIbis⁴⁶, adapter and primer sequences were clipped and false-paired reads were discarded. We filtered the data by removing all reads that included >5 bases with a quality score below 15. Raw data were submitted to the NCBI sequence read archive under accession numbers SRR1222146 (wZoo), SRR1222150 (wCte), SRR1222159 (wFol) and SRR1221705 (wOc). De novo assemblies were conducted with CLC Genomics Workbench 5.1 (CLC bio, Århus, Denmark) using default settings and with IDBA-UD 1.1.0 (ref. 47), using an initial k-mer size of 21, an iteration size of 10 and a maximum k-mer size of 81. For all subsequent analyses, the assemblies with highest N50 values were selected: for wOC, we used the CLC assembly; for wCte, wFol and wZoo, IDBA-UD assemblies were used. Assembly statistics are listed in Supplementary Table 1.

Alignment and phylogenetic analyses

In a recent phylogenomic analysis of Wolbachia supergroups A, B, C and D¹⁶, 90 orthologous loci were identified that meet the following criteria: (1) presence of a single copy in four investigated Wolbachia supergroups and outgroups (Anaplasma ssp. and Ehrlichia ssp.), (2) absence of recombination and (3) no evidence for nucleotide substitution saturation. Since these loci were shown to provide a well-resolved supergroup-level Wolbachia phylogeny¹⁶, we used the same set of orthologues in our analyses. We identified these loci in all assemblies using BLAST+ version 2.2.8 (ref. 48). Single loci were translated with TranslatorX version 1.1 (ref. 49), aligned with MAFFT version 7.037b⁵⁰ using the L-INS-i strategy and then back-translated. Thus we obtained codon-based nucleotide alignments as well as amino-acid alignments. To remove ambiguously aligned positions, we performed alignment masking with Gblocks version 0.91b⁵¹, allowing small block sizes and gaps (options b4=2 and b5=all). Amino-acid and nucleotide supermatices were constructed with FASconCAT⁵²; best-fitting evolutionary models for these were determined by their BIC (Bayesian information criterion) values with ProtTest version 3.4 (ref. 53) and jModelTest version 2.1.3 (ref. 54), respectively. We tested for recombination within our data sets using the Pairwise homoplasy index as implemented in PhiPack⁵⁵, with sliding-window sizes of 200, 100, 50 and 25 and 1,000 permutations each. Furthermore, test of nucleotide substitution saturation were performed using Xia’s⁵⁶ method, as implemented in DAMBE version 5.

Phylogenetic reconstructions of Wolbachia supergroup relationships were conducted with maximum likelihood (ML) methods and Bayesian inference (BI). For the nucleotide supermatrix, a ML tree was inferred with RAxML version 8.0.5 (ref. 57) using the model GTR+Γ+I. Branch support was estimated with 1,000 bootstrap replicates. BI was performed with MrBayes version 3.1.2 (ref. 58), using GTR+Γ+I. Two times four chains were run for 1 million generations, every 500th generation was sampled. After a deviation of split frequencies of ≤5% was determined, tree information was summarized excluding 250,000 generations as burnin. Posterior probabilities were inferred from clade frequencies of the majority rule consensus tree constructed from the remaining trees. Both BI and ML analyses were separately conducted with identical settings for nucleotide matrices without outgroups.

ML analysis of the amino-acid supermatrix was performed with RAxML using the model FLU+Γ+I and calculating bootstrap support from 1,000 replicates. In addition, for BI we employed PhyloBayes MPI version 1.5a (ref. 59) with the CAT-GTR model⁶⁰ that accounts for substitutional heterogeneities among amino-acid data sets. For all PhyloBayes analyses, two chains with at least 10,000 cycles were run (10,000–24,377; 14,666 on average). All trace parameters were plotted to test whether stationarity had been reached and to diagnose suitable burnin sizes. The chains were stopped after both trees and continuous parameters were diagnosed to have converged with the built-in methods of PhyloBayes (bpcomp & tracecomp). Posterior probabilities were calculated from the clade frequencies of the posterior sample of trees. ML and BI as described above were also conducted for an amino-acid data set without outgroups.

For provisional supergroup assignment, we used BLAST+ to search for Wolbachia MLST loci²⁴, aligned these with available MLST profiles from Wolbachia PubMLST database (http://pubmlst.org/wolbachia) that include a supergroup annotation and performed a ML tree search with RAxML.

Assessment of root position and tests for systematic errors

To assess the stability of the root position, we calculated 11 separate ML trees with RAxML while enforcing different topologies, each corresponding to a distinct rooting of the Wolbachia ingroup. We then compared the resulting trees with the best tree of the unconstrained ML analysis via a SH-test⁶¹, as implemented in RAxML. In addition, we calculated per-site log likelihoods for all 12 trees with RAxML and compared the topologies with an AU test using CONSEL version 1.2.0 (ref. 62). Both tests were performed with nucleotide and amino-acid supermatrices.

Since rooting artefacts may origin from distantly related outgroups²³, we took recoding and exclusion approaches to reduce the overall evolutionary distances within the data sets and to explore potentially alternative rooting positions. This approach was shown to be suitable to investigate systematic biases in similar data sets⁶³. For the nucleotide supermatrix, we performed ML analysis for a RY-coded supermatrix and for a data set without third-codon positions as described above. The amino-acid supermatrix was recoded with the dayhoff6 and dayhoff4 schemes in PhyloBayes. Then, analyses with PhyloBayes were run as described above. Next, we determined pairwise sequence identities (as proxy for evolutionary changes through time) for all loci with the function ‘dist.alignment’ of the R package SequinR⁶⁴. PhyloBayes was then used as described above to infer Wolbachia supergroup phylogeny based on amino-acid matrices without the 20 and 40 fastest-evolving genes.

To test for sequence composition biases, we first used BaCoCa Version 1.104r⁶⁵ to create descriptive statistics for our amino-acid supermatrix. Taxon to gene-specific heat maps were generated for the proportion of hydrophilic, polar, positively, negatively and neutrally charged amino-acid side chains. These proportions were calculated for all loci and taxa and subject to hierarchical clustering. The resulting heat maps were inspected for conspicuous clusters, especially of Wolbachia strains with outgroups. Heterogeneity in base composition was addressed by employing nhPhyML⁶⁶, which uses a non-homogeneous non-stationary model that accounts for variations in the base composition. Since Wolbachia supergroups were homogeneous in base composition, but the outgroups Anaplasma and Ehrlichia showed pronounced differences (Supplementary Fig. 14), we also performed ML analyses with the nucleotide supermatrix using only Anaplasma and only Ehrlichia outgroups.

Because ingroup taxa did not seem compositionally biased, we next identified the loci that significantly deviated from compositional homogeneity and thus potentially skewed our results. To this end, we ran a single chain for 5,000 points with PhyloBayes for each of the 90 loci. Then, we used the implemented test statistics of PhyloBayes (option -comp) to calculate z-scores and P values for compositional deviation. We then excluded all loci with a z-score>2 and a P value<0.05 (33 loci altogether) and reran the PhyloBayes analysis as described above.

To further assess what influence single loci have on the topology, we conducted a partition jackknifing approach⁶⁷. Out of 90 loci in total, we randomly picked 30 loci or 60 loci, with 100 permutations each. Then, we analysed each single jackknifed matrix with RAxML. Finally, we counted the number of times each node appeared in the jackknifed analyses as a proxy for the support of that node. Finally, we also analysed single loci with RAxML. We used only the 72 loci that had at least a single representative for all supergroups except supergroup H and removed the taxa for which not all of these 72 loci were available. All single-gene topologies were then summarized to a ‘primordial consensus’ tree using the method by Steel et al.⁶⁸, which accounts for events of potential lateral gene transfers.

Gene content analysis

To identify genes that might have been lost or gained during Wolbachia’s evolutionary history, we first downloaded the coding sequences of representative Wolbachia strains of supergroups A (wMel, wHa), B (wPip, wNo), C (wOo) and D (wBm) from NCBI. Next, we performed orthologue clustering with OrthoMCL version 2.0 (ref. 69) using default settings. We kept the clusters that contained only sequences from supergroups A and B and used them to run BLAST+ searches against the assemblies of wLs (supergroup C) and wDim (supergroup D). We discarded the clusters that returned a significant hit (cutoff at e-value 10E-4) and used the remaining clusters to identify potential orthologues in wFol, wZoo, wOc and wMen with BLAST+. Finally, we ran online BLAST searches on NCBI database to check whether queries and hits were coherently annotated. Furthermore, to gain insights into the evolutionary history of phage acquisition and loss across Wolbachia strains, we searched for gene products of the bacteriophage WO⁷⁰ in the assemblies wFol, wZoo, wOc and wMen.

Additional information

How to cite this article: Gerth, M. et al. Phylogenomic analyses uncover origin and spread of the Wolbachia pandemic. Nat. Commun. 5:5117 doi: 10.1038/ncomms6117 (2014).

Accession codes: Whole-genome-shotgun data have been deposited in NCBI sequence read archive under BioProject number PRJNA244005.

Accession codes

Accessions

BioProject

PRJNA244005

References

Zhou, W. G., Rousset, F. & O’Neill, S. Phylogeny and PCR-based classification of Wolbachia strains using wsp gene sequences. Proc. R. Soc. B 265, 509–515 (1998).
Article CAS PubMed PubMed Central Google Scholar
Zug, R. & Hammerstein, P. Still a host of hosts for Wolbachia: analysis of recent data suggests that 40% of terrestrial arthropod species are infected. PLoS ONE 7, e38544 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Werren, J. H., Baldo, L. & Clark, M. E. Wolbachia: master manipulators of invertebrate biology. Nat. Rev. Microbiol. 6, 741–751 (2008).
Article CAS PubMed Google Scholar
Schilthuizen, M. & Stouthamer, R. Horizontal transmission of parthenogenesis-inducing microbes in Trichogramma wasps. Proc. R. Soc. B 264, 361–366 (1997).
Article ADS CAS PubMed PubMed Central Google Scholar
Gerth, M., Röthe, J. & Bleidorn, C. Tracing horizontal Wolbachia movements among bees (Anthophila): a combined approach using MLST data and host phylogeny. Mol. Ecol. 22, 6149–6162 (2013).
Article PubMed Google Scholar
Taylor, M. J., Voronin, D., Johnston, K. L. & Ford, L. Wolbachia filarial interactions. Cell. Microbiol. 15, 520–526 (2013).
Article CAS PubMed Google Scholar
Vandekerckhove, T. T. M. et al. Phylogenetic analysis of the 16S rDNA of the cytoplasmic bacterium Wolbachia from the novel host Folsomia candida (Hexapoda, Collembola) and its implications for wolbachial taxonomy. FEMS Microbiol. Lett. 180, 279–286 (1999).
Article CAS PubMed Google Scholar
Bordenstein, S. R. & Rosengaus, R. B. Discovery of a novel Wolbachia supergroup in Isoptera. Curr. Microbiol. 51, 393–398 (2005).
Article CAS PubMed Google Scholar
Casiraghi, M. et al. Mapping the presence of Wolbachia pipientis on the phylogeny of filarial nematodes: evidence for symbiont loss during evolution. Int. J. Parasitol. 34, 191–203 (2004).
Article PubMed Google Scholar
Gorham, C. H., Fang, Q. Q. & Durden, L. A. Wolbachia endosymbionts in fleas (Siphonaptera). J. Parasitol. 89, 283–289 (2003).
Article CAS PubMed Google Scholar
Ros, V. I. D., Fleming, V. M., Feil, E. J. & Breeuwer, J. A. J. How diverse is the genus Wolbachia? multiple-gene sequencing reveals a putatively new Wolbachia supergroup recovered from spider mites (Acari: Tetranychidae). Appl. Environ. Microbiol. 75, 1036–1043 (2009).
Article CAS PubMed Google Scholar
Zeh, D. W., Zeh, J. a. & Bonilla, M. M. Wolbachia, sex ratio bias and apparent male killing in the harlequin beetle riding pseudoscorpion. Heredity 95, 41–49 (2005).
Article CAS PubMed Google Scholar
Hosokawa, T., Koga, R., Kikuchi, Y., Meng, X.-Y. & Fukatsu, T. Wolbachia as a bacteriocyte-associated nutritional mutualist. Proc. Natl Acad. Sci. USA 107, 769–774 (2010).
Article ADS CAS PubMed Google Scholar
Coulibaly, Y. I. et al. A randomized trial of doxycycline for Mansonella perstans infection. N. Engl. J. Med. 361, 1448–1458 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zabal-Aguirre, M. et al. Wolbachia effects in natural populations of Chorthippus parallelus from the pyrenean hybrid zone. J. Evol. Biol. 27, 1136–1148 (2014).
Article CAS PubMed Google Scholar
Comandatore, F. et al. Phylogenomics and analysis of shared genes suggest a single transition to mutualism in Wolbachia of nematodes. Genome Biol. Evol. 5, 1668–1674 (2013).
Article PubMed PubMed Central Google Scholar
Lo, N. et al. Taxonomic status of the intracellular bacterium Wolbachia pipientis. Int. J. Syst. Evol. Microbiol. 57, 654–657 (2007).
Article CAS PubMed Google Scholar
Lefoulon, E. et al. A new type F Wolbachia from Splendidofilariinae (Onchocercidae) supports the recent emergence of this supergroup. Int. J. Parasitol. 42, 1025–1036 (2012).
Article CAS PubMed Google Scholar
Ferri, E. et al. New insights into the evolution of Wolbachia infections in filarial nematodes inferred from a large range of screened species. PLoS ONE 6, e20843 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Casiraghi, M. et al. Phylogeny of Wolbachia pipientis based on gltA, groEL and ftsZ gene sequences: clustering of arthropod and nematode symbionts in the F supergroup, and evidence for further diversity in the Wolbachia tree. Microbiology 151, 4015–4022 (2005).
Article CAS PubMed Google Scholar
Baldo, L., Bordenstein, S. R., Wernegreen, J. J. & Werren, J. H. Widespread recombination throughout Wolbachia genomes. Mol. Biol. Evol. 23, 437–449 (2006).
Article CAS PubMed Google Scholar
Fenn, K. et al. Phylogenetic relationships of the Wolbachia of nematodes and arthropods. PLoS Pathog. 2, e94 (2006).
Article PubMed PubMed Central Google Scholar
Bordenstein, S. R. et al. Parasitism and mutualism in Wolbachia: what the phylogenomic trees can and cannot say. Mol. Biol. Evol. 26, 231–241 (2009).
Article CAS PubMed Google Scholar
Baldo, L. et al. Multilocus sequence typing system for the endosymbiont Wolbachia pipientis. Appl. Environ. Microbiol. 72, 7098–7110 (2006).
Article CAS PubMed PubMed Central Google Scholar
Pike, N. & Kingcombe, R. Antibiotic treatment leads to the elimination of Wolbachia endosymbionts and sterility in the diplodiploid collembolan Folsomia candida. BMC Biol. 7, 54 (2009).
Article PubMed PubMed Central Google Scholar
Wheeler, W. C. Nucleic acid sequence phylogeny and random outgroups. Cladistics 6, 363–367 (1990).
Article PubMed Google Scholar
Huelsenbeck, J. P., Bollback, J. P. & Levine, A. M. Inferring the root of a phylogenetic tree. Syst. Biol. 51, 32–43 (2002).
Article PubMed Google Scholar
Lartillot, N., Brinkmann, H. & Philippe, H. Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol. Biol. 7, (Suppl 1): S4 (2007).
Article PubMed PubMed Central Google Scholar
Lo, N. & Evans, T. A. Phylogenetic diversity of the intracellular symbiont Wolbachia in termites. Mol. Phylogenet. Evol. 44, 461–466 (2007).
Article CAS PubMed Google Scholar
Vaishampayan, P. A. et al. Molecular evidence and phylogenetic affiliations of Wolbachia in cockroaches. Mol. Phylogenet. Evol. 44, 1346–1351 (2007).
Article CAS PubMed Google Scholar
Czarnetzki, A. B. & Tebbe, C. C. Detection and phylogenetic analysis of Wolbachia in Collembola. Environ. Microbiol. 6, 35–44 (2004).
Article CAS PubMed Google Scholar
Tanganelli, V., Fanciulli, P. P., Nardi, F. & Frati, F. Molecular phylogenetic analysis of a novel strain from Neelipleona enriches Wolbachia diversity in soil biota. Pedobiologia 57, 15–20 (2013).
Article Google Scholar
Baldo, L., Prendini, L., Corthals, A. & Werren, J. H. Wolbachia are present in Southern African scorpions and cluster with supergroup F. Curr. Microbiol. 55, 367–373 (2007).
Article CAS PubMed Google Scholar
Zug, R. & Hammerstein, P. Bad guys turned nice? A critical assessment of Wolbachia mutualisms in arthropod hosts. Biol. Rev. doi:10.1111/brv.12098 (2014).
Timmermans, M. J. T. N. & Ellers, J. Wolbachia endosymbiont is essential for egg hatching in a parthenogenetic arthropod. Evol. Ecol. 23, 931–942 (2009).
Article Google Scholar
Nikoh, N. et al. Evolutionary origin of insect-Wolbachia nutritional mutualism. Proc. Natl Acad. Sci. USA 111, 10257–10262 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
McNulty, S. N. et al. Endosymbiont DNA in endobacteria-free filarial nematodes indicates ancient horizontal genetic transfer. PLoS ONE 5, e11029 (2010).
Article ADS PubMed PubMed Central Google Scholar
Darby, A. C. et al. Analysis of gene expression from the Wolbachia genome of a filarial nematode supports both metabolic and defensive roles within the symbiosis. Genome Res. 22, 2467–2477 (2012).
Article CAS PubMed PubMed Central Google Scholar
Godel, C. et al. The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets. FASEB J. 26, 4650–4661 (2012).
Article CAS PubMed PubMed Central Google Scholar
Duron, O. Lateral transfers of insertion sequences between Wolbachia, Cardinium and Rickettsia bacterial endosymbionts. Heredity 111, 330–337 (2013).
Article CAS PubMed PubMed Central Google Scholar
Moran, N. A., McCutcheon, J. P. & Nakabachi, A. Genomics and evolution of heritable bacterial symbionts. Annu. Rev. Genet. 42, 165–190 (2008).
Article CAS PubMed Google Scholar
Kent, B. N. & Bordenstein, S. R. Phage WO of Wolbachia: lambda of the endosymbiont world. Trends Microbiol. 18, 173–181 (2010).
Article CAS PubMed PubMed Central Google Scholar
Niehuis, O. et al. Genomic and morphological evidence converge to resolve the enigma of Strepsiptera. Curr. Biol. 22, 1309–1313 (2012).
Article CAS PubMed Google Scholar
Kircher, M., Sawyer, S. & Meyer, M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Res. 40, e3 (2012).
Article CAS PubMed Google Scholar
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. doi:10.1101/pdb.prot5448 (2010).
Renaud, G., Kircher, M., Stenzel, U. & Kelso, J. freeIbis: an efficient basecaller with calibrated quality scores for Illumina sequencers. Bioinformatics 29, 1208–1209 (2013).
Article CAS PubMed PubMed Central Google Scholar
Peng, Y., Leung, H. C. M., Yiu, S. M. & Chin, F. Y. L. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428 (2012).
Article CAS PubMed Google Scholar
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
Article PubMed PubMed Central Google Scholar
Abascal, F., Zardoya, R. & Telford, M. J. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res. 38, W7–13 (2010).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K., Misawa, K., Kuma, K. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066 (2002).
Article CAS PubMed PubMed Central Google Scholar
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000).
Article CAS PubMed Google Scholar
Kuck, P. & Meusemann, K. FASconCAT: convenient handling of data matrices. Mol. Phylogenet. Evol. 56, 1115–1118 (2010).
Article PubMed Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164–1165 (2011).
Article CAS PubMed Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods 9, 772 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bruen, T. C., Philippe, H. & Bryant, D. A simple and robust statistical test for detecting the presence of recombination. Genetics 172, 2665–2681 (2006).
Article CAS PubMed PubMed Central Google Scholar
Xia, X. DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution. Mol. Biol. Evol. 30, 1720–1728 (2013).
Article CAS PubMed PubMed Central Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ronquist, F. & Huelsenbeck, J. P. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003).
Article CAS PubMed Google Scholar
Lartillot, N., Rodrigue, N., Stubbs, D. & Richer, J. PhyloBayes MPI: Phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. Syst. Biol. 62, 611–615 (2013).
Article CAS PubMed Google Scholar
Lartillot, N. & Philippe, H. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
Article CAS PubMed Google Scholar
Shimodaira, H. & Hasegawa, M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16, 1114–1116 (1999).
Article CAS Google Scholar
Shimodaira, H. & Hasegawa, M. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics 17, 1246–1247 (2001).
Article CAS PubMed Google Scholar
Husnik, F., Chrudimsky, T. & Hypša, V. Multiple origins of endosymbiosis within the Enterobacteriaceae (γ-Proteobacteria): convergence of complex phylogenetic approaches. BMC Biol. 9, 87 (2011).
Article PubMed PubMed Central Google Scholar
Charif, D. & Lobry, J. inStruct. Approaches to Seq. Evol eds Bastolla U., Porto M., Roman H. E., Vendruscolo M. 207–232Springer (2007).
Kück, P. & Struck, T. H. BaCoCa-a heuristic software tool for the parallel assessment of sequence biases in hundreds of gene and taxon partitions. Mol. Phylogenet. Evol. 70, 94–98 (2014).
Article PubMed Google Scholar
Boussau, B. & Gouy, M. Efficient likelihood computations with nonreversible models of evolution. Syst. Biol. 55, 756–768 (2006).
Article PubMed Google Scholar
Bleidorn, C. et al. On the phylogenetic position of Myzostomida: can 77 genes get it wrong? BMC Evol. Biol. 9, 150 (2009).
Article PubMed PubMed Central Google Scholar
Steel, M., Linz, S., Huson, D. H. & Sanderson, M. J. Identifying a species tree subject to random lateral gene transfer. J. Theor. Biol. 322, 81–93 (2013).
Article MathSciNet CAS PubMed MATH Google Scholar
Chen, F., Mackey, A. J., Stoeckert, C. J. & Roos, D. S. OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res. 34, D363–D368 (2006).
Article CAS PubMed Google Scholar
Fujii, Y., Kubo, T., Ishikawa, H. & Sasaki, T. Isolation and characterization of the bacteriophage WO from Wolbachia, an arthropod endosymbiont. Biochem. Biophys. Res. Commun. 317, 1183–1188 (2004).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We are indebted to David Russel, Ulrich Burkhardt, Dieter Striese (Senckenberg Museum of Natural History Görlitz, Germany) and to Ronny Wolf (University of Leipzig, Germany) for providing specimens. We thank Andreas Rost (University of Leipzig, Germany) for help and continuous support with setting up computations on the computer cluster of the University of Leipzig. We thank Franziska Anni Franke (University of Leipzig, Germany) for fruitful discussions on the manuscript. This work was funded by the German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig and by the University of Leipzig.

Author information

Authors and Affiliations

Molecular Evolution and Systematics of Animals, Institute of Biology, University of Leipzig, Talstrasse 33, Leipzig, 04103, Germany
Michael Gerth, Anne Weigert & Christoph Bleidorn
Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig, 04103, Germany
Marie-Theres Gansauge
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Deutscher Platz 5d, Leipzig, 04103, Germany
Christoph Bleidorn

Authors

Michael Gerth
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Theres Gansauge
View author publications
You can also search for this author in PubMed Google Scholar
Anne Weigert
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Bleidorn
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.B. and M.G. designed the study. M.G., M.-T.G. and A.W. performed in vitro experiments. M.G. analysed the data and wrote the manuscript with help from all authors.

Corresponding author

Correspondence to Michael Gerth.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-18, Supplementary Tables 1-3 and Supplementary Reference (PDF 1621 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gerth, M., Gansauge, MT., Weigert, A. et al. Phylogenomic analyses uncover origin and spread of the Wolbachia pandemic. Nat Commun 5, 5117 (2014). https://doi.org/10.1038/ncomms6117

Download citation

Received: 07 April 2014
Accepted: 01 September 2014
Published: 06 October 2014
DOI: https://doi.org/10.1038/ncomms6117

This article is cited by

One’s trash is someone else’s treasure: sequence read archives from Lepidoptera genomes provide material for genome reconstruction of their endosymbionts
- Victoria G. Twort
- Daniel Blande
- Anne Duplouy
BMC Microbiology (2022)
Wolbachia infection dynamics in a natural population of the pear psyllid Cacopsylla pyri (Hemiptera: Psylloidea) across its seasonal generations
- Liliya Štarhová Serbina
- Domagoj Gajski
- Jessica Dittmer
Scientific Reports (2022)
Genetic and endosymbiotic diversity of Greek populations of Philaenus spumarius, Philaenus signatus and Neophilaenus campestris, vectors of Xylella fastidiosa
- Despoina Ev. Kapantaidaki
- Spyridon Antonatos
- Panagiotis Milonas
Scientific Reports (2021)
Inhibition of a nutritional endosymbiont by glyphosate abolishes mutualistic benefit on cuticle synthesis in Oryzaephilus surinamensis
- Julian Simon Thilo Kiefer
- Suvdanselengee Batsukh
- Tobias Engl
Communications Biology (2021)
Culex quinquefasciatus carrying Wolbachia is less susceptible to entomopathogenic bacteria
- Leonardo M. Díaz-Nieto
- M. Florencia Gil
- Corina M. Berón
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.