Complete de novo assembly of Wolbachia endosymbiont of Diaphorina citri Kuwayama (Hemiptera: Liviidae) using long-read genome sequencing

Neupane, Surendra; Bonilla, Sylvia I.; Manalo, Andrew M.; Pelz-Stelinski, Kirsten S.

doi:10.1038/s41598-021-03184-0

Download PDF

Article
Open access
Published: 07 January 2022

Complete de novo assembly of Wolbachia endosymbiont of Diaphorina citri Kuwayama (Hemiptera: Liviidae) using long-read genome sequencing

Surendra Neupane¹^na1,
Sylvia I. Bonilla¹^na1,
Andrew M. Manalo¹ &
…
Kirsten S. Pelz-Stelinski¹

Scientific Reports volume 12, Article number: 125 (2022) Cite this article

3778 Accesses
10 Citations
5 Altmetric
Metrics details

Subjects

Abstract

Wolbachia, a gram-negative \(\mathrm{\alpha }\)-proteobacterium, is an endosymbiont found in some arthropods and nematodes. Diaphorina citri Kuwayama, the vector of ‘Candidatus Liberibacter asiaticus’ (CLas), are naturally infected with a strain of Wolbachia (wDi), which has been shown to colocalize with the bacteria pathogens CLas, the pathogen associated with huanglongbing (HLB) disease of citrus. The relationship between wDi and CLas is poorly understood in part because the complete genome of wDi has not been available. Using high-quality long-read PacBio circular consensus sequences, we present the largest complete circular wDi genome among supergroup-B members. The assembled circular chromosome is 1.52 megabases with 95.7% genome completeness with contamination of 1.45%, as assessed by checkM. We identified Insertion Sequences (ISs) and prophage genes scattered throughout the genomes. The proteins were annotated using Pfam, eggNOG, and COG that assigned unique domains and functions. The wDi genome was compared with previously sequenced Wolbachia genomes using pangenome and phylogenetic analyses. The availability of a complete circular chromosome of wDi will facilitate understanding of its role within the insect vector, which may assist in developing tools for disease management. This information also provides a baseline for understanding phylogenetic relationships among Wolbachia of other insect vectors.

Hybrid de novo whole-genome assembly and annotation of the model tapeworm Hymenolepis diminuta

Article Open access 03 December 2019

Chromosome-level genome assembly of Chouioia cunea Yang, the parasitic wasp of the fall webworm

Article Open access 26 July 2023

Complete genome and bimodal genomic structure of the amoebal symbiont Neochlamydia strain S13 revealed by ultra-long reads obtained from MinION

Article 14 November 2019

Introduction

The Asian citrus psyllid, Diaphorina citri Kuwayama, (Hemiptera: Liviidae), is a vector of ‘Candidatus Liberibacter asiaticus’ (CLas), a gram-negative \(\mathrm{\alpha }\)-proteobacteria that putatively causes citrus greening disease, also known as huanglongbing (HLB)¹. D. citri also harbor three endosymbionts: ‘Candidatus Carsonella ruddii’, ‘Candidatus Profftella armature’, and ‘Wolbachia’ (wDi)². Infected D. citri transmit CLas while feeding on citrus trees. Infection with CLas reduces fruit quality and yield, and eventually kills the citrus tree¹. CLas also interacts with host D. citri and its endosymbionts, including Wolbachia, a gram-negative \(\mathrm{\alpha }\)-proteobacteria^3,4,5. These studies reported that the abundance of wDi is related to the abundance of CLas in D. citri and regulates the phage lytic cycle genes in CLas as D. citri infected with “Ca. Liberibacter asiaticus” had a higher Wolbachia titer than the non-infected ones^5,6. The 56-amino-acid repressor protein of Wolbachia in the psyllid represses SC1_gp110 (holin) gene of Ca. Liberibacter asiaticus which is critical for the survival of both endosymbionts in the psyllid⁵. This suggests a potential role of Wolbachia in CLas transmission and underscores the need for a well characterized Wolbachia genome⁴ in gaining a better grasp of and combating this dreadful citrus disease.

In some arthropods, such as Drosophila melanogaster, Aedes aegypti, Culex pipiens, Acraea encedon, Armadillidium vulgare, and Asobara tabida, Wolbachia can alter host reproduction and increase viral resistance^7,8,9. The presence of Wolbachia can manipulate the cellular and reproductive processes by inducing cytoplasmic incompatibility, parthenogenesis, feminization, or male killing⁸. The infection of Aedes aegypti by Wolbachia strains, wMelCS (D. melanogaster), wRi (D. simulans) and wPip (Culex quinquefasciatus) had effects on fitness, maternal transmission, cytoplasmic incompatibility, tissue tropism and dengue virus blocking¹⁰. In addition, a recent study showed the importance of Wolbachia as Wolbachia-infected A. aegypti were resistant to Zika and dengue virus co-infection and were suitable for mitigating mosquito-borne diseases¹¹. The role of Wolbachia in Hemiptera, including D. citri, remains poorly understood. The previously released draft wDi genome used paired-end and mate-pair Illumina datasets for the D. citri metagenome¹². The draft wDi genome (AMZJ01000000.1) was estimated to be 1.25 Mb with 124 contigs with gaps. In this study, we utilized single molecule real-time (SMRT) sequencing by Pacific Biosciences (PacBio) technology to generate long reads from isolated wDi from the host cells¹³. Several challenges confronting whole genome sequencing and de novo assembly of wDi genome exist, including: (1) difficulties in culturing and isolating large amounts of high quality wDi DNA, (2) the incidence of many long repetitive elements and lateral gene transfers (LGTs) from Wolbachia to the host genome, and 3) the presence of Insertion Sequences (IS) and WO-prophage sequences that complicate the complete genome assembly^14,15,16,17. The obstacles for generating a single complete contig have been overcome using long-read sequencing methods, such as PacBio, that generate longer reads through the repeats¹⁵. In this study, we utilized HiCanu¹⁸ for the complete assembly of genome sequences from wDi sample, which could resolve near-identical genomic repeats. The assembly resulted in a circular genome of 1.52 Mb which is the largest complete genome among assembled Wolbachia genomes to date among supergroup-B members, except for the complete Wolbachia genome from Folsomia candida (wFol) of 1.8 Mb¹⁹ (supergroup-E), invasive cherry fruit fly Rhagoletis cingulata (wCin2)²⁰ of 1.53 Mb (supergroup-A). The genome dataset will enhance our ability to elucidate the interactions of wDi with its D. citri host and associated endosymbionts.

Result and discussion

wDi genome assembly

The purpose of this study was to obtain an enclosed Wolbachia genome from D. citri. Recently, we published wDi genomes from a single collection point of the same wDi culture used in this study, which were near complete but could not be circularized²¹. The sequencing of obligate endosymbionts such as Wolbachia is not an easy task because of their very low abundance, inability to grow outside a host, and inability to culture axenically²². In addition, collection of large amounts of high-quality DNA for whole genome sequencing requires a large quantity of bacteria. This requires a high number of infected host cells to obtain the obligate endosymbiont bacteria²². Thus, in this study, wDi samples were collected from combination of two collection points (cell passages) from the same culture to obtain high quality wDi DNA for whole genome sequencing. An overview of wDi extraction and genome assembly pipeline is shown in Fig. 1.

To produce a high-quality assembly, circular consensus sequences (CCS) were used. CCS are derived accurately from the noisy individual subreads which are consensus sequence obtained from multiple passes of a single template molecule^23,24. The raw PacBio sequencing data obtained from the SMRT cells produced 899,643 filtered subreads and a total of approximately four billion bases, with the longest subread length of 118 kb. High quality CCS reads upto 32 kb size were generated from raw PacBio reads for high quality assembly. The maximum number of CCS reads (> 4,000) generated from using SMRT® LINK v7.0 using Sequel II system were of high quality with Q60 (Fig. 2a,b). Further, 45-bp left adapter sequences were trimmed from CCS reads. In addition, the short reads < 1000 bp and worst 10% of read bases were discarded to ensure high-quality assembly with the coverage of 72.89 × . We utilized pacbio-hifi parameter in Canu v1.9 to solve the complexity of Wolbachia genomes and generate complete assembly with overlapping ends that can be trimmed for circularization. Pacbio-hifi, recently integrated in Canu v1.9 provides high repeat resolution than pacbio-corrected at least on complex genomes like Wolbachia¹⁸. By default, Canu v1.9 with pacbio-hifi option uses only overlaps that are below 0.03% error which is much lower than used with pacbio-corrected option. In this study, we applied an even lower rate, correctedErrorRate = 0.001, that reduces the risk for the mis-assembly. Before trimming, the assembled genome size was 1,530,940 bp. The genomes after circularization were checked for potential errors using Illumina sequencing data. Firstly, the quality of trimmed Illumina data was ensured using FastQC to determine the data quality using various quality metrics. Phred quality scores per-base for the sample was higher than 30 and GC content of 33%, following a normal distribution. The Illumina data provided median coverage of 925 × for the sample. The analyses corrected 91 SNPs, 10 small insertions totaling 73 bases, and three small deletions totaling 41 bases. The de novo assembled genome after correction was 1,528,786 bp in size with an average GC content of 34.08% (Table 1).

Table 1 Assembly statistics of wDi genome assembly.

Full size table

The complete genome is longer than the previously reported draft contigs of wDi which was estimated to be 1.25 Mb¹². The wDi genome is largest among assembled Wolbachia genomes as compared with other Wolbachia from arthropods and nematodes. Previously, the largest Wolbachia genomes were from Folsomia candida (1.8 Mb)¹⁹, invasive cherry fruit fly Rhagoletis cingulata (1.53 Mb)²⁰ and embryos of Aedes albopictus (1.48 Mb)²⁵.

Genome annotations and assessments

The wDi genome was annotated including protein coding genes, 5S, 16S, and 23S rRNA and tRNA genes. An overview of their genome features, including CDSs, rRNAs, and tRNAs was visualized in CG view Server (Fig. 3). PGAP annotations showed assembled wDi chromosome to contain total of 1,435 genes which are 1,394 coding sequences with 1,202 protein coding genes. Forty-one genes are related to RNAs (three RNAs, 34 tRNAs, and four noncoding RNAs) and 192 are pseudogenes. We compared the complete wDi chromosome with the draft wDi in various perspectives using various tools implemented in Microscope platform²⁶. The core genes and genome specific genes was identified comparing wDi_AMZJ.1¹² based on Microscope gene families with parameter of 80% amino acid identity and 80% alignment coverage. A total of 1,073 genes were shared between two wDi genomes, while 239 and 183 genes were specific to wDi assembled in this study and wDi_AMZJ.1¹², respectively, based on single transitive links (single linkage) with alignment coverage constraints and implemented in a software package (called SiLiX for SIngle LInkage Clustering of Sequences) (Figure S1; Table S1). Notably, dnaK (fragment of chaperone protein), metC (fragment of cystathionine beta-lyase/L-cysteine desulfhydrase), ylbg (putative DNA-binding transcriptional regulator), insF (transposase), rpoC (fragment of RNA polymerase subunit beta), kefB (fragment of K +: H + antiporter) constituted the largest fraction of genes in complete wDi. However, the Microscope platform's gene phyloprofile analysis revealed that homologs for those genes exist in draft wDi, with homology constraints of identity greater than or equal to 35 percent (Table S2). In complete wDi, tandem duplications revealed 36 locations containing 286 genes, whereas draft wDi revealed just 20 regions involving 64 genes. Tandem duplicated genes have an identity ≥ 35% with a minLRap ≥ 0.8 and are separated by a maximum of five consecutive genes. It is evident that tandem duplications play major role in expansion of gene families²⁷. In addition, the comparison between complete and draft wDi was done using lineplots, dotplots, and mauve alignment. The lineplot showed the strand conservation and inversions in the syntenic regions and shows high prevalence of transposases and insertion sequences throughout the complete wDi genome that are absent in the draft wDi (Fig. 4a). The dot plot shows the breaks and inversions when compared to the draft wDi (Fig. 4b). Mauve alignment showed some regions in the complete wDi genome whose locally collinear blocks (LCBs) were absent in the draft wDi (Fig. 4c). Each LCB is a homologous sequence region shared by two or more of the genomes under investigation and does not contain any homologous sequence rearrangements²⁸. We also looked at a number of critical elements such as transposases, Ankyrin, DNA-repair genes, and resolvases in complete and draft wDi that are responsible for both difficulty in assembly and genome expansion. In complete and draft wDi, we found 109 versus 15 transposases, 57 versus 54 proteins with ankyrin repeats, 14 versus 11 DNA repair proteins, and six versus one resolvases. The homolog for 56-amino-acid repressor protein (WP_017531870) of Wolbachia in the psyllid that represses SC1_gp110 (holin) gene of Ca. Liberibacter asiaticus was also found in the complete wDi genome (GZ065_v1_1041).

The BUSCO completeness scores of assembled wDi genome was also compared to Wolbachia reference genomes using bacteria_odb10 database (calculated in this study). The BUSCO completeness of the final assembled wDi genome showed 80.6% as compared to other reference Wolbachia genomes wOo (78.2%), wOv (78.2%), wFol (81.5%), wAlbB (84.7%), wBm (79.8%), wOo (78.2%), wMau (83.9%), wMel (83.1%), wPip (86.3%) and wRi (83.9%) suggesting similar number of ‘complete and single-copy’ genes recovered in wDi genome compared to reference Wolbachia genomes and is typical and reliable for comparative genomics among Wolbachia genomes²⁵ (Figure S2). It has been suggested that even the complete genomes of Wolbachia miss up to 9 to 25 genes from the BUSCO set because of their endosymbiotic lifestyle which makes genes redundant, and these genes probably are not missing from the assemblies and annotations²⁹. The final assembled wDi genome showed 94.0% completeness when the subset database, rickettsiales_odb10 was used for the BUSCO analysis. In addition, the checkM completeness of the assembled wDi genome was 95.73% with 1.45% contamination. The checkM completeness and contamination falls within the range of ≥ 95% complete with ≤ 5% contamination that makes excellent reference genome for analysis^30,31. The checkM contamination of the previously published complete wFol genome (1.8 Mb)¹⁹ was 1.82% (calculated in this study) which was assembled from filtered reads obtained from F. candida genome that was sequenced using PacBio sequencing technology (Table S3). In addition, the taxonomy to Wolbachia sp. was confirmed using Centrifuge v1.0.3 tool that showed all sequences belonging to Wolbachia species.

Insertion sequences (ISs), prophage genes, ORF7 and Ankyrin proteins

Insertion sequences are bacterial class-II transposons that are capable of replication and can spread throughout the genome using cut-and-paste mechanism³². ISs are classified into about 20 families and play key role in genome evolution^32,33. Specifically, 10% of the Wolbachia genomes consist of insertion sequence elements³⁴. A total of 138 ORFs related to ISs were found in the wDi genome, belonging to 14 different IS families (Figure S3; Table S4). The most represented IS families were IS982 (28 copies; 20.3%), IS481 (26 copies; 18.8%), and IS110 (25 copies; 18.1%). Although the ISs in the wDi genome are diverse, they have less ORFs than in the entire circular wAlbB (CP031221) chromosome belonging to supergroup B, which has nine IS families and 216 ORFs associated to IS elements, with IS982 and IS481 having 99 and 76 copies, respectively. The other supergroup B members, wPip possess IS982, wNo and wMau possessed IS110 and wRi possess IS66_ssgr_ISBst12 as a dominant IS family. The majority of the members of the supergroup A, wWpum, wCin2, wMel, wMel_I23 possess IS5_ssgr_IS1031 as a dominant IS family while, wDAna possess IS110, wCsol and wHa possess IS5_ssgr_IS903 as a dominant IS family. Wolbachia belonging to supergroups C, D that infect filarial nematodes such as wOo (one IS ORF) and wBm (three IS ORFs) possess highly reduced IS elements with IS4_ssgr_IS231 and IS630 as a dominant IS family, respectively. The supergroup E and F members, wFol and wCle possessed IS5_ssgr_IS1031 as a dominant IS family with 117 and 231 IS ORFs respectively (Fig. 5).

Prophages are subjected to selective pressure from their hosts, resulting in a variety of partial DNA genomic abnormalities such as recombination, gene loss, and progressive disintegration³⁵. The prophage genes are dynamic elements that mediate horizontal gene transfer and are widespread in Wolbachia genomes^36,37. Defective genomic prophages, also known as cryptic prophages, are virions that have lost their ability to generate virions and lyse host cells^35,38. The most major difference between intact and cryptic WO is that intact WO possesses a rather complete gene module that codes for head, baseplate, and tail proteins, allowing it to generate active virions³⁹.The prophage regions in the wDi genome showed five regions (four intact and one incomplete or cryptic) sized 55.8 kb, 23.1 kb, 32.2 kb, 11.9 kb and 34.6 kb containing 64, 33, 21, 18, and 24 proteins, respectively (Figure S2; Table S5). Altogether, prophage region constituted total of 164 prophage-associated loci scattered in four intact and one incomplete regions with the combined size of 137.9 kb (10.3%) in the wDi genome. Based on the existence of all genomic structures (phage attachment sites, genes encoding structural phage proteins, and genes coding for proteins involved in DNA regulation, insertion to the host genome, and lysis), the four entire WOwDi phages have the ability to create virions. One cryptic WOwDi sized 11.8 kb (location: 522,438–534,303) lacks phage baseplate and tail assembly proteins. wDi genome supports widely held belief that Wolbachia with cryptic prophages usually has at least one intact WO prophage⁴⁰. This shows the expansion of the prophage region when compared to other supergroup B members such as wAlbB_CP031221 (1.47%), wNo (4.09%), wMau (4.07%) and but comparable to wPip (1.48 Mb genome size) with 9.25% prophage sequences (with only one 59.8 kb sized intact prophage region with other four cryptic prophage regions) (Fig. 5). Surprisingly, PHASTER analysis revealed two cryptic prophage regions of 6.4 kb and 15.4 kb in wAlbB_CP031221 without the presence of intact prophage region. However, four WO-like islands (designated wAlbB WO like island 01 through wAlbB WO like island 04) and 19 prophage-associated loci (13 CDS, 6 pseudogenes) were discovered by BLAST comparisons to several WO phages totaling 111 prophage-associated loci with a combined size of 116 kb (8%) without active prophages²⁵. Other Wolbachia genome only with cryptic prophages were found in group A member, wWpum (Wolbachia in Wiebesia pumilae)³⁹ having no ability to produce active virions.

The WO prophage areas are sometimes used in cytoplasmic incompatibility genetic investigations⁴¹. The BLASTp searches of WOMelB WD0631 (NCBI accession number AAS14330.1) and WD0632 (AAS14331.1) in Microscope platform for CifA and CifB protein sequences, respectively⁴¹ found no homologs in the wDi strain for CifA but a few for CifB. Among CifB hits using HHpred⁴², GZ065_v1_1517, GZ065_v1_0240 follow Module B-1 (ModB-1 with PDDEXK nuclease family, and various other restriction endonucleases such as NucS, HSDR_N, and MmeI), and GZ065_v1_0695, GZ065_v1_0696, GZ065_v1_0704 follow Module B-3 [with ubiquitin-modification (Ulp-1) and protease-like domains (Sentrin-specific protease)]⁴¹.

In addition, the wDi genome revealed the presence of four different minor capsid gene ORF7 paralogs (GZ065_00870, GZ065_01245, GZ065_01575, and GZ065_6965) (Figure S3) as in Nasonia vitripennis A Wolbachia³⁷ which are present in the four different prophage sequence regions. The protein domain annotations of the assembled genomes showed 57 (4.0%) proteins in the wDi genome to contain at least one copy of an ankyrin repeat domain (Figures S3; Table S6) which is comparable to ANK proteins wMel, wRi, and wPip with about 4% of the total genes⁴³. These ANK proteins of about 33 amino acids play significant role in interactions between host and symbionts^34,44 and are found abundantly in genes of WO-prophage⁴⁴.

Many contemporary hypotheses propose that obligate endosymbionts should have limited genome sizes⁴⁵, similar to Wolbachia strains in filarial nematodes, which contain no or few insertion sequences, transposable elements, and prophage sequences, due to their obligate association with the host⁴⁶. Recent study have shown that the genome of the obligatory wFol²⁹ strain, on the other hand, is the biggest complete Wolbachia genome ever identified, with 1,801,626 base pairs (bp) and highly enriched in repeated and mobile elements (124 transposases, 96 ankyrin repeat proteins, 34 DNA-repair genes, and 19 resolvases). In wDi too, the genome is highly enriched in repeated and mobile elements (109 transposases, 57 proteins with ankyrin repeats, 14 DNA repair proteins, and six resolvases) than other supergroup-B members²⁹. All known Wolbachia strains are in a similar transitional stage, in which they are primarily vertically transferred and do not exist in specialized structures⁴⁷. As a result, their genome size is expected to vary depending on the host⁴⁷.

COG, eggNOG, and pfam annotations

COG automatic classification revealed 1,092 CDSs classified in at least one COG group in the wDi genome (Table S7). eggNOG annotations of protein coding genes assigned functions to 1,221 protein coding genes (Table S8). The top five pathways were related to “replication, recombination and repair”, “translation, ribosomal structure and biogenesis”, “energy production and conversion”, “posttranslational modification, protein turnover, chaperones”, and “coenzyme transport and metabolism”. The Pathway Tools was used to observe whether the metabolic pathways were complete or not. The analysis showed 40 complete metabolic pathways and 62 incomplete metabolic pathways (Table S9). The pfam annotation of wDi identified 1075 protein coding genes with unique pfam domains. The important pfam domains for mobile genetic elements such as DDE Transposase domain DDE_Tnp_1 (PF01609), DDE_Tnp_1_3 (PF13612), DDE_Tnp_4 (PF13359), DDE_Tnp_IS240 (PF13610.6), Retroviral Integrase domain rve (PF00665), rve_3 (PF13683), and reverse transcriptase domain RVT_1 (PF00078) were found abundantly in wDi genome (Table S10).

Toxin-antitoxin system and Type IV Secretion SSystem (T4SS) genes

Toxin–antitoxin (TA) systems are genetic components that consist of a toxin gene (proteins) and its antitoxin counterpart (protein or non-coding RNAs). In bacteria various processes, like translation, replication, cytoskeleton development, membrane integrity, and cell wall biosynthesis are affected by TA toxins⁴⁸. PGAP annotation in the wDi genome revealed the presence of Type II RelE/ParE toxin genes, GZ065_00055, GZ065_03670 (pseudogene) and one Type II RatA family toxin gene, GZ065_04425. Based on the BLASTp search using wPip antitoxin gene, WP_007302904.1, we identified GZ065_00050 as a possible antitoxin gene for RelE toxin. Type II RatA family toxin gene, GZ065_04425 was situated immediate to ssrS noncoding RNA gene (Rfam RF00013), separated by fewer than 18 nucleotides. Previously, RelE/ParE and RatA/ssrS toxin-antioxin modules were also reported in wCle, wFol, wPip, wMel, wRi, wAu, wHa, wNo⁴⁹.

Genes related to the Type IV Secretion System (T4SS) are another important group represented in Wolbachia. Bacteria utilize T4SSs to proliferate and survive inside the host secreting protein effectors, protein-DNA complexes⁵⁰. The wDi genome revealed the presence of 14 genes associated to T4SSs (Table S11). These genes were organized in two operons in each wDi genome. Operon 1 contains virB8, virB9-1, virB10, virB11, and virD4. Operon 2 contains virB3, virB4, virB6-1, and virB6-2. The virB2 and virB7 genes were found to be scattered elsewhere in the genomes. Interestingly, we found both virB2 (three copies) and virB7 (one copy) genes in the wDi genome. These genes have been reported as absent among Wolbachia and most members of the order Rickettsiales^51,52. However, recent studies have shown the presence of virB2 gene (pilus component) in Wolbachia pipientis from Ae. albopictus (wAlbB)²⁵, Wolbachia from Laodelphax striatellus⁵³, Candidatus Wolbachia bourtzisii (wDacA), Wolbachia pipientis wDacB from Dactylopius coccus⁵⁴, and Wolbachia from Muscidifurax uniraptor (wUni)⁵⁵. In addition, the virB7 gene (pilus-associated protein) was previously observed only in Wolbachia from Laodelphax striatellus (wStri)⁵³. Bing et al.⁵³ also showed wDi clustered together with wStri with a strong support in a monophyletic clade and suggested that these strains shared the same ancestor.

Comparative genomics of wDi with reference Wolbachia genomes

The Wolbachia pangenome describes 2,112 gene clusters with 18,800 genes that were identified in 15 Wolbachia genomes. The pangenome study resulted three bins that were unique to wDi genomes. The Bin_1 consisted of 58 gene clusters with 127 genes common in both complete and incomplete wDi_AMZJ.1¹² genomes, Bin_2 consisted of 29 gene clusters with 62 genes that were unique to the complete wDi genome, and Bin_3 consisted of 12 gene clusters with 13 genes that were unique to incomplete wDi_AMZJ.1¹² genome (Fig. 6a, Table S12). The largest fraction of genes in three bins constituted Ankyrin repeat proteins (n = 28; play important role in interactions between host and symbionts) and IS4 transposase (n = 11; play role in DNA mobility using “cut and paste” mechanism), chromosome segregation ATPases (n = 5; play important role in chromosome condensation and segregation during cytoplasmic incompatibility in male insects), curved DNA-binding protein CbpA, containing a DnaJ-like domain (n = 2; act as a molecular chaperone in an adaptive response to environmental stresses other than heat shock), DNA repair protein RadC (n = 2), DNA-directed RNA polymerase (n = 2), RecA-family ATPase (n = 6) , REP element-mobilizing transposase (n = 2), transcriptional regulator with XRE-family HTH domain (n = 2), Mg/Co/Ni transporter MgtE (n = 2; important in inorganic ion transport and metabolism) and rest were conserved protein with unknown function.

The ANI values among the wDi genome and reference Wolbachia genomes indicated the similarity in the range of 82% (supergroup D-wBm) to 95% (supergroup B-wAlbB) and 99.8% to incomplete wDi_AMZJ.1¹² genome (Fig. 6b). OrthoFinder assigned 21,264 genes (96.3% of total) to 1,924 orthogroups (Table S13) in the 15 Wolbachia genomes. There were 626 orthogroups with all species present and 407 of these consisted entirely of single-copy genes (Fig. 6c). The analysis showed 43 orthogroups unique to complete and draft wDi genomes.

Phylogenetics of wDi and other Wolbachia genomes

The IQ-TREE v 1.6.8 tool was used to construct a ML phylogenetic tree using the concatenated protein sequences of single copy genes including ribosomal proteins of reference Wolbachia genomes obtained from NCBI database (Table S14) with the wDi genome. The single copy genes were utilized instead of multilocus sequence typing loci (gatB, coxA, hcpA, fbpA, and ftsZ)⁵⁸ which are problematic in phylogenetic analyses and may not accurately represent the properties of different Wolbachia strains⁵⁹. The advent of sequencing technology and availability of complete and draft genomes of Wolbachia, recent phylogenetic studies have been done utilizing single copy gene sets^53,59,60 rather than whole-genome sequence typing⁶¹. Although comparisons of whole Wolbachia genome sequences is useful for strain differentiation, diversity estimates, and phylogenetic analyses, the size is cumbersome and not necessary to answer specific questions that can be addressed using genetic marker loci⁵⁹. The obtained tree (Fig. 7) indicated that the wDi genome belonged to supergroup-B Wolbachia strains (wVulC, wCon, wLug, wBta, wStri, wAlbB, wDacB, wLcl, wNo, wMau, wAus, Ob_Wba, wBol1-b, wMeg, and wPip) and made a clade with wStri (the Wolbachia from Korean Laodelphax striatellus population) and wStri_1 (the Wolbachia from Chinese L. striatellus population). Wolbachia are supergrouped (A, B, E–H), the Wolbachia endosymbionts of arthropods belong to supergroup-A and -B and of filarial nematodes belong to supergroup-C and -D^8,62. wPpe belongs to supergroup-L⁶³, whereas wCfeT strain is ancestrally to most other Wolbachia lineages (used as an outgroup)⁶⁴. The phylogenetic analysis by Saha et al.¹² also indicated that Wolbachia from D. citri belongs to supergroup-B using FtsZ and Wsp genes.

Conclusions

The genome sequence of the Wolbachia culture isolated from D. citri was completely assembled and compared with other Wolbachia genomes available in the NCBI database. This study is in accordance with the study by Sinha et al.²⁵, which demonstrated that high quality, complete Wolbachia genome assemblies can be achieved from long-read sequences of high coverage without enrichment, such as through Large Enriched Fragment Targeted Sequencing⁶⁷ and other target genome enrichment techniques^68,69. In this study, we used DNA from an axenic Wolbachia cultures for whole genome sequencing rather than filtering Wolbachia sequence reads from the whole insect genome sequence. The latter, referred to as a metagenomic sequencing approach, is a frequent practice that generates low coverage reads for Wolbachia genome assembly^70,71. Recent integration of the pacbio-hifi option in Canu (HiCanu) facilitates generation of complete assemblies consisting of repeat resolution on complex genomes like that of Wolbachia rather than pacbio-corrected assemblies in previous versions. In addition, concatenated protein sequences of single copy genes generated using hmm source from Campbell et al.⁶⁵ delineated supergroup-B Wolbachia of D. citri from other supergroups. The availability of a complete circular genome of the D. citri endosymbiont, Wolbachia, will facilitate the development of endosymbiont-mediated strategies for pest and disease management. This study expands the list of complete Wolbachia reference genomes that can be useful in studying evolutionary relationships among Wolbachia of arthropods and nematodes.

Materials and methods

Extraction of Wolbachia from D. citri (wDi)

D. citri were collected from a laboratory culture established in 2005 from a population collected in Polk Co. (28.0′ N, 81.9′ W), Lake Alfred, Florida, USA. Individual psyllids were placed on sterile diet rings for two days prior to Wolbachia extraction. The surface sterilized psyllid was homogenized in 1.0 mL of Schneider’s Drosophila (S2) medium (catalog number 21720024, Gibco) followed by centrifugation at 100 × g for five minutes. The supernatant was further centrifuged at 400 × g for five minutes to pellet wDi with insect debris. The pellet was resuspended with 1.0 mL of S2 medium separate wDi from impurities. The samples were centrifuged at 100 × g for five minutes to pellet impurities, and the supernatant was transferred to a new tube. The final centrifuge step was conducted at 4000 × g for five minutes, and the pelleted wDi was resuspended in fresh 1.0 mL of S2 media.

Infection of wDi in S2 cells and isolation of wDi from cell culture

Drosophila S2 cells (catalog number R69007, Invitrogen) were infected with Wolbachia extracted from Diaphorina citri (S2 + wDi)⁷² and maintained in Schneider’s Drosophila medium (catalog number 21720024, Gibco) containing 10% heat inactivated fetal bovine serum (catalog number 10082147, Gibco); 50 units of penicillin and 50 μg streptomycin sulfate (catalog number 15070063, Gibco) per mL (S2 complete media) Dobson et al.⁷³ according to standard procedures⁷⁴. The S2 + wDi cells were harvested and lysed by vortex using 3 mm borosilicate glass beads to isolate wDi. The supernatant samples were processed as described by Rasgon et al.⁷⁵. wDi cells from the same culture were collected on different dates (different cell passages, 26 and 28) and combined to obtain enough wDi DNA to produce a complete genome²¹.

wDi Genomic DNA (gDNA) extraction

The wDi gDNA was extracted using the MagAttract HMW DNA Mini kit (catalog number 67563, Qiagen) using manufacturer’s protocol with few modifications. The modifications were as follows: The bacterial pellet was resuspended in 180 µl ATL buffer [from DNeasy® Blood and Tissue Kit (catalog number 69506, Qiagen)] with 20 µl Proteinase K and incubated for 30 min at 56 °C. 15 μl MagAttract Suspension and 280 μl Buffer MB was added to the sample and mixed by pulse vortexing. The sample tubes were transferred to the tube holder of the Magnetic Rack (without the magnetic insert). The tube holder of the Magnetic Rack (without the magnetic insert) was placed onto the mixer and incubate at room temperature (15–25 °C) for 3 min at 1400 rpm. The magnetic insert was placed into the tube holder of the Magnetic Rack, wait (~ 1 min) until bead separation has been completed, and the supernatant was removed. The extracted gDNA was purified using the DNeasy PowerClean Cleanup kit (catalog number 1287750, Qiagen). gDNA was quantified using the Qubit 1 × dsDNA HS Assay kit (ThermoFisher Scientific) and DNA quality was assessed using the TapeStation Genomic DNA ScreenTape (Agilent Technologies).

Long-read (PacBio) sequencing

Sequencing of wDi gDNA was performed on six replicate samples (five samples are not included in this study). wDi gDNA (4–8 µg in 150 µl TE) was sheared down to 10 kb using Covaris g-TUBES (catalog number 520079, Covaris Inc.), using two passes at 7,000 rpm. The resulting size of the fragments was verified on the TapeStation Genomic DNA ScreenTape (Agilent Technologies). Barcoded, 10 kb insert-size libraries were constructed using 600–700 ng of pure and fragmented (10 kb) from each bacterial sample using the protocol of PacBio for multiplex SMRT sequencing of bacterial genomes (PacBio Manual PN 101–069-200–02) in conjunction with barcodes from the Barcoded Adaptor Kit 8A (PacBio PN 101–081-300). Briefly, the library construction reactions consisted of the following sequential steps: ExoVII treatment, DNA Damage Repair, End Repair and Blunt-end ligation of barcoded SMRT bell adaptors. After ligation, samples were pooled, purified using AMPure, and treated with ExoIII/ExoVII to eliminate excess adaptors and any damaged DNA. This procedure resulted in ~ 800 ng of adaptor ligated SMRT bell library. The final library was further size selected in the SageELF™ instrument (catalog number ELD7510), using 0.75% agarose gel cassettes and the 1–18 kb v2 cassette definition program. The desired SageELF™ fractions in the 5–20 kb range, averaging 10 kb (TapeStation) were cleaned using AMPure magnetic beads (0.6:1.0 beads to sample ratio) and eluted in 15 μl of 10 nM Tris HCl, pH 8.0. The library size selection by ELF step yielded 126 ng of ready-to-sequence material. Sequencing was performed on the PacBio SEQUEL instrument using the Chemistry 3.0 reagents in combination with the SMRT® LINK v 6.0 software. The library was added on the PacBio SEQUEL sample plate at 8 pM by diffusion-loading and 224 min pre-extension time for sequencing in LR-SMRT cells with 20-h data collection. All other steps for sequencing were done according to the recommended protocol by PacBio sequencing calculator.

Short-read (Illumina) sequencing

The gDNA samples for Illumina sequencing were fragmented using the Covaris to 400 bp following the manufacturer recommended protocol. The genomic libraries were constructed using 100 ng as the input and the NEBNext Ultra II DNA library prep kit for Illumina (New England Biolabs). Three PCR cycles were performed with each library prior to library validation using the TapeStation High Sensitivity D5000 ScreenTape (Agilent Technologies). Libraries were quantified using the Qubit 1 × dsDNA HS Assay kit (ThermoFisher Scientific) and molar concentration was calculated to pool the libraries in equimolar ratios. The pool was then quantified and 14 pM was loaded into the MiSeq flow cell. The run was set as a 300 paired-end run using the 600-cycles v3 kit.

De novo genome assembly

PacBio CCS were generated using SMRT® LINK v7.0 using Sequel II system. The parameters used for CCS generation were minimum full passes of three and minimum predicted accuracy of 99%. The left adapter sequences (45 bp) were trimmed using seqtk (https://github.com/lh3/seqtk). The reads smaller than 1000 bp were filtered out using filtlong (–min_length 1000, –keep_percent 90) (https://github.com/rrwick/Filtlong). The de novo assembly was done using Canu v1.9 (https://github.com/marbl/canu)⁷⁶ using the “pacbio-hifi” option¹⁸. The suggested circular chromosome was rendered using the following parameters: trim-assemble, genomeSize = 1.5 m, correctedErrorRate = 0.001, cnsErrorRate = 0.050, minReadLength = 3000. The resulted contig was circularized by introducing a ‘break’ in the single contig using Amos v3.1.0 and Minimus2 (http://amos.sourceforge.net/wiki/index.php/Minimus2) that trimmed the duplicate sequences in the beginning and end of the chromosome to produce a circular genome. The origin of replication was adjusted using Circlator v1.5.5⁷⁷.

Genome correction

The PacBio-only assembled genome can have a high probability of indel errors⁷⁸. Therefore, the assembled genome was checked for potential errors using Illumina data obtained from respective samples using the Pilon error-detection and correction tool⁷⁹. The adapters and low-quality Illumina sequences were filtered using program Trimmomatic v0.36 (ILLUMINACLIP: adapters.fasta:2:30:20 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50)⁸⁰. The quality of trimmed reads was assessed using FastQC v0.11.7⁸¹. After cleaning, the reads were mapped to the PacBio chromosome using bwa v0.7.17⁸² using pair-end mode. The indexed bam output file obtained from bwa was utilized for indel correction using Pilon v1.22⁷⁹.

Genome annotations and assessments

Genome annotation was done using the standard NCBI Prokaryotic Genome Annotation Pipeline (PGAP)⁸³ and Microscope platform²⁶. PGAP annotations are available at NCBI GenBank. The annotations from Microscope platform were used for some comparative studies and mentioned when discussed below (represented by GZ065_v1_n). The completeness of the genome was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO) v4 using bacteria_obd10 database (Creation date: 2019–06-26, number of species: 4085, number of BUSCOs: 124) and rickettsiales_odb10 database (Creation date: 2020–03-06, number of species: 34, number of BUSCOs: 364)⁸⁴ and CheckM⁸⁵. Microscope platform was utilized for completeness using CheckM, Clusters of Orthologous Groups (COG) classification of proteins including functional annotation of protein-coding genes using eggNOG-Mapper v1.0.3⁸⁶ , eggNOG database v4.5.1⁸⁷, encoded pathway analysis via Pathway Tools v23⁸⁸ and the MicroCyc metabolic pathways database⁸⁹. The map of the circular genome with gene feature information was generated using CGView⁹⁰. The SiLiX software⁹¹ integrated in the Microscope platform that uses the MicroScope gene families (MICFAM) was used for the analysis of the components (core-genome, strain specific sequences) for complete and draft wDi. MAUVE²⁸ was used for complete and draft wDi genomes alignments with locally collinear blocks. Gepard⁹² was used for creating dot plot between complete and draft wDi genomes. LinePlot tool implemented in the Microscope platform was used to create a line plot for a global comparison, based on minimum synton size of eight genes. Protein sequences from Microscope platform were used for identifying Pfam domains using pfam_scan.pl script v1.5 (last accessed March 10, 2020) using Pfam database v31.0⁹³. The prophage regions were identified by PHAge Search Tool Enhanced Release (PHASTER. https://phaster.ca/)⁹⁴ (last accessed September 28, 2021). ISsaga web server http://issaga.biotoul.fr/issaga_index.php⁹⁵(last accessed September 28, 2021) was used to find Insertion Sequence (IS) elements using ISfinder database³³. HHpred⁴² was used for the detection of protein domains for identification of modules⁴¹ to categorize the possible cytoplasmic incompatibility genes. ORF7, or phage WO-B genome was identified from Pfam which are molecular markers for Wolbachia strain typing^96,97 and plays a possible role in inducing cytoplasmic incompatibility⁹⁸. The prophage sequences, IS elements, Ankyrin genes, T4SS genes and ORF7 sequences in the corresponding wDi genomes was represented in a circos plot using Circa (OMGenomics, http://omgenomics.com/circa/).

Comparative genomics of wDi genome with other Wolbachia genomes

Wolbachia metapangenome, ANI identity, and orthogroup analyses

The assembled wDi genome from this study was compared to various reference genomes: wPip⁹⁹, wAlbB¹⁰⁰, wAlbB_CP031221²⁵, wMel⁴⁴, wBm_CP034333⁶⁷, wBm¹⁰¹, wMau⁶⁷, wRi³⁴, wDAna¹⁰², wHa²², wMel_I23⁷⁰, wNo²² and wRec¹⁰³. The previously published, non-circular wDi genomes wDi_AMZJ.1¹² was also included in the comparison. The pangenome analyses were performed using anvio v5.5.0⁵⁷ (http://merenlab.org/software/anvio/). The taxonomy was assigned using Centrifuge v1.0.3¹⁰⁴. The COGs to the reference genomes were assigned using program ‘anvi-run-ncbi-cogs’. The program ‘anvi-pan-genome’ was used following flags and parameters: ‘-use-ncbi-blast’, ‘-minbit 0.5’, and ‘-mcl-inflation 5’ for the wDi genome and reference genomes. The similarity between the wDi and reference genomes were calculated using ‘anvi-compute-ani’ which utilizes PyANI⁵⁶ in ‘ANIb’ mode to compute average nucleotide identity across the genomes. The orthogroups across the wDi and reference genomes were identified using Orthofinder v2.4.0¹⁰⁵ and common orthogroups across multiple genomes were visualized via UpSet plot using Intervene (https://asntech.shinyapps.io/intervene/)¹⁰⁶.

Phylogenetic analysis

We constructed two maximum likelihood phylogenetic trees in different scale. The phylogenetic analysis was performed using protein sequences hits obtained via ‘anvi-get-sequences-for-hmm-hits,’ which utilizes the hidden markov model (hmm) source from Campbell et al.⁶⁵ using 139 single copy genes including 48 ribosomal genes. One small scale phylogenetic tree was constructed using seventeen complete Wolbachia chromosomes for studying and visualizing the abundance and variations of Insertion and prophage sequences. For big scale phylogenetic tree, seventy-seven Wolbachia genomes (taxid: 953) were downloaded from the NCBI database using command ncbi-genome-download to perform the phylogenetic analysis with wDi genome. The concatenated protein sequences of single copy genes were aligned using MUSCLE¹⁰⁷ and were subjected to ModelFinder¹⁰⁸ for RAxML tree using Bayesian Information Criterion (BIC). The best amino acid substitution model was used for construction of maximum likelihood phylogenetic tree using IQ-TREE v1.6.8⁶⁶ using ultrafast bootstrap mode with 5000 iterations. Branch support was estimated using the Shimodaira–Hasegawa (SH)-like approximate likelihood ratio test with 1,000 replicates. Modelfinder and IQ-TREE was integrated in a PhyloSuite v1.2.2 software¹⁰⁹ The rerooting, labeling, and color coding of the phylogenetic tree was performed using iTOL v5.7 (https://itol.embl.de/)¹¹⁰.

Data availability

The accessions SRR10985324, and SRR11075881 under Bioproject PRJNA603775 connected with biosample SAMN13940805 have been deposited at the NCBI. The assembled genome and annotations have been deposited at the NCBI GenBank database under the accession CP048819. All the supplemental materials have been uploaded in Figshare: https://doi.org/10.6084/m9.figshare.14397131. Figure S1. Venn diagram showing common and genome specific genes between complete wDi and draft wDi_AMZJ.1 genome. Figure S2. BUSCO assessment of the completeness of wDi genomes with reference sequences. Figure S3. Circos plot representation of various features in the wDi genome. The wDi genome is represented by the outer circle. The first, second, third, fourth and fifth inner circle represents the track for IS elements, Ankyrin genes, T4SS genes, prophage sequences, and ORF7 sequences, respectively in the wDi genome. Table S1 shows list of complete and draft wDi genome specific genes. Table S2 shows list of orthologs of complete and draft wDi using annotation from Microscope platform. Table S3 shows list of Wolbachia genomes sequenced and assembled using different technology and assembly tools. Table S4 shows Insertion Sequences (ISs) in the wDi genome. Table S5 shows prophage statistics in the wDi genome. Table S6 shows list of Ankyrin genes in the wDi genome. Table S7 shows COG automatic classification of protein coding genes in the wDi genome. Table S8 shows eggNOG annotations of protein coding genes in the wDi genome. Table S9 shows Metabolic pathways analysis in the wDi genome. Table S10 shows Pfam domain annotations for the wDi proteins of the wDi genome. Table S11 shows list of genes related to Type IV Secretion System in the wDi genome. Table S12 shows summary of Wolbachia Pan gene clusters. Table S13 shows Orthogroup analyses. Table S14 shows list of Wolbachia genome assemblies downloaded from the NCBI database, consisting of 139 single copy genes including 48 ribosomal genes from Campbell et al.⁶⁵ used for the hidden markov model (hmm) source, concatenated protein sequences, and phylogenetic tree construction file.

References

Gottwald, T. R. Current epidemiological understanding of citrus huanglongbing. Annu. Rev. Phytopathol. 48, 119–139 (2010).
Article CAS PubMed Google Scholar
Nakabachi, A. et al. Defensive bacteriome symbiont with a drastically reduced genome. Curr. Biol. 23, 1478–1484 (2013).
Article CAS PubMed Google Scholar
Pelz-Stelinski, K. & Killiny, N. Better together: Association with ‘Candidatus Liberibacter asiaticus’ increases the reproductive fitness of its insect vector, Diaphorina citri (Hemiptera: Liviidae). Ann. Entomol. Soc. Am. 109, 371–376 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chu, C.-C., Gill, T. A., Hoffmann, M. & Pelz-Stelinski, K. S. Inter-population variability of endosymbiont densities in the Asian citrus psyllid (Diaphorina citri Kuwayama). Microb. Ecol. 71, 999–1007 (2016).
Article PubMed PubMed Central Google Scholar
Jain, M., Fleites, L. A. & Gabriel, D. W. A small Wolbachia protein directly represses phage lytic cycle genes in “Candidatus Liberibacter asiaticus” within psyllids. MSphere 2, e00171-e1117 (2017).
Article PubMed PubMed Central Google Scholar
Fagen, J. R. et al. Characterization of the relative abundance of the citrus pathogen Ca: Liberibacter asiaticus in the microbiome of its insect vector, Diaphorina citri, using high throughput 16S rRNA sequencing. Open Microbiol. J. 6, 29 (2012).
Article CAS PubMed PubMed Central Google Scholar
Serbus, L. R., Casper-Lindley, C., Landmann, F. & Sullivan, W. The Genetics and Cell Biology of Wolbachia-Host Interactions. Annu. Rev. Genet. 42, 683–707. https://doi.org/10.1146/annurev.genet.41.110306.130354 (2008).
Article CAS PubMed Google Scholar
Werren, J. H., Baldo, L. & Clark, M. E. Wolbachia: master manipulators of invertebrate biology. Nat. Rev. Genet. 6, 741–751. https://doi.org/10.1038/nrmicro1969 (2008).
Article CAS Google Scholar
Kamtchum-Tatuene, J., Makepeace, B. L., Benjamin, L., Baylis, M. & Solomon, T. The potential role of Wolbachia in controlling the transmission of emerging human arboviral infections. Curr. Opin. Infect. Dis. 30, 108–116. https://doi.org/10.1097/QCO.0000000000000342 (2017).
Article PubMed PubMed Central Google Scholar
Fraser, J. E. et al. Novel Wolbachia-transinfected Aedes aegypti mosquitoes possess diverse fitness and vector competence phenotypes. PLoS Pathog. 13, e1006751 (2017).
Caragata, E. P. et al. Pathogen blocking in Wolbachia-infected Aedes aegypti is not affected by Zika and dengue virus co-infection. PLOS Negl. Trop. Dis. 13, e0007443 (2019).
Saha, S. et al. Survey of endosymbionts in the Diaphorina citri metagenome and assembly of a Wolbachia wDi draft genome. PLoS ONE 7, 1 (2012).
Article Google Scholar
Eid, J. et al. Real-time DNA sequencing from single polymerase molecules. Science 323, 133–138 (2009).
Article ADS CAS PubMed Google Scholar
Hotopp, J. C. D. & Klasson, L. The complexities and nuances of analyzing the genome of Drosophila ananassae and its Wolbachia endosymbiont. G3-Genes Genom Genet. 8, 373–374 (2018).
CAS Google Scholar
Goodwin, S., McPherson, J. D. & McCombie, W. R. Coming of age: ten years of next-generation sequencing technologies. Nat. Rev. Genet. 17, 333 (2016).
Article CAS PubMed Google Scholar
Kent, B. N. et al. Complete bacteriophage transfer in a bacterial endosymbiont (Wolbachia) determined by targeted genome capture. Genome Biol. Evol. 3, 209–218 (2011).
Article CAS PubMed PubMed Central Google Scholar
Blaxter, M. Symbiont Genes in Host Genomes: Fragments with a Future?. Cell Host Microbe 2, 211–213. https://doi.org/10.1016/j.chom.2007.09.008 (2007).
Article CAS PubMed Google Scholar
Nurk, S. et al. HiCanu: accurate assembly of segmental duplications, satellites, and allelic variants from high-fidelity long reads. Genome Res. (2020).
Faddeeva-Vakhrusheva, A. et al. Coping with living in the soil: the genome of the parthenogenetic springtail Folsomia candida. BMC Genomics 18, 493. https://doi.org/10.1186/s12864-017-3852-x (2017).
Article CAS PubMed PubMed Central Google Scholar
Wolfe, T. M. et al. Comparative genome sequencing reveals insights into the dynamics of Wolbachia in native and invasive cherry fruit flies. Mol. Ecol. https://doi.org/10.1111/mec.15923 (2021).
Article PubMed Google Scholar
Neupane, S., Bonilla, S. I., Manalo, A. M. & Pelz-Stelinski, K. S. Near-Complete Genome Sequences of a Wolbachia Strain Isolated from Diaphorina citri Kuwayama Microbiol. Resour. Announc. 9, e00560-e1520. https://doi.org/10.1128/MRA.00560-20 (2020).
Article CAS Google Scholar
Ellegaard, K. M., Klasson, L., Näslund, K., Bourtzis, K. & Andersson, S. G. Comparative genomics of Wolbachia and the bacterial species concept. PLoS Genet. 9, 1 (2013).
Article Google Scholar
Travers, K. J., Chin, C.-S., Rank, D. R., Eid, J. S. & Turner, S. W. A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucl. Acids Res. 38, e159–e159 (2010).
Article PubMed PubMed Central Google Scholar
Loomis, E. W. et al. Sequencing the unsequenceable: expanded CGG-repeat alleles of the fragile X gene. Genome Res. 23, 121–128 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sinha, A., Li, Z., Sun, L. & Carlow, C. K. Complete Genome Sequence of the Wolbachia wAlbB Endosymbiont of Aedes albopictus. Genome Biol. Evol. 11, 706–720 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vallenet, D. et al. MicroScope: an integrated platform for the annotation and exploration of microbial gene functions through genomic, pangenomic and metabolic comparative analysis. Nucl. Acids Res. 48, D579–D589 (2020).
CAS PubMed Google Scholar
Zhang, J. Evolution by gene duplication: An update. Trends Ecol. Evol. 18, 292–298 (2003).
Article Google Scholar
Darling, A. C. E., Mau, B., Blattner, F. R. & Perna, N. T. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14, 1394–1403. https://doi.org/10.1101/gr.2289704 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kampfraath, A. A. et al. Genome expansion of an obligate parthenogenesis-associated Wolbachia poses an exception to the symbiont reduction model. BMC Genom. 20, 106. https://doi.org/10.1186/s12864-019-5492-9 (2019).
Article CAS Google Scholar
Brady, A. & Salzberg, S. L. Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models. Nat. Methods 6, 673–676. https://doi.org/10.1038/nmeth.1358 (2009).
Article CAS PubMed PubMed Central Google Scholar
Parks, D. H., MacDonald, N. J. & Beiko, R. G. Classifying short genomic fragments from novel lineages using composition and homology. BMC Bioinformatics 12, 328. https://doi.org/10.1186/1471-2105-12-328 (2011).
Article CAS PubMed PubMed Central Google Scholar
Chandler, M. & Mahillon, J. i Insertion sequences revisited, p. 305–366. In N. L. Craig, R. Craigie, M. Gellert, and A. Lambowitz (ed.), Mobile DNA II. American Society for Microbiology, Washington, D.C. (2002).
Siguier, P., Pérochon, J., Lestrade, L., Mahillon, J. & Chandler, M. ISfinder: the reference centre for bacterial insertion sequences. Nucl. Acids Res. 34, D32–D36 (2006).
Article CAS PubMed Google Scholar
Klasson, L. et al. The mosaic genome structure of the Wolbachia wRi strain infecting Drosophila simulans. Proc. Natl. Acad. Sci. 106, 5725–5730 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Canchaya, C., Proux, C., Fournous, G., Bruttin, A. & Brüssow, H. Prophage genomics. Microbiol. Mol. Biol. Rev. 67, 238–276 (2003).
Article CAS PubMed PubMed Central Google Scholar
Masui, S. et al. Bacteriophage WO and virus-like particles in Wolbachia, an endosymbiont of arthropods. Biochem. Biophys. Res. Commun. 283, 1099–1104 (2001).
Article CAS PubMed Google Scholar
Bordenstein, S. R. & Wernegreen, J. J. Bacteriophage Flux in Endosymbionts (Wolbachia): Infection Frequency, Lateral Transfer, and Recombination Rates. Mol. Biol. Evol. 21, 1981–1991. https://doi.org/10.1093/molbev/msh211 (2004).
Article CAS PubMed Google Scholar
Saridaki, A. et al. Wolbachia prophage DNA adenine methyltransferase genes in different Drosophila-Wolbachia associations. PLoS One 6, e19708 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Miao, Y.-h., Xiao, J.-h. & Huang, D.-w. Distribution and evolution of the bacteriophage WO and Its antagonism with Wolbachia. Front. Microbial. 11. https://doi.org/10.3389/fmicb.2020.595629 (2020).
Kent, B. N., Funkhouser, L. J., Setia, S. & Bordenstein, S. R. Evolutionary genomics of a temperate bacteriophage in an obligate intracellular bacteria (Wolbachia). PLoS One 6, e24984 (2011).
Lindsey, A. R. I. et al. Evolutionary genetics of cytoplasmic incompatibility genes cifA and cifB in prophage WO of Wolbachia. Genome Biol. Evol. 10, 434–451 (2018).
Article CAS PubMed PubMed Central Google Scholar
Söding, J., Biegert, A. & Lupas, A. N. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 33, W244–W248. https://doi.org/10.1093/nar/gki408 (2005).
Article CAS PubMed PubMed Central Google Scholar
Klasson, L. et al. Genome evolution of Wolbachia strain wPip from the Culex pipiens group. Mol. Biol. Evol. 25, 1877–1887 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wu, M. et al. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol.2, e69 (2004).
McCutcheon, J. P. & Moran, N. A. Extreme genome reduction in symbiotic bacteria. Nat. Rev. Microbiol. 10, 13–26. https://doi.org/10.1038/nrmicro2670 (2012).
Article CAS Google Scholar
Comandatore, F. et al. Supergroup C Wolbachia, mutualist symbionts of filarial nematodes, have a distinct genome structure. Open Biol. 5, 150099 (2015).
Article PubMed PubMed Central Google Scholar
Lo, W.-S., Huang, Y.-Y. & Kuo, C.-H. Winding paths to simplicity: genome evolution in facultative insect symbionts. FEMS Microbiol. Rev. 40, 855–874. https://doi.org/10.1093/femsre/fuw028 (2016).
Article CAS PubMed PubMed Central Google Scholar
Unterholzner, S. J., Poppenberger, B. & Rozhon, W. Toxin–antitoxin systems. Mob. Genet. Elements. 3, e26219. https://doi.org/10.4161/mge.26219 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fallon, A. M. Computational evidence for antitoxins associated with RelE/ParE, RatA, Fic, and AbiEii-family toxins in Wolbachia genomes. Mol. Genet. Genomics 295, 891–909. https://doi.org/10.1007/s00438-020-01662-0 (2020).
Article CAS PubMed Google Scholar
Gonzalez-Rivera, C., Bhatty, M. & Christie, P. J. Mechanism and function of type IV secretion during infection of the human host. Microbiol. Spectr. 4 (2016).
Rancès, E., Voronin, D., Tran-Van, V. & Mavingui, P. Genetic and functional characterization of the type IV secretion system in Wolbachia. J. Bacteriol. 190, 5020–5030 (2008).
Article PubMed PubMed Central Google Scholar
Pichon, S. et al. Conservation of the Type IV secretion system throughout Wolbachia evolution. Biochem. Biophys. Res. Commun. 385, 557–562 (2009).
Article CAS PubMed Google Scholar
Bing, X.-L., Zhao, D.-S., Sun, J.-T., Zhang, K.-J. & Hong, X.-Y. Genomic Analysis of Wolbachia from Laodelphax striatellus (Delphacidae, Hemiptera) Reveals Insights into Its “Jekyll and Hyde” Mode of Infection Pattern. Genom. Biol. Evol. 12, 3818–3831. https://doi.org/10.1093/gbe/evaa006 (2020).
Article CAS Google Scholar
Ramírez-Puebla, S. T. et al. Genomes of Candidatus Wolbachia bourtzisii wDacA and Candidatus Wolbachia pipientis wDacB from the Cochineal Insect Dactylopius coccus (Hemiptera: Dactylopiidae). G3-Genes Genom. Genet. 6, 3343–3349. https://doi.org/10.1534/g3.116.031237 (2016).
Article CAS Google Scholar
Newton, I. L. G. et al. Comparative genomics of two closely related Wolbachia with different reproductive effects on hosts. Genom. Biol. Evol. 8, 1526–1542. https://doi.org/10.1093/gbe/evw096 (2016).
Article Google Scholar
Pritchard, L., Glover, R. H., Humphris, S., Elphinstone, J. G. & Toth, I. K. Genomics and taxonomy in diagnostics for food security: Soft-rotting enterobacterial plant pathogens. Anal. Methods. 8, 12–24 (2016).
Article Google Scholar
Eren, A. M. et al. Anvi’o: An advanced analysis and visualization platform for ‘omics data. PeerJ 3, e1319. https://doi.org/10.7717/peerj.1319 (2015).
Article PubMed PubMed Central Google Scholar
Baldo, L. et al. Multilocus sequence typing system for the endosymbiont Wolbachia pipientis. Appl. Environ. Microbiol. 72, 7098. https://doi.org/10.1128/AEM.00731-06 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Bleidorn, C. & Gerth, M. A critical re-evaluation of multilocus sequence typing (MLST) efforts in Wolbachia. FEMS Microbiol. Ecol. 94, 1. https://doi.org/10.1093/femsec/fix163 (2018).
Article CAS Google Scholar
Wang, X. et al. Phylogenomic analysis of Wolbachia strains reveals patterns of genome evolution and recombination. Genom. Biol. Evol. 12, 2508–2520. https://doi.org/10.1093/gbe/evaa219 (2020).
Article CAS Google Scholar
Pérez-Losada, M., Cabezas, P., Castro-Nallar, E. & Crandall, K. A. Pathogen typing in the genomics era: MLST and the future of molecular epidemiology. Infect. Genet. Evol. 16, 38–53. https://doi.org/10.1016/j.meegid.2013.01.009 (2013).
Article CAS PubMed Google Scholar
Glowska, E., Dragun-Damian, A., Dabert, M. & Gerth, M. New Wolbachia supergroups detected in quill mites (Acari: Syringophilidae Infect. Genet. Evol. 30, 140–146 (2015).
Article Google Scholar
Chung, M., Munro, J. B., Tettelin, H. & Dunning Hotopp, J. C. Using Core Genome Alignments To Assign Bacterial Species. mSystems 3, e00236–00218. https://doi.org/10.1128/mSystems.00236-18 (2018).
Vasconcelos, E. J. et al. Assessing cat flea microbiomes in Northern and Southern California by 16S rRNA next-generation sequencing. Vector Borne Zoonot. Dis. 18, 491–499 (2018).
Article Google Scholar
Campbell, J. H. et al. UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiota. Proc. Natl. Acad. Sci. 110, 5540–5545 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Lefoulon, E. et al. Large Enriched Fragment Targeted Sequencing (LEFT-SEQ) Applied to Capture of Wolbachia Genomes. Sci. Rep. 9, 5939 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Geniez, S. et al. Targeted genome enrichment for efficient purification of endosymbiont DNA from host DNA. Symbiosis 58, 201-207. https://doi.org/10.1007/s13199-012-0215-x (2012).
Article CAS PubMed Google Scholar
Dunning Hotopp, J. C., Slatko, B. E. & Foster, J. M. Targeted enrichment and sequencing of recent endosymbiont-host lateral gene transfers. Sci. Rep. 7, 857-857. https://doi.org/10.1038/s41598-017-00814-4 (2017).
Article ADS PubMed PubMed Central Google Scholar
Basting, P. J. & Bergman, C. M. Complete genome assemblies for three variants of the Wolbachia endosymbiont of Drosophila melanogaster. Microbiol. Resour. Announc. 8, 1 (2019).
Article Google Scholar
Wang, X. et al. Genome assembly of the A-Group Wolbachia in Nasonia oneida using linked-reads technology. Genom. Biol. Evol. 11, 3008–3013. https://doi.org/10.1093/gbe/evz223 (2019).
Article CAS Google Scholar
Rasgon, J. L., Ren, X. & Petridis, M. Can anopheles gambiae Be infected with Wolbachia pipientis? Insights from an in vitro system. Appl. Environ. Microbiol. 72, 7718. https://doi.org/10.1128/AEM.01578-06 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Dobson, S. L., Marsland, E. J., Veneti, Z., Bourtzis, K. & O’Neill, S. L. Characterization of Wolbachia host cell range via the in vitro establishment of infections. Appl. Environ. Microbiol. 68, 656–660 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Baum, B. & Cherbas, L. Drosophila cell lines as model systems and as an experimental tool, pp. 391–424 in Drosophila. 391–424 (Springer, 2008).
Rasgon, J. L., Gamston, C. E. & Ren, X. Survival of Wolbachia pipientis in cell-free medium. Appl. Environ. Microbiol. 72, 6934–6937 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Koren, S. et al. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hunt, M. et al. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol. 16, 294 (2015).
Article PubMed PubMed Central Google Scholar
Watson, M. & Warr, A. Errors in long-read assemblies can critically affect protein prediction. Nat. Biotechnol. 37, 124–126. https://doi.org/10.1038/s41587-018-0004-z (2019).
Article CAS PubMed Google Scholar
Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One 9 (2014).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Andrews, S. FastQC: a quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (2010).
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar
Tatusova, T. et al. NCBI prokaryotic genome annotation pipeline. Nucl. Acids. Res. 44, 6614–6624 (2016).
Article CAS PubMed PubMed Central Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Article CAS PubMed PubMed Central Google Scholar
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol. Biol. Evol. 34, 2115–2122 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huerta-Cepas, J. et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 44, D286-D293 (2015).
Karp, P. D. et al. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology. Brief. Bioinform. 11, 40–79 (2010).
Article CAS PubMed Google Scholar
Vallenet, D. et al. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes. Nucl. Acids Res. 45, D517–D528. https://doi.org/10.1093/nar/gkw1101 (2017).
Article CAS PubMed Google Scholar
Grant, J. R. & Stothard, P. The CGView Server: A comparative genomics tool for circular genomes. Nucl. Acids Res. 36, W181–W184. https://doi.org/10.1093/nar/gkn179 (2008).
Article CAS PubMed PubMed Central Google Scholar
Miele, V., Penel, S. & Duret, L. Ultra-fast sequence clustering from similarity networks with SiLiX. BMC Bioinformatic.s 12, 116–116. https://doi.org/10.1186/1471-2105-12-116 (2011).
Krumsiek, J., Arnold, R. & Rattei, T. Gepard: A rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics 23, 1026–1028. https://doi.org/10.1093/bioinformatics/btm039 (2007).
Article CAS PubMed Google Scholar
Finn, R. D. et al. The Pfam protein families database: Towards a more sustainable future. Nucleic Acids Res. 44, D279–D285 (2015).
Article PubMed PubMed Central Google Scholar
Arndt, D. et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 44, W16–W21 (2016).
Article CAS PubMed PubMed Central Google Scholar
Varani, A. M., Siguier, P., Gourbeyre, E., Charneau, V. & Chandler, M. ISsaga is an ensemble of web-based methods for high throughput identification and semi-automatic annotation of insertion sequences in prokaryotic genomes. Genome Biol. 12, R30 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sanogo, Y. O. & Dobson, S. L. WO bacteriophage transcription in Wolbachia-infected Culex pipiens. Insect Biochem. Mol. Biol. 36, 80–85 (2006).
Article CAS PubMed Google Scholar
Bordenstein, S. R., Marshall, M. L., Fry, A. J., Kim, U. & Wernegreen, J. J. The Tripartite Associations between Bacteriophage, Wolbachia, and Arthropods. PLOS Pathog. 2, e43. https://doi.org/10.1371/journal.ppat.0020043 (2006).
Article PubMed PubMed Central Google Scholar
Sinkins, S. P. et al. Wolbachia variability and host effects on crossing type in Culex mosquitoes. Nat. Biotechnol. 436, 257–260 (2005).
CAS Google Scholar
Klasson, L. et al. Genome evolution of Wolbachia strain wPip from the Culex pipiens group. Mol. Evol. Evol. 25, 1877–1887 (2008).
Article CAS Google Scholar
Gerth, M. & Bleidorn, C. Comparative genomics provides a timeframe for Wolbachia evolution and exposes a recent biotin synthesis operon transfer. Nat. Microb. 2, 1–7 (2016).
Google Scholar
Foster, J. et al. The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode. PLoS Biol. 3 (2005).
Pichon, S. Type IV secretion system and ankyrin domain-containing proteins in Wolbachia-arthropods interactions, Université de Poitiers (2009).
Metcalf, J. A., Jo, M., Bordenstein, S. R., Jaenike, J. & Bordenstein, S. R. Recent genome reduction of Wolbachia in Drosophila recens targets phage WO and narrows candidates for reproductive parasitism. PeerJ 2, e529 (2014).
Kim, D., Song, L., Breitwieser, F. P. & Salzberg, S. L. Centrifuge: Rapid and sensitive classification of metagenomic sequences. Genome Res. 26, 1721–1729 (2016).
Article CAS PubMed PubMed Central Google Scholar
Emms, D. M. & Kelly, S. OrthoFinder: Solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157. https://doi.org/10.1186/s13059-015-0721-2 (2015).
Article CAS PubMed PubMed Central Google Scholar
Khan, A. & Mathelier, A. Intervene: A tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinformatics 18, 287. https://doi.org/10.1186/s12859-017-1708-7 (2017).
Article CAS PubMed PubMed Central Google Scholar
Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucl. Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., Von Haeseler, A. & Jermiin, L. S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods. 14, 587–589 (2017).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. et al. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Res. 20, 348–355 (2020).
Article Google Scholar
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 44, W242–W245 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors acknowledge Albert Mangual for maintaining cell cultures and assisting with wDi isolation. We thank Paul Carr for maintaining D. citri cultures from which wDi cells were isolated. The authors acknowledge the team at University of Florida Interdisciplinary Center for Biotechnology Research (UF-ICBR) NextGen DNA Sequencing Center, in particular David Moraga, Scientific Director, for his inputs. The University of Florida Research Computing Center provided computational resources and support that have contributed to the research results reported in this publication. Funding for this project was provided to K.S.P.-S by the United States Defense Advanced Research Projects Agency, United States (DARPA) (award D19AP00013).

Author information

These authors contributed equally: Surendra Neupane and Sylvia I. Bonilla

Authors and Affiliations

Entomology and Nematology Department, Citrus Research and Education Center/IFAS, University of Florida, Lake Alfred, Florida, 33850, USA
Surendra Neupane, Sylvia I. Bonilla, Andrew M. Manalo & Kirsten S. Pelz-Stelinski

Authors

Surendra Neupane
View author publications
You can also search for this author in PubMed Google Scholar
Sylvia I. Bonilla
View author publications
You can also search for this author in PubMed Google Scholar
Andrew M. Manalo
View author publications
You can also search for this author in PubMed Google Scholar
Kirsten S. Pelz-Stelinski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.N., S.I.B., and K.S.P. designed the study and wrote the main manuscript text. S.N and S.I.B. analyzed data and prepared Figs. 1, 2, 3, 4, 5, 6, and 7. A.M.M. and S.N. collected the sequencing data. All authors reviewed the manuscript.

Corresponding author

Correspondence to Kirsten S. Pelz-Stelinski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Supplementary Information 5.

Supplementary Information 6.

Supplementary Information 7.

Supplementary Information 8.

Supplementary Information 9.

Supplementary Information 10.

Supplementary Information 11.

Supplementary Information 12.

Supplementary Information 13.

Supplementary Information 14.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Neupane, S., Bonilla, S.I., Manalo, A.M. et al. Complete de novo assembly of Wolbachia endosymbiont of Diaphorina citri Kuwayama (Hemiptera: Liviidae) using long-read genome sequencing. Sci Rep 12, 125 (2022). https://doi.org/10.1038/s41598-021-03184-0

Download citation

Received: 21 June 2021
Accepted: 26 November 2021
Published: 07 January 2022
DOI: https://doi.org/10.1038/s41598-021-03184-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Result and discussion

wDi genome assembly

Genome annotations and assessments

Insertion sequences (ISs), prophage genes, ORF7 and Ankyrin proteins

COG, eggNOG, and pfam annotations

Toxin-antitoxin system and Type IV Secretion SSystem (T4SS) genes

Comparative genomics of wDi with reference Wolbachia genomes

Phylogenetics of wDi and other Wolbachia genomes

Conclusions

Materials and methods

Extraction of Wolbachia from D. citri (wDi)

Infection of wDi in S2 cells and isolation of wDi from cell culture

wDi Genomic DNA (gDNA) extraction

Long-read (PacBio) sequencing

Short-read (Illumina) sequencing

De novo genome assembly

Genome correction

Genome annotations and assessments

Comparative genomics of wDi genome with other Wolbachia genomes

Wolbachia metapangenome, ANI identity, and orthogroup analyses

Phylogenetic analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links