Dual-Seq reveals genome and transcriptome of Caedibacter taeniospiralis, obligate endosymbiont of Paramecium

Pirritano, Marcello; Zaburannyi, Nestor; Grosser, Katrin; Gasparoni, Gilles; Müller, Rolf; Simon, Martin; Schrallhammer, Martina

doi:10.1038/s41598-020-65894-1

Download PDF

Article
Open access
Published: 16 June 2020

Dual-Seq reveals genome and transcriptome of Caedibacter taeniospiralis, obligate endosymbiont of Paramecium

Marcello Pirritano^1,2,
Nestor Zaburannyi³,
Katrin Grosser⁴^nAff6,
Gilles Gasparoni⁵,
Rolf Müller³,
Martin Simon^1,2 &
…
Martina Schrallhammer⁴

Scientific Reports volume 10, Article number: 9727 (2020) Cite this article

2100 Accesses
7 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Interest in host-symbiont interactions is continuously increasing, not only due to the growing recognition of the importance of microbiomes. Starting with the detection and description of novel symbionts, attention moves to the molecular consequences and innovations of symbioses. However, molecular analysis requires genomic data which is difficult to obtain from obligate intracellular and uncultivated bacteria. We report the identification of the Caedibacter genome, an obligate symbiont of the ciliate Paramecium. The infection does not only confer the host with the ability to kill other cells but also renders them immune against this effect. We obtained the C. taeniospiralis genome and transcriptome by dual-Seq of DNA and RNA from infected paramecia. Comparison of codon usage and expression level indicates that genes necessary for a specific trait of this symbiosis, i.e. the delivery of an unknown toxin, result from horizontal gene transfer hinting to the relevance of DNA transfer for acquiring new characters. Prediction of secreted proteins of Caedibacter as major agents of contact with the host implies, next to several toxin candidates, a rather uncharacterized secretome which appears to be highly adapted to this symbiosis. Our data provides new insights into the molecular establishment and evolution of this obligate symbiosis and for the pathway characterization of toxicity and immunity.

Unveiling microbial diversity: harnessing long-read sequencing technology

Article 30 April 2024

Single-cell analysis reveals context-dependent, cell-level selection of mtDNA

Article Open access 24 April 2024

CoCas9 is a compact nuclease from the human microbiome for efficient and precise genome editing

Article Open access 24 April 2024

Introduction

Symbionts can have severe impact on host nutrition, metabolism, reproduction, immune and stress responses, and even behavior. Our knowledge about their biology is mostly limited to pathogenic representatives of medical or economic concern. Another bias is the focus on the minority which can be cultivated in artificial media, most probably due to the technical difficulties when facing uncultivated prokaryotes. But these symbionts represent only a small proportion of the natural diversity. A fundamental resource for understanding the biology of uncultivated intracellular symbionts is their genome sequence. It can be used to infer key aspects such as metabolic properties, phylogeny and evolution as well as additional traits crucial for the symbiosis. Next generation sequencing (NGS) techniques now allow for obtaining genome sequences of uncultivated symbionts of less well-studied host organisms. Still, the procedure entails multiple challenges such as low DNA quality and quantities of mixed samples with unfavorable abundances of target DNA, unavailability of reference genomes, etc. Choosing the ciliate Paramecium tetraurelia carrying a cytoplasmic infection with Caedibacter taeniospiralis (Gammaproteobacteria), we provide a roadmap for sequencing the genome and transcriptome of obligate intracellular bacteria.

We focus on this system as Paramecium can host diverse bacterial endosymbionts and only few genomes are available so far (i.e. fourHolospora draft genomes^1,2 and the only ones completed, “Candidatus Fokinia solitaria”³ and “Candidatus Deianiraea vastatrix”⁴). All belong to the sister families Rickettsiales and Holosporales, thus C. taeniospiralis represents a member of another clade of Proteobacteria. So far, it is the only described gammaproteobacterial symbiont of Paramecium. It can serve as a phylogenetic distant example for genome adaptation during the transition from a free-living to an intracellular lifestyle as well as co-evolution with the same host and ecological restraints as the alphaproteobacterial symbionts. The interaction between Paramecium tetraurelia and Caedibacter is especially intriguing as it is linked to the killer trait^5,6 which is the ability to eliminate symbiont-free paramecia⁷ gained from symbiosis with Caedibacter or Caedimonas⁸ bacteria. This trait is expressed by the bacterial symbionts. It comprises three components: the R-body, which constitutes an unusual protein delivery machine, an unidentified toxin, and an unknown resistance mechanism. If symbiont-free paramecia ingest released Caedibacter, the R-body is triggered to unroll and therewith delivers the toxin into the Paramecium cytoplasm. R-bodies themselves are not toxic and once the symbiont is eliminated the paramecia loose the resistance^9,10. In addition, this symbiosis can result in increased host cell densities without the provision of nutritional supplements¹⁰. The mechanisms how the symbionts cause these host phenotype modifications are not yet fully understood. With the C. taeniospiralis genome annotation we provide here a RNA-Seq corrected annotation of the former draft genome sequence¹¹ which was not sufficient for a solid annotation of genes and operons. Furthermore, we close several gaps by using additionally long reads from Oxford Nanopore Technologies (ONT). As a result, we are able to present a roadmap for assessing genomes of uncultivated symbionts, the first genome of a Paramecium symbiont outside Alphaproteobacteria, and a resource to address the question how this symbiont drastically changes the properties of its host.

Results and Discussion

Dual-Seq reveals the Caedibacter genome sequence and facilitates refined phylogenetic placement

As Caedibacter cannot be cultivated outside of its host cell, we used total DNA isolated from food bacteria depleted paramecia cultures for library preparation using Tagmentation¹¹. After assembly, we separated Caedibacter sequences from host chromosomes by GC percentage and coverage (Supplementary Fig. S1). The resulting assembly was further improved by ONT sequencing of high molecular weight DNA which allowed to close several gaps (Table 1). The resulting assembly consists of 17 scaffolds and one circular plasmid with a total size of 1.32 Mbp. Although we were not able to close the chromosome, this genome currently represents the best assembled genome (Table 2) inside a group of related bacteria (Fastidiosibacteriaceae¹²), now allowing for a more detailed phylogenetic characterization. We performed phylogenetic reconstructions (Fig. 1) based on 16S rRNA gene sequences and a concatenated alignment of 19 conserved protein-coding genes as well as genome-by-genome comparisons using average nucleotide identity (ANI) and digital DNA-DNA hybridization (Supplementary Table S1). The different analyses are in good agreement and place C. taeniospiralis within the little characterized family Fastidiosibacteriaceae, sister family to the facultative intracellular Francisellaceae which cause zoonotic diseases in fish and mammals. Notably, Caedibacter is the only organism within Fastidiosibacteriaceae with an obligate intracellular lifestyle, all other members were isolated from marine or freshwater samples and grow on artificial media. The closest relatives of C. taeniospiralis are Cysteiniphilum litorale¹³ and Cysteiniphilum halobium¹⁴.

Table 1 Properties of the assembled Caedibacter taeniospiralis genome prior to (first draft) and after annotation improvement by RNA-Seq and Oxford Nanopore sequencing (second draft).

Full size table

Table 2 Comparison of Fastidiosibacteraceae with available genome sequences.

Full size table

RNA-Seq corrected gene annotation

Gene annotation was carried out by analysing the genome assembly which revealed 1080 protein coding gene candidates, 36 tRNAs and 3 rDNA cluster¹¹ (Table 1). Dual RNA-Seq was carried out with the aim to improve this annotation and to verify transcriptional units and operons. Thus, prokaryotic mRNA was enriched for by food depletion using antibiotics as described previously¹¹ as well as subsequent double depletion of host and symbiont rRNA and host poly(A)RNA (Fig. 2A). We created directional RNA libraries to dissect which strand of the genome is transcribed. The Rockhopper tool¹⁵ was applied to define transcriptional units and operons based on the obtained RNA-Seq information. Accordingly, we improved the annotation by correcting individual ORFs and increased the number of protein coding genes to 1.091 and predicted 202 operons. Furthermore, we distinguished gene annotations of automatically annotated genes and human curated ones (see Fig. 2B,C). The latter involved manual inspection of the predicted protein sequences via database comparison (NCBI nucleotide collection respectively non-redundant protein sequences) as well as de novo annotation of previously not predicted protein sequences based on mRNA signals. Figure 2B,C show examples for annotation improvements in the Integrated Genomics Viewer (IGV) browser⁴⁸.

In addition to gene content, also genome synteny and operon composition now enable the comparison of the symbiont genome to free-living species and thus provide insights into the evolutionary adaption of the killer trait symbiosis. Figure 3A shows the comparison of the C. taeniospiralis str-operon juxtaposed to E. coli¹⁶ indicating the exchange of the elongation factor EF-G (tufA) against the ribosomal protein S10 (rpsJ) in Caedibacter. The replacement of the elongation factor which is encoded on a different contig (CDBSP s08) may have led to a distorted expression level as consequence of the destroyed co-transcription. When comparing the respective operon structure in the sequenced Fastidiosibacteriaceae and Caedibacter, an interesting evolution of its synteny can be observed (Fig. 3B). The str-operon composition of Caedibacter is a general feature of the Fastidiosibacteriaceae. Thus, this composition and obvious difference to E. coli has clearly not evolved as result of the symbiotic adaption of Caedibacter. Considering the organization of E. coli as ancestral, the shift towards the replacement of tufA with rpsJ, and general reorganization as present in Caedibacter potentially correlates with an increasing fastidiousness^12,13 of the bacteria regarding their cultivation or growth conditions. The rearrangement of the str-operon might be interpreted as one example for a decrease in regulatory fine-tuning in Fastidiosibacteriaceae which results in the strict dependence on the rather constant environment of a host organism such as Paramecium cytoplasm in case of Caedibacter. For this symbiont it was shown that even small changes of the cultivation temperature of the “poikilothermic” ciliates can have drastic effects to the symbiont density¹⁷.

Horizontal gene transfer as driver of intracellularity

While the majority of Paramecium endosymbionts belongs to a fast evolving branch of obligate intracellular Alphaproteobacteria, Caedibacter’s closest relatives are free-living. Typical hallmarks for the transition from free-living to intracellular life such as reduced genome size, gene loss, and fewer rRNA genes¹⁸ are evident in the genome of Caedibacter (Table 2). Interestingly, the typical positive correlation between a reduction in genome size and GC-content^18,19 is not detectable for Fastidiosibacteriaceae, as it is highest for the smallest genome (Table 2). While the importance of horizontal gene transfer (HGT) for rapid adaption and evolution of prokaryotes is broadly accepted, it has been suggested that obligate endosymbiotic bacteria might be protected from mobile genetic elements and DNA exchange due to their intracellular lifestyle. But as several studies demonstrate, e.g.^20,21,22, evidence for different kinds of HGT can be detected in endosymbionts. Thus, we searched the genome of Caedibacter for horizontally acquired genes involved in its symbiotic interaction with Paramecium. A fascinating question remains unanswered: Were these genes acquired prior to the transition to an intracellular lifestyle and actually enabled the free-living ancestor of Caedibacter to become an endosymbiont? Or did the HGT occur when Caedibacter already lived inside Paramecium and hence stabilized the symbiosis by e.g. the acquisition of the killer trait? Double infections with different symbionts have been reported for Paramecium, so this remains an intriguing hypothesis^8,23,24.

It has been speculated that central elements of the here described symbiosis, i.e. the Reb genes encoding for the R-body, have been acquired via HGT^6,8,25. The R-body protein delivery machinery is produced by the obligate Paramecium endosymbionts Caedibacter, belonging to Gammaproteobacteria and Caedimonas, member of Alphaproteobacteria⁸. In the latter, phage-like particles are often found associated to the R-body structure by transmission electron microscopy⁵. Thus, we searched the genome of Caedibacter for the presence of mobile genetic elements and other indications for HGT, which might also assist in the search for and identification of the toxin and resistance mechanism of the killer trait.

We confirmed the presence of a circular plasmid (pKAP51, 41.65 kb) that carries the Reb operon encoding for the R-body. This plasmid shares high similarities to plasmid pKAP298 (49.11 kb) of another strain of C. taeniospiralis, strain 298²⁶. It encodes hypothetical proteins, transposases and phage-derived genes. The main difference to plasmid pKAP298 is the excision of transposon Tn5403 (7.78 kb) which is lacking from pKAP51. As phages have been implicated to play a role in the killer trait either by structural evidence^5,6 or indirect experimental proof such as increase R-body production after UV-irradiation or induction with mitomycin C²⁷, we searched for phage genomes on the bacterial chromosome. We applied PHASTER (PHAge Search Tool Enhanced Release²⁸), to identify prophage sequences within the Caedibacter genome and detected one incomplete prophage of ca. 12 kb (Fig. 4). Noteworthy, this potential prophage encodes a component of a toxin-antitoxin system, HicB. Interestingly, C. taeniospiralis also possesses phage defense mechanisms as we identified a CRISPR locus headed by a AT-rich leader sequence which serves as promoter for the pre-crRNA synthesis followed by three identical repeats and 2 unique sequences, the so-called spacers (Supplementary Fig. S2). Together with a set of 7 CAS (CRISPR-associated) Type IC proteins, they constitute a CRISPR immune system. CRISPR are specific structures found in the majority of archaeal and many bacterial genomes that show characteristics of both tandem and interspaced repeats.

As mentioned before, an unknown toxin, which is delivered by the R-body, kills uninfected paramecia and hence is central to the symbiosis between Caedibacter and Paramecium. Additionally, the symbiosis provides immunity against this toxin raising the question for the underlying mechanism. This toxin shows a surprising high specificity for members of the genus Paramecium⁷, possibly correlating with the host specificity of the toxin producing bacteria⁹. Our genome annotation enables reverse genetics for the unknown toxin to understand the specificity to individual cells and furthermore the identification of the conditional resistance mechanism, as the immunity towards the killer trait is lost when the symbionts are eliminated from the killer paramecia. We started this analysis by plotting the codon adaptation index (CAI) of protein coding genes against gene expression. Figure 5A shows that there are indeed discrete outliers regarding the CAI, especially the highlighted Reb genes which build the R-body, a large protein structure which is crucial to deliver the toxin into the target cell’s cytoplasm. Thus, the outlier position of RebA, RebB, and RebD genes suggests a more recent HGT of these genes than the acquisition of the complete plasmid pKAP51 by Caedibacter, which is maybe even more evident when looking at the overall profile of plasmid- versus chromosomal encoded genes (Fig. 5C). The plasmid carries several transposons and transposases as well as phage-derived genes. It indeed has been speculated that it might derive from a phage genome²⁶. The finding that HGT contributed to the evolution of the killer trait in Paramecium enforces our understanding of genetic transfer between species to rapidly confer new characteristics as in the system aphid-Hamiltonella defensa-phage APSE-2²⁹. Another example involves the killer-effect in yeast strains which is based on the presence of two distinct mycoviruses which allow the secretion of toxic proteins to kill uninfected yeast cells as well as providing protective immunity³⁰. Thus, in all these systems viruses confer the ability to kill predators or competitors.

However, the Reb encoded R-body only delivers the toxin to sensitive cells. Searching for the toxin itself, candidates are the different toxin-antitoxin systems (Fig. 5B) present in the C. taeniospiralis genome and even as part of the incomplete prophage (Fig. 4) as they could provide both toxicity and non-permanent immunity. Taking advantage of heterologous expression of R-bodies in E. coli⁹, potential toxin candidates can now be easily screened by co-expression in the same E. coli vector. As feeding of R-body producing bacteria does not cause lethal effects in paramecia⁹, the addition of the killer trait toxin should cause cell death in sensitive strains.

The secretome is enriched in uncharacterized proteins

An alternative perspective on this endosymbiosis is the secretome analysis as all secreted proteins are in direct contact with the hosts cytoplasm and therefore the main candidates for intraspecific communication and adaptation. We performed in silico prediction of secreted proteins (Fig. 6A), many of them show high transcript levels (See Supplementary Table S2 for the full list of genes). From 95 proteins identified as secreted, 54 are hypothetical proteins and Fig. 6B indicates that the secretome shows clearly a lower annotation score meaning that reverse genetics can only describe potential functions for few secreted proteins. Among those is for example a component (IcmE) of a type IV secretion system. More parts (12 in total) of this large translocation machinery transporting proteins or protein/DNA complexes out of the symbionts’ cells are encoded on contigs CDBSP s01, s12, and s14. Type IV secretions systems are used by some pathogenic bacteria to inject virulence factors into their host cells³¹. However, the majority of secreted proteins are hypothetical, thus, their function remains unknown. This might simply be an indication for our limited knowledge of effector proteins in general, or instead due to a highly specialized secretome of this endosymbiont which likely constitutes not only effectors required for the alteration of the host’s transcriptome¹⁰ but also the antitoxin providing immunity against the killer trait. Thus, our data now allows to identify and to characterize these secreted proteins to gain deeper understanding into the molecular communication of the symbiosis and the identification of the killer trait toxin and resistance mechanism.

Methods

DNA isolation, library preparation and sequencing

Paramecium tetraurelia strain 51K (CCAP 1660/3F) infected with cytoplasmic C. taeniospiralis as verified by fluorescence in situ hybridisation with the species-specific probe Ctaen998⁸ was grown on beta-lactam hypersensitive E. coli Delta tolC³² to limit the contamination with food bacteria. Total DNA was isolated after ampicillin treatment¹¹ and used for Illumina and ONT sequencing. Tagmentation³³ was performed for Illumina library generation with different ratios of DNA/transposase (Supplementary Fig. S1) to optimize insert length in addition to two different polymerases (Q5, NEB and KAPA HiFi, Roche). After gel purification, Illumina MiSeq sequencing was carried out with a read-lenght of 2 × 300 nt. Reads were trimmed for adapters and low quality bases by the cutadapt (v1.4.1) wrapper trim galore (v0.3.3)³⁴. Additionally, MinION library preparation and sequencing was performed by Seq-IT (Kaiserslautern, Germany) using a 1D2 Sequencing Kit (LSK-308, ONT) and a MKI vR9 MinION flow cell (FLO-MIN107, ONT) to obtain longer DNA reads in order to improve the genome assembly. In total, 1.099 Gbp of raw ONT data with a mean read length of 11955 bp were obtained.

RNA isolation, library preparation and sequencing

Total RNA was isolated from cultures after antibiotics treatment using Tri-Reagent (Sigma). After additional DNAse digestion, RNA samples were depleted for eukaryotic and bacterial rRNAs by subsequent usage of the Yeast Ribo-Kit and the gram-negative bacterial Ribo-Zero Kits (Illumina). After additional depletion of poly(A) RNAs, directional RNA libraries were generated from the left-over RNA using the NEB ultra directional RNA library prep Kit (NEB). Libraries were sequenced on a Illumina HiSeq2500 Platform and trimmed as described above. RNA reads have been deposited under ENA accession number PRJEB36201.

Genome assembly and gene annotation

Genome assembly and gene annotation was carried out as described¹¹ using Illumina reads. Additionally, ONT reads were assembled using the Canu assembler³⁵ version 1.7 with default parameters. The resulting assembly was compared to and used for joining the contigs derived from the Illumina-only assembly in Geneious version 11.1.2³⁶. reducing the number of contigs from 24 to 18.

Gene annotation improvement and operon definition was carried out using Rockhopper v2.03¹⁵. To prevent inclusion of reads derived from the host, dual RNA-Seq reads were first mapped to the Paramecium genome using Bowtie2³⁷. Subsequently, mapping reads were subtracted. The remaining reads were then analyzed by Rockhopper with default settings. Genes that were annotated neither during the first annotation nor by the Rockhopper tool, but showed a distinct mRNA coverage signal, were annotated manually and verified via BLAST search of the corresponding nucleotide and protein sequences.

This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PGGB00000000. The version described in this paper is version PGGB02000000. We also published the corresponding Geneious format including all annotations as well as the .fasta- and .gff-files of the genome at Zenodo (https://zenodo.org/; https://doi.org/10.5281/zenodo.3372731).

Transcripts per million (TPM) values of the Caedibacter genes were calculated by Rockhopper using the same RNA-Seq reads mentioned above.

Phylogenetic analyses and operon synteny

Phylogenetic distances were inferred based on either 16S rRNA gene sequence respectively a selection of 19 conserved bacterial single copy genes. The 16S rRNA gene sequence of C. taeniospiralis was aligned against its closest relatives with sequenced genomes (Supplementary Table S1). Phylogenetic trees were calculated using Bayesian Inference (BI; MrBayes 3.2.6³⁸; with a burn-in of 25% after 1,000,000 generations, and Maximum Likelihood (ML) with 1,000 bootstrap pseudoreplicates (PHYML 2.4.5³⁹). Furthermore, 19 conserved bacterial single copy genes were identified and extracted from each genome using AmphoraNet⁴⁰. Protein sequences (Supplementary Table S3) were concatenated and aligned (MUSCLE version 3.8.31⁴¹) comprising 6072 characters. BI phylogenetic analysis of AMPHORA concatenated multiprotein sequences were carried as described above. For pairwise genome comparisons, the average nucleotide identity was calculated using the BLAST+-based approach (ANIb) by JSpeciesWS version 3.1.2⁴². A heatmap was generated using CIMminer (http://discover.nci.nih.gov/cimminer). Digital DNA-DNA hybridization (dDDH) values were estimated with the Genome-to-Genome Distance Calculator (using GGDC 2.1 BLAST+⁴³) applying formula 2 which accounts for incomplete genome sequences. Genomes used for the in silico analyses are listed in Supplementary Table S1. Synteny analysis of the genetic context of the str-operon between the different indicated organisms was carried out using the Mauve genome alignment algorithm⁴⁴ running on default settings.

Genomic and post-genomic analyses

Prophage and CRISPR identification: Prophages were searched for using the PHASTER web server (from: http://phaster.ca/)²⁸. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) were identified by CRISPRCasFinder (from: https://crisprcas.i2bc.pariss)⁴⁵.

CAI prediction/estimation: A codon usage table of Caedibacter was created by a web-based codon usage calculator⁴⁶. The codon adaptation index (CAI) of Caedibacter genes was calculated using the web-based CAI calculator from EMBOSS (http://www.bioinformatics.nl/cgibin/emboss/help/cai).

Secretome analysis: Prediction of potentially secreted proteins was performed by submitting a fasta-file containing the amino-acid sequence from all protein-coding genes of the Caedibacter genome to the SignalIP v5.0 webtool⁴⁷ using the default thresholds for signal peptide prediction.

References

Dohra, H., Tanaka, K., Suzuki, T., Fujishima, M. & Suzuki, H. Draft genome sequences of three Holospora species (Holospora obtusa, Holospora undulata, and Holospora elegans), endonuclear symbiotic bacteria of the ciliate Paramecium caudatum. FEMS Microbiology Letters 359, 16–18, https://doi.org/10.1111/1574-6968.12577, https://academic.oup.com/femsle/article-pdf/359/1/16/19129761/359-1-16.pdf (2014).
Garushyants, S. K. et al. Comparative genomic analysis of Holospora spp., intranuclear symbionts of paramecia. Frontiers in Microbiology 9, 738 (2018).
Article Google Scholar
Floriano, A. M. et al. The genome sequence of “Candidatus Fokinia solitaria”: insights on reductive evolution in Rickettsiales. Genome Biology and Evolution 10, 1120–1126 (2018).
Article CAS Google Scholar
Castelli, M. et al. Deianiraea, an extracellular bacterium associated with the ciliate Paramecium, suggests an alternative scenario for the evolution of Rickettsiales. The ISME Journal 13, 2280–2294 (2019).
Article Google Scholar
Schrallhammer, M. & Schweikert, M. The killer effect of Paramecium and its causative agents. In Endosymbionts in Paramecium, 227–246 (Springer, 2009).
Pond, F., Gibson, I., Lalucat, J. & Quackenbush, R. R-body-producing bacteria. Microbiology and Molecular Biology Reviews 53, 25–67 (1989).
CAS Google Scholar
Koehler, L., Flemming, F. E. & Schrallhammer, M. Towards an ecological understanding of the killer trait–a reproducible protocol for testing its impact on freshwater ciliates. European Journal of Protistology 68, 108–120 (2019).
Article Google Scholar
Schrallhammer, M., Castelli, M. & Petroni, G. Phylogenetic relationships among endosymbiotic R-body producer: Bacteria providing their host the killer trait. Systematic and Applied Microbiology 41, 213–220 (2018).
Article Google Scholar
Schrallhammer, M. et al. Tracing the role of R-bodies in the killer trait: absence of toxicity of R-body producing recombinant E. coli on paramecia. European Journal of Protistology 48, 290–296 (2012).
Article Google Scholar
Grosser, K. et al. More than the “killer trait”: infection with the bacterial endosymbiont Caedibacter taeniospiralis causes transcriptomic modulation in Paramecium host. Genome Biology and Evolution 10, 646–656 (2018).
Article Google Scholar
Zaburannyi, N. et al. Draft genome sequence and annotation of the obligate bacterial endosymbiont Caedibacter taeniospiralis, causative agent of the killer phenotype in Paramecium tetraurelia. Genome Announc. 6, e01418–17 (2018).
Article Google Scholar
Xiao, M. et al. Fastidiosibacter lacustris gen. nov., sp. nov., isolated from a lake water sample, and proposal of Fastidiosibacteraceae fam. nov. within the order Thiotrichales. International Journal of Systematic and Evolutionary Microbiology 68, 347–352 (2017).
Article Google Scholar
Liu, L. et al. Cysteiniphilum litorale gen. nov., sp. nov., isolated from coastal seawater. International Journal of Systematic and Evolutionary Microbiology 67, 2178–2183 (2017).
Article CAS Google Scholar
Xiao, M. et al. Facilibium subflavum gen. nov., sp. nov. and Cysteiniphilum halobium sp. nov., new members of the family Fastidiosibacteraceae isolated from coastal seawater. International Journal of Systematic and Evolutionary Microbiology 69, 3757–3764 (2019).
Article Google Scholar
Tjaden, B. A computational system for identifying operons based on RNA-Seq data. Methods (2019).
Post, L. E. & Nomura, M. DNA sequences from the str operon of Escherichia coli. Journal of Biological Chemistry 255, 4660–4666 (1980).
CAS PubMed Google Scholar
Dusi, E. et al. Vertically transmitted symbiont reduces host fitness along temperature gradient. Journal of Evolutionary Biology 27, 796–800 (2014).
Article CAS Google Scholar
Merhej, V., Royer-Carenzi, M., Pontarotti, P. & Raoult, D. Massive comparative genomic analysis reveals convergent evolution of specialized bacteria. Biology direct 4, 13 (2009).
Article Google Scholar
Almpanis, A., Swain, M., Gatherer, D. & McEwan, N. Correlation between bacterial G+C content, genome size and the G+C content of associated plasmids and bacteriophages. Microbial Genomics 4 (2018).
Blanc, G. et al. Lateral gene transfer between obligate intracellular bacteria: evidence from the Rickettsia massiliae genome. Genome Research 17, 1657–1664 (2007).
Article CAS Google Scholar
Ogata, H. et al. The genome sequence of Rickettsia felis identifies the first putative conjugative plasmid in an obligate intracellular parasite. PLOS Biology 3, https://doi.org/10.1371/journal.pbio.0030248 (2005).
Chafee, M. E., Funk, D. J., Harrison, R. G. & Bordenstein, S. R. Lateral phage transfer in obligate intracellular bacteria (Wolbachia): verification from natural populations. Molecular Biology and Evolution 27, 501–505 (2009).
Article Google Scholar
Szokoli, F. et al. Disentangling the taxonomy of Rickettsiales and description of two novel symbionts (“Candidatus Bealeia paramacronuclearis” and “Candidatus Fokinia cryptica”) sharing the cytoplasm of the ciliate protist Paramecium biaurelia. Appl. Environ. Microbiol. 82, 7236–7247 (2016).
Article CAS Google Scholar
Schrallhammer, M. et al. ‘Candidatus Megaira polyxenophila’ gen. nov., sp. nov.: Considerations on evolutionary history, host range and shift of early divergent rickettsiae. PLoS One 8 (2013).
Raymann, K., Bobay, L.-M., Doak, T. G., Lynch, M. & Gribaldo, S. A genomic survey of Reb homologs suggests widespread occurrence of R-bodies in proteobacteria. G3: Genes, Genomes, Genetics 3, 505–516 (2013).
Article CAS Google Scholar
Jeblick, J. & Kusch, J. Sequence, transcription activity, and evolutionary origin of the R-bodycoding plasmid pKAP298 from the intracellular parasitic bacterium Caedibacter taeniospiralis. Journal of Molecular Evolution 60, 164–173 (2005).
Article ADS CAS Google Scholar
Lalucat, J., Wells, B. & Gibson, I. Relationships between R bodies of certain bacteria. Micron and Microscopica Acta 17, 243–245 (1986).
Article Google Scholar
Arndt, D. et al. Phaster: a better, faster version of the phast phage search tool. Nucleic Acids Research 44, W16–W21 (2016).
Article CAS Google Scholar
Moran, N. A., Degnan, P. H., Santos, S. R., Dunbar, H. E. & Ochman, H. The players in a mutualistic symbiosis: insects, bacteria, viruses, and virulence genes. Proceedings of the National Academy of Sciences 102, 16919–16926 (2005).
Article ADS CAS Google Scholar
Schmitt, M. J. & Breinig, F. Yeast viral killer toxins: lethality and self-protection. Nature Reviews Microbiology 4, 212 (2006).
Article CAS Google Scholar
Sexton, J. A. & Vogel, J. P. Type IVb secretion by intracellular pathogens. Traffic 3, 178–185 (2002).
Article CAS Google Scholar
Lagkouvardos, I., Shen, J. & Horn, M. Improved axenization method reveals complexity of symbiotic associations between bacteria and acanthamoebae. Environmental Microbiology Reports 6, 383–388 (2014).
Article Google Scholar
Picelli, S. et al. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Research 24, 2033–2040 (2014).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. Journal 17, 10–12 (2011).
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research 27, 722–736 (2017).
Article CAS Google Scholar
Kearse, M. et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Article Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with bowtie 2. Nature Methods 9, 357 (2012).
Article CAS Google Scholar
Ronquist, F. et al. MrBayes 3.2: Efficient bayesian phylogenetic inference and model choice across a large model space. Systematic Biology 61, 539–542 (2012).
Article Google Scholar
Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by Maximum Likelihood. Systematic Biology 52, 696–704 (2003).
Article Google Scholar
Kerepesi, C., Banky, D. & Grolmusz, V. AmphoraNet: the webserver implementation of the Amphora2 metagenomic workflow suite. Gene 533, 538–540 (2014).
Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797 (2004).
Richter, M., Rosselló-Móra, R., Oliver Glöckner, F. & Peplies, J. JSpeciesWS: A web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32, 929–931 (2015).
Article Google Scholar
Meier-Kolthoff, J. P., Auch, A. F., Klenk, H.-P. & Göker, M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics 14, 60 (2013).
Article Google Scholar
Darling, A. C., Mau, B., Blattner, F. R. & Perna, N. T. MAUVE: Multiple alignment of conserved genomic sequence with rearrangements. Genome Research 14, 1394–1403 (2004).
Couvin, D. et al. CRISPRCasFinder, an update of CRISPRFinder, includes a portable version, enhanced performance and integrates search for cas proteins. Nucleic Acids Research 46, W246–W251 (2018).
Article CAS Google Scholar
Stothard, P. The sequence manipulation suite: Javascript programs for analyzing and formatting protein and DNA sequences. Biotechniques 28, 1102–1104 (2000).
Armenteros, J. J. A. et al. SignalIP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology 37, 420 (2019).
Article Google Scholar
Thorvaldsdottir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Briefings in Bioinformatics 14 (2), 178–192 (2013).
Article CAS Google Scholar

Download references

Acknowledgements

Astrid Petersen is gratefully acknowledged for assistance with the DNA extraction. Marcello Pirritano received a scholarship by the Studienstiftung des deutschen Volkes.

Author information

Katrin Grosser
Present address: Deep Sequencing Unit, Max-Planck-Institute for Immunobiology and Epigenetics, Freiburg, Germany

Authors and Affiliations

Molecular Cell Biology and Microbiology, University of Wuppertal, Wuppertal, Germany
Marcello Pirritano & Martin Simon
Molecular Cell Dynamics Saarland University, Saarbrücken, Germany
Marcello Pirritano & Martin Simon
Department of Microbial Natural Products, Helmholtz Centre for Infection Research and Department of Pharmacy, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Saarland University, Saarbrücken and German Centre for Infection Research (DZIF), Hannover, Germany
Nestor Zaburannyi & Rolf Müller
Microbiology, Institute of Biology II, Albert Ludwig University of Freiburg, Freiburg, Germany
Katrin Grosser & Martina Schrallhammer
Genetics, Centre for Human and Molecular Biology, Saarland University, Saarbruecken, Germany
Gilles Gasparoni

Authors

Marcello Pirritano
View author publications
You can also search for this author in PubMed Google Scholar
Nestor Zaburannyi
View author publications
You can also search for this author in PubMed Google Scholar
Katrin Grosser
View author publications
You can also search for this author in PubMed Google Scholar
Gilles Gasparoni
View author publications
You can also search for this author in PubMed Google Scholar
Rolf Müller
View author publications
You can also search for this author in PubMed Google Scholar
Martin Simon
View author publications
You can also search for this author in PubMed Google Scholar
Martina Schrallhammer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. and M.Sch. conceived the study, M.S., M.Sch., M.P., K.G. conducted the experiment(s), M.P., N.Z., R.M., M.S., M.Sch., G.G. analysed the results. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Martin Simon or Martina Schrallhammer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pirritano, M., Zaburannyi, N., Grosser, K. et al. Dual-Seq reveals genome and transcriptome of Caedibacter taeniospiralis, obligate endosymbiont of Paramecium. Sci Rep 10, 9727 (2020). https://doi.org/10.1038/s41598-020-65894-1

Download citation

Received: 16 September 2019
Accepted: 01 April 2020
Published: 16 June 2020
DOI: https://doi.org/10.1038/s41598-020-65894-1

This article is cited by

Symbionts of Ciliates and Ciliates as Symbionts
- Jyoti Dagar
- Swati Maurya
- Ravi Toteja
Indian Journal of Microbiology (2024)
Pseudomonas aeruginosa PA14 produces R-bodies, extendable protein polymers with roles in host colonization and virulence
- Bryan Wang
- Yu-Cheng Lin
- Lars E. P. Dietrich
Nature Communications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.