Abstract
Interest in host-symbiont interactions is continuously increasing, not only due to the growing recognition of the importance of microbiomes. Starting with the detection and description of novel symbionts, attention moves to the molecular consequences and innovations of symbioses. However, molecular analysis requires genomic data which is difficult to obtain from obligate intracellular and uncultivated bacteria. We report the identification of the Caedibacter genome, an obligate symbiont of the ciliate Paramecium. The infection does not only confer the host with the ability to kill other cells but also renders them immune against this effect. We obtained the C. taeniospiralis genome and transcriptome by dual-Seq of DNA and RNA from infected paramecia. Comparison of codon usage and expression level indicates that genes necessary for a specific trait of this symbiosis, i.e. the delivery of an unknown toxin, result from horizontal gene transfer hinting to the relevance of DNA transfer for acquiring new characters. Prediction of secreted proteins of Caedibacter as major agents of contact with the host implies, next to several toxin candidates, a rather uncharacterized secretome which appears to be highly adapted to this symbiosis. Our data provides new insights into the molecular establishment and evolution of this obligate symbiosis and for the pathway characterization of toxicity and immunity.
Similar content being viewed by others
Introduction
Symbionts can have severe impact on host nutrition, metabolism, reproduction, immune and stress responses, and even behavior. Our knowledge about their biology is mostly limited to pathogenic representatives of medical or economic concern. Another bias is the focus on the minority which can be cultivated in artificial media, most probably due to the technical difficulties when facing uncultivated prokaryotes. But these symbionts represent only a small proportion of the natural diversity. A fundamental resource for understanding the biology of uncultivated intracellular symbionts is their genome sequence. It can be used to infer key aspects such as metabolic properties, phylogeny and evolution as well as additional traits crucial for the symbiosis. Next generation sequencing (NGS) techniques now allow for obtaining genome sequences of uncultivated symbionts of less well-studied host organisms. Still, the procedure entails multiple challenges such as low DNA quality and quantities of mixed samples with unfavorable abundances of target DNA, unavailability of reference genomes, etc. Choosing the ciliate Paramecium tetraurelia carrying a cytoplasmic infection with Caedibacter taeniospiralis (Gammaproteobacteria), we provide a roadmap for sequencing the genome and transcriptome of obligate intracellular bacteria.
We focus on this system as Paramecium can host diverse bacterial endosymbionts and only few genomes are available so far (i.e. fourHolospora draft genomes1,2 and the only ones completed, “Candidatus Fokinia solitaria”3 and “Candidatus Deianiraea vastatrix”4). All belong to the sister families Rickettsiales and Holosporales, thus C. taeniospiralis represents a member of another clade of Proteobacteria. So far, it is the only described gammaproteobacterial symbiont of Paramecium. It can serve as a phylogenetic distant example for genome adaptation during the transition from a free-living to an intracellular lifestyle as well as co-evolution with the same host and ecological restraints as the alphaproteobacterial symbionts. The interaction between Paramecium tetraurelia and Caedibacter is especially intriguing as it is linked to the killer trait5,6 which is the ability to eliminate symbiont-free paramecia7 gained from symbiosis with Caedibacter or Caedimonas8 bacteria. This trait is expressed by the bacterial symbionts. It comprises three components: the R-body, which constitutes an unusual protein delivery machine, an unidentified toxin, and an unknown resistance mechanism. If symbiont-free paramecia ingest released Caedibacter, the R-body is triggered to unroll and therewith delivers the toxin into the Paramecium cytoplasm. R-bodies themselves are not toxic and once the symbiont is eliminated the paramecia loose the resistance9,10. In addition, this symbiosis can result in increased host cell densities without the provision of nutritional supplements10. The mechanisms how the symbionts cause these host phenotype modifications are not yet fully understood. With the C. taeniospiralis genome annotation we provide here a RNA-Seq corrected annotation of the former draft genome sequence11 which was not sufficient for a solid annotation of genes and operons. Furthermore, we close several gaps by using additionally long reads from Oxford Nanopore Technologies (ONT). As a result, we are able to present a roadmap for assessing genomes of uncultivated symbionts, the first genome of a Paramecium symbiont outside Alphaproteobacteria, and a resource to address the question how this symbiont drastically changes the properties of its host.
Results and Discussion
Dual-Seq reveals the Caedibacter genome sequence and facilitates refined phylogenetic placement
As Caedibacter cannot be cultivated outside of its host cell, we used total DNA isolated from food bacteria depleted paramecia cultures for library preparation using Tagmentation11. After assembly, we separated Caedibacter sequences from host chromosomes by GC percentage and coverage (Supplementary Fig. S1). The resulting assembly was further improved by ONT sequencing of high molecular weight DNA which allowed to close several gaps (Table 1). The resulting assembly consists of 17 scaffolds and one circular plasmid with a total size of 1.32 Mbp. Although we were not able to close the chromosome, this genome currently represents the best assembled genome (Table 2) inside a group of related bacteria (Fastidiosibacteriaceae12), now allowing for a more detailed phylogenetic characterization. We performed phylogenetic reconstructions (Fig. 1) based on 16S rRNA gene sequences and a concatenated alignment of 19 conserved protein-coding genes as well as genome-by-genome comparisons using average nucleotide identity (ANI) and digital DNA-DNA hybridization (Supplementary Table S1). The different analyses are in good agreement and place C. taeniospiralis within the little characterized family Fastidiosibacteriaceae, sister family to the facultative intracellular Francisellaceae which cause zoonotic diseases in fish and mammals. Notably, Caedibacter is the only organism within Fastidiosibacteriaceae with an obligate intracellular lifestyle, all other members were isolated from marine or freshwater samples and grow on artificial media. The closest relatives of C. taeniospiralis are Cysteiniphilum litorale13 and Cysteiniphilum halobium14.
RNA-Seq corrected gene annotation
Gene annotation was carried out by analysing the genome assembly which revealed 1080 protein coding gene candidates, 36 tRNAs and 3 rDNA cluster11 (Table 1). Dual RNA-Seq was carried out with the aim to improve this annotation and to verify transcriptional units and operons. Thus, prokaryotic mRNA was enriched for by food depletion using antibiotics as described previously11 as well as subsequent double depletion of host and symbiont rRNA and host poly(A)RNA (Fig. 2A). We created directional RNA libraries to dissect which strand of the genome is transcribed. The Rockhopper tool15 was applied to define transcriptional units and operons based on the obtained RNA-Seq information. Accordingly, we improved the annotation by correcting individual ORFs and increased the number of protein coding genes to 1.091 and predicted 202 operons. Furthermore, we distinguished gene annotations of automatically annotated genes and human curated ones (see Fig. 2B,C). The latter involved manual inspection of the predicted protein sequences via database comparison (NCBI nucleotide collection respectively non-redundant protein sequences) as well as de novo annotation of previously not predicted protein sequences based on mRNA signals. Figure 2B,C show examples for annotation improvements in the Integrated Genomics Viewer (IGV) browser48.
In addition to gene content, also genome synteny and operon composition now enable the comparison of the symbiont genome to free-living species and thus provide insights into the evolutionary adaption of the killer trait symbiosis. Figure 3A shows the comparison of the C. taeniospiralis str-operon juxtaposed to E. coli16 indicating the exchange of the elongation factor EF-G (tufA) against the ribosomal protein S10 (rpsJ) in Caedibacter. The replacement of the elongation factor which is encoded on a different contig (CDBSP s08) may have led to a distorted expression level as consequence of the destroyed co-transcription. When comparing the respective operon structure in the sequenced Fastidiosibacteriaceae and Caedibacter, an interesting evolution of its synteny can be observed (Fig. 3B). The str-operon composition of Caedibacter is a general feature of the Fastidiosibacteriaceae. Thus, this composition and obvious difference to E. coli has clearly not evolved as result of the symbiotic adaption of Caedibacter. Considering the organization of E. coli as ancestral, the shift towards the replacement of tufA with rpsJ, and general reorganization as present in Caedibacter potentially correlates with an increasing fastidiousness12,13 of the bacteria regarding their cultivation or growth conditions. The rearrangement of the str-operon might be interpreted as one example for a decrease in regulatory fine-tuning in Fastidiosibacteriaceae which results in the strict dependence on the rather constant environment of a host organism such as Paramecium cytoplasm in case of Caedibacter. For this symbiont it was shown that even small changes of the cultivation temperature of the “poikilothermic” ciliates can have drastic effects to the symbiont density17.
Horizontal gene transfer as driver of intracellularity
While the majority of Paramecium endosymbionts belongs to a fast evolving branch of obligate intracellular Alphaproteobacteria, Caedibacter’s closest relatives are free-living. Typical hallmarks for the transition from free-living to intracellular life such as reduced genome size, gene loss, and fewer rRNA genes18 are evident in the genome of Caedibacter (Table 2). Interestingly, the typical positive correlation between a reduction in genome size and GC-content18,19 is not detectable for Fastidiosibacteriaceae, as it is highest for the smallest genome (Table 2). While the importance of horizontal gene transfer (HGT) for rapid adaption and evolution of prokaryotes is broadly accepted, it has been suggested that obligate endosymbiotic bacteria might be protected from mobile genetic elements and DNA exchange due to their intracellular lifestyle. But as several studies demonstrate, e.g.20,21,22, evidence for different kinds of HGT can be detected in endosymbionts. Thus, we searched the genome of Caedibacter for horizontally acquired genes involved in its symbiotic interaction with Paramecium. A fascinating question remains unanswered: Were these genes acquired prior to the transition to an intracellular lifestyle and actually enabled the free-living ancestor of Caedibacter to become an endosymbiont? Or did the HGT occur when Caedibacter already lived inside Paramecium and hence stabilized the symbiosis by e.g. the acquisition of the killer trait? Double infections with different symbionts have been reported for Paramecium, so this remains an intriguing hypothesis8,23,24.
It has been speculated that central elements of the here described symbiosis, i.e. the Reb genes encoding for the R-body, have been acquired via HGT6,8,25. The R-body protein delivery machinery is produced by the obligate Paramecium endosymbionts Caedibacter, belonging to Gammaproteobacteria and Caedimonas, member of Alphaproteobacteria8. In the latter, phage-like particles are often found associated to the R-body structure by transmission electron microscopy5. Thus, we searched the genome of Caedibacter for the presence of mobile genetic elements and other indications for HGT, which might also assist in the search for and identification of the toxin and resistance mechanism of the killer trait.
We confirmed the presence of a circular plasmid (pKAP51, 41.65 kb) that carries the Reb operon encoding for the R-body. This plasmid shares high similarities to plasmid pKAP298 (49.11 kb) of another strain of C. taeniospiralis, strain 29826. It encodes hypothetical proteins, transposases and phage-derived genes. The main difference to plasmid pKAP298 is the excision of transposon Tn5403 (7.78 kb) which is lacking from pKAP51. As phages have been implicated to play a role in the killer trait either by structural evidence5,6 or indirect experimental proof such as increase R-body production after UV-irradiation or induction with mitomycin C27, we searched for phage genomes on the bacterial chromosome. We applied PHASTER (PHAge Search Tool Enhanced Release28), to identify prophage sequences within the Caedibacter genome and detected one incomplete prophage of ca. 12 kb (Fig. 4). Noteworthy, this potential prophage encodes a component of a toxin-antitoxin system, HicB. Interestingly, C. taeniospiralis also possesses phage defense mechanisms as we identified a CRISPR locus headed by a AT-rich leader sequence which serves as promoter for the pre-crRNA synthesis followed by three identical repeats and 2 unique sequences, the so-called spacers (Supplementary Fig. S2). Together with a set of 7 CAS (CRISPR-associated) Type IC proteins, they constitute a CRISPR immune system. CRISPR are specific structures found in the majority of archaeal and many bacterial genomes that show characteristics of both tandem and interspaced repeats.
As mentioned before, an unknown toxin, which is delivered by the R-body, kills uninfected paramecia and hence is central to the symbiosis between Caedibacter and Paramecium. Additionally, the symbiosis provides immunity against this toxin raising the question for the underlying mechanism. This toxin shows a surprising high specificity for members of the genus Paramecium7, possibly correlating with the host specificity of the toxin producing bacteria9. Our genome annotation enables reverse genetics for the unknown toxin to understand the specificity to individual cells and furthermore the identification of the conditional resistance mechanism, as the immunity towards the killer trait is lost when the symbionts are eliminated from the killer paramecia. We started this analysis by plotting the codon adaptation index (CAI) of protein coding genes against gene expression. Figure 5A shows that there are indeed discrete outliers regarding the CAI, especially the highlighted Reb genes which build the R-body, a large protein structure which is crucial to deliver the toxin into the target cell’s cytoplasm. Thus, the outlier position of RebA, RebB, and RebD genes suggests a more recent HGT of these genes than the acquisition of the complete plasmid pKAP51 by Caedibacter, which is maybe even more evident when looking at the overall profile of plasmid- versus chromosomal encoded genes (Fig. 5C). The plasmid carries several transposons and transposases as well as phage-derived genes. It indeed has been speculated that it might derive from a phage genome26. The finding that HGT contributed to the evolution of the killer trait in Paramecium enforces our understanding of genetic transfer between species to rapidly confer new characteristics as in the system aphid-Hamiltonella defensa-phage APSE-229. Another example involves the killer-effect in yeast strains which is based on the presence of two distinct mycoviruses which allow the secretion of toxic proteins to kill uninfected yeast cells as well as providing protective immunity30. Thus, in all these systems viruses confer the ability to kill predators or competitors.
However, the Reb encoded R-body only delivers the toxin to sensitive cells. Searching for the toxin itself, candidates are the different toxin-antitoxin systems (Fig. 5B) present in the C. taeniospiralis genome and even as part of the incomplete prophage (Fig. 4) as they could provide both toxicity and non-permanent immunity. Taking advantage of heterologous expression of R-bodies in E. coli9, potential toxin candidates can now be easily screened by co-expression in the same E. coli vector. As feeding of R-body producing bacteria does not cause lethal effects in paramecia9, the addition of the killer trait toxin should cause cell death in sensitive strains.
The secretome is enriched in uncharacterized proteins
An alternative perspective on this endosymbiosis is the secretome analysis as all secreted proteins are in direct contact with the hosts cytoplasm and therefore the main candidates for intraspecific communication and adaptation. We performed in silico prediction of secreted proteins (Fig. 6A), many of them show high transcript levels (See Supplementary Table S2 for the full list of genes). From 95 proteins identified as secreted, 54 are hypothetical proteins and Fig. 6B indicates that the secretome shows clearly a lower annotation score meaning that reverse genetics can only describe potential functions for few secreted proteins. Among those is for example a component (IcmE) of a type IV secretion system. More parts (12 in total) of this large translocation machinery transporting proteins or protein/DNA complexes out of the symbionts’ cells are encoded on contigs CDBSP s01, s12, and s14. Type IV secretions systems are used by some pathogenic bacteria to inject virulence factors into their host cells31. However, the majority of secreted proteins are hypothetical, thus, their function remains unknown. This might simply be an indication for our limited knowledge of effector proteins in general, or instead due to a highly specialized secretome of this endosymbiont which likely constitutes not only effectors required for the alteration of the host’s transcriptome10 but also the antitoxin providing immunity against the killer trait. Thus, our data now allows to identify and to characterize these secreted proteins to gain deeper understanding into the molecular communication of the symbiosis and the identification of the killer trait toxin and resistance mechanism.
Methods
DNA isolation, library preparation and sequencing
Paramecium tetraurelia strain 51K (CCAP 1660/3F) infected with cytoplasmic C. taeniospiralis as verified by fluorescence in situ hybridisation with the species-specific probe Ctaen9988 was grown on beta-lactam hypersensitive E. coli Delta tolC32 to limit the contamination with food bacteria. Total DNA was isolated after ampicillin treatment11 and used for Illumina and ONT sequencing. Tagmentation33 was performed for Illumina library generation with different ratios of DNA/transposase (Supplementary Fig. S1) to optimize insert length in addition to two different polymerases (Q5, NEB and KAPA HiFi, Roche). After gel purification, Illumina MiSeq sequencing was carried out with a read-lenght of 2 × 300 nt. Reads were trimmed for adapters and low quality bases by the cutadapt (v1.4.1) wrapper trim galore (v0.3.3)34. Additionally, MinION library preparation and sequencing was performed by Seq-IT (Kaiserslautern, Germany) using a 1D2 Sequencing Kit (LSK-308, ONT) and a MKI vR9 MinION flow cell (FLO-MIN107, ONT) to obtain longer DNA reads in order to improve the genome assembly. In total, 1.099 Gbp of raw ONT data with a mean read length of 11955 bp were obtained.
RNA isolation, library preparation and sequencing
Total RNA was isolated from cultures after antibiotics treatment using Tri-Reagent (Sigma). After additional DNAse digestion, RNA samples were depleted for eukaryotic and bacterial rRNAs by subsequent usage of the Yeast Ribo-Kit and the gram-negative bacterial Ribo-Zero Kits (Illumina). After additional depletion of poly(A) RNAs, directional RNA libraries were generated from the left-over RNA using the NEB ultra directional RNA library prep Kit (NEB). Libraries were sequenced on a Illumina HiSeq2500 Platform and trimmed as described above. RNA reads have been deposited under ENA accession number PRJEB36201.
Genome assembly and gene annotation
Genome assembly and gene annotation was carried out as described11 using Illumina reads. Additionally, ONT reads were assembled using the Canu assembler35 version 1.7 with default parameters. The resulting assembly was compared to and used for joining the contigs derived from the Illumina-only assembly in Geneious version 11.1.236. reducing the number of contigs from 24 to 18.
Gene annotation improvement and operon definition was carried out using Rockhopper v2.0315. To prevent inclusion of reads derived from the host, dual RNA-Seq reads were first mapped to the Paramecium genome using Bowtie237. Subsequently, mapping reads were subtracted. The remaining reads were then analyzed by Rockhopper with default settings. Genes that were annotated neither during the first annotation nor by the Rockhopper tool, but showed a distinct mRNA coverage signal, were annotated manually and verified via BLAST search of the corresponding nucleotide and protein sequences.
This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PGGB00000000. The version described in this paper is version PGGB02000000. We also published the corresponding Geneious format including all annotations as well as the .fasta- and .gff-files of the genome at Zenodo (https://zenodo.org/; https://doi.org/10.5281/zenodo.3372731).
Transcripts per million (TPM) values of the Caedibacter genes were calculated by Rockhopper using the same RNA-Seq reads mentioned above.
Phylogenetic analyses and operon synteny
Phylogenetic distances were inferred based on either 16S rRNA gene sequence respectively a selection of 19 conserved bacterial single copy genes. The 16S rRNA gene sequence of C. taeniospiralis was aligned against its closest relatives with sequenced genomes (Supplementary Table S1). Phylogenetic trees were calculated using Bayesian Inference (BI; MrBayes 3.2.638; with a burn-in of 25% after 1,000,000 generations, and Maximum Likelihood (ML) with 1,000 bootstrap pseudoreplicates (PHYML 2.4.539). Furthermore, 19 conserved bacterial single copy genes were identified and extracted from each genome using AmphoraNet40. Protein sequences (Supplementary Table S3) were concatenated and aligned (MUSCLE version 3.8.3141) comprising 6072 characters. BI phylogenetic analysis of AMPHORA concatenated multiprotein sequences were carried as described above. For pairwise genome comparisons, the average nucleotide identity was calculated using the BLAST+-based approach (ANIb) by JSpeciesWS version 3.1.242. A heatmap was generated using CIMminer (http://discover.nci.nih.gov/cimminer). Digital DNA-DNA hybridization (dDDH) values were estimated with the Genome-to-Genome Distance Calculator (using GGDC 2.1 BLAST+43) applying formula 2 which accounts for incomplete genome sequences. Genomes used for the in silico analyses are listed in Supplementary Table S1. Synteny analysis of the genetic context of the str-operon between the different indicated organisms was carried out using the Mauve genome alignment algorithm44 running on default settings.
Genomic and post-genomic analyses
Prophage and CRISPR identification: Prophages were searched for using the PHASTER web server (from: http://phaster.ca/)28. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) were identified by CRISPRCasFinder (from: https://crisprcas.i2bc.pariss)45.
CAI prediction/estimation: A codon usage table of Caedibacter was created by a web-based codon usage calculator46. The codon adaptation index (CAI) of Caedibacter genes was calculated using the web-based CAI calculator from EMBOSS (http://www.bioinformatics.nl/cgibin/emboss/help/cai).
Secretome analysis: Prediction of potentially secreted proteins was performed by submitting a fasta-file containing the amino-acid sequence from all protein-coding genes of the Caedibacter genome to the SignalIP v5.0 webtool47 using the default thresholds for signal peptide prediction.
References
Dohra, H., Tanaka, K., Suzuki, T., Fujishima, M. & Suzuki, H. Draft genome sequences of three Holospora species (Holospora obtusa, Holospora undulata, and Holospora elegans), endonuclear symbiotic bacteria of the ciliate Paramecium caudatum. FEMS Microbiology Letters 359, 16–18, https://doi.org/10.1111/1574-6968.12577, https://academic.oup.com/femsle/article-pdf/359/1/16/19129761/359-1-16.pdf (2014).
Garushyants, S. K. et al. Comparative genomic analysis of Holospora spp., intranuclear symbionts of paramecia. Frontiers in Microbiology 9, 738 (2018).
Floriano, A. M. et al. The genome sequence of “Candidatus Fokinia solitaria”: insights on reductive evolution in Rickettsiales. Genome Biology and Evolution 10, 1120–1126 (2018).
Castelli, M. et al. Deianiraea, an extracellular bacterium associated with the ciliate Paramecium, suggests an alternative scenario for the evolution of Rickettsiales. The ISME Journal 13, 2280–2294 (2019).
Schrallhammer, M. & Schweikert, M. The killer effect of Paramecium and its causative agents. In Endosymbionts in Paramecium, 227–246 (Springer, 2009).
Pond, F., Gibson, I., Lalucat, J. & Quackenbush, R. R-body-producing bacteria. Microbiology and Molecular Biology Reviews 53, 25–67 (1989).
Koehler, L., Flemming, F. E. & Schrallhammer, M. Towards an ecological understanding of the killer trait–a reproducible protocol for testing its impact on freshwater ciliates. European Journal of Protistology 68, 108–120 (2019).
Schrallhammer, M., Castelli, M. & Petroni, G. Phylogenetic relationships among endosymbiotic R-body producer: Bacteria providing their host the killer trait. Systematic and Applied Microbiology 41, 213–220 (2018).
Schrallhammer, M. et al. Tracing the role of R-bodies in the killer trait: absence of toxicity of R-body producing recombinant E. coli on paramecia. European Journal of Protistology 48, 290–296 (2012).
Grosser, K. et al. More than the “killer trait”: infection with the bacterial endosymbiont Caedibacter taeniospiralis causes transcriptomic modulation in Paramecium host. Genome Biology and Evolution 10, 646–656 (2018).
Zaburannyi, N. et al. Draft genome sequence and annotation of the obligate bacterial endosymbiont Caedibacter taeniospiralis, causative agent of the killer phenotype in Paramecium tetraurelia. Genome Announc. 6, e01418–17 (2018).
Xiao, M. et al. Fastidiosibacter lacustris gen. nov., sp. nov., isolated from a lake water sample, and proposal of Fastidiosibacteraceae fam. nov. within the order Thiotrichales. International Journal of Systematic and Evolutionary Microbiology 68, 347–352 (2017).
Liu, L. et al. Cysteiniphilum litorale gen. nov., sp. nov., isolated from coastal seawater. International Journal of Systematic and Evolutionary Microbiology 67, 2178–2183 (2017).
Xiao, M. et al. Facilibium subflavum gen. nov., sp. nov. and Cysteiniphilum halobium sp. nov., new members of the family Fastidiosibacteraceae isolated from coastal seawater. International Journal of Systematic and Evolutionary Microbiology 69, 3757–3764 (2019).
Tjaden, B. A computational system for identifying operons based on RNA-Seq data. Methods (2019).
Post, L. E. & Nomura, M. DNA sequences from the str operon of Escherichia coli. Journal of Biological Chemistry 255, 4660–4666 (1980).
Dusi, E. et al. Vertically transmitted symbiont reduces host fitness along temperature gradient. Journal of Evolutionary Biology 27, 796–800 (2014).
Merhej, V., Royer-Carenzi, M., Pontarotti, P. & Raoult, D. Massive comparative genomic analysis reveals convergent evolution of specialized bacteria. Biology direct 4, 13 (2009).
Almpanis, A., Swain, M., Gatherer, D. & McEwan, N. Correlation between bacterial G+C content, genome size and the G+C content of associated plasmids and bacteriophages. Microbial Genomics 4 (2018).
Blanc, G. et al. Lateral gene transfer between obligate intracellular bacteria: evidence from the Rickettsia massiliae genome. Genome Research 17, 1657–1664 (2007).
Ogata, H. et al. The genome sequence of Rickettsia felis identifies the first putative conjugative plasmid in an obligate intracellular parasite. PLOS Biology 3, https://doi.org/10.1371/journal.pbio.0030248 (2005).
Chafee, M. E., Funk, D. J., Harrison, R. G. & Bordenstein, S. R. Lateral phage transfer in obligate intracellular bacteria (Wolbachia): verification from natural populations. Molecular Biology and Evolution 27, 501–505 (2009).
Szokoli, F. et al. Disentangling the taxonomy of Rickettsiales and description of two novel symbionts (“Candidatus Bealeia paramacronuclearis” and “Candidatus Fokinia cryptica”) sharing the cytoplasm of the ciliate protist Paramecium biaurelia. Appl. Environ. Microbiol. 82, 7236–7247 (2016).
Schrallhammer, M. et al. ‘Candidatus Megaira polyxenophila’ gen. nov., sp. nov.: Considerations on evolutionary history, host range and shift of early divergent rickettsiae. PLoS One 8 (2013).
Raymann, K., Bobay, L.-M., Doak, T. G., Lynch, M. & Gribaldo, S. A genomic survey of Reb homologs suggests widespread occurrence of R-bodies in proteobacteria. G3: Genes, Genomes, Genetics 3, 505–516 (2013).
Jeblick, J. & Kusch, J. Sequence, transcription activity, and evolutionary origin of the R-bodycoding plasmid pKAP298 from the intracellular parasitic bacterium Caedibacter taeniospiralis. Journal of Molecular Evolution 60, 164–173 (2005).
Lalucat, J., Wells, B. & Gibson, I. Relationships between R bodies of certain bacteria. Micron and Microscopica Acta 17, 243–245 (1986).
Arndt, D. et al. Phaster: a better, faster version of the phast phage search tool. Nucleic Acids Research 44, W16–W21 (2016).
Moran, N. A., Degnan, P. H., Santos, S. R., Dunbar, H. E. & Ochman, H. The players in a mutualistic symbiosis: insects, bacteria, viruses, and virulence genes. Proceedings of the National Academy of Sciences 102, 16919–16926 (2005).
Schmitt, M. J. & Breinig, F. Yeast viral killer toxins: lethality and self-protection. Nature Reviews Microbiology 4, 212 (2006).
Sexton, J. A. & Vogel, J. P. Type IVb secretion by intracellular pathogens. Traffic 3, 178–185 (2002).
Lagkouvardos, I., Shen, J. & Horn, M. Improved axenization method reveals complexity of symbiotic associations between bacteria and acanthamoebae. Environmental Microbiology Reports 6, 383–388 (2014).
Picelli, S. et al. Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Research 24, 2033–2040 (2014).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. Journal 17, 10–12 (2011).
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Research 27, 722–736 (2017).
Kearse, M. et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28, 1647–1649 (2012).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with bowtie 2. Nature Methods 9, 357 (2012).
Ronquist, F. et al. MrBayes 3.2: Efficient bayesian phylogenetic inference and model choice across a large model space. Systematic Biology 61, 539–542 (2012).
Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by Maximum Likelihood. Systematic Biology 52, 696–704 (2003).
Kerepesi, C., Banky, D. & Grolmusz, V. AmphoraNet: the webserver implementation of the Amphora2 metagenomic workflow suite. Gene 533, 538–540 (2014).
Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32, 1792–1797 (2004).
Richter, M., Rosselló-Móra, R., Oliver Glöckner, F. & Peplies, J. JSpeciesWS: A web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32, 929–931 (2015).
Meier-Kolthoff, J. P., Auch, A. F., Klenk, H.-P. & Göker, M. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics 14, 60 (2013).
Darling, A. C., Mau, B., Blattner, F. R. & Perna, N. T. MAUVE: Multiple alignment of conserved genomic sequence with rearrangements. Genome Research 14, 1394–1403 (2004).
Couvin, D. et al. CRISPRCasFinder, an update of CRISPRFinder, includes a portable version, enhanced performance and integrates search for cas proteins. Nucleic Acids Research 46, W246–W251 (2018).
Stothard, P. The sequence manipulation suite: Javascript programs for analyzing and formatting protein and DNA sequences. Biotechniques 28, 1102–1104 (2000).
Armenteros, J. J. A. et al. SignalIP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology 37, 420 (2019).
Thorvaldsdottir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Briefings in Bioinformatics 14 (2), 178–192 (2013).
Acknowledgements
Astrid Petersen is gratefully acknowledged for assistance with the DNA extraction. Marcello Pirritano received a scholarship by the Studienstiftung des deutschen Volkes.
Author information
Authors and Affiliations
Contributions
M.S. and M.Sch. conceived the study, M.S., M.Sch., M.P., K.G. conducted the experiment(s), M.P., N.Z., R.M., M.S., M.Sch., G.G. analysed the results. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pirritano, M., Zaburannyi, N., Grosser, K. et al. Dual-Seq reveals genome and transcriptome of Caedibacter taeniospiralis, obligate endosymbiont of Paramecium. Sci Rep 10, 9727 (2020). https://doi.org/10.1038/s41598-020-65894-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-65894-1
This article is cited by
-
Symbionts of Ciliates and Ciliates as Symbionts
Indian Journal of Microbiology (2024)
-
Pseudomonas aeruginosa PA14 produces R-bodies, extendable protein polymers with roles in host colonization and virulence
Nature Communications (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.