The origin and impeded dissemination of the DNA phosphorothioation system in prokaryotes

Jian, Huahua; Xu, Guanpeng; Yi, Yi; Hao, Yali; Wang, Yinzhao; Xiong, Lei; Wang, Siyuan; Liu, Shunzhang; Meng, Canxing; Wang, Jiahua; Zhang, Yue; Chen, Chao; Feng, Xiaoyuan; Luo, Haiwei; Zhang, Hao; Zhang, Xingguo; Wang, Lianrong; Wang, Zhijun; Deng, Zixin; Xiao, Xiang

doi:10.1038/s41467-021-26636-7

Download PDF

Article
Open access
Published: 04 November 2021

The origin and impeded dissemination of the DNA phosphorothioation system in prokaryotes

Nature Communications volume 12, Article number: 6382 (2021) Cite this article

3783 Accesses
12 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Phosphorothioate (PT) modification by the dnd gene cluster is the first identified DNA backbone modification and constitute an epigenetic system with multiple functions, including antioxidant ability, restriction modification, and virus resistance. Despite these advantages for hosting dnd systems, they are surprisingly distributed sporadically among contemporary prokaryotic genomes. To address this ecological paradox, we systematically investigate the occurrence and phylogeny of dnd systems, and they are suggested to have originated in ancient Cyanobacteria after the Great Oxygenation Event. Interestingly, the occurrence of dnd systems and prophages is significantly negatively correlated. Further, we experimentally confirm that PT modification activates the filamentous phage SW1 by altering the binding affinity of repressor and the transcription level of its encoding gene. Competition assays, concurrent epigenomic and transcriptomic sequencing subsequently show that PT modification affects the expression of a variety of metabolic genes, which reduces the competitive fitness of the marine bacterium Shewanella piezotolerans WP3. Our findings strongly suggest that a series of negative effects on microorganisms caused by dnd systems limit horizontal gene transfer, thus leading to their sporadic distribution. Overall, our study reveals putative evolutionary scenario of the dnd system and provides novel insights into the physiological and ecological influences of PT modification.

DNA glycosylases provide antiviral defence in prokaryotes

Article Open access 17 April 2024

Amer A. Hossain, Ying Z. Pigli, … Luciano A. Marraffini

Elucidation of genes enhancing natural product biosynthesis through co-evolution analysis

Article 12 April 2024

Xinran Wang, Ningxin Chen, … Xiaozhou Luo

Diverse and abundant phages exploit conjugative plasmids

Article Open access 12 April 2024

Natalia Quinones-Olvera, Siân V. Owen, … Michael Baym

Introduction

DNA phosphorothioate (PT) modification is a unique modification of the DNA backbone, in which a non-bridging oxygen atom is swapped with a sulfur atom^1,2. PT modification is a sequence-selective, stereospecific, post-replicative modification governed by a family of proteins encoded by five genes, termed dnd genes (corresponding to the often-observed DNA degradation phenotype during electrophoresis)^3,4. In dnd gene clusters, the dndA, dndC, dndD, dndE genes are essential for PT modification⁵. Among them, DndA acts as a cysteine desulfurase and assembles DndC, which is an iron-sulfur cluster protein that has ATP pyrophosphatase activity and is predicted to have PAPS reductase activity^6,7. In some cases, DndA can be functionally replaced by the cysteine desulfurase IscS⁸. DndD is believed to provide energy for PT modification⁹, and a DndD homolog known as SpfD in Pseudomonas fluorescens Pf0-1 has ATPase activity that is possibly related to DNA structure alterations or nicking during sulfur incorporation¹⁰. Structural analysis of DndE indicates that this protein is involved in binding nicked dsDNA¹¹. Although DndB is not essential for PT modification, its homolog DptB in Salmonella enterica serovar Cerro 87 can negatively regulate the transcription of members of the dptBCDE gene cluster¹². Revealed by a co-purification experiment, IscS, DndC, DndD and DndE form a protein complex with the same stoichiometry, and the four proteins assemble into a pipeline according to their gene organization¹³.

Generally, three major physiological functions of PT modifications have been identified. First, PT modification can function as an antioxidant^14,15,16. Phosphorothioated DNA affords DNA with reducing ability, reacting with H₂O₂, peroxides and hydroxyl radicals in vivo, thus protecting genomic DNA as well as sensitive enzymes from intracellular oxidative damage^14,15. Correspondingly, the growth range of both the mesophile Escherichia coli and the extremophile Shewanella piezotolerans WP3 (hereafter referred to as WP3) under multiple stresses increased in response to the antioxidant function of PT modification¹⁶. Second, PT modification systems can be coupled with PT restriction components (DndFGH), which recognize DNA lacking PT modification in consensus motifs and cleave unmodified DNA, thus forming PT-based restriction-modification (R-M) systems^17,18. These novel PT R-M systems have been identified in 734 strains and contribute to the defense of microbes against foreign DNA¹⁹. Finally, DNA PT-based antiviral activities have been recently revealed in some archaea and bacteria, thus expanding the known arsenal of virus resistance systems^20,21. Moreover, epigenetic regulation of PT has been suggested to occur in P. fluorescens Pf0-1, in which the transcriptional efficiency of 4 genes was altered in the presence of PT modifications, according to an in vitro transcriptional assay. This assertion was further supported by characterization of the relationships between two PT-modified sites and the gene expression of dndB in Streptomyces lividans²². Nevertheless, additional convincing evidence, especially the influence of PT modification on the transcription of those non-dnd genes, is needed to confirm the epigenetic regulatory function of PT modification.

Since PT modification was first discovered in soil-inhabiting, antibiotic-producing Streptomyces species²³, dnd genes have been identified in several prokaryotic genomes and environmental samples^19,24,25,26. However, the distribution of these genes is rather sporadic^19,20,26. In a previous sequence search for the dnd system in NCBI nucleotide databases, only 1,349 positive hits were obtained¹⁹. Moreover, the detection of dnd genes in oceanic metagenomes showed that the number of dnd genes is often <1% of that of housekeeping genes²⁶. In contrast, a survey of 230 diverse bacterial and archaeal genomes revealed DNA methylation in 93% of the genomes and identified 1,459 candidate MTase genes, indicating widespread distribution of DNA methylation in prokaryotes²⁷. These results revealed an ecological paradox involving this novel DNA modification system: although PT modifications confer benefits to microbes, why are dnd systems only sporadically distributed in a limited number of microorganisms? In actuality, this phenomenon was noticed early, and possible explanations have been proposed in that, despite extensive horizontal gene transfer (HGT), the lability of PT-modified DNA under oxidative stress and its susceptibility to PT-dependent endonucleases has led to widespread but sporadic distribution of PT modifications in bacteria⁹. However, this speculation has not yet been tested. Additionally, a comprehensive evaluation of the occurrence, origin, and evolution of dnd systems is lacking, thus severely limiting the ability to resolve this ecological paradox.

Here, to address this paradox, we hypothesized that the laterally transferred dnd system would generate a significantly negative effect on the physiology of the recipient microbes, thus limiting the spread of the PT modification system among prokaryotes. In this study, we explore the distribution pattern and ancient ancestor of the dnd system in bacterial and archaeal genomes. A variety of experiments are conducted to demonstrate the effect of PT modification on recipient microorganisms after HGT by using the marine bacterium S. piezotolerans WP3 and its endogenous bacteriophage SW1 together as a model system. Taken together, we revealthe most likely evolutionary scenario of the dnd system and propose that its sporadic distribution is result from the restricted HGT due to transcriptional interference.

Results

Distribution of the dnd system among prokaryotes

To comprehensively reveal the distribution of the dnd system in prokaryotes, we searched the NCBI Reference Sequence (RefSeq) genome database, which contains the sequences of 22,280 bacterial and 388 archaeal genomes, for different dnd genes and gene clusters (Supplementary Data 1). Notably, except for the dndA/iscS gene, the occurrence of dnd genes was substantially limited (Fig. 1). Furthermore, dndBCDE gene clusters were present in only 1.82% and 0.77% of the bacterial and archaeal genomes, respectively, indicating the makedly limited distribution of the dnd system. Accordingly, the occurrence of PT R components (dndFGH) was also limited to 1.09% of the bacterial genomes, and this gene cluster was completely absent in the archaeal genomes (Fig. 1b).

**Fig. 1: Occurrence and distribution of *dnd* genes and gene clusters among prokaryotes.**

The distribution pattern of the dnd system was further demonstrated by coupling the data with those of phylogenetic trees that collapsed at the order and phylum levels for bacteria and archaea, respectively (Fig. 1c and Supplementary Figs. S1 and S2). In addition to that of dndA/iscS, enrichment of dndD, dndE and dndH genes was noted in several bacterial orders, including Nostocales, Themoanaerobacterales and Sphingobacteriales. However, the presence of dnd gene clusters (dndCD, dndBCDE and dndFGH) was substantially limited. Consistent with the findings of a previous report¹⁹, the occurrence of the PT R component was much lower than that of the M component, indicating the presence of solitary dndBCDE gene clusters, especially in Actinobacteria, Alphaproteobacteria, Betaproteobacteria and Gammaproteobacteria (Fig. 1c). In contrast, no enrichment of the dnd system was observed in Archaea. Notably, dnd genes and gene clusters were significantly enriched in Nostocales in the phylum Cyanobacteria, while the dndFGH gene cluster was absent (Fig. 1c). In addition, dnd systems that were identified in cyanobacterial genomes had various configurations with different components (Supplementary Fig. S3). The incomplete dnd gene cluster and the disorder of the operon structure imply a primitive form of dnd systems or that they have undergone vertical evolution for an extremely long time and therefore diversified within the cyanobacterial lineage. Combining this evidence, we supposed that the PT M system initially originated in ancient Cyanobacteria and that cognate PT R components were later developed within the Gammaproteobacteria.

The dnd system probably originated in ancient Cyanobacteria

The evolutionary history of the dnd system was further explored. By using stringent selection criteria for BLASTP (e-value cut off of 10⁻²⁰ and query coverage of 75%), we collected highly reliable sequences of DndD proteins, which are essential components of the dnd system and can be used as marker proteins for this system⁹. Phylogenetic analysis indicated that the DndD distribution generally did not match the corresponding species phylogeny (Fig. 2a), suggesting that HGT processes strongly influenced the dissemination of dnd genes. However, the DndD from the Cyanobacteria phylum formed a monophyletic group, and these cyanobacterial DndD sequences were not found in the remaining phylogenetic branches (Fig. 2a). Intriguingly, phylogenetic analysis of the DndC protein revealed a similar pattern (Supplementary Fig. S4). Therefore, these results indicated the presence of an independent and conserved evolutionary path in Cyanobacteria.

**Fig. 2: Putative origin of the *dnd* system in Cyanobacteria.**

Next, a Cyanobacteria phylogenomic tree was constructed to further explore the relationship between the evolution of dnd genes and Cyanobacteria (Fig. 2b and Supplementary Fig. S5). The results showed that dnd systems were absent in Sericytochromatia and Vampirovibrionia, which are sister classes of Cyanobacteriia (GTDB). In contrast, dnd gene clusters existed exclusively in Cyanobacteriales but not in other orders (Fig. 2b). The DndD proteins in Cyanobacteriales formed two major and one minor phylogenetic clade (Fig. 2c and Supplementary Fig. S6). Among them, the phylogenetic positions of one of the major clades (clade 1) in the well-supported tree were highly consistent with the positions in the enlarged Cyanobacteriales phylogenomic tree (Fig. 2c), suggesting that the dnd systems were vertically inherited in the Cyanobacteriales order. Based on this evidence, we propose that the ancestor of Cyanobacteriales strains could be considered a candidate host in which the dnd system originated.

The occurrence of dnd systems is negatively correlated with prophages

We subsequently aimed to reveal the causes of the sporadic distribution of the dnd system. Interestingly, we occasionally noticed that prophages were not detected in bacterial genomes that harbored dnd gene clusters (Supplementary Fig. S7). This phenomenon led us to assess the relationship between the occurrence of dnd systems and prophages in a large dataset. The sequences of all publicly available bacterial genomes in the NCBI RefSeq database were downloaded and analyzed. To facilitate statistical analysis, only the 32 genera that contained >100 genomes of high quality were included in the following analysis. We searched for prophages in 13,916 bacterial genomes belonging to these 32 genera, and 24,720 prophages were identified in 7,549 genomes (Supplementary Data 2). Notably, the occurrence of all dnd genes and gene clusters was significantly negatively correlated (linear regression R² > 0.5, P < 0.001; Pearson coefficient < −0.7, P < 0.001) with the occurrence of prophages (Fig. 3a and Supplementary Table S1). This conclusion was further supported by a similar analysis among different species (Supplementary Fig. S8 and Table S1). Generally, individual dnd genes generally have a wider occurrence range than gene clusters. Remarkably, in contrast to the high prevalence of prophages in bacteria, the occurrence of dnd gene clusters was limited to a low amount (Fig. 3a).

**Fig. 3: Negative correlations between the occurrence of *dnd* genes/gene clusters and prophages are predominant in bacteria.**

To further quantify the pervasiveness of the negative association between prophages and the dnd system, we calculated Kendall’s τ coefficient between the abundance of prophages and dnd genes/gene clusters in all genomes for each genus (Supplementary Data 3). The results revealed negative correlations in the majority of these genera (Fig. 3b). Notably, the abundance of prophages and PT M systems were significantly negatively correlated (P < 0.05) in 50% of the analyzed genera, which contained 8,578 genomes. Again, a similar correlation was detected between the prophages and PT R system. Specifically, the number of prophages and dndBCDE was most strongly negatively correlated in Streptomyces (Kendall’s τ coefficient = −0.61, P = 3.25E-07) and Vibrio (Kendall’s τ coefficient = −0.34, P = 3.82E-08), and the number of prophages was also strongly negatively correlated with dndFGH in Vibrio (Kendall’s τ coefficient = −0.33, P = 1.26E-07) (Fig. 3c). Moreover, the association between prophages and the dnd system was further measured at the species level, and similar results were obtained (Supplementary Data 3). Overall, these results clearly indicated that the occurrence of dnd systems is negatively correlated with prophages in bacterial genomes. We therefore hypothesized that horizontally transferred dnd genes and the accompanying PT modifications may activate prophages, and the subsequent cell lysis and decrease in microbial abundance would significantly inhibit dnd system dissemination among prokaryotes.

PT modifications activate prophage by altering the binding affinity of phage repressor and the transcription level of its encoding gene

To address the aforementioned hypothesis, the influences of PT modification on prophages were assessed in a model system that was derived from the temperate phage SW1 and its bacterial host S. piezotolerans WP3 and has previously been utilized to explore the physiological influence of PT modification^16,28. In particular, the whole dnd gene cluster (dndABCDE) from S. enterica serovar Cerro 87 was cloned into a pSW2 shutter vector to generate pSW2Dnd (Fig. 4a). Correspondingly, eight putative PT modification motifs in the fpsR-fpsA intergenic region were identified according to the conserved PT modification sites (5′-G_psAAC-3′/5′-G_psTTC-3′) of S. enterica serovar Cerro 87²⁹. Furthermore, two of these motifs were located within FpsR operators, which are responsible for the genetic switch of phage activation³⁰. Exposure of pSW2Dnd from WP3NR to peracetic acid (PAA) and Tris-acetate EDTA (TAE) buffer caused extensive DNA cleavage (Fig. 4b), which is the typical in vitro characteristic of PT-modified DNA³. DNA PT modifications were then quantitatively analyzed; the frequencies of d(G_psA) and d(G_psT) modifications were 373 ± 3 and 329 ± 14/10⁶ nt, respectively, which is equivalent to 11 d(G_psA) and 9 d(G_psT) modifications in every pSW2Dnd plasmid (Fig. 4c). Taken together, the above-described data demonstrated that the transferred dnd gene cluster was functional in WP3NR cells.

**Fig. 4: PT modification influences the gene transcription and DNA replication of phage SW1.**

The relative transcription levels (RTLs) of SW1 genes fpsA and fpsB in pSW2Dnd were subsequently assessed by comparison with those in pSW2DndΔE (Supplementary Fig. S9), which lacked a dndE gene and thus did not show sensitivity to cleavage by PAA-TAE³¹. These derivatives of pSW2 vectors remain stable during cultivation and could therefore be used to accurately evaluate the effects of PT modification on transcription (Supplementary Fig. S10). Interestingly, the RTLs of fpsA and fpsB were significantly higher in pSW2Dnd than in pSW2DndΔE (Fig. 4d), indicating that PT modification influenced the transcription of SW1. Here, the possibility that this differential expression resulted from the SOS response as previously reported in S. enterica was ruled out, as neither cell elongation nor upregulated expression of the key SOS genes recA and lexA (both of which are typical features of the SOS response³²) was observed in the PT-modified strain (Supplementary Fig. S11). Additionally, we also excluded the possibility of differences due to PT-specific endonucleases, because neither a homologous gene of dndFGH was found in the WP3 genome nor differences in nuclease activity were detected in different WP3NR strains (Supplementary Fig. S12). We therefore were convinced that PT modification directly caused a change in phage SW1 transcription.

To further assess whether PT modification influenced the life cycle of SW1, the copy numbers of double-stranded replicative form (RF) DNA and the single-stranded (ss) DNA of pSW2Dnd and pSW2DndΔE were quantified. The data showed that ssDNA substantially accumulated after PT modification, indicating that the genetic switch of SW1 was activated during this situation (Fig. 4e). To confirm the effect of PT modification on the bacteriophage, we transferred pSW2Dnd and pSW2DndΔE into WP3ΔRE, which is a non-restricting WP3 strain harboring intact phage SW1³³. As expected, the RTLs of fpsA and fpsB significantly increased in WP3ΔRE/Dnd compared with WP3ΔRE/DndΔE (Supplementary Fig. S13a). Notably, the ssDNA of SW1 accumulated to high levels in the PT-modified strain WP3ΔRE/Dnd (Supplementary Fig. S13b). Therefore, our results suggested that PT modification functions as an anti-repressor, activating phage gene transcription and DNA replication.

The binding of the phage-borne repressor FpsR to its cognate operator is the key controlling factor for the SW1 genetic switch³⁰. To further explore the underlying mechanism through which PT modification affects phage gene expression, 8 PT modification motifs located within the fpsR-fpsA intergenic region were point mutated in pSW2Dnd and pSW2DndΔE to generate two vectors: pSW2Dnd-IG and pSW2DndΔE-IG (Supplementary Fig. S14). As expected, subsequent real-time qPCR (RT-qPCR) showed that there were no significant differences in the RTLs of SW1 genes and in the copy number of phage ssDNA after the PT modification motifs were mutated (Fig. 5a).

**Fig. 5: PT modification activates phage SW1 by influencing the repressor FpsR.**

We then examined whether the activation of phage gene transcription was due to the effects of PT modification on the activity of RNA polymerase or FpsR. First, the possibility of influence on RNA polymerase was ruled out by an in vitro transcription assay, as no significant difference in RNA production was detected (Supplementary Fig. S15). To test another possibility, we constructed two vectors, pSW2DndΔR and pSW2DndΔEΔR, in which the fpsR gene was deleted in-frame from pSW2Dnd and pSW2DndΔE, respectively (Supplementary Fig. S9). The RTLs of fpsA and fpsB significantly increased in these two vectors, but there was no significant difference in the levels between pSW2DndΔR and pSW2DndΔEΔR (Fig. 5c). Moreover, significant ssDNA production was observed in both vectors because of the derepressing effect of fpsR deletion (Fig. 5d). Taken together, these results suggest that PT modification influences the regulatory function of FpsR, thus affecting phage gene transcription.

To further test whether DNA PT modification affected the binding affinity of FpsR, PT-modified DNA and normal DNA probes that covered the fpsA promoter were chemically synthesized (Fig. 5e). Notably, one of the FpsR operators (O4) and a PT modification site were located within the −35 and −10 regions, respectively, of the fpsA promoter. A real-time surface plasmon resonance (SPR) experiment was then performed (Fig. 5f), and the equilibrium dissociation constants (K_D) obtained from the curve fit were 7.01 nM and 3.80 nM for FpsR binding with normal and PT-modified DNA, respectively (Fig. 5f and Supplementary Table S2). Besides that, the RTLs of the fpsR gene were significantly decreased after PT modification (Fig. 5g), indicating a lower amount of FpsR in the PT-modified WP3 strains. These data thus support a model in which PT modification substantially increased the binding affinity of FpsR to the fpsA promoter, and decreased the amount of FpsR protein, thereby relieving the repression of phage gene transcription because of the competitive binding of operators with different statuses of modification involving FpsR (Fig. 5h). Moreover, considering the dynamic nature of PT motifications⁹, we proposed that a “lagged effect” resulting from the different time requirements for the change in PT modification status and the reestablishment of repression probably also contributed to the derepression process.

Horizontal transfer of the dnd system reduces competitive fitness by perturbing gene transcription

To further elucidate the reason for the sporadic distribution of the dnd system, we questioned whether exogenous introduction of the dnd gene cluster confers competitive disadvantages to a non-PT bacterium. To this end, a competition assay was performed via coculture of WP3NR/Dnd and WP3NR/DndΔE at 20 °C (the optimum growth temperature of WP3) (Fig. 6a). In the starting culture (T0), WP3NR/Dnd accounted for 52% of the population. After 1-day- and 5-day-long incubation periods, the percentage of WP3 PT strains in the population decreased to 48 and 31%, respectively (Supplementary Table S3). The relative fitness values of the WP3NR/DndΔE vs WP3NR/Dnd strains from T1 vs. T0, T5 vs. T0 and T5 vs. T1 were 1.038 ± 0.022, 1.041 ± 0.003 and 1.043 ± 0.010, respectively (Fig. 6b), indicating decreased competitive fitness due to the dnd system. The competition experiment was also performed at 28 °C (the maximal growth temperature of WP3). Similarly, the coculture was dominated by the non-PT strain (93%) after 5 days of cultivation (Supplementary Table S3), and the calculated relative fitness values from T1 vs. T0, T5 vs. T0 and T5 vs. T1 were 1.180 ± 0.047, 1.131 ± 0.013 and 1.119 ± 0.005, respectively (Fig. 6b), indicating that the PT system conferred a stronger competitive disadvantage to WP3 under high temperature stress.

**Fig. 6: The horizontally transferred *dnd* gene cluster substantially reduces the fitness of the deep-sea bacterium *S. piezotolerans* WP3.**

Next, we concurrently performed genomic PT mapping and global transcriptome analysis to explore why the horizontally transferred dnd system reduced the competitive fitness of the marine bacterium S. piezotolerans WP3. Single-molecule real-time (SMRT) sequencing was used, which led to the identification of 2,254 PT modification sites within the WP3NR genome (Fig. 6c and Supplementary Data 4)-1,203 5′-G_psAAC-3′ sites and 1,329 5′-G_psTTC-3′ sites, which accounted for 4.05% and 4.48%, respectively, of the total number of PT motifs in the genome. This coincides with the nature of PT modification, which has been revealed to be partial or incomplete^9,29. Moreover, the calculated frequencies of d(G_psA) (223.36/10⁶ nt) and d(G_psT) (246.75/10⁶ nt) modification in the WP3NR/Dnd genome were in accordance with the mass spectrometric evidence (200 ± 7 and 221 ± 24/10⁶ nt for G_psA and G_psT, respectively) (Supplementary Fig. S16), and both were comparable to the previously reported modification frequencies in S. enterica serovar Cerro 87³⁴. Additionally, d(G_psA) and d(G_psT) modifications were identified in the coding regions of 934 and 1,018 genes, respectively, and were evenly distributed throughout the genome (Fig. 6c).

Transcriptomic analysis was performed to investigate the influence of PT modification on the gene transcription profile of WP3NR. The RNA sequencing (RNA-seq) data were validated via RT-qPCR analysis, which showed a strong correlation coefficient (R² = 0.9653) (Supplementary Fig. S17), indicating that the transcriptomic data were reliable and could be used for follow-up analysis. Again, the possibility that this differential expression resulted from the SOS response was ruled out, as neither cell elongation nor upregulated expression of recA/lexA was detected in the PT-modified strain (Supplementary Fig. S18 and Table S4). Overall, 57 genes were found to be differentially expressed (FDR < 0.05 and FC > 2) between WP3NR/Dnd and WP3NR/DndΔE (Fig. 6d and Supplementary Table S4). Notably, hierarchical clustering analysis indicated that enriched differentially expressed genes (DEGs) (26/57) were associated mainly with cellular metabolic functions according to classification via the KEGG database (Fig. 6d), probably explaining the decreased fitness of the PT-modified strain. Further coupled analysis revealed that although the proportion of DEGs containing PT sites was slightly higher than that of other genes (39.3% vs. 33.1%), the average number of PT sites they contain did not significantly differ from that of other genes (Fig. 6e), indicating that transcription alteration of and PT modification in a single gene were likely not directly correlated.

Regarding the substantial competitive disadvantage of WP3NR/Dnd at high temperature (28 °C), we speculated that PT modification perturbed the transcription of genes encoding heat-shock proteins (HSPs), which include chaperones and proteases and are essential for overcoming changes that involve protein denaturation during thermal stress³⁵. To address this possibility, eight HSPs were chosen as representatives, and the RTLs of their encoding genes at different temperatures were measured in the WP3NR/Dnd and WP3NR/DndΔE strains. The expression of HSP-encoding genes in both strains was significantly upregulated at 28 °C compared with 20 °C. However, the extent of the upregulation of these genes in the PT-modified strain was significantly less than that in the control strains (Supplementary Fig. S19). Specifically, the transcript levels of the hslU, hsp70 and dnaJ genes were >10-fold lower in WP3NR/Dnd than in WP3NR/DndΔE. These results strongly suggested that PT modification interferes with the normal induction of the heat-shock response, resulting in a competitive disadvantage for WP3NR at high temperature (Fig. 6b).

Discussion

As the earliest oxygen-producing life forms, Cyanobacteria are thought to be responsible for the steady increase in oxygen concentration on Earth by oxygenic photosynthesis^36,37. The origins of Cyanobacteria can be dated back to 2.7 billion years ago³⁷, and these organisms have been considered the key players in the Great Oxygenation Event (GOE) on Earth ~2.33 billion years ago³⁸. Additionally, sulfur-based metabolism has been suggested to be very ancient, because sulfur-metabolizing cells have been preserved in 3.4-billion-year-old microfossils^39,40. In this study, evolutionary analysis indicated that the dnd system likely originated from ancient Cyanobacteriales, probably the ancestors of Nostocaceae (Nostocales, according to NCBI taxonomy). Nostocales belong to category IV of the subsection of Cyanobacteria, and their most notable feature is the ability to form heterocysts for nitrogen fixation^37,41. Fossil evidence indicates that their existence and distribution fit well with the timing of the end of the GOE over a long period of geological history (2,100–720 Ma)^42,43. Consistent with this, dnd systems were absent in Sericytochromatia and Vampirovibrionia (Fig. 2b), which are sister groups of Cyanobacteria, and their differentiation from Cyanobacteria occurred before the GOE⁴⁴. This evidence thus supports the inference that the dnd system originated after the GOE. As the PT R systems are absent in Cyanobacteria (Figs. 1c and 2b), it is highly probable that the dnd gene clusters were originally used as antioxidant systems, while their coupling with the PT M component occurred later. In accordance with this speculation, the distribution of antioxidant-related genes in Cyanobacteria was found to be consistent with that of the dnd system (Supplementary Fig. S20). Taken together, these results suggest that the dnd system formed after the GOE and that this system may have been initially used by Cyanobacteria to combat oxidative stress caused by rising oxygen concentrations on Earth. Meanwhile, considering the analogy between the orphan PT and methylation-based modification systems, the possibility that the dnd system evolved first for epigenetic regulation cannot currently be excluded (Fig. 7).

**Fig. 7: Schematic representation of the hypothetical evolutionary scenario of the *dnd* system.**

Intriguingly, dnd gene clusters were frequently identified within mobile genetic elements, implying that the dissemination of dnd systems depends mainly on HGT events⁴⁵. This notion was in line with the results of the phylogenetic analysis of dnd genes and PT sequence contexts^19,26. To address the hypothesis against the ecological paradox concerning the distribution of the dnd system, derivatives of the marine bacterium S. piezotolerans WP3 and filamentous phage SW1 were used as a model system in this study to investigate the influence of PT modification on the transcription and physiology of microbes. Owing to their versatile metabolic capabilities, members of the genus Shewanella inhabit diverse environments, including seawater, sediment, deep ocean, marine invertebrates, food, and occasionally clinical samples^46,47,48,49. A previous survey indicated that only 3 Shewanella species harbour a dnd system, leaving the majority of Shewanella not modified by PT¹⁹. Filamentous phages from the Inoviridae family are pervasive in prokaryotes across Earth’s biomes^50,51, and they have been identified in diverse environments and belong to the dominant virus groups in marine sediment^52,53,54,55. Moreover, due to their unique characteristic of not lysing host cells even in the induced state, they are very suitable for exploring the influences after prophage activation⁵⁶. Previously, dnd genes have been identified in several marine bacteria and oceanic metagenomes, and PT modification has been detected in DNA samples from the Oregon coast and the Sargasso Sea at varying depths²⁶. Therefore, the experiment conducted in this study simulated the HGT of a dnd system into a non-PT Shewanella bacterium, which probably occurs in natural marine environments.

As the most abundant biological entities on the planet, viruses, including bacteriophages, significantly influence the physiology, metabolism and life cycle of their microbial hosts^57,58,59,60. Specifically, prophages are highly abundant in prokaryotic genomes^61,62, and they could be considered as dangerous molecular time bombs due to their potential to be activated⁶³. In this study, the significant negative correlation between the presence of prophages and a dnd system was illustrated (Fig. 3), and the activation of filamentous phages triggered by PT modification was experimentally confirmed. Noteworthily, Caudovirales, including Siphoviridae, Myoviridae, and Podoviridae, were predominant, accounting for 99.39% of all the taxonomically classified prophages in the present study (n = 17822) (Supplementary Fig. S21 and Data 2). Furthermore, phage-borne repressors or regulators were identified in the majority (59.68%) of all these prophage elements, suggesting that a similar PT-induced activation may be widespread among taxonomically diverse bacteriophages. To confirm the activation of other types of prophages triggered by PT modification, we reanalyzed previously reported transcriptomic data of E. coli K-12 strain JW3350⁶⁴, which contains 9 different prophage elements in its genome (Supplementary Fig. S22). Among the 9 prophages, 6 contained a total of 14 genes whose transcription levels were significantly upregulated by PT modification. Notably, many of these upregulated genes encode proteins that have lethal effects on the bacterial host, such as holin, lysozyme, murein endopeptidase in DLP12 and Qin, cell death peptidase in e14, and toxin YeeV in CP4-44. Next, transformation assays were performed to verify whether the increasing expression of prophage genes due to PT modification would affect the transfer of plasmid DNA (Supplementary Fig. S22). As expected, the transformation frequency of the plasmid pSW2Dnd carrying the complete PT modification function was significantly lower than that of the control plasmids pSW2DndΔE and pSW2. To test the pervasiveness of this effect, we further conducted plasmid transfer assays in 2 Shewanella strains carrying multiple prophages (4 and 3 prophages in S. oneidensis MR-1 and S. putrefaciens W3-18-1, respectively), and similar results were obtained (Supplementary Fig. S23). These experimental evidences strongly suggest that the activation of the prophage gene and the subsequent detrimental consequence caused by PT modification probably led to HGT failure of the dnd system, thereby impeding its dissemination among prokaryotic genomes (Fig. 7).

Epigenetic modifications of DNA, which involve no alterations in the nucleic acid sequence, play important roles in cellular physiology⁶⁵. The most well-studied DNA modifications are base methylations, including the forms of 6-methyladenosine (m6A), 4-methylcytosine (m4C), and 5-methylcytosine (m5C), which are actively involved in the regulation of the expression of various genes^66,67. DNA methylase is presumably derived from the more ancient RNA methylase, which could be traceable to the last universal common ancestor (LUCA) of all life^68,69. As a novel type of modification of the DNA backbone, PT modification is also speculated to participate in gene regulation¹⁹. Previously, although 184 genes were identified as being differentially transcribed in the dndBCDE deletion mutant of S. enterica, subsequent experiments indicated that these changes were due to the SOS response resulting from the activity of DndFGH⁷⁰. The global transcriptional impact of PT modification was further investigated in P. fluorescens pf0-1, which harbours dndBCDE but lacks the cognate dndFGH¹⁹. However, only 10 DEGs were observed between the wild-type strain and the dndBCDE deletion mutant, and none of the promoter regions of these DEGs contained PT sites¹⁹. Nevertheless, an in vitro transcription assay indicated that PT modification disturbed gene transcription (4 genes out of the 10 tested genes)¹⁹. Recently, dndB gene expression was shown to be regulated by PT modification in S. lividans²². Generally, although the in vivo evidence is not convincing, these results strongly suggest that PT modification is involved in epigenetic regulation. The key relevant step forward is to verify whether and how PT modification is capable of regulating the transcription of non-dnd genes in vivo. In this study, the transcription of various functional genes was significantly altered in the PT-modified strain, thus providing important evidence further supporting the epigenetic regulation of PT modification.

Interestingly, it seemed that PT modifications are not directly correlated with the transcriptional changes of genes based on coupling analysis of modification and transcriptomic data. This phenomenon has been previously noted in P. fluorescens Pf0-1, and reasonable explanations were given accordingly¹⁹. Regarding this special feature of PT-dependent epigenetic regulation, we propose that an in-depth analysis, especially at the single-cell or single-molecule level, will facilitate determining the underlying mechanism in the future. Despite this unresolved problem, we proved in this study that PT modification can alter the binding affinity between transcriptional regulators and their cognate operator DNA, thereby causing changes in gene transcription (Fig. 5). The underlying mechanism at the molecular level can be further explained by the results of a previous theoretical study. Specifically, Rp-phosphorothioation (Rp-PT), which is the biologically produced isomer of the PT modification, substantially destabilized B-type DNA, enhanced the rigidity of the DNA backbone, and differentiated backbone stability as an interaction with base steps⁷¹. Therefore, we assumed that the interactions between PT and bases can alter the conformation of DNA and then influence the DNA-protein binding affinity.

As a multifunctional epigenetic system, the dnd system plays roles in defence against foreign DNA, the balance of intracellular redox homeostasis, antioxidation, and virus resistance⁹. Despite these advantages, deficiencies in PT modification have also been reported. For example, the PT modification of DNA leads to lethal genomic instability under hypochlorous acid stress⁷². Moreover, the antioxidant ability of PT modification is very weak; it could restore only approximately 50% of the antioxidant ability in the E. coli MG1655 strain with the loss of catalase and peroxidase¹⁶. Compared with that of other, more efficient antioxidant systems, such as those of catalase, SOD and SOR⁷³, the survival advantage of the dnd system conferred to the host is probably not substantial, thereby reducing the possibility that it is retained by the microbial population after HGT. Correspondingly, the introduction of the dnd gene cluster did not promote the growth of S. piezotolerans WP3 cells at its optimal temperature¹⁶. In contrast, a marked competitive disadvantage was noticed (Fig. 6b). To further support this notion, the dndBCDE gene cluster was transformed into E. coli MG1655, which harbor an intact antioxidant system. Unlike the situation in the Hpx⁻ strain, which is a catalase/peroxidase-disrupting strain¹⁶, the horizontally transferred PT system did not afford E. coli MG1655 growth advantages under multiple stress conditions (Supplementary Fig. S24). In accordance with this viewpoint, dnd systems were found to be preferentially maintained in some pathogens, such Mycobacterium abscessus (~50% of clinical isolates of M. abscessus exhibit the PT phenotype) and Clostridium difficile, both of which lack catalase and SOD^74,75, to compensate for their antioxidative activity.

In this study, an evolutionary scenario of the dnd system was proposed, and the underlying mechanism of restricted HGT was experimentally explored (Fig. 7). Our findings not only explain the paradox concerning the patchy distribution of dnd systems but also deepen our understanding of the relationship between the origin and dissemination of novel microbial functional traits and major events (especially the GOE) on Earth during long-term evolutionary history.

Methods

Identification of dnd genes and prophages in prokaryotes

In total, the sequence data of 22,280 and 388 bacterial and archaeal genomes were retrieved from the RefSeq genome database (Release 202) in December 2020. To avoid the effect of low genomic quality on the analysis of the distribution of the dnd system, only the “assembly level” of “complete genome” or “chromosome” was selected. The Dnd protein sequences in the Kyoto Encyclopedia of Genes and Genomes (KEGG) Orthology (KO) database of the KEGG database⁷⁶, including DndA/IscS (K04487), DndB (K19169), DndC (K19170), DndD (K19171), DndE (K19172), DndF (K19173), DndG (K19174), and DndH (K19175), were retrieved and used to construct a reference protein database. Dnd proteins encoded by prokaryotic genomes were identified by BLASTp using DIAMOND (v0.9.25.126)⁷⁷ against the reference database, with a cut-off value of e < 10⁻¹⁰, a coverage ≥50% and a parameter of “—more-sensitive”, as used in a previous study¹⁹. The dndCD, dndBCDE and dndFGH gene clusters were considered to be present when all dnd genes were adjacent. Additionally, the occurrence of dndFGH in 30 ORFs upstream or downstream of dndCD/dndBCDE was a requirement, as previously described¹⁹. The prophages in prokaryotic genomes were identified and annotated by VIBRANT (v1.2.1), with the default parameters⁷⁸. For the taxonomic classification of prophages, a majority-rules approach was used to assign viral taxonomy as previously described⁷⁹. Briefly, all proteins from prophages were subjected to BLASTp alignment against RefSeq virus, and a prophage was considered to belong to a viral family if ≥50% of the proteins were assigned to that family with a bitscore ≥50.

Phylogenetic analysis

To construct a phylogenetic tree of DndD, all DndD protein sequences were first identified by BLASTp (E-value cut off of 10⁻²⁰ and query coverage of 75%) and then downloaded from the NCBI database, aligned by the MAFFT algorithm (v7.313)⁸⁰, and filtered with trimAl (v1.2)⁸¹; a phylogenetic tree was then constructed using RAxML (v8.0)⁸² and the PROTGAMMAAUTO model, with 1,000 bootstraps; a phylogenetic tree of DndC was constructed with the same procedure. For the reference phylogenetic tree of prokaryotes, 120 and 122 concentrated conserved proteins were identified and aligned by GTDB-Tk (v1.3.0)⁸³ from the 771 bacterial and 96 archaeal genomes, respectively. The tree was constructed by FastTree 2 (v2.1.10)⁸⁴ based on the maximum-likelihood algorithm and visualized by MEGA X (v10.2.2)⁸⁵. For clarity, only the bacterial orders and archaeal phyla with ≥30 high-quality genomes were included in the phylogenetic tree. For the phylogenomic tree of Cyanobacteria, a collection of sequences of cyanobacterial genomes was downloaded from the NCBI database. The tree was constructed via GTDB-Tk⁸³ “classify_wf” and visualized by iTOL (v4)⁸⁶; Sericytochromatia was set as the root lineage⁴⁴. The phylogenetic tree of Cyanobacteriales DndD and the phylogenomic tree of Cyanobacteriales were constructed according to similar procedures.

Correlation analyses between prophages and dnd genes/gene clusters

For effective statistical analyses⁸⁷, only the genera and species with more than 100 and 30 genomes of high quality, respectively, were included for correlation analyses. Among these genera and species, the association between the occurrence of prophages and each dnd gene/gene cluster was measured with the Pearson correlation coefficient via the Python function “Pearsonr” from SciPy (v1.0)⁸⁸ and linear regression analyses in R (v3.5.3)⁸⁹. Within each genus and species, the association between the abundance of prophages and each dnd gene/gene cluster was measured with Kendall’s τ coefficient and P-values via the Python function “kendalltau” from SciPy⁸⁸. Kendall’s τ coefficient was chosen for this because the values included a large number of “0” and “1” and varied on a low degree, which were not considered continuous normal data for the calculation of linear correlations but instead were considered ordinal categorical variables for the calculation of rank correlations⁹⁰. To facilitate statistical analysis, only the genus and species with ≥30 genomes carrying at least 1 prophage or dnd gene/cluster were included.

Bacterial strains and culture conditions

All bacterial strains and plasmids used in this study are listed in Supplementary Table S5. The Shewanella strains were cultured in modified 2216E marine media (2216E) (5 g/l tryptone, 1 g/l yeast extract, 0.1 g/l FePO₄, 34 g/l NaCl) with shaking at 220 rpm at different temperatures. E. coli strains were incubated in lysogeny broth (LB) media (10 g/l tryptone, 5 g/l yeast extract, 10 g/l NaCl) supplemented with 50 μg/ml DL-α, ε-diaminopimelic acid (DAP) at 37 °C. For solid media, agar-A (Bio Basic Inc., Ontario, Canada) was added at 1.5% (w/v). The antibiotic chloramphenicol (Cm) (Sigma, St. Louis, USA) was added to the media at final concentrations of 25 μg/ml and 12.5 μg/ml for E. coli and Shewanella, respectively, when needed. The growth of the Shewanella and E. coli strains was determined using turbidity measurements at 600 nm in 2216E and LB media, respectively.

Construction of PT modification strains

DNA fragments coding for the dndA and dndBCD genes were amplified via PCR using plasmids pJTU3619 and pJTU3529, respectively, as templates. The dndA products were cloned between XhoI and KpnI into the corresponding sites of pSW2. The dndBCD fragments were then cloned between PstI and KpnI into the corresponding sites, yielding pSW2DndΔE. Vectors harbouring the fpsR deletion were constructed using the vectors pSW2Dnd and pSW2DndΔE. Briefly, two opposing primers targeting the fpsR gene were used to amplify the whole sequence of the two vectors, except for the coding region of fpsR. The PCR products were subsequently digested with ApaI and then self-ligated, yielding pSW2DndΔR and pSW2DndΔEΔR, respectively. For the construction of vectors with PT site mutations, fragments of dndABCDE, dndABCD, and pSW2 were amplified via PCR. The intergenic region between fpsA-fpsR, which harboured PT site mutations, was synthesized (Biosune, Shanghai, China) and then fused to dndABCDE/dndABCD and pSW2 fragments with a ClonExpress cloning kit (Vazyme, Nanjing, China), yielding pSW2Dnd-IG and pSW2DndΔE-IG, respectively. The vector constructs were transformed into WM3064, a DAP auxotrophic strain. The transformants were subsequently confirmed via enzyme digestion and DNA sequencing. The vectors were then introduced into WP3 strains by two-parent conjugation. The transconjugant was selected by Cm resistance and was verified via PCR. For construction of PT-modified E. coli strains, pJTU3619 and pJTU3529 plasmids were introduced into MG1655 cells by calcium chloride transformation. All the vectors were confirmed by DNA sequencing.

Determination of PT modification in plasmid and genomic DNA

PT modification in plasmid DNA was detected by a peracetic acid (PAA) cleavage assay as previously described¹⁵. Briefly, the plasmid was linearized using XhoI or EcoRI according to the manufacturer’s instructions. The linearized plasmid DNA was dissolved in 1× TAE to a concentration of 300 ng/ml. A total amount of 30 ng/ml DNA was incubated in PAA–TAE, which was made by mixing 1% 1 M stock PAA with TAE buffer (40 mM Tris base adjusted to pH 7.5 using acetic acid and 0.8 mM ethylenediaminetetraacetic acid (EDTA)) for 20 min. DNA was precipitated using propanol and then resuspended in TAE buffer for agarose gel electrophoresis analysis. PT modifications in pSW2Dnd and WP3NR/Dnd were quantified by LC-coupled, time-of-flight mass spectrometry as previously described²⁶. In brief, the DNA was digested using nuclease P1 and alkaline phosphatase. The digestion mixture containing PT dinucleotides was resolved on a Poroshell 120 SB-AQ column (Agilent Co., CA, USA). The high-performance LC (HPLC) column was then coupled to an Agilent 6410 Triple Quad LC–MS spectrometer (Agilent Co., CA, USA) with an electrospray ionization source in positive mode. Multiple reaction monitoring modes were used for the detection of product ions derived from the precursor ions. The instrument parameters, including precursor ion m/z, product ion m/z, fragmentor voltage and collision energy, were the same as previously described²⁶.

RNA isolation and RT-qPCR

The S. piezotolerans WP3 strains were inoculated into 2216E media, after which the culture was collected and immersed in liquid nitrogen immediately when the cells reached exponential phase. Total RNA was isolated with TRI reagent-RNA isolation kit (Molecular research center, Cincinnati, USA). The RNA samples were treated with DNase I at 37 °C for 1 h to remove DNA contamination. The purified RNA were reverse transcribed to cDNA by RevertAid First Strand cDNA Synthesis Kit (Fermentas, Maryland, USA). The primer pairs used to amplify the selected genes for RT-qPCR were designed using Primer Express software (v3.0.1) (Applied Biosystems, CA, USA). PCR cycling was conducted using 7500 System SDS software (v2.0.6) (Applied Biosystems) in 20 μl reaction mixtures that included 1× SYBR Green I Universal PCR Master Mix (Applied Biosystems), 0.5 μM each primer, and 1 μl cDNA template^91,92.

Phage DNA copy number determination

The copy numbers of pSW2 RF DNA and ssDNA were quantified as previously described⁹³. In brief, a pair of primers, SW1RFRTFor/SW1RFRTRev (Supplementary Table S6), were designed to quantify the copy number of RF DNA; the primer pair targeted the attP site of SW1, and RF DNA was used as the amplification template. Total DNA was used as a template for the first round of qPCR. The copy number equalled the number of RF DNA plus ssDNA combined. The template for the second round of qPCR was the total DNA treated with S1 nuclease (Thermo Fisher Scientific, MA, USA). After the ssDNA was removed from the total DNA, the quantification result was the copy number of RF DNA. Finally, the copy number of ssDNA was calculated from the two rounds of qPCR.

Nuclease activity assays

To assess intracellular nuclease activity, crude enzyme extracts of WP3NR strains were incubated together with DNA substrate. In particular, the WP3NR strains from cultures at the mid-logarithmic phase (OD₆₀₀ = 1.0) were pelleted by centrifugation. The harvested cell pellets were resuspended in lysis buffer [500 mM NaCl, 10% glycerol (w/v), 20 mM Tris–HCl (pH 8.0)] and sonicated on ice. The cell lysates were subsequently centrifuged at 10,000 × g for 20 min at 4 °C, after which the supernatants were obtained. Nine microliters of the crude enzyme extracts were incubated together with approximately 100 ng of purified DNA (PCR-amplified 16 S rRNA gene and genomic DNA of S. piezotolerans WP3) in reaction buffer (Takara, Dalian, China). The reaction mixtures were incubated at 20 °C for 1 h and then separated on a 1% agarose gel. The DNA was then stained with GelRed (Biotium, Hayward, USA) and visualized by a gel imaging system (Tanon, Shanghai, China).

Expression and purification of FpsR

FpsR was expressed and purified as previously described³³. Briefly, E. coli strain C41 (DE3) containing the FpsR expression vector was grown in 1 l of LB broth with 50 μg/ml kanamycin at 37 °C for 3 h. FpsR expression was induced by the addition of 0.5 mM isopropyl-β-D-thiogalactopyranoside (IPTG) when the OD₆₀₀ reached 1.0, and the culture was then incubated at 20 °C overnight. The cells were collected by centrifugation, resuspended in binding buffer (500 mM NaCl and 20 mM imidazole, 20 mM Tris-HCl, pH 8.0) and sonicated on ice. The cell extract was clarified by centrifugation at 10,000 × g for 20 min at 4 °C. Ni Sepharose High Performance (GE Healthcare, WI, USA) resin was used to purify the His-tagged FpsR according to the manufacturer’s instructions. The protein was eluted in elution buffer (500 mM NaCl and 500 mM imidazole, 20 mM Tris-HCl, pH 8.0). Imidazole was removed using HiTrap Desalting columns (GE Healthcare, WI, USA) according to the manufacturer’s instructions. The purified FpsR was stored at 4 °C, and its concentration was determined by the Bradford method using bovine serum albumin (BSA) as a standard. The purity of FpsR was confirmed by SDS-PAGE (15%) with visualization using Coomassie Brilliant Blue R-250.

In vitro transcription

In vitro transcription was performed as previously described¹⁹, with slight modifications. Briefly, PT-modified oligonucleotides (ssDNA) were chemically synthesized (Sangon Biotech, Shanghai, China), and annealed to obtain PT-modified dsDNA; non–PT-modified DNA templates were generated by PCR amplification. Both DNA templates were purified using a GenElute PCR Clean-Up Kit (Sigma, St. Louis, USA) and then quantified by using an Invitrogen Qubit dsDNA High-Sensitivity (HS) Assay Kit (Thermo Fisher Scientific, MA, USA). Afterwards, 100 ng of DNA template was transcribed in a 20-μl reaction mixture that included 2 units of E. coli RNA polymerase, holoenzyme (New England Biolabs, MA, USA), 4 μl of 5× E. coli RNA polymerase reaction buffer, 0.5 mM NTP Mix (Thermo Fisher Scientific, MA, USA), and 20 units of RNase inhibitor (Solarbio, Beijing, China). In vitro transcription was conducted at 37 °C for 12 h, after which the temperature was increased to 85 °C for 10 min to stop the reaction. The RNA product was quantified by using an Invitrogen Qubit RNA HS Assay (Thermo Fisher Scientific, MA, USA).

SPR measurements

The DNA probe used for SPR was a 50-bp fragment that included the FpsR binding site and PT site (Supplementary Table S6). Oligonucleotides (with and without PT modification) were chemically synthesized (Biosune, Shanghai, China) and purified with a Cycle Pure Kit (Omega Bio-Tek, Norcross, USA). The SPR measurements were performed via a Biacore 8 K instrument (GE Healthcare, WI, USA), as previously described³⁰, with slight modifications. Briefly, 10 nM biotinylated DNA was captured on the surface of a streptavidin sensor chip (Cytiva, MA, USA) at a flow rate of 30 μl/min for 120 s in phosphate-buffered saline (PBS) consisting of 0.05% (v/v) Tween-20 (pH 7.4). A series of concentrations of FpsR proteins were injected into the flow system and analyzed. All binding analyses were performed in PBS consisting of 0.05% (v/v) Tween-20 (pH 7.4), at both 4 °C and 20 °C. Both the association and dissociation times were set to 120 s. After dissociation, the chip surface was regenerated with 0.5% SDS (w/v) for 30 s and stabilized for 120 s. Prior to analysis, double reference subtractions were performed to eliminate bulk refractive index changes, injection noise, and data drift. The binding affinity was determined by global fitting to a Langmuir 1:1 binding model within the Biacore insight evaluation software (v1.0) (GE Healthcare, WI, USA)^91,92.

Genomic mapping of PT modification by SMRT sequencing

SMRT sequencing and PT modification detection were performed as previously described^19,20,29. In brief, the genomic DNA of WP3 was fragmented to an average size of 20 kb using g-TUBEs (Covaris, Woburn, MA, USA). The fragmented DNA was end repaired and ligated to hairpin adaptors. SMRT sequencing was carried out on a PacBio RS II instrument (Pacific Biosciences, Menlo Park, CA, USA). A total of 886,401,450 bp of sequencing data were produced, with an average read length of 5,897 bp. The genome was then assembled using HGAP 3.0⁹⁴ with the default parameters in SMRT Analysis Suite (v1.3) (Pacific Biosciences, Menlo Park, CA, USA). Base modification analysis was performed using the base modification detection workflow of SMRT Analysis v1.3, and the PT modifications in the whole genome were identified according to calculations of the interpulse duration (IPD) ratio as previously described²⁹.

Transcriptomic analysis

Strand-specific transcriptome sequencing was performed at Magigene Biotechnology Co., Ltd. (Guangdong, China). Briefly, rRNA was removed using an Epicentre Ribo-Zero rRNA Removal Kit (Epicentre, Madison, WI, USA), and a cDNA library was prepared with a NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (NEB, Ipswich, MA, USA) according to the manufacturer’s instructions. The initial quantification of the library was carried out using a Qubit Fluorometer (Life Technologies, Carlsbad, CA, USA), and the insertion fragment size of the library was determined with an Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). The effective concentration of the library was quantified accurately via qPCR (effective concentration >2 nM). The different libraries were pooled together in a flow cell according to the effective concentration and the target offline data volume. After clustering, the Illumina HiSeq sequencing platform (Illumina, San Diego, USA) was used for paired-end sequencing. The raw data were filtered and evaluated by fastp software (v0.19.7)⁹⁵, after which the clean reads were mapped to the S. piezotolerans WP3 genome (NC_011566.1) by HISAT software (v2.1.0)⁹⁶. RSEM (v1.3.1)⁹⁷ was used to calculate the read counts per sample, and the sequencing results were evaluated in terms of quality, alignment, saturation, and distribution of reads on the reference genome by DEGseq (v1.36.0)⁹⁸. Gene expression was calculated on the basis of the number of reads mapped to each gene using the fragments per kilobase per million mapped reads (FPKM) method⁹⁹ and analyzed by edgeR (v3.20.2)¹⁰⁰. The DEGs were identified according to the following standards: a false discovery rate (FDR) < 0.05 and an FPKM fold change ≥ 2 between two samples.

Competition assays

Cultures of WP3NR/Dnd and WP3NR/DndΔE were grown independently to the stationary phase in modified 2216E media to an OD₆₀₀ of 3.5. A total of 5 ml of each culture was mixed together and taken as the T0 sample, and 100 μl of the same mixture was inoculated into 9.9 ml of fresh 2216E media supplemented with Cm at a final concentration of 12.5 μg/ml. After incubation for 24 h, 100 μl of the competing cells was inoculated into 9.9 ml of fresh 2216E media, and the rest was taken as the T1 sample. The experiment was repeated the next day, and the sample was collected as T2. In total, the procedure was performed for 5 consecutive days¹⁰¹. All the samples were serially diluted with fresh 2216E media, and 100-μl aliquots of appropriately diluted samples were plated onto 2216E plates. A total of 100 colonies from plates containing 100–400 colonies were randomly picked and subjected to colony PCR in conjunction with the primers listed in Supplementary Table S6. The relative fitness (W)¹⁰² was calculated by the ratio of the number of doubling of WP3NR/Dnd and WP3NR/DndΔE.

Plasmid transfer assays

Three plasmids (pSW2Dnd, pSW2DndΔE and pSW2) were introduced into E. coli BW25113 and JW3350 cells by calcium chloride transformation. The number of transformants was determined by counting colonies on selective agar medium and was verified via PCR. The transformation frequency was calculated as the number of transformants per μg plasmid DNA. For conjugal transfer, these 3 plasmids were introduced into S. oneidensis MR-1 and S. putrefaciens W3-18-1 strains by two-parent conjugation between them and the plasmid harbouring E. coli WM3064 strains. The transconjugant was selected by Cm resistance and was verified via colony PCR. The conjugation frequency was calculated as the number of transconjugants per number of donors.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All bacterial and archaeal assembled genomes (n = 22,668) used in this study were collected from the NCBI RefSeq database (Release 202, https://www.ncbi.nlm.nih.gov/refseq/). The Dnd protein sequences were retrieved from the KEGG Orthology (KO) database (https://www.genome.jp/kegg/ko.html). All the detailed information of the identified dnd genes, gene clusters and prophages are described and available publicly in Supplementary Data 1 and 2. The epigenomic sequencing data of S. piezotolerans WP3NR/Dnd generated using the PacBio RSII platform are available in Figshare [https://figshare.com/articles/dataset/SMRT_sequencing_data_of_Shewanella_piezotolerans_WP3NR_Dnd/16809943]. The transcriptomic data from the current study have been deposited in the NCBI SRA under project ID PRJNA565632. Source data are provided with this paper.

References

Eckstein, F. Phosphorothioation of DNA in bacteria. Nat. Chem. Biol. 3, 689–690 (2007).
Article CAS PubMed Google Scholar
Wang, L. et al. Phosphorothioation of DNA in bacteria by dnd genes. Nat. Chem. Biol. 3, 709–710 (2007).
Article CAS PubMed Google Scholar
Zhou, X. et al. A novel DNA modification by sulphur. Mol. Microbiol. 57, 1428–1438 (2005).
Article CAS PubMed Google Scholar
Chen, S., Wang, L. & Deng, Z. Twenty years hunting for sulfur in DNA. Protein cell 1, 14–21 (2010).
Article PubMed PubMed Central CAS Google Scholar
Xu, T. et al. DNA phosphorothioation in Streptomyces lividans: mutational analysis of the dnd locus. BMC Microbiol. 9, 41 (2009).
Article PubMed PubMed Central CAS Google Scholar
You, D., Wang, L., Yao, F., Zhou, X. & Deng, Z. A novel DNA modification by sulfur: DndA is a NifS-like cysteine desulfurase capable of assembling DndC as an iron-sulfur cluster protein in Streptomyces liVidans. Biochemistry 46, 6126–6133 (2007).
Article CAS PubMed Google Scholar
Chen, F. et al. Crystal structure of the cysteine desulfurase DndA from Streptomyces lividans which is involved in DNA phosphorothioation. PLoS ONE 7, e36635 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
An, X. et al. A novel target of IscS in Escherichia coli: participating in DNA phosphorothioation. PLoS ONE 7, e51265 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, L., Jiang, S., Deng, Z., Dedon, P. C. & Chen, S. DNA phosphorothioate modification-a new multi-functional epigenetic system in bacteria. FEMS Microbiol. Rev. 43, 109–122 (2019).
Article CAS PubMed Google Scholar
Yao, F., Xu, T., Zhou, X., Deng, Z. & You, D. Functional analysis of spfD gene involved in DNA phosphorothioation in Pseudomonas fluorescens Pf0-1. FEBS Lett. 583, 729–733 (2009).
Article CAS PubMed Google Scholar
Hu, W. et al. Structural insights into DndE from Escherichia coli B7A involved in DNA phosphorothioation modification. Cell Res. 22, 1203–1206 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Cheng, Q. et al. Regulation of DNA phosphorothioate modifications by the transcriptional regulator DptB in Salmonella. Mol. Microbiol. 97, 1186–1194 (2015).
Article CAS PubMed Google Scholar
Xiong, W., Zhao, G., Yu, H. & He, X. Interactions of Dnd proteins involved in bacterial DNA phosphorothioate modification. Front. Microbiol. 6, 1139 (2015).
PubMed PubMed Central Google Scholar
Dai, D. et al. DNA phosphorothioate modification plays a role in peroxides resistance in Streptomyces lividans. Front. Microbiol. 7, 1380 (2016).
PubMed PubMed Central Google Scholar
Xie, X. et al. Phosphorothioate DNA as an antioxidant in bacteria. Nucleic Acids Res. 40, 9115–9124 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yang, Y. et al. DNA backbone sulfur-modification expands microbial growth range under multiple stresses by its anti-oxidation function. Sci. Rep. 7 (2017).
Xu, T., Yao, F., Zhou, X., Deng, Z. & You, D. A novel host-specific restriction system associated with DNA backbone S-modification in Salmonella. Nucleic Acids Res. 38, 7133–7141 (2010).
Article CAS PubMed PubMed Central Google Scholar
Liu, G. et al. Cleavage of phosphorothioated DNA and methylated DNA by the Type IV restriction endonuclease ScoMcrA. PLoS Genet. 6, e1001253 (2010).
Article CAS PubMed PubMed Central Google Scholar
Tong, T. et al. Occurrence, evolution, and functions of DNA phosphorothioate epigenetics in bacteria. Proc. Natl Acad. Sci. USA 115, E2988–E2996 (2018).
Article CAS PubMed PubMed Central Google Scholar
Xiong, L. et al. A new type of DNA phosphorothioation-based antiviral system in archaea. Nat. Commun. 10 (2019).
Xiong, X. et al. SspABCD-SspE is a phosphorothioation-sensing bacterial defence system with broad anti-phage activities. Nat. Microbiol. 5, 917–928 (2020).
Article CAS PubMed Google Scholar
Dai, D., Pu, T., Liang, J., Wang, Z. & Tang, A. Regulation of dndB gene expression in Streptomyces lividans. Front. Microbiol. 9, 2387 (2018).
Article PubMed PubMed Central Google Scholar
Zhou, X., Deng, Z., Firmin, J. L., Hopwood, D. A. & Kieser, T. Site-specific degradation of Streptomyces lividans DNA during electrophoresis in buffers contaminated with ferrous iron. Nucleic Acids Res. 16, 4341–4352 (1988).
Article CAS PubMed PubMed Central Google Scholar
Sun, Y. et al. DNA phosphorothioate modifications are widely distributed in the human microbiome. Biomolecules 10, 1175 (2020).
Article CAS PubMed Central Google Scholar
Khan, H. et al. DNA phosphorothioate modification facilitates the dissemination of mcr-1 and bla_NDM-1 in drinking water supply systems. Environ. Pollut. 268, 115799 (2021).
Article CAS PubMed Google Scholar
Wang, L. et al. DNA phosphorothioation is widespread and quantized in bacterial genomes. Proc. Natl Acad. Sci. USA 108, 2963–2968 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Blow, M. J. et al. The epigenomic landscape of prokaryotes. PLoS Genet. 12, e1005854 (2016).
Article PubMed PubMed Central CAS Google Scholar
Yang, X., Jian, H. & Wang, F. pSW2, a novel low-temperature-inducible gene expression vector based on a filamentous phage of the deep-sea bacterium Shewanella piezotolerans WP3. Appl. Environ. Microbiol. 81, 5519–5526 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Cao, B. et al. Genomic mapping of phosphorothioates reveals partial modification of short consensus sequences. Nat. Commun. 5, 3951 (2014).
Article ADS CAS PubMed Google Scholar
Jian, H. et al. Multiple mechanisms are involved in repression of filamentous phage SW1 transcription by the DNA-binding protein FpsR. J. Mol. Biol. 431, 1113–1126 (2019).
Article CAS PubMed Google Scholar
Lai, C. et al. In vivo mutational characterization of DndE involved in DNA phosphorothioate modification. PLoS ONE 9, e107981 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Schoemaker, J. M., Gayda, R. C. & Markovitz, A. Regulation of cell division in Escherichia coli: SOS induction and cellular location of the SulA protein, a key to lon-associated filamentation and death. J. Bacteriol. 158, 551–561 (1984).
Article CAS PubMed PubMed Central Google Scholar
Jian, H., Xiong, L., Xu, G., Xiao, X. & Wang, F. Long 5′ untranslated regions regulate the RNA stability of the deep-sea filamentous phage SW1. Sci. Rep. 6, 21908 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, C. et al. Convergence of DNA methylation and phosphorothioation epigenetics in bacterial genomes. Proc. Natl Acad. Sci. USA 114, 4501–4506 (2017).
Article CAS PubMed PubMed Central Google Scholar
Maleki, F., Khosravi, A., Nasser, A., Taghinejad, H. & Azizian, M. Bacterial heat shock protein activity. J. Clin. Diagnostic Res. 10, BE01–BE03 (2016).
CAS Google Scholar
Knoll, A. H. Paleobiological perspectives on early microbial evolution. Cold Spring Harb. Perspect. Biol. 7, a018093 (2015).
Article PubMed PubMed Central Google Scholar
Schirrmeister, B. E., Gugger, M. & Donoghue, P. C. Cyanobacteria and the great oxidation event: evidence from genes and fossils. Palaeontology 58, 769–785 (2015).
Article PubMed PubMed Central Google Scholar
Luo, G. et al. Rapid oxygenation of Earth’s atmosphere 2.33 billion years ago. Sci. Adv. 2, e1600134 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Wacey, D., Kilburn, M. R., Saunders, M., Cliff, J. & Brasier, M. D. Microfossils of sulphur-metabolizing cells in 3.4-billion-year-old rocks of Western Australia. Nat. Geosci. 4, 698–702 (2011).
Article ADS CAS Google Scholar
Bontognali, T. R. R. et al. Sulfur isotopes of organic matter preserved in 3.45-billion-year-old stromatolites reveal microbial metabolism. Proc. Natl Acad. Sci. USA 109, 15146–15151 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Schirrmeister, B. E., Vos, J. M. D., Antonelli, A. & Bagheri, H. C. Evolution of multicellularity coincided with increased diversification of cyanobacteria and the great oxidation event. Proc. Natl Acad. Sci. USA 110, 1791–1796 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Pang, K. et al. Nitrogen-fixing heterocystous Cyanobacteria in the tonian period. Curr. Biol. 28, 616–622 (2018).
Article CAS PubMed Google Scholar
Demoulin, C. F. et al. Cyanobacteria evolution: Insight from the fossil record. Free Radic. Biol. Med. in press (2021).
Soo, R. M., Hemp, J., Parks, D. H., Fischer, W. W. & Hugenholtz, P. On the origins of oxygenic photosynthesis and aerobic respiration in Cyanobacteria. Science 355, 1436–1440 (2017).
Article ADS CAS PubMed Google Scholar
Ou, H.-Y. et al. dndDB: a database focused on phosphorothioation of the DNA backbone. PLoS ONE 4, e5132 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Janda, J. M. & Abbott, S. L. The genus Shewanella: from the briny depths below to human pathogen. Crit. Rev. Microbiol. 40, 293–312 (2014).
Article PubMed Google Scholar
Fredrickson, J. K. et al. Towards environmental systems biology of Shewanella. Nat. Rev. Microbiol. 6, 592–603 (2008).
Article CAS PubMed Google Scholar
Hau, H. H. & Gralnick, J. A. Ecology and biotechnology of the genus Shewanella. Annu. Rev. Microbiol. 61, 237–258 (2007).
Article CAS PubMed Google Scholar
Nealson, K. H. & Scott, J. Ecophysiology of the Genus Shewanella. Prokaryotes 6, 1133–1151 (2006).
Article Google Scholar
Roux, S. et al. Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth’s biomes. Nat. Microbiol. 4, 1895–1906 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hay, I. D. & Lithgow, T. Filamentous phages: masters of a microbial sharing economy. EMBO Rep. 20, e47427 (2019).
Article PubMed PubMed Central CAS Google Scholar
Mai-Prochnow, A. et al. ‘Big things in small packages: the genetics of filamentous phage and effects on fitness of their host’. FEMS Microbiol. Rev. 39, 465–487 (2015).
Article PubMed Google Scholar
Middelboe, M., Glud, R. N. & Finster, K. Distribution of viruses and bacteria in relation to diagenetic activity in an estuarine sediment. Limnol. Oceanogr. 48, 1447–1456 (2003).
Article ADS Google Scholar
Engelhardt, T., Orsi, W. D. & Jørgensen, B. B. Viral activities and life cycles in deep subseafloor sediments. Environ. Microbiol. Rep. 7, 868–873 (2015).
Article CAS PubMed Google Scholar
Dell’Anno, A., Corinaldesi, C. & Danovaro, R. Virus decomposition provides an important contribution to benthic deep-sea ecosystem functioning. Proc. Natl Acad. Sci. USA 112, E2014–E2019 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Rakonjac, J. Filamentous Bacteriophages: Biology and Applications. eLS (2012).
Güemes, A. G. C. et al. Viruses as winners in the game of life. Annu. Rev. Virol. 3, 197–214 (2016).
Article CAS Google Scholar
Breitbart, M. Marine viruses: truth or dare. Annu. Rev. Mar. Sci. 4, 425–448 (2012).
Article ADS Google Scholar
Danovaro, R. et al. Marine viruses and global climate change. FEMS Microbiol. Rev. 35, 993–1034 (2011).
Article CAS PubMed Google Scholar
Rohwer, F. & Thurber, R. V. Viruses manipulate the marine environment. Nature 459, 207–212 (2009).
Article ADS CAS PubMed Google Scholar
Touchon, M., Bernheim, A. & Rocha, E. P. Genetic and life-history traits associated with the distribution of prophages in bacteria. ISME J. 10, 2744–2754 (2016).
Article CAS PubMed PubMed Central Google Scholar
Harrison, E. & Brockhurst, M. A. Ecological and evolutionary benefits of temperate phage: what does or doesn’t kill you makes you stronger. Bioessays 39, 201700112 (2017).
Article Google Scholar
Paul, J. H. Prophages in marine bacteria: dangerous molecular time bombs or the key to survival in the seas? ISME J. 2, 579–589 (2008).
Article CAS PubMed Google Scholar
Wu, X. et al. Epigenetic competition reveals density-dependent regulation and target site plasticity of phosphorothioate epigenetics in bacteria. PNAS 117, 14322–14330 (2020).
CAS PubMed PubMed Central Google Scholar
Willbanks, A. et al. The evolution of epigenetics: from prokaryotes to humans and its biological consequences. Genet. Epigenet. 8, 25–36 (2016).
Article PubMed PubMed Central Google Scholar
Razin, A. & Cedar, H. DNA methylation and gene expression. Microbiol. Rev. 55, 451–458 (1991).
Article CAS PubMed PubMed Central Google Scholar
Casadesús, J. & Low, D. Epigenetic gene regulation in the bacterial world. Microbiol. Mol. Biol. Rev. 70, 830–856 (2006).
Article PubMed PubMed Central CAS Google Scholar
Iyer, L. M., Abhiman, S. & Aravind, L. Natural history of eukaryotic DNA methylation systems. Prog. Mol. Biol. Transl. Sci. 101, 25–104 (2011).
Article CAS PubMed Google Scholar
Weiss, M. C. et al. The physiology and habitat of the last universal common ancestor. Nat. Microbiol. 1, 16116 (2016).
Article CAS PubMed Google Scholar
Gan, R. et al. DNA phosphorothioate modifications influence the global transcriptional response and protect DNA from double-stranded breaks. Sci. Rep. 4, 6642 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chen, L. et al. Theoretical study on the relationship between Rp-phosphorothioation and base-step in S-DNA: based on energetic and structural analysis. J. Phys. Chem. B 119, 474–481 (2015).
Article CAS PubMed Google Scholar
Kellner, S. et al. Oxidation of phosphorothioate DNA modifications leads to lethal genomic instability. Nat. Chem. Biol. 13, 888–894 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ślesak, I., Kula, M., Ślesak, H., Miszalski, Z. & Strzałka, K. How to define obligatory anaerobiosis? An evolutionary view on the antioxidant response system and the early stages of the evolution of life on Earth. Free Radic. Biol. Med. 140, 61–73 (2019).
Article PubMed CAS Google Scholar
Brioukhanov, A. L., Thauer, R. K. & Netrusov, A. I. Catalase and superoxide dismutase in the cells of strictly anaerobic microorganisms. Microbiol. (Russ. Acad. Sci.) 71, 330–335 (2002).
Google Scholar
Sebaihia, M. et al. The multidrug-resistant human pathogen Clostridium difficile has a highly mobile, mosaic genome. Nat. Genet. 38, 779–786 (2006).
Article PubMed CAS Google Scholar
Kanehisa, M. et al. Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res. 42, D199–D205 (2014).
Article CAS PubMed Google Scholar
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).
Article CAS PubMed Google Scholar
Kieft, K., Zhou, Z. & Anantharaman, K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome 8, 90 (2020).
Article PubMed PubMed Central Google Scholar
Gregory, A. C. et al. Marine DNA viral macro- and microdiversity from pole to pole. Cell 177, 1109–1123 (2019).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 (2013).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article PubMed PubMed Central CAS Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the genome taxonomy database. Bioinformatics 36, 1925–1927 (2020).
CAS Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2–approximately maximum-likelihood treesfor large alignments. PLoS ONE 5, e9490 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
Article CAS PubMed PubMed Central Google Scholar
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 47, W256–W259 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kwak, S. G. & Kim, J. H. Central limit theorem: the cornerstone of modern statistics. Korean J. Anesthesiol. 70, 144–156 (2017).
Article PubMed PubMed Central Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria, 2013).
Chok, N. S. Pearson’s versus Spearman’s and Kendall’s correlation coefficients for continuous data Master of Science thesis, University of Pittsburgh, (2010).
Jian, H., Xu, G., Gai, Y., Xu, J. & Xiao, X. The histone-like nucleoid structuring protein (H-NS) is a negative regulator of the lateral flagellar system in the deep-sea bacterium Shewanella piezotolerans WP3. Appl. Environ. Microbiol. 82, 2388–2398 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, F. et al. Environmental adaptation: genomic analysis of the piezotolerant and psychrotolerant deep-sea iron reducing bacterium Shewanella piezotolerans WP3. PLoS ONE 3, e1937 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Jian, H., Xu, J., Xiao, X. & Wang, F. Dynamic modulation of DNA replication and gene transcription in deep-sea filamentous phage SW1 in response to changes of host growth and temperature. PLoS ONE 7, e41578 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Chin, C.-S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2016).
Article CAS Google Scholar
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
Article PubMed PubMed Central CAS Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011).
Article CAS Google Scholar
Wang, L., Feng, Z., Wang, X., Wang, X. & Zhang, X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics 26, 136–138 (2010).
Article PubMed CAS Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 28, 511–515 (2010).
Article CAS PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS PubMed Google Scholar
Gao, H. et al. Reduction of nitrate in Shewanella oneidensis depends on atypical NAP and NRF systems with NapB as a preferred electron transport protein from CymA to NapA. ISME J. 3, 966–976 (2009).
Article CAS PubMed Google Scholar
Lenski, R. E., Rose, M. R., Simpson, S. C. & Tadler, S. C. Long-term experimental evolution in Escherichia coli. I. Adaptation and divergence during 2000 generations. Am. Naturalist 138, 1315–1341 (1991).
Article Google Scholar

Download references

Acknowledgements

This work was financially supported by the National Key R&D Program of China (Grant no. 2018YFC0309800 to X.X.) and the National Natural Science Foundation of China (Grant no. 91851113 to H.J.; grants no. 41921006 and 41776173 to X.X.).

Author information

These authors contributed equally: Huahua Jian, Guanpeng Xu, Yi Yi.

Authors and Affiliations

State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Development Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
Huahua Jian, Guanpeng Xu, Yi Yi, Yali Hao, Yinzhao Wang, Siyuan Wang, Shunzhang Liu, Canxing Meng, Jiahua Wang, Yue Zhang, Xiaoyuan Feng, Zhijun Wang, Zixin Deng & Xiang Xiao
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
Huahua Jian & Xiang Xiao
Key Laboratory of Combinatorial Biosynthesis and Drug Discovery, Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan, China
Lei Xiong, Chao Chen & Lianrong Wang
Simon F. S. Li Marine Science Laboratory, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
Xiaoyuan Feng, Haiwei Luo & Hao Zhang
Grandomics Biosciences, Wuhan, China
Xingguo Zhang

Authors

Huahua Jian
View author publications
You can also search for this author in PubMed Google Scholar
Guanpeng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yi Yi
View author publications
You can also search for this author in PubMed Google Scholar
Yali Hao
View author publications
You can also search for this author in PubMed Google Scholar
Yinzhao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Xiong
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shunzhang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Canxing Meng
View author publications
You can also search for this author in PubMed Google Scholar
Jiahua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyuan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Haiwei Luo
View author publications
You can also search for this author in PubMed Google Scholar
Hao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xingguo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lianrong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhijun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zixin Deng
View author publications
You can also search for this author in PubMed Google Scholar
Xiang Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.X. and H.J. conceived and designed the research; G.X., Y.H., S.W., S.L., and C.M. performed the microbiological experiments; Y.Y., H.J., Y.W., J.W., Y.Z., X.F., H.L., and H.Z. performed the bioinformatic analysis; L.X., C.C., L.W., and Z.W. analyzed the PT phenotype; X.Z. helped in analyzing SMRT sequencing data; H.J., G.X. and Y.Y. analyzed and interpreted the data; H.J. wrote the manuscript; Z.D. provided useful comments to improve the manuscript; X.X. supervised the project. All the authors reviewed the results and approved the manuscript.

Corresponding author

Correspondence to Xiang Xiao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Peter Weigele, and the other, anonymous, reviewer for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Dataset 1

Dataset 2

Dataset 3

Dataset 4

Reporting Summary

Source data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jian, H., Xu, G., Yi, Y. et al. The origin and impeded dissemination of the DNA phosphorothioation system in prokaryotes. Nat Commun 12, 6382 (2021). https://doi.org/10.1038/s41467-021-26636-7

Download citation

Received: 25 November 2018
Accepted: 18 October 2021
Published: 04 November 2021
DOI: https://doi.org/10.1038/s41467-021-26636-7

This article is cited by

Unexplored diversity and ecological functions of transposable phages
- Mujie Zhang
- Yali Hao
- Huahua Jian
The ISME Journal (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.