The population structure of Clostridium tetani deduced from its pan-genome

Chapeton-Montes, Diana; Plourde, Lucile; Bouchier, Christiane; Ma, Laurence; Diancourt, Laure; Criscuolo, Alexis; Popoff, Michel Robert; Brüggemann, Holger

doi:10.1038/s41598-019-47551-4

Download PDF

Article
Open access
Published: 02 August 2019

The population structure of Clostridium tetani deduced from its pan-genome

Scientific Reports volume 9, Article number: 11220 (2019) Cite this article

9591 Accesses
15 Citations
2 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 19 November 2019

This article has been updated

Abstract

Clostridium tetani produces a potent neurotoxin, the tetanus neurotoxin (TeNT) that is responsible for the worldwide neurological disease tetanus, but which can be efficiently prevented by vaccination with tetanus toxoid. Until now only one type of TeNT has been characterized and very little information exists about the heterogeneity among C. tetani strains. We report here the genome sequences of 26 C. tetani strains, isolated between 1949 and 2017 and obtained from different locations. Genome analyses revealed that the C. tetani population is distributed in two phylogenetic clades, a major and a minor one, with no evidence for clade separation based on geographical origin or time of isolation. The chromosome of C. tetani is highly conserved; in contrast, the TeNT-encoding plasmid shows substantial heterogeneity. TeNT itself is highly conserved among all strains; the most relevant difference is an insertion of four amino acids in the C-terminal receptor-binding domain in four strains that might impact on receptor-binding properties. Other putative virulence factors, including tetanolysin and collagenase, are encoded in all genomes. This study highlights the population structure of C. tetani and suggests that tetanus-causing strains did not undergo extensive evolutionary diversification, as judged from the high conservation of its main virulence factors.

Comparative in silico genome analysis of Clostridium perfringens unravels stable phylogroups with different genome characteristics and pathogenic potential

Article Open access 24 March 2021

Spatiotemporal persistence of multiple, diverse clades and toxins of Corynebacterium diphtheriae

Article Open access 08 March 2021

Ancient Clostridium DNA and variants of tetanus neurotoxins associated with human archaeological remains

Article Open access 06 September 2023

Introduction

Tetanus is a worldwide neurological disease of man and animals, characterized by spastic paralysis of skeletal muscles. Neonatal tetanus is currently the most prevalent form in humans with estimated 34,000 deaths in 2015¹. The disease is caused by the tetanus toxin, solely produced by the Gram-positive, anaerobic, spore-forming species C. tetani, spores of which are widespread in the environment where they survive for long periods of time.

The tetanus toxin (TeNT) gene tent has been cloned and sequenced in 1986^2,3, and it has been found to be localized on a large plasmid⁴. The mode of action was characterized, i.e. the TeNT metalloprotease activity towards the SNARE protein VAMP/synaptobrevin^5,6 and its axonal retrograde transport to the central nervous system was analyzed in detail^7,8. More recently, nidogens were found to mediate TeNT binding to neuronal cells and subsequent internalization into neurons⁷.

Tetanus is a preventable disease and immunization with tetanus toxoid-containing vaccines are safe and confer efficient and long-term protection⁹. Large scale worldwide vaccination programs are supported by the World Health Organization to prevent tetanus⁹. Only one TeNT type is known which is currently used for vaccine preparation. The high efficiency of this vaccine suggests that all tetanus-causing C. tetani strains produce an identical or very similar toxin. Investigations on possible TeNT variations are based on genetic and genomic analyses of C. tetani strains. The first whole genome sequence of the toxigenic strain E88 was reported in 2003¹⁰. Up to now, only 12 strains have been sequenced at present^11,12,13. The study of Cohen et al. focuses on U.S. vaccine strains of C. tetani, but no larger effort was undertaken to assess genomic diversity among tetanus-causing strains from various geographic locations and isolated at different times. Here we report the genomic analysis of 26 newly sequenced C. tetani strains from recent and ancient periods. Our work highlights the conservation of the chromosome and the high heterogeneity of the toxin-encoding plasmid, as well as reveals the population structure of the species with two phylogenetically distinct clades. In addition, we found that albeit a strong conservation of TeNT sequences, five out of 38 C. tetani strains encode a toxin containing four additional amino acids in the receptor-binding domain that might impact the receptor-binding properties and antigenicity.

Results

Strain selection and whole genome sequencing

Whole genome sequencing (WGS) was performed for 26 C. tetani strains. These including 11 strains from the National Reference Center (NRC) of Anaerobic Bacteria (Institut Pasteur, France), isolated in the time period 1984–2017, two strains from Sanofi (France), and 13 strains from the Prevot’s collection (Institut Pasteur, France), isolated in the time period 1955–1965; an exception was the Harvard strain, which was isolated before 1949 (Table 1). Most of the strains have been isolated from human wounds, except one strain from cheese and one from a cat wound. WGS of these strains resulted in 26 draft genomes with 30 to 68 contigs (on average 46 contigs). The genome size varied from 2.735 to 2.951 Mb, on average 2.844 Mb, which is almost identical to the average size of the 12 previously sequenced C. tetani genomes (2.841 Mb; Table 2).

Table 1 Newly sequenced Clostridium tetani strains of this study and previously sequenced strains, their originand sequencing data.

Full size table

Table 2 Previously sequenced strains.

Full size table

Phylogenomic analysis of C. tetani

The genome sequences of the 26 strains and of the previously sequenced strains (Tables 1 and 2) were phylogetically analyzed by calling single nucleotide polymorphisms (SNPs) within the core genome using the tool ParSnp. According to this analysis, the core genome comprises 77% of the reference genome (strain E88), with a total number of 94,816 SNPs (Fig. 1). The SNP analysis revealed that C. tetani strains can be separated into two clades: clade 1 comprises the large majority of strains (33 strains), and clade 2, as a minor clade, comprises five strains (strains 778-17, 12124569, 184-08, 3483 and 63-05) (Fig. 1). Clade 1 strains can be further distinguished based on their SNPs into eight subclades (1A to 1H). About half of all clade 1 strains belong to the subclade 1A (15 strains), including the Harvard/Massachusetts-derived strains that are used in vaccine production and more recent isolates. These subclade 1A strains include strain A, E88, CN655, and ATCC19406. They all originate from strain Harvard, and are thus redundant. Among subclade 1A strains are also more recently isolated strains, such as strain C2, 4784-A, strain 3, 1586, that are independent isolates from the Prevot’s collection, and strains 75–97 and 407-86 that are more recent isolates from the anaerobic laboratory at the Institut Pasteur. Subclade 1A strains are highly similar, with only 292 SNPs in total (Fig. 1). The other subclades within clade 1 comprise one to three strains only. Subclades within clade 1 that are most distant to subclade 1A are subclade 1G, containing two strains, including an Indian isolate (strain Mfbjulcb2), and subclade 1 H, containing only strain 2017.061; this strain could represent a hybrid of clade 1 and 2 strains, or a founder of these two clades.

The phylogenomic analysis did not reveal an obvious separation of strains based on their geographical origin, disease association or isolation time, since recently isolated strains and those isolated 50–70 years ago are distributed in different (sub)clades.

Diversity of the tetanus toxin-encoding plasmid

All draft genomes were searched against the circular tetanus toxin-encoding plasmid pE88 from strain E88 (Brüggemann et al., 2003). This revealed that the plasmid sequence was not conserved in most strains. Only subclade 1A strains contained a plasmid identical to, or highly similar with pE88 (Fig. 2A). Exceptions were three subclade 1A strains (ATCC19406, 1586-Z1 and 407-86): no plasmid sequences were detected in the genome assemblies of these strains.

All other subclades of clade 1 as well as clade 2 strains contained plasmids that differed from pE88 (Fig. 2B–E). Even within some subclades of clade 1, i.e. subclades 1B, 1 C and 1D, strains differed regarding their plasmid sequences. Most distant to pE88 were the plasmids of clade 2 strains, as well as strains of subclades 1E to 1 H. The estimated plasmid sizes in these strains varied from 52.6 kb to 78.2 kb. This is an estimation, based on the sum of all contigs that showed similarity to previously sequenced C. tetani plasmids. For four strains (one strain of each subclade 1A, 1B, 1G and clade 2), a single contig was found that corresponds very likely to the complete plasmid (Table 1).

Next, we searched the plasmid sequences for the most important plasmid genes, i.e. the toxin gene tent and the upstream regulatory gene tetR which encodes for an alternative sigma factor¹⁴. The tent and tetR genes were found in all plasmid-carrying strains, except for the three strains of subclade 1E and one strain of subclade 1D (ATCC454). The plasmid in the latter strain was exceptional, showing a large deletion. TeNT was identical on protein level in all subclade 1A strains; in all other strains there were a few variations that corresponded largely with the different subclades (Fig. 3A). Strains of subclade 1 F carried the most distant TeNT variant compared to subclade 1A. Strains of subclade 1 F, 1 H, as well as clade 2 strain 778.17 encode a TeNT with four amino acids in the C-terminal domain (Fig. S1). This amino acid insertion has been confirmed by PCR amplification of the 3′ part of tent and subsequent sequencing (Fig. S2). Regarding additional tent-regulating elements, the plasmid of clade 1A strains encodes a two-component system (TCS), which has been found to positively control TeNT synthesis (manuscript in preparation); the TCS genes are present only in clade 1 strains of the subclades 1A, 1B and 1 C.

Interestingly, all plasmid-carrying strains, including the clade 2 strains contained the gene colT, encoding a collagenase, a putative virulence factor that is possibly involved in tissue colonization¹⁰. Only one additional gene was found to be conserved in all plasmids: ctc_p19, encoding a 516 amino acid protein of unknown function, but with homology to proteins of the replication initiator protein A (RPA) family, suggesting that this protein is essential for plasmid replication. A phylogenetic analysis of all ctc_p19 homologs revealed that the variation of the putative RPA corresponded largely with the separation of the different (sub)clades (Fig. 3B). This result suggests that in the majority of strains the plasmid was not recently acquired, but evolved together with the chromosome. An exception was the plasmid in the clade 2 strain 12124695, whose RPA sequence was more similar to those of clade 1 strains. The RPA sequences of the other four clade 2 strains (778-17, 184-08, 3483 and 63-05) as well as the subclade 1 H strain 2017.061 showed the most diverged RPA variant compared to subclade 1A RPA homologs. There were 13 amino acid replacements, including five radical replacements that are distributed over the entire protein length.

When the previously sequenced plasmid of the clade 2 strain 12124569 was taken as reference, it could be shown that this plasmid is not conserved among clade 2 strains (Fig. 2B). The majority of clade 2 strains possess a plasmid identical or highly similar to the one found in strain 3483 (Fig. 2C); this plasmid contains a clade 2-specific region of 9 kb downstream of the tetR-tent locus, encoding a type III restriction-modification (RM) system. The clade 1G strain 132CV contained a plasmid that is shared with the other clade 1G strain (Mfbjulcb2), and to some extent with strains of clades 1 F and 1D (Fig. 2D). These plasmids shared for example a 10 kb region, located in direct proximity to colT; it encodes a cluster of proteins with unknown functions, including proteins with predicted peptidase and transport activities. The plasmid of the clade 1B strain TMB2 contained a smaller plasmid (Fig. 2E), with a strain-specific region of 8 kb, encoding a type II restriction-modification system.

Taken together, the data show the high variability of the plasmid among C. tetani strains; however, a strong conservation can be seen regarding TeNT, TetR, ColT and the putative replication initiator protein.

Conservation of chromosomal genes

The sequenced C. tetani genomes are similar in size (Tables 1 and 2). Annotation of the genomes resulted in 2,654 to 2,960 coding sequences (CDS) per strain. The pan-genome encodes 3,915 CDS, i.e. the total number of detected CDS in the C. tetani population (Table S1). In total, 1,266 CDS (32% of CDS) are shared by all strains, and 2,292 (58% of CDS) by 90% of all strains. There are no strain-specific genes; 411 CDS are present in two strains only, and mainly comprise phage-related CDS.

Clade 2 strains carry on average 118 CDS more than clade 1 strains (2,900 and 2,782 CDS, respectively; Table S1). Clade 1-specific CDS can be identified, whose genes are mostly located in genomic islands scattered around the genome (Fig. 4). Ten islands were found that are largely subclade 1A-specific, thus not only missing in clade 2 but also in subclade 1B-1H strains (Fig. 4A, Table S2A). These include three (cryptic) prophages or fragments thereof, a plasmid-like element carrying a toxin-antitoxin system, two CRISPR/cas loci, a gene cluster encoding surface-layer proteins, an iron transport system and a putative cell wall/spore coat/envelope/membrane modification system. Genomes of other clade 1 strains contained different genomic islands that carry similar functions. For example, COR1, a subclade 1E strain harbors several islands that are only shared with another subclade 1E strain (strain 329) (Fig. 4B, Table S2B); these encode functions related to restriction-modification systems, surface modification, CRISPR/cas systems and prophages. Additional smaller clusters and single genes in subclade 1E strains are predicted to encode proteins for chemotaxis, multi-antimicrobial extrusion protein (Na⁺/drug antiporter), phosphate regulon regulator and sensor (PhoR, PhoB), cell wall-binding and adhesion proteins, and DNA modification. Clade 2 genomes harbor only a few clade-specific islands, containing genes related to (pro)phages, cell surface modification, metabolism of aromatic compounds, transport and DNA methylation (Fig. 4C, Table S2C). However, there is heterogeneity among clade 2 strains, in particular regarding the presence of prophages. Apart from these genomic islands, several strain-specific genes can be found, mostly related to transposases, cell-surface/adhesion/S-layer proteins, RM systems, ABC transport systems and efflux pumps, two-component systems, resistance determinants and defense/repair functions, and specific metabolic functions (Table S1).

Since CRISPR/cas systems can contribute to the diversification within a species, we had a closer look: strain E88 and subclade 1A derivate strains contain two loci (locus numbers 5 and 6 in Fig. 4A). One locus is composed of the cas genes cas6, csx8 (cas8a1), devR (cas7), cas5, cas3, cas4, cas1, cas2 (a type I-B-like system) with a repeat consensus 5′-GTATTAGTAGCACCATATTGGAATGTAAAT-3′; the other system is composed of cas6, csh1 (cas8a), csh2 (cas7), cas5, cas3, cas4, cas1 and cas2 (another type I–B-like system) with a repeat consensus 5′-ATTTAAATACAACTCTTGTTATTGTTCAAC-3′. In a few other genomes (1E strains COR1 and 329; 1 F strains 202-15, 358-99, 512-15) there is a type II-like system (locus number 4 in Fig. 4B) with the genes cas2, cas1, cas9 (csn1) harbouring the repeat consensus 5′-GTTATAGTTCCTAGTAAATTCTCCATATGCTATAAT-3′. This is in agreement with a previous study¹².

Additional host-interacting or virulence factors

Tetanolysin

All genomes carry a gene for tetanolysin (TetO), which is highly conserved; it is identical on protein level in 25 strains that all belong to clade 1. Overall, only minor variations are seen in the N-terminal part (8 amino acid changes and 11 conservative substitutions) among the 37 strains analyzed. All TetO homologs of clade 2 strains are distinct (data not shown).

Surface-attached proteins

Most clostridia contain a surface layer which is organized in a paracrystalline array surrounding the cell. Surface layers are usually composed of one or two surface-layer proteins (SLPs) or glycoproteins, which are expressed at very high levels and are anchored in the peptidoglycan of the cell wall. Clostridial SLPs have been found to retain one or several conserved motifs called clostridial cell wall binding repeat 2 (CWB2; Pfam 04122)¹⁵, usually located at the N-termini. In total, 19 CDSs with at least one copy of the characteristic CWB2 domain were identified in clade 1A genomes; in clade 1E and clade 2 strains 20 such CDSs are encoded per genome. The genomes of type 1E strains contain a genomic insertion that harbors genes for surface-attached proteins, including a CWB2 domain protein (Fig. 5A).

The main SLP in C. tetani was previously identified (locus tag CTC_RS02355)¹⁶. A BLAST search of this SLP against all other clostridial genomes showed some heterogeneity among strains, which indicates clade- and to some extent strain-specific surface structures (Fig. 5B). Clade 1A strains share a highly similar SLP; this SLP variant has three CWB2 repeats in the N-terminus and no known domain in the C-terminus. Also strains of clade 2 and clade 1B, 1G and 1 H, and most strains of clades 1 C and 1 F have this variant, with some minor substitutions in the C-terminus. However, other strains such as all clade 1E strains and some from other clades (1 C, 1D, 1 F) have different SLP variants. The difference is largely restricted to the C-terminus; interestingly, the SLP variant of clade 1E strains contains a bacterial IG-like domain in the C-terminus. In addition, every clade 2 strain has a second SLP variant; currently it is not known if both SLP variants are produced.

Discussion

Despite a low amino acid sequence identity (21 to 41%) with botulinum neurotoxins (BoNTs), TeNT retains a common structure and enzymatic mode of action with BoNTs^17,18,19. Interestingly, TeNT shows the highest level of identity (41%) with BoNT type B and both neurotoxins cleave the same substrate (synaptobrevin) at the same cleavage site¹⁹. It is likely that TeNT and BoNTs derive from a common ancestor gene. However, tent and bont genes have diverged in the course of evolution²⁰. Indeed, bont is located in an operon with ntnh that results from a bont duplication and that encodes the non-toxic non-hemagglutinin (NTNH) protein^21,22,23. NTNH retains a similar structure to that of BoNT but the zinc-proteolytic site is lacking^24,23. In addition, the ntnh-bont operon is associated with either a hemagglutinin (ha) or orfX operon, and BoNT combines with non-toxic proteins to form complexes of various sizes (review in^25,26). In contrast, tent is unique and does not show any duplication or operon association with genes encoding non-toxic proteins. Only, the regulatory gene tetR, that is related to botR, lies upstream of tent¹⁴. Moreover, in contrast to BoNTs that display large sequence variations and are distributed in 10 types and more than 40 subtypes^27,28, TeNT shows a remarkable genetic stability in toxigenic C. tetani strains.

At the protein level, TeNT amino acid changes can be found at 50 positions, but half correspond to conservative substitutions (Fig. S1). The most striking difference is an insertion of four amino acids (‘NSES/Y’), at position 1137 of TeNT in strains of subclades 1 F, 1 H and in one clade 2 strain (778-17). This insertion lies within the C-terminal receptor binding domain (pfam07951), indicating that those strains might have different receptor-binding properties and antigenicity. The zinc-dependent proteolytic site (HELIH) as well as the residues involved in the binding to ganglioside receptor (D1222, H1271, SNWY1290, G1300, and R1226)²⁹ are conserved in all TeNT sequences. An exception is H1271 that is replaced by R1271 in clade 2 strain 12124569. This indicates that all TeNT variants retain the same functional sites.

Until now, tent has been found only in C. tetani strains, whereas BoNTs are produced by diverse clostridial species and a few other bacterial species^{27,28,30,31,32}. C. tetani strains show a high level of genomic conservation with a core genome of about 77%. Most of the strains (32 of 37) are distributed in one clade (clade 1) with further subdivision in eight subclades (1A to 1 H). Most strains sequenced to date are subclade 1A strains. Thus, previous knowledge about C. tetani is very much restricted to those strains; all strains used for tetanus toxoid production are clade 1A strains. Five strains, historical and recent isolates, are more distantly related and constitute a second clade (clade 2). The main genomic variations include genes of prophages, cell surface proteins, and DNA modification and defence (RM, CRISPR/cas systems). The strains of the different clades and subclades do often not share parameters such as isolation time or geographic origin. This indicates no obvious evolutionary trend over time and space.

C. tetani and BoNT-producing clostridia are spore-forming bacteria, the main habitat of which is the environment. It is intriguing to better understand the environmental factors and selective pressures involved in the high genetic diversity of BoNT-producing clostridia, in contrast to the the conservation of C. tetani, respectively.

The gene tent is localized on large plasmids in C. tetani, that in contrast to the chromosomes show a high variability. Interestingly, bontB, which encodes the most related BoNT to TeNT, is also frequently localized on plasmids in C. botulinum^33,34. Interestingly, it was recently shown that BoNT-encoding plasmids can be transferred by conjugation between various Clostridium species³⁵. Here, no relatedness between C. tetani and C. botulinum plasmids was observed. Phylogenetic comparison of the putative replication initiator protein homologs encoded in all C. tetani plasmids indicates coevolution of plasmid and chromosome, since a similar separation into clades and subclades can be seen for most strains (Figs 1 and S1B). This also indicates that there was no recent plasmid acquisition by horizontal gene transfer, at least not between phylogenetically distant C. tetani strains. However, several strain-specific genes and regions can be found on the plasmids of individual strains. This highlights the evolutionary more dynamic nature of the plasmid, which can serve as a sink for horizontally transferred genes.

Only few additional virulence factors are known for C. tetani, such as the chromosomally encoded tetanolysin TetO and, putatively, the plasmid-encoded collagenase ColT; these two factors are highly conserved in all strains. TetO is a pore-forming toxin, which attacks macrophages and thus facilitates the early step of wound colonization by C. tetani³⁶. The N-terminal domain of cholesterol-dependent cytolysins including TetO is not directly involved in the interaction with the membrane receptor and the mechanism of pore formation; it likely stabilizes the toxin monomers and contributes to oligomer formation³⁷. Thus, the eight here detected amino acid changes in this domain (data not shown) seem not to be critical for toxin activity. The collagenase ColT also shows a high level of similarity at the amino acid level (94 to 100%) (data not shown). The minor variations in ColT reflect the clade distribution of the genome.

A surface-exposed proteineous layer, consisting of regularly assembled SLPs is common in Gram-negative and Gram-positive bacteria. Most clostridial species contain one or two SLPs. For example, C. difficile produces two main SLPs, but contains many (29) cell wall proteins (CWPs), harboring the characteristic CWB2 motif³⁸. C. tetani strains possess 19 or 20 CWB2-containing proteins, but some variation exists between strains that affect mostly the C-termini of such proteins, that are exposed into the extracellular space. Surface-attached proteins have multiple roles; notably, in C. difficile they have been found to mediate bacterial adhesion to the intestinal mucus layer and gastrointestinal cells as well as to induce a pro-inflammatory innate immune signaling via toll-like receptor 4 (TLR4)^39,40. C. tetani likely uses the surface-attached proteins in wound colonization. The numerous proteins and their diversity possibly reflect specific abilities and strain-specific variations to colonize tissues and, possibly, a strategy to divert host recognition. As for other bacteria such as C. difficile³⁹, SLPs could be used as vaccine candidates to prevent C. tetani colonization in addition to the tetanus toxoid.

Materials and Methods

Bacterial strains

The C. tetani strains used for this study are listed in Table 1. C. tetani strains were grown in TGY broth (pH 7.5) containing trypticase (Trypticase-Glucose Yeast Peptone BBL, BD Biosciences; 30 g/L), yeast extract (Bacto Yeast Extract, BD Biosciences; 20 g/L), glucose (5 g/L) and cystein, HCl (0.5 g/L) under anaerobic conditions.

DNA extraction and genome sequencing

Genomic DNA from all strains of C. tetani was extracted and purified as previously described^41,42. Whole genome sequencing (WGS) using the NEXTflex^® PCR-Free DNA-Seq kit for Illumina Plaforms (Bioo Scientific Corporation) were performed using MiSeq device (Illumina) in paired-end reads of 250 or 300 bases. Sequence files were generated using Illumina Analysis Pipeline version 1.8 (CASAVA). Reads were trimmed using a quality control pipeline⁴³. The assembly of sequence reads was performed using SOAPdenovo (version 1.05).

Genome comparison, phylogenomic and other bioinformatic analyses

For phylogenomic analyses, the core genome was identified and aligned with Parsnp, a program that is part of the Harvest software package⁴⁴. Parsnp aligns microbial genomes based on a suffix graph data structure; the output is a core-genome alignment that contains all SNPs, Indels, and structural variation within the core genome. Parsnp is further quality-filtering SNPs; only reliable core-genome SNPs are considered for reconstruction of the whole-genome phylogeny that can be visualized with Gingr, another program of the Harvest software package⁴⁴.

Gene prediction and annotation of all genomes were performed with RAST⁴⁵. Phylogenetic trees were visualized using Mega v7⁴⁶ and Interactive Tree Of Life (iTOL; https://itol.embl.de/). For comparative genome analyses and visualization, the program BRIG was used⁴⁷. To determine orthologous genes among the C. tetani strains we used the tool Proteinortho⁴⁸.

Sequence accession

The genome sequence accession numbers are: 1240, QMBG00000000; 132CV, QMAZ00000000; 1337, QMAN00000000; 157-15, QMAR00000000; 1586-U1, QMBI00000000; 1586-Z1, QMBH00000000; 2017-061, QMAP00000000; 202-15; QMAQ00000000; 329, QMBF00000000; 3483, QMDR00000000; 3582, QMBA00000000; 358-99, QMAV00000000; 407-86, QMAX00000000; 46-1-08, QMAT00000000; 4784A, QMBJ00000000; 512-15, QMBE00000000; 63-05, QMAU00000000; 641-84, QMAY00000000; 75-97, QMAW00000000; 778-17, QMAO00000000; 89-12, QMAS00000000; B4, QMBD00000000; COR1, QMBC00000000; Harvard, QMBL00000000; Strain-3, QMBK00000000; TMB2, QMBB00000000.

Previously sequenced strains have these accession numbers: E88, AE015927.1 (chromosome) and AF528097.1 (plasmid); CN655, JSWC00000000; strain A, JWIX00000000; strain C2, JRGG00000000; ATCC19406, JRGJ00000000 and FUWT00000000; ATCC9441, JRGH00000000; ATCC454, LBNB00000000; ATCC453, JRGI00000000; Mfbjulcb2, CP027782.1; 184.08, JSWD00000000; 12124569, HG530135.1 (chromosome) and HG530136.1 (plasmid).

Change history

19 November 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Burgess, C. et al. Eliminating maternal and neonatal tetanus and closing the immunity gap. Lancet 389, 1380–1381 (2017).
Article PubMed Google Scholar
Fairweather, N. F. & Lyness, V. A. The complete nucleotide sequence of tetanus toxin. Nucleic. Acids Res. 14, 7809–7812 (1986).
Article CAS PubMed PubMed Central Google Scholar
Eisel, U. et al. Tetanus toxin: primary structure, expression in E. coli, and homology with botulinum toxins. EMBO J. 5, 2495–2502 (1986).
Article CAS PubMed PubMed Central Google Scholar
Finn, C. W. et al. The structural gene for tetanus neurotoxin is on a plasmid. Science 224, 881–884 (1984).
Article ADS CAS PubMed Google Scholar
Schiavo, G. et al. Tetanus toxin is a zinc protein and its inhibition of neurotransmitter release and protease activity depend on zinc. EMBO J. 11, 3577–3583 (1992).
Article CAS PubMed PubMed Central Google Scholar
Schiavo, G., Rossetto, O., Santucci, A., DasGupta, B. R. & Montecucco, C. Botulinum neurotoxins are zinc proteins. J. Biol. Chem. 267, 23479–23483 (1992).
Article CAS PubMed Google Scholar
Bercsenyi, K. et al. Tetanus toxin entry. Nidogens are therapeutic targets for the prevention of tetanus. Science 346, 1118–1123 (2014).
CAS PubMed Google Scholar
Surana, S. et al. The travel diaries of tetanus and botulinum neurotoxins. Toxicon 147, 58–67 (2018).
Article CAS PubMed Google Scholar
WHO. Tetanus vaccines: WHO position paper - February 2017. Wkly Epidemiol Rec 92, 53–76 (2017).
Google Scholar
Brüggemann, H. et al. The genome sequence of Clostridium tetani, the causative agent of tetanus disease. Proc. Ntl. Acad. Sci. (USA) 100, 1316–1321 (2003).
Article ADS CAS Google Scholar
Fournier, P. E. et al. Genome of a chronic osteitis-causing Clostridium tetani. New Microbes New Infect 2, 25–26 (2014).
Article PubMed PubMed Central Google Scholar
Cohen, J. E., Wang, R., Shen, R. F., Wu, W. W. & Keller, J. E. Comparative pathogenomics of Clostridium tetani. PLoS One 12, e0182909 (2017).
Article PubMed PubMed Central CAS Google Scholar
Brüggemann, H. et al. Genomics of Clostridium tetani. Res Microbiol 166, 326–331 (2015).
Article CAS PubMed Google Scholar
Marvaud, J. C., Eisel, U., Binz, T., Niemann, H. & Popoff, M. R. tetR is a positive regulator of the Tetanus toxin gene in Clostridium tetani and is homologous to botR. Infect. Immun. 66, 5698–5702 (1998).
Article CAS PubMed PubMed Central Google Scholar
Fagan, R. P. et al. A proposed nomenclature for cell wall proteins of Clostridium difficile. J Med Microbiol 60, 1225–1228 (2011).
Article CAS PubMed Google Scholar
Qazi, O. et al. Identification and characterization of the surface-layer protein of Clostridium tetani. FEMS Microbiol Lett 274, 126–131 (2007).
Article CAS PubMed Google Scholar
Masuyer, G., Conrad, J. & Stenmark, P. The structure of the tetanus toxin reveals pH-mediated domain dynamics. EMBO Rep 18, 1306–1317 (2017).
Article CAS PubMed PubMed Central Google Scholar
Minton, N. P. Molecular genetics of clostridial neurotoxins. Curr. Top. Microbiol. Immunol. 195, 161–194 (1995).
CAS PubMed Google Scholar
Schiavo, G. et al. Tetanus and Botulinum-B neurotoxins block neurotransmitter release by proteolytic cleavage of synaptobrevin. Nature (London) 359, 832–835 (1992).
Article ADS CAS Google Scholar
Mansfield, M. J. & Doxey, A. C. Genomic insights into the evolution and ecology of botulinum neurotoxins. Pathog Dis. 76(4), 4978416 (2018).
Article CAS Google Scholar
Doxey, A. C., Lynch, M. D., Muller, K. M., Meiering, E. M. & McConkey, B. J. Insights into the evolutionary origins of clostridial neurotoxins from analysis of the Clostridium botulinum strain A neurotoxin gene cluster. BMC Evol Biol 8, 316 (2008).
Article PubMed PubMed Central CAS Google Scholar
Popoff, M. R. & Bouvet, P. Genetic characteristics of toxigenic Clostridia and toxin gene evolution. Toxicon 75, 63–89 (2013).
Article CAS PubMed Google Scholar
Inui, K. et al. Toxic and nontoxic components of botulinum neurotoxin complex are evolved from a common ancestral zinc protein. Biochem Biophys Res Commun 419, 500–504 (2012).
Article CAS PubMed Google Scholar
Gu, S. et al. Botulinum neurotoxin is shielded by NTNHA in an interlocked complex. Science 335, 977–981 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Poulain, B., Molgo, J. & Popoff, M. R. In The Comprehensive Sourcebook of Bacterial Protein Toxins (eds J. Alouf, D. Ladant, & M. R. Popoff) Ch. 11, 287–336 (Elsevier, 2015).
Gu, S. & Jin, R. Assembly and function of the botulinum neurotoxin progenitor complex. Curr Top Microbiol Immunol 364, 21–44 (2013).
CAS PubMed Google Scholar
Peck, M. W. et al. Historical Perspectives and Guidelines for Botulinum Neurotoxin Subtype Nomenclature. Toxins (Basel) 9, (38 (2017).
Google Scholar
Doxey, A. C., Mansfield, M. J. & Montecucco, C. Discovery of novel bacterial toxins by genomics and computational biology. Toxicon 147, 2–12 (2018).
Article CAS PubMed Google Scholar
Rummel, A. Two Feet on the Membrane: Uptake of Clostridial Neurotoxins. Curr Top Microbiol Immunol 406, 1–37 (2017).
CAS PubMed Google Scholar
Mansfield, M. J., Adams, J. B. & Doxey, A. C. Botulinum neurotoxin homologs in non-Clostridium species. FEBS Lett 589, 342–348 (2015).
Article CAS PubMed Google Scholar
Popoff, M. R. Botulinum Neurotoxins: Still a Privilege of Clostridia? Cell Host Microbe 23, 145–146 (2018).
Article CAS PubMed Google Scholar
Zhang, S. et al. Identification of a Botulinum Neurotoxin-like Toxin in a Commensal Strain of Enterococcus faecium. Cell Host Microbe 23, 169–176 e166 (2018).
Article CAS PubMed PubMed Central Google Scholar
Carter, A. T., Austin, J. W., Weedmark, K. A., Corbett, C. & Peck, M. W. Three classes of plasmid (47-63 kb) carry the type B neurotoxin gene cluster of group II Clostridium botulinum. Genome Biol Evol. 6, 2076–2087 (2014).
Article PubMed PubMed Central CAS Google Scholar
Franciosa, G., Maugliani, A., Scalfaro, C. & Aureli, P. Evidence that plasmid-borne botulinum neurotoxin type B genes are widespread among Clostridium botulinum serotype B strains. PLoS One 4, e4829 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Nawrocki, E. M., Bradshaw, M. & Johnson, E. A. Botulinum neurotoxin-encoding plasmids can be conjugatively transferred to diverse clostridial strains. Sci Rep 8, 3100 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Keyel, P. A., Heid, M. E. & Salter, R. D. Macrophage responses to bacterial toxins: a balance between activation and suppression. Immunol Res. 50, 118–123 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tweten, R. K., Hotze, E. M. & Wade, K. R. The Unique Molecular Choreography of Giant Pore Formation by the Cholesterol-Dependent Cytolysins of Gram-Positive Bacteria. Annu Rev Microbiol 69, 323–340 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fagan, R. P. & Fairweather, N. F. Biogenesis and functions of bacterial S-layers. Nat Rev Microbiol 12, 211–222 (2014).
Article CAS PubMed Google Scholar
Pechine, S., Bruxelle, J. F., Janoir, C. & Collignon, A. Targeting Clostridium difficile Surface Components to Develop Immunotherapeutic Strategies Against Clostridium difficile Infection. Front Microbiol 9, 1009 (2018).
Article PubMed PubMed Central Google Scholar
Mori, N. & Takahashi, T. Characteristics and Immunological Roles of Surface Layer Proteins in Clostridium difficile. Ann Lab Med 38, 189–195 (2018).
Article CAS PubMed PubMed Central Google Scholar
Popoff, M. R., Guillou, J. P. & Carlier, J. P. Taxonomic position of lecithinase-negative strains of Clostridium sordellii. J. Gen. Microbiol. 131, 1697–1703 (1985).
CAS PubMed Google Scholar
Dineen, S. S., Bradshaw, M. & Johnson, E. A. Neurotoxin gene clusters in Clostridium botulinum type A strains: sequence comparison and evolutionary implications. Cur. Microbiol. 46, 342–352 (2003).
Article CAS Google Scholar
Desvillechabrol, D. et al. Sequanix: a dynamic graphical interface for Snakemake workflows. Bioinformatics 34, 1934–1936 (2018).
Article CAS PubMed PubMed Central Google Scholar
Treangen, T. J., Ondov, B. D., Koren, S. & Phillippy, A. M. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol 15, 524 (2014).
Article PubMed PubMed Central CAS Google Scholar
Aziz, R. K. et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9, 7 (2008).
Article CAS Google Scholar
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol 30, 2725–2729 (2013).
Article CAS PubMed PubMed Central Google Scholar
Alikhan, N. F., Petty, N. K., Ben Zakour, N. L. & Beatson, S. A. BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics 12, 402 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lechner, M. et al. Proteinortho: detection of (co-)orthologs in large-scale analysis. BMC Bioinformatics 12, 124 (2011).
Article PubMed PubMed Central Google Scholar
Levy, P. Y. et al. Clostridium tetani osteitis without tetanus. Emerg Infect Dis. 20, 1571–1573 (2014).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank C. Fund, France Génomique, for technical assistance, and we thank Genomics Platform (PF1), member of “France Génomique” consortium (ANR10-INBS-09-08), for high throughput sequencing. This work was supported by Sanofi-Pasteur and Institut Pasteur.

Author information

Authors and Affiliations

Bacterial Toxins, Institut Pasteur, Paris, France
Diana Chapeton-Montes & Michel Robert Popoff
Sanofi-Pasteur, Marcy l′Etoile, France
Lucile Plourde
Genomic Platform, Biomics, Institut Pasteur, Paris, France
Christiane Bouchier & Laurence Ma
CNR Bactéries anaérobies Botulisme, Institut Pasteur, Paris, France
Laure Diancourt
Hub Bioinformatique Biostatistique, Institut Pasteur, Paris, France
Alexis Criscuolo
Aarhus University, Department of Biomedicine, Aarhus, Denmark
Holger Brüggemann

Authors

Diana Chapeton-Montes
View author publications
You can also search for this author in PubMed Google Scholar
Lucile Plourde
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Bouchier
View author publications
You can also search for this author in PubMed Google Scholar
Laurence Ma
View author publications
You can also search for this author in PubMed Google Scholar
Laure Diancourt
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Criscuolo
View author publications
You can also search for this author in PubMed Google Scholar
Michel Robert Popoff
View author publications
You can also search for this author in PubMed Google Scholar
Holger Brüggemann
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.C. performed experiments, C.B., L.M., L.D. and A.C. were involved in genome sequencing, L.P. provided strains and reagents, H.B. and M.R.P. wrote the manuscript, H.B. performed genomics analysis.

Corresponding author

Correspondence to Michel Robert Popoff.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Table S1

Table S2 Fig S1 S2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chapeton-Montes, D., Plourde, L., Bouchier, C. et al. The population structure of Clostridium tetani deduced from its pan-genome. Sci Rep 9, 11220 (2019). https://doi.org/10.1038/s41598-019-47551-4

Download citation

Received: 26 December 2018
Accepted: 16 July 2019
Published: 02 August 2019
DOI: https://doi.org/10.1038/s41598-019-47551-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.