Genomic characteristics and comparative genomics analysis of Salmonella enterica subsp. enterica serovar Thompson isolated from an outbreak in South Korea

Lee, Woojung; Kim, Eiseul; Zin, Hyunwoo; Sung, Soohyun; Woo, Jungha; Lee, Min Jung; Yang, Seung-Min; Kim, Seung Hwan; Kim, Soon Han; Kim, Hae-Yeong

doi:10.1038/s41598-022-22168-2

Download PDF

Article
Open access
Published: 29 November 2022

Genomic characteristics and comparative genomics analysis of Salmonella enterica subsp. enterica serovar Thompson isolated from an outbreak in South Korea

Woojung Lee^1,2,
Eiseul Kim²,
Hyunwoo Zin¹,
Soohyun Sung¹,
Jungha Woo¹,
Min Jung Lee¹,
Seung-Min Yang²,
Seung Hwan Kim¹,
Soon Han Kim¹ &
…
Hae-Yeong Kim²

Scientific Reports volume 12, Article number: 20553 (2022) Cite this article

1968 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Salmonella infections represent an important public health problem. In 2018, a multistate outbreak of S. enterica subsp. enterica serovar Thompson infection associated with contaminated chocolate cakes in schools was reported in South Korea. In this study, we sequenced the 37 S. Thompson strains isolated from chocolate cakes, egg whites, preserves, and cookware associated with the outbreak. In addition, we analyze the genomic sequences of 61 S. Thompson strains (37 chocolate cake-related outbreak strains, 4 strains isolated from outbreaks in South Korea and 20 strains available in the National Center for Biotechnology Information) to assess the genomic characteristics of outbreak-related strains by comparative genomics and phylogenetic analysis. The results showed that identically classified clusters divided strains into two clusters, sub-clusters A & I (with strains from 2018 in South Korea) and sub-clusters B & II (with strains from 2014 to 2015 in South Korea). S. Thompson isolated from South Korea were accurately distinguished from publicly-available strains. Unlike other S. Thompson genomes, those of chocolate cake outbreak-related strains had three Salmonella phages (SEN8, vB SosS Oslo, and SI7) integrated into their chromosome. Comparative genomics revealed several genes responsible for the specific genomic features of chocolate cake outbreak-related strains and three bacteriophages that may contribute to the pathogenicity of other S. Thompson strains.

Phylogenetic structure of Salmonella Enteritidis provides context for a foodborne outbreak in Peru

Article Open access 16 December 2020

Whole genome sequencing analyses revealed that Salmonella enterica serovar Dublin strains from Brazil belonged to two predominant clades

Article Open access 22 June 2022

Genomic analysis of Shiga toxin-producing Escherichia coli O157:H7 from cattle and pork-production related environments

Article Open access 01 July 2021

Introduction

Salmonella enterica is one of the most common foodborne pathogens, and it is responsible for > 99% of foodborne outbreaks caused by Salmonella species¹. Over 2600 serovars of S. enterica have been identified, including Typhimurium, Enteritidis, Typhi, Paratyphi, and Bareilly². S. enterica is the main foodborne pathogen that causes salmonellosis, which is characterized by diarrhea, vomiting, and high fever due to the consumption of contaminated foods. This species represents the second-most common and confirmed etiological agent, with 896 outbreaks and 23,662 hospitalizations reported between 2009 and 2015 in the United States³ and 6340 hospitalizations from 2014 to 2018 in South Korea, indicating that salmonellosis is one of the most common foodborne illnesses⁴.

S. enterica subsp. enterica serovar Thompson was first identified by Scott in 1926, and it belongs to the C1 serogroup⁵. S. Thompson has been identified as the cause of foodborne outbreaks associated with cilantro, arugula, chicken, beef, egg, bread, and smoked salmon consumption^6,7,8. In addition, S. Thompson, S. Livingstone, S. Bareilly, and S. Montevideo were isolated at increasing frequencies and implicated in several foodborne disease outbreaks in 2014⁹. In September 2018, a large outbreak of S. Thompson infection associated with chocolate cake consumption was reported in South Korea. On September 6, the Ministry of Food and Drug Safety and the Korea Centers for Disease Control and Prevention (KCDC) announced that 13 schools in the country had reported outbreaks of gastroenteritis until September 5, and the same chocolate cakes provided to those schools were suspected as the source of infection. Subsequently, the distribution and sales of these chocolate cakes were suspended. Thereafter, the Ministry of Food and Drug Safety and the KCDC reported outbreaks of gastroenteritis in 57 mass meal services that distributed the chocolate cake in 12 provinces, including Seoul, until September 10, resulting in 2207 cases¹⁰. The genomic characteristics of S. Thompson need to be elucidated to further understand and control this newly emerging foodborne pathogen.

Advances in whole-genome sequencing (WGS) technology have made it an economically viable alternative to conventional typing methods for outbreak investigations and public health surveillance¹¹. Comparative genomics using WGS data provide insights into the genomic characteristics of pathogenic bacteria, including candidate drug compounds, potential virulence determinants, mechanisms of pathogenicity, and their evolution in pathogens. In addition, several approaches are used to analyze WGS data for epidemiological and infection control purposes. A popular approach is to construct phylogenies based on single nucleotide polymorphism (SNP) variant calling, which identifies single nucleotide differences between isolates using a single reference genome^12,13. An alternative approach is a gene-by-gene typing method, indexing core or accessory gene variation due to mutations or recombination events^14,15. Consequently, this technology has been instrumental in improving diagnostics and public health microbiology¹⁶. Currently, WGS-based analysis is widely used to investigate outbreaks of pathogenic bacteria, such as Bacillus cereus, Escherichia coli, Vibrio parahaemolyticus, and Salmonella species^17,18.

Major differences are found between and within species in their ability to cause infections. Horizontal transfer and acquisition of virulence factors are major driving forces in the emergence and evolution of pathogenic isolates. Several mobile genetic elements, such as insertion sequences, plasmids, bacteriophages, and pathogenicity islands, are implicated in the horizontal transfer of virulence genes; bacteriophages represent one of the most important factors¹⁹.

Bacteriophages are the most abundant biological entity on Earth and have been estimated to kill 20–25% of microbes daily^20,21. Moreover, phages represent key contributors to bacterial ecology and evolution via obligate parasitism via either lytic or temperate life cycles, resulting in the direct or delayed lysis of bacterial hosts, respectively²². Phage–host interactions contribute considerably to genetic flux via horizontal gene transfer, which is responsible for the dissemination and acquisition of important bacterial phenotypes, such as enhanced colonization of the human gut epithelium, antimicrobial resistance, and toxin production^21,23.

In this study, we aimed to characterize S. Thompson isolated from a chocolate cake-related outbreak in South Korea by WGS, utilizing various bioinformatics tools for outbreak analysis. The phylogenetic analysis using whole-genome multilocus sequence typing (wgMLST) and whole-genome SNP (wgSNP) analyses would provide their evolutionary relationships, and their typical phylogenetic patterns in South Korea would be determined. In addition, comparative genomic analysis revealed several genes responsible for the specific genomic features of chocolate cake outbreak-related strains and identified three bacteriophages. Consequently, this study would be useful to extend our understanding on the evolutionary relationship and pathogenesis of S. Thompson strains isolated from the chocolate cake-related outbreak in South Korea, and it would provide basic information on the control and regulation of this pathogenic S. Thompson for food safety.

Results

Genomic features

The representative genomes of Salmonella strains MFDS1011643 (chocolate cake), MFDS1011657 (egg white), MFDS1011659 (preserved foods), and MFDS1011716 (cookware) isolated from the food poisoning outbreak associated with contaminated chocolate cakes were sequenced; all strains had one chromosome and one plasmid in common (Salmonella enterica subsp. enterica strain YU39 plasmid pYU39_89 (accession number CP011430.1)) (Table 1). In silico serotyping predicted an antigenic profile of 7:k:1,5 S. Thompson (O antigen: 7, H antigen phase 1: k, H antigen phase 2: 1,5) based on the Kauffmann-White scheme. Furthermore, the sequence type (ST) was identified by using the Salmonella species multilocus sequence typing (MLST) database. S. Thompson strains MFDS1011643, 1011657, 1011659, and 1011716 were typed as ST26, which is associated with strains isolated from the chocolate cake-related outbreak in 2018.

Table 1 Genomic properties of Salmonella enterica serovar Thompson isolates from the chocolate cake-related outbreak in South Korea, 2018.

Full size table

MFDS1011643 contained a 4,822,132-bp chromosome and a 91,380-bp plasmid that showed G + C contents of 52.2 and 47.2%, respectively. Gene prediction of this genome revealed 4898 protein-coding sequences, as well as 87 tRNA (transfer ribonucleic acid) and 22 rRNA (ribosomal ribonucleic acid) genes. Similar results were observed for strains MFDS1011657, MFDS1011659, and MFDS1011716, which contained chromosomes sized 4,822,555, 4,822,450, and 4,822,696 bp and plasmids sized 92,361, 92,361, and 92,357 bp, respectively; the chromosomes and plasmids of all of these genomes showed G + C contents of 52.2 and 47.3%, respectively. The chromosomes and plasmids of MFDS1011657, MFDS1011659, and MFDS1011716 contained 4888, 4892, and 4894 protein-coding sequences, respectively, and they identically harbored 87 tRNA and 22 rRNA genes.

Phylogenetic analysis

To investigate the genetic diversity of chocolate cake-related outbreak strains, wgMLST and wgSNP were used for the phylogenetic analysis of S. Thompson strains. The dataset is composed of 61 S. Thompson strains genomes: (1) chocolate cake-related outbreak strains (N = 37); (2) S. Thompson strains isolated from outbreaks in South Korea (N = 4); (3) S. Thompson strains, which are available in the NCBI (accessed on April 12, 2022) (N = 20).

The phylogenetic tree was analyzed based on wgMLST and wgSNP. The wgMLST-based analysis showed that identically classified clusters divided strains into two clusters, cluster A (with strains from 2018 in South Korea) and cluster B (with strains from 2014–2015 in South Korea) (Fig. 1a). Cluster A isolates differed from each other by 0 to 5 alleles. Cluster A isolates differed from cluster B isolates by 40 to 54 alleles, and S. Thompson strains in the NCBI differed by 89 to 584 alleles (Fig. 1b). The results of the wgSNP-based analysis were the same as those of wgMLST. The wgSNP-based analysis showed that identically classified clusters divided strains into two clusters, cluster I (with strains from 2018 in South Korea) and cluster II (with strains from 2014 to 2015 in South Korea) (Fig. 2a). Cluster I isolates differed from each other by 0 to 3 SNPs. Cluster I isolates differed from cluster II isolates by 31 to 34 SNPs, and S. Thompson strains in the NCBI differed by 60 to 415 SNPs (Fig. 2b). The presence of 20 or fewer SNPs indicates that bacteria are genetically very similar and recently originated from the same source^24,25. Therefore, the chocolate cake-related outbreak S. Thompson strains are distant from other isolates in South Korea. In addition, S. Thompson strains isolated in South Korea were accurately distinguished from S. Thompson strains reported in the NCBI.

Comparative genomics

The serotypes of all genomes were identified as 7:k:1,5 (S. Thompson) by in silico serotyping. Also, all genomes were typed as ST26. All genomes were assigned to two distinct groups in the phylogenetic analysis based on pangenome. The first cluster contained 37 outbreak-related strains and the second cluster contained 24 other strains (Fig. 3). Pan-genome analysis showed that 61 whole-genome sequences of S. Thompson strains had a pan-genome comprising 7226 genes, which was segregated into a core genome of 3740 genes, an accessory genome of 1657 genes, and a unique genome of 1829 genes. Of the 1657 accessory genes, 121 were identified as unique genes in the 37 outbreak strains, including replication endonuclease, head protein, phage tail protein, and hypothetical proteins. Unique genes of outbreak strains were assigned nine COG categories: cell cycle control, cell division, chromosome partitioning (D); cell wall/membrane/envelope biogenesis (M); cell motility (N); signal transduction mechanisms (T); transcription (K); replication, recombination, and repair (L); nucleotide transport and metabolism (F); general function prediction only (R); and function unknown (S). Functional categories, such as general function prediction only (R, 20.94%), transcription (K, 15.71%), and replication, recombination, and repair (L, 15.71%), were the most enriched among the unique genes of the outbreak strains.

In silico identification of virulence genes and antimicrobial resistance genes

Virulence gene mapping of 61 genomes was performed to determine the virulence repertoire of S. Thompson. In the 61 genomes, 162 virulence genes, including fimbrial adherence determinants, non-fimbrial adherence determinants, macrophage-inducible genes, magnesium-uptake genes, and regulation and secretion system genes, were identified. The detailed results of virulence gene mapping for the 61 S. Thompson genomes are shown in Supplementary Table 1. The virulence gene pattern of S. Thompson was similar for each strain, and a core set of virulence genes was conserved in all strains. Therefore, the genomes of S. Thompson isolated from the chocolate cake-related outbreak contained virulence factors similar to those present in other S. Thompson genomes.

All genomes including S. Thompson isolated from the chocolate cake-related outbreak carried SPI-1 and SPI-2 (Supplementary Table 1). Multiple virulence factors have been implicated in Salmonella pathogenesis. These factors include type 3 secretion systems (T3SSs) encoded in SPI-1, SPI-2, and other SPIs, as well as other factors, such as flagella, capsules, plasmids, and adhesion systems^26,27. Among these factors, fimbriae play a critical role in virulence by facilitating the interaction of bacteria with host cells and other solid substrates²⁸. Additionally, S. enterica produces two T3SSs encoded by SPI-1 and SPI-2²⁹. The T3SS-1 cluster is responsible for the early phase invasion of intestinal epithelial cells and M cells in the gut lumen and the activation of proinflammatory responses. In contrast, T3SS-2 is associated with the late phase of infection, including intracellular survival and replication within host phagocytes³⁰. Thus, S. Thompson strains isolated from the chocolate cakes harbored various virulence factors that may contribute to human pathogenicity.

The ResFinder database was used to forecast the antimicrobial resistance genes that 61 S. Thompson genomes contained. At least two or more antimicrobial resistance genes were present in every genome (Supplementary Table 2). All genomes including 37 chocolate cake-related outbreak strains had genes associated with aminoglycoside (amikacin and tobramycin). Among them, five genomes (14-Sa103, 688C, ABBSB1050-2, HFCDC-SM-846, and SH11G0791) possess multiple antimicrobial resistance genes, such as beta-lactam, tetracycline, and quinolone.

Phage characterization

Based on a comparison of genome sequences of 61 S. Thompson isolates, 37 S. Thompson isolates from the chocolate cake-related outbreak showed sequences of three unique bacteriophages in their chromosomes (Fig. 4). Based on the analysis of phages in 50 S. Thompson genomes, the 38.3-kb Salmonella phage SEN8 (accession number NC_047753), 50.3-kb Salmonella phage vB_SosS_Oslo (accession number NC_018279), and 31.5-kb Salmonella phage SI7 (NC_049460) were found only in the genomes of S. Thompson strains isolated from the outbreak. Therefore, while Salmonella pathogenicity islands (SPI) distribution was conserved among the evaluated S. Thompson genomes, the bacteriophage repertoire was diverse and contributed significantly to the genetic diversification of S. Thompson strains.

The purpose of this study is to predict the putative prophage sequence based on the genome sequence using in-silico phage search tool. Bacteriophage sequences were identified using PHASTER. PHASTER was used to predict putative prophage regions as “intact,” “questionable,” or “incomplete” based on the proportion of phage-related genes in the identified phage region. Prophages classified as “intact” by PHASTER and are the most likely to be complete and functional. In this study, the identified putative prophages were classified as “intact”. However, in order to confirm whether the identified prophage is induced or inactivated, excision induction experiments may need to be performed further.

Discussion

S. Thompson often causes human diarrheal disease and has the ability to spread virulence and antibiotic resistance genes³¹. Therefore, in-depth studies for S. Thompson of genetic characteristics such as mobile gene elements and virulence factors are needed. Comparative genomics using WGS data provide insights into the genomic characteristics of pathogenic bacteria, including potential virulence determinants. In this study, we sequenced and analyzed the S. Thompson strains isolated from this chocolate cake-related outbreak to understand the pathogenic characteristics of S. Thompson strains.

The WGS-based analyses are able to construct the highly resolved phylogeny needed to support the findings of the outbreak investigation. The application of WGS allowed clustering analysis of S. Thompson in an outbreak and wgMLST, wgSNP, and phylogenetic tree analyses. The wgMLST and wgSNP analysis showed that S. Thompson clones isolated from egg white, cake, preserved foods, and cookware had the same clonal origin. This result represents likely cross-contamination events where egg whites had been contaminated with S. Thompson clones found in the cake, preserved foods, and cookware. Also, phylogeny analysis showed that S. Thompson strains isolated from South Korea were accurately distinguished from S. Thompson strains in the NCBI. Consequently, wgMLST and wgSNP analyses showed significant differences in S. Thompson strains isolated from a multistate chocolate cake-related outbreak in 2018, suggesting that human-infecting Salmonella strains can cause symptomatic salmonellosis.

Salmonella genomes were prophages are abundant (roughly 9000 phage types)³², and many prophages carry virulence genes that encode proteins that play major roles in bacterial pathogenesis. Unlike other S. Thompson genomes, those of chocolate cake outbreak-related strains had three Salmonella phages (SEN8, vB SosS Oslo, and SI7) integrated into their chromosome. The phage SEN8 encodes fljA, which plays crucial roles in bacterial motility, chemotactic behavior, and host cell invasion as a virulence determinant³³. The phage SEN8 also encodes T1SS-associated proteins (LapB, LapC, and LapE), which function in relatively simple pathways that enable the secretion of a diverse range of virulence factors. The phage vB_SosS_Oslo encodes holin and endopeptidase Rz. Holin and endolysin are important for host cell lysis³⁴. Holins create holes in the cytoplasmic membrane, which serve as transport channels for endolysins that digest the peptidoglycan layer. In addition, Rz/Rz1-like proteins often enhance endolysin activity as accessory proteins³⁵. The phage SI7 encodes the copper-sensing two-component system response regulator CpxR, which is involved in the virulence of uropathogenic E. coli³⁶, S. Typhimurium³⁷, and Vibrio cholerae³⁸. Additionally, several studies have verified the role of CpxR in the antibiotic resistance of pathogenic bacteria. In particular, the phage holin and endopeptidase Rz were present simultaneously in the strains isolated from the chocolate cake-related outbreak. Our results show that chocolate cake outbreak-related strains had three unique Salmonella phages (SEN8, vB SosS Oslo, SI7) integrated into their chromosomal sequences, which contributed considerably to the genetic diversification of S. Thompson strains.

In conclusion, we reported the genome sequences and functional genomic features of 37 S. Thompson strains that were isolated from the large-scale outbreak associated with contaminated chocolate cakes in South Korea in 2018. Clustering and phylogenetic analysis from wgMLST and wgSNP suggested that S. Thompson strains isolated from a multistate chocolate cake-related outbreak in 2018 had low genetic relevance to previously reported S. Thompson strains^24,25. In addition, comparative genomic analysis provides comprehensive information on the disease potential of the outbreak strains. Comparative genomic analysis showed that the outbreak strains were similar to previously reported S. Thompson strains, except that the outbreak strains contained a few different genes related to phages. However, further studies are required to elucidate the association between unique phage-related genes and large-scale foodborne disease outbreaks. Along with the identified genomic features of outbreak strains, the accumulation of additional genome sequence information on diverse S. Thompson strains can serve as basic information for preventing foodborne disease outbreaks and assisting treatment in South Korea. Also, the information obtained by WGS can be further used to gain insight into antibiotic resistance, virulence genes, and molecular evaluation for pathogenesis.

Methods

Strain isolation and serotyping

Thirty-seven S. Thompson strains associated with the chocolate cake-related outbreak in 2018 were isolated from chocolate cakes, egg whites, preserves, and cookware. The strains isolated in this study are summarized in Table 2. The samples were plated on xylose lysine deoxycholate agar (Oxoid, Basingstoke, United Kingdom) and Rambach agar (CHROMagar, Paris, France) and incubated at 37 °C for 18–24 h. After isolation, the isolates were identified using the VITEK MS and VITEK 2 systems using a GN card (BioMerieux, Marcy-l'Étoile, France). Salmonella serotyping was performed according to the White-Kauffmann-Le Minor scheme using somatic (O) (Denka Seiken, Tokyo, Japan) and flagella (H) antisera (Becton & Dickinson, Sparks, MD, USA).

Table 2 Information on Salmonella enterica serovar Thompson outbreak-related isolates.

Full size table

Genome sequencing, assembly, and annotation

Genomic DNA (deoxyribonucleic acid) was extracted using a MagListo™ 5 M Genomic DNA Extraction Kit for cells and tissues (Bioneer, Daejeon, Korea), according to the manufacturer’s protocol. The integrity and concentration of DNA were determined via standard agarose gel electrophoresis and a Qubit™ 3.0 Fluorometer (Life Technologies, Carlsbad, CA, USA), respectively. A DNA library was prepared using the Nextera DNA Flex Library Prep Kit (Illumina, San Diego, CA, USA). Sequencing was performed using a MiSeq sequencing system (Illumina) and a MiSeq Reagent Kit v3 (600-cycle). Raw read data were first demultiplexed and quality-trimmed to remove low quality reads and base calls in CLC Genomics Workbench using a modified Mott trimming algorithm and a parameter value limit of 0.05; ambiguous nucleotides were trimmed using a maximum number of ambiguities of two. The contigs (FASTA sequence files) were assembled de novo using CLC Workbench version 12.0 (Qiagen, Hilden, Germany) with automatic word size 20 and bubble size 50, and a minimum contig length of 200 bp. Paired read distances were automatically detected and contigs were scaffolded where possible. Following assembly, the reads were mapped back to the contigs using a mismatch cost = 2, insertion cost = 3, deletion cost = 3, length fraction = 0.5, and similarity fraction = 0.8; contigs were updated and gaps were filled. To obtain high-quality data to determine a complete genome sequence, hybrid genome assembly was performed using additional long-read sequence data obtained using PacBio Sequel (Pacific Bioscience, Menlo Park, CA, USA). Hybrid assembly for raw FASTQ PacBio sequel long-read sequence data and Illumina MiSeq short-read FASTQ sequence data were performed using Unicycler (v0.4.9, https://github.com/rrwick/Unicycler; default options). The assembled genome was annotated using the Rapid Annotation using Subsystem Technology toolkit as implemented in the PATRIC annotation web service (v3.6.12).

DNA sequence analysis and bioinformatics

A comparative genomic analysis was performed for the 37 S. Thompson strains associated with the chocolate cake-related outbreak in 2018 and 24 other S. Thompson strains. The sequences of 4 strains isolated from other outbreaks³⁹ in South Korea and 20 other S. Thompson strains are available in the National Center for Biotechnology Information (NCBI, accessed on August 06, 2021). The genomes analyzed in this study are summarized in Supplementary Table 3. The pan-genome was analyzed using the bacterial pan-genome analysis (BPGA) tool (v. 1.3; default parameters). The assignment of cluster of orthologous groups (COG) functional categories for genes derived from the pan-genome were performed using USEARCH in the BPGA tool, against the COG database. Virulence factors and antimicrobial resistance genes were identified using the virulence factor database and ResFinder v.4.1, respectively^40,41. Plasmid features for complete genomes were confirmed by BLASTN search v. 2.13.0 (https://blast.ncbi.nlm.nih.gov/Blast.cgi). wgMLST was performed with BioNumerics 8.0 (Applied Maths, Sint-Martens-Latem, Belgium), where 15,874 loci for S. enterica were included for analysis. wgSNP analysis was performed with BioNumerics 8.0 (Applied Maths) using S. enterica LT2 as the reference genome, and “Strict SNP filtering” was applied. Clustering of the wgMLST and wgSNP results was performed by an unweighted pair group method with an arithmetic-means-based tree. A minimum spanning tree based on pairwise wgSNP differences was also constructed using BioNumerics. In addition, the web-based serotyping tool SeqSero 1.2⁴² was used to predict the antigenic profile of Salmonella strains.

Prophage prediction and analysis

Bacteriophage sequences within the unique genome sequence were identified using PHAge Search Tool Enhanced Release (PHASTER)⁴³. PHASTER was used to predict putative prophage regions as “intact,” “questionable,” or “incomplete” based on the proportion of phage-related genes in the identified phage region. Three intact prophage sequences were subjected to further analysis to confirm the prophage completeness. The basic local alignment search tool was used to align the three intact prophage sequences to those of Salmonella phages SEN8 (NC_047753), vB SosS Oslo (NC_018729), and SI7 (NC_049460) obtained from the NCBI database. Easyfig 2.2.5 was used to analyze the homologous regions between phage sequences⁴⁴.

Accession codes

The complete genome data of Salmonella enterica subsp. enterica serovar Thompson strains MFDS1011643, MFDS1011657, MFDS1011659, and MFDS1011716 have been deposited in GenBank with Accession No. JAKVTZ000000000, CP092627/CP092628, CP092510/CP092511, and CP092690/CP092691, respectively. The draft genome data of 33 S. Thompson strains associated with the chocolate cake-related outbreak in 2018 have been deposited in NCBI Sequence Read Archive (SRA) with Accession No. SRR18056016 ~ SRR18056048.

Data availability

Sequence data have been submitted to the NCBI archives (https://ncbi.nlm.nih.gov), including GenBank, Sequence Read Archive (SRA), under the accession numbers listed in Supplementary Table 3.

References

Lamas, A. et al. A comprehensive review of non-enterica subspecies of Salmonella enterica. Microbiol. Res. 206, 60–73 (2018).
Article PubMed CAS Google Scholar
Grimont, P. & Weill, F.-X. Antigenic Formulae of the Salmonella Servovars: WHO Collaborating Centre for Reference and Research on Salmonella 9th edn. (Institute Pasteur, 2007).
Google Scholar
Dewey-Mattia, D., Manikonda, K., Hall, A. J., Wise, M. E. & Crowe, S. J. Surveillance for foodborne disease outbreaks—United States, 2009–2015. MMWR Surveill. Summ. 67, 1–11 (2018).
Article PubMed PubMed Central Google Scholar
https://www.foodsafetykorea.go.kr/.
Scott, W. M. The, “Thompson” type of Salmonella. J. Hyg. 25, 398–405 (1926).
PubMed PubMed Central CAS Google Scholar
Campbell, J. V. et al. An outbreak of Salmonella serotype Thompson associated with fresh cilantro. J. Infect. Dis. 183, 984–987 (2001).
Article PubMed CAS Google Scholar
Kimura, A. C. et al. A multi-state outbreak of Salmonella serotype Thompson infection from commercially distributed bread contaminated by an ill food handler. Epidemiol. Infect. 133, 823–828 (2005).
Article PubMed PubMed Central CAS Google Scholar
Nygård, K. et al. Outbreak of Salmonella Thompson infections linked to imported rucola lettuce. Foodborne Pathog. Dis. 5, 165–173 (2008).
Article PubMed Google Scholar
Yun, Y. S., Lee, D. Y. & Tae, C. Prevalence and characteristics of Salmonella spp. isolated in Korea. Public Heal. Wkly [Report], KCDC (2015).
Eun, Y. et al. A large outbreak of Salmonella enterica serovar Thompson infections associated with chocolate cake in Busan, Korea. Epidemiol. Health 41, e2019002 (2019).
Article PubMed PubMed Central Google Scholar
Ashton, P. M. et al. Identification of Salmonella for public health surveillance using whole genome sequencing. PeerJ 4, e1752 (2016).
Article PubMed PubMed Central Google Scholar
Besser, J., Carleton, H. A., Gerner-Smidt, P., Lindsey, R. L. & Trees, E. Next-generation sequencing technologies and their application to the study and control of bacterial infections. Clin. Microbiol. Infect. 24, 335–341 (2018).
Article PubMed CAS Google Scholar
Parcell, B. J. et al. Pseudomonas aeruginosa intensive care unit outbreak: Winnowing of transmissions with molecular and genomic typing. J. Hosp. Infect. 98, 282–288 (2018).
Article PubMed PubMed Central CAS Google Scholar
Maiden, M. C. J. et al. MLST revisited: The gene-by-gene approach to bacterial genomics. Nat. Rev. Microbiol. 11, 728–736 (2013).
Article PubMed PubMed Central CAS Google Scholar
Pearce, M. E. et al. Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak. Int. J. Food Microbiol. 274, 1–11 (2018).
Article PubMed PubMed Central CAS Google Scholar
Punina, N. V., Makridakis, N. M., Remnev, M. A. & Topunov, A. F. Whole-genome sequencing targets drug-resistant bacterial infections. Hum. Genom. 9, 19 (2015).
Article CAS Google Scholar
Yang, S. M. et al. Development of a genoserotyping method for salmonella infantis detection on the basis of pangenome analysis. Microorganisms 9, 1–11 (2021).
Google Scholar
Wagner, E. et al. Virulence characterization and comparative genomics of Listeria monocytogenes sequence type 155 strains. BMC Genom. 21, 847 (2020).
Article CAS Google Scholar
Boyd, E. F. & Brüssow, H. Common themes among bacteriophage-encoded virulence factors and diversity among the bacteriophages involved. Trends Microbiol. 10, 521–529 (2002).
Article PubMed CAS Google Scholar
Hendrix, R. W. Bacteriophages: Evolution of the majority. Theor. Popul. Biol. 61, 471–480 (2002).
Article PubMed Google Scholar
Turner, D. et al. Comparative analysis of 37 Acinetobacter bacteriophages. Viruses 10, 5 (2018).
Article Google Scholar
Mikalová, L. et al. Novel temperate phages of Salmonella enterica subsp. salamae and subsp. diarizonae and their activity against pathogenic S. enterica subsp. enterica isolates. PLoS ONE 12, e0170734 (2017).
Article PubMed PubMed Central Google Scholar
Seed, K. D. Battling phages: How bacteria defend against viral attack. PLoS Pathog. 11, e1004847 (2015).
Article PubMed PubMed Central Google Scholar
Pightling, A. W. et al. Interpreting whole-genome sequence analyses of foodborne bacteria for regulatory applications and outbreak investigations. Front. Microbiol. 9, 01482 (2018).
Article Google Scholar
Stevens, E. L. et al. Use of whole genome sequencing by the federal interagency collaboration for genomics for food and feed safety in the United States. J. Food Prot. 85, 755–772 (2022).
Article PubMed Google Scholar
Daigle, F. Typhi genes expressed during infection or involved in pathogenesis. J. Infect. Dev. Ctries. 2, 431–437 (2008).
Article PubMed CAS Google Scholar
Sabbagh, S. C., Forest, C. G., Lepage, C., Leclerc, J. M. & Daigle, F. So similar, yet so different: Uncovering distinctive features in the genomes of Salmonella enterica serovars Typhimurium and Typhi. FEMS Microbiol. Lett. 305, 1–13 (2010).
Article PubMed CAS Google Scholar
Edwards, R. A., Schifferli, D. M. & Maloy, S. R. A role for Salmonella fimbriae in intraperitoneal infections. Proc. Natl. Acad. Sci. U. S. A. 97, 1258–1262 (2000).
Article PubMed PubMed Central CAS Google Scholar
Jennings, E., Thurston, T. L. M. & Holden, D. W. Salmonella SPI-2 type III secretion system effectors: Molecular mechanisms and physiological consequences. Cell Host Microbe 22, 217–231 (2017).
Article PubMed CAS Google Scholar
Bao, H., Wang, S., Zhao, J. H. & Liu, S. L. Salmonella secretion systems: Differential roles in pathogen-host interactions. Microbiol. Res. 241, 126591 (2020).
Article PubMed CAS Google Scholar
Zhang, J. et al. Resistance and pathogenicity of Salmonella Thompson isolated from incubation end of a poultry farm. Vet. Sci. 9, 349 (2022).
Article PubMed PubMed Central CAS Google Scholar
Wahl, A., Battesti, A. & Ansaldi, M. Prophages in Salmonella enterica: A driving force in reshaping the genome and physiology of their bacterial host?. Mol. Microbiol. 111, 303–316 (2019).
Article PubMed CAS Google Scholar
Colin, R., Ni, B., Laganenka, L. & Sourjik, V. Multiple functions of flagellar motility and chemotaxis in bacterial physiology. FEMS Microbiol. Rev. 45, fuab038 (2021).
Article PubMed PubMed Central CAS Google Scholar
Wang, I. N., Smith, D. L. & Young, R. Holins: The protein clocks of bacteriophage infections. Annu. Rev. Microbiol. 54, 799–825 (2000).
Article PubMed CAS Google Scholar
Krupovič, M., Cvirkaite-Krupovič, V. & Bamford, D. H. Identification and functional analysis of the Rz/Rz1-like accessory lysis genes in the membrane-containing bacteriophage PRD1. Mol. Microbiol. 68, 492–503 (2008).
Article PubMed Google Scholar
Debnath, I. et al. The Cpx stress response system potentiates the fitness and virulence of uropathogenic escherichia coli. Infect. Immun. 81, 1450–1459 (2013).
Article PubMed PubMed Central CAS Google Scholar
Humphreys, S. et al. Role of the two-component regulator CpxAR in the virulence of Salmonella enterica serotype Typhimurium. Infect. Immun. 72, 4654–4661 (2004).
Article PubMed PubMed Central CAS Google Scholar
Acosta, N., Pukatzki, S. & Raivio, T. L. The Cpx system regulates virulence gene expression in Vibrio cholerae. Infect. Immun. 83, 2396–2408 (2015).
Article PubMed PubMed Central CAS Google Scholar
Park, S. et al. Complete genome sequence of Salmonella Thompson strain MFDS1004024 isolated from crab-stick. Korean J. Microbiol. 54, 155–157 (2018).
Google Scholar
Liu, B., Zheng, D., Jin, Q., Chen, L. & Yang, J. VFDB 2019: A comparative pathogenomic platform with an interactive web interface. Nucleic Acids Res. 47, D687–D692 (2019).
Article PubMed CAS Google Scholar
Bortolaia, V. et al. ResFinder 4.0 for predictions of phenotypes from genotypes. J. Antimicrob. Chemother. 75, 3491–3500 (2020).
Article PubMed PubMed Central CAS Google Scholar
Zhang, S. et al. Salmonella serotype determination utilizing high-throughput genome sequencing data. J. Clin. Microbiol. 53, 1685–1692 (2015).
Article PubMed PubMed Central CAS Google Scholar
Arndt, D. et al. PHASTER: A better, faster version of the PHAST phage search tool. Nucleic Acids Res. 44, W16–W21 (2016).
Article PubMed PubMed Central CAS Google Scholar
Sullivan, M. J., Petty, N. K. & Beatson, S. A. Easyfig: A genome comparison visualizer. Bioinformatics 27, 1009–1010 (2011).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This research was supported by grants (No. 20161MFDS030) from the Ministry of Food and Drug Safety. The findings and conclusions of this article are ours and do not necessarily represent the views of the Ministry of Food and Drug Safety.

Author information

Authors and Affiliations

Division of Food Microbiology, National Institute of Food and Drug Safety Evaluation, Ministry of Food and Drug Safety, Cheongju, 28159, Korea
Woojung Lee, Hyunwoo Zin, Soohyun Sung, Jungha Woo, Min Jung Lee, Seung Hwan Kim & Soon Han Kim
Institute of Life Sciences & Resources and Department of Food Science and Biotechnology, Kyung Hee University, Yongin, 17104, Korea
Woojung Lee, Eiseul Kim, Seung-Min Yang & Hae-Yeong Kim

Authors

Woojung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Eiseul Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyunwoo Zin
View author publications
You can also search for this author in PubMed Google Scholar
Soohyun Sung
View author publications
You can also search for this author in PubMed Google Scholar
Jungha Woo
View author publications
You can also search for this author in PubMed Google Scholar
Min Jung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Min Yang
View author publications
You can also search for this author in PubMed Google Scholar
Seung Hwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Soon Han Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hae-Yeong Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.L., E.K., H.Z., S.S., J.W., M.J.L., S.-M.Y., and S.H.K. performed genome analysis and comparative genomics. W.L. wrote the main manuscript text and W.L. and E.K. prepared figs. 1, 2, 3, 4. S.H.K. and H.-Y.K. designed the work and revised the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Soon Han Kim or Hae-Yeong Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lee, W., Kim, E., Zin, H. et al. Genomic characteristics and comparative genomics analysis of Salmonella enterica subsp. enterica serovar Thompson isolated from an outbreak in South Korea. Sci Rep 12, 20553 (2022). https://doi.org/10.1038/s41598-022-22168-2

Download citation

Received: 22 May 2022
Accepted: 11 October 2022
Published: 29 November 2022
DOI: https://doi.org/10.1038/s41598-022-22168-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.