Genomic insights of Salmonella isolated from dry fermented sausage production chains in Spain and France

The presence of Salmonella in dry fermented sausages is source of recalls and outbreaks. The genomic diversity of 173 Salmonella isolates from the dry fermented sausage production chains (pig carcasses, pork, and sausages) from France and Spain were investigated through their core phylogenomic relationships and accessory genome profiles. Ten different serovars and thirteen sequence type profiles were identified. The most frequent serovar from sausages was the monophasic variant of S. Typhimurium (1,4,[5],12:i:-, 72%) while S. Derby was in pig carcasses (51%). Phylogenomic clusters found in S. 1,4,[5],12:i:-, S. Derby, S. Rissen and S. Typhimurium serovars identified closely related isolates, with less than 10 alleles and 20 SNPs of difference, displaying Salmonella persistence along the pork production chain. Most of the S. 1,4,[5],12:i:- contained the Salmonella genomic island-4 (SGI-4), Tn21 and IncFIB plasmid. More than half of S. Derby strains contained the SGI-1 and Tn7. S. 1,4,[5],12:i:- genomes carried the most multidrug resistance genes (91% of the strains), whereas extended-spectrum β-lactamase genes were found in Typhimurium and Derby serovars. Salmonella monitoring and characterization in the pork production chains, specially S. 1,4,[5],12:i:- serovar, is of special importance due to its multidrug resistance capacity and persistence in dry fermented sausages.

The most frequently reported serovars in pigs, as a food-animal source, and associated with human salmonellosis due to consumption of pork and its thereof products in the EU in 2021 3 were the monophasic variant of S. Typhimurium (1,4, [5],12:i:-, 28.2%), S. Derby (22.3%), S. Typhimurium (15.3%) and S. Rissen (6.6%).Although these serovars are closely related genetically at subspecies level (they belong all to Salmonella enterica subsp.enterica), they can differ significantly in their pathogenic potentials [6][7][8] .Furthermore, within the same serovar, clones with a higher virulence and resistance potential may exist.Indeed, pathogenicity is directly associated with resistance to antimicrobials, biocide or heavy metal and virulence profile, traits usually acquired through mobile genetic elements (MGE) (i.e., transposons, integrons and plasmids) 9 .Subsequently, the dissemination of these specific and emerging clones can be favoured by international goods trade and human travelling 10 .
Whole genome sequencing (WGS) is currently the most robust method used in surveillance, microbial trace-back investigation, source attribution and risk assessment of food-borne microorganisms [11][12][13][14] , including Salmonella strains and circulating clones.The two main WGS-typing techniques are single nucleotide polymorphism (SNP) or allelic based methods.In particular, core-genome single nucleotide polymorphisms (cgSNP) and core-genome multilocus sequence typing (cgMLST, with 3002 loci in the case of Salmonella spp.) are largely used for bacterial typing and phylogenomic analysis 12,15 .Elseways, the accessory genome analyses allow exploring the most variable part of the microbial pan-genome, comprising the vertically or horizontally transferred DNA incorporated in the bacterial chromosome or contained in plasmids 16 .
By analysing 173 Salmonella isolates from the pork production chain, and more particularly from the DFS production chain, from France and Spain collected in the 1997-2021 period, the present study aimed to characterize the circulating clones with a high potential for resistance and virulence.The dissemination of the prevalent clones was also considered within the pork sector (from farm to fork) and possible trade between France and Spain, two countries among the largest producers of DFS and pork in Europe.

Description of Salmonella serovars isolated in the French pork production chain
A total of 74 different serovars were identified among the 4717 Salmonella references of the French Salmonella Network collection from the pork production chain between 2002 and 2022 within the context of alerts, official control, surveys, surveillance, and control plans.Most of the references (97.9%) were isolated from 2008 to 2020 and, within this period, the main Salmonella serovars were S. 1,4, [5],12:i:-, S. Derby, S. Rissen and S. Typhimurium.Remarkably, S. 1,4, [5],12:i:-progressively increased from 2009 (6.4%) to become the most predominant serovar in the pork production chain in 2014 (41.3%) and then stabilized (Fig. 1).On the contrary, S. Typhimurium remarkably decreased its proportion from 2008 (45.5%) to 2020 (5.3%).From 2008 to 2020, there was a slight decrease in the proportion of S. Derby (from 36.4 to 26.0%) and S. Rissen (from 9.1 to 7.9%).

Genome panel characteristics
Whole genome sequence data of 173 Salmonella enterica originating from pork and different stages of the DFS production chain in the northeast area of Spain and France were analysed.The sources of the strains were pig carcass (49), pork (38), fresh sausage (16) and pork DFS (70) (Table 1).A total of 125 isolates were collected during Salmonella surveillance, 31 in the context of outbreaks (27 specifically isolated from DFS), 14 and 3 were from IRTA and ANSES culture collections, respectively (Supplementary Table S1).
Considering the cgMLST results, a maximum likelihood phylogenomic tree (Supplementary Fig. S3) and a minimum spanning tree (Fig. 2) were built, and both clustered the isolates per each serovar except for S. Derby which had two different lineages due to its polyphyletic nature (ST39 and ST40 belonging to the lineage 1 and ST71 to the lineage 2) 17 .
Clustering association analysis revealed 22 clusters (named with the letter of the alphabet from A to V) using as cut-off a maximum of 10 alleles of difference between genomes and grouped 61 out of the 173 genomes (Supplementary Table S2, "cgMLST" tab).Within these 22 clusters, 2 belonged to S. Typhimurium (A-B), 14 to S. 1,4, [5],12:i:-(C-P), 4 to S. Derby (Q-T), 1 to S. Rissen (U) and 1 to S. Worthington (V) serovars (Fig. 2).Nine of the clusters included Salmonella isolated specifically from DFS (B, F, H, I, J, L, M, O, V), 4 from pig carcass (K, R, S, U), 2 from pork (P, Q),1 from fresh sausages (E), and the remaining 6 clusters included isolates from different matrixes.Two (D, G) out of 22 included DFS and pig carcass matrixes, 2 (A, S) pork and pig carcass, 1 (C) fresh sausage and pig carcass, and 1 (N, including six genomes) DFS, pork and pig carcass matrixes (Supplementary Table S2).
Clusters formed by isolates sharing the same metadata and with a difference of ≤ 2 SNPs between isolates were formed by clonal isolates coming from the same sampling day, batch or belonging to the same outbreak.For S. 1,4, [5],12,:i:-serovar, six clusters (4M, 6M, 7M, 10M, 11M, 14M) shared the same metadata, for each S. Derby, S. Rissen and S. Typhimurium serovars, only one cluster grouped isolates sharing the same metadata and with ≤ 2 SNPs of difference between core genomes (1D, 1R, and 2T, respectively).
The largest cluster (5M) contained a total of nine closely related S. 1,4, [5],12,:i:-isolates, six of them isolated in 2018 from a DFS outbreak occurred in Occitanie.Only one cgSNP differed between the outbreak related isolates and the strain isolated in 2019 also from DFS in Occitanie, and there were six cgSNPs of difference with the strain isolated in 2019 from a pig carcass in Nouvelle-Aquitaine.In cluster 1T (identical cgSNP profiles), S. Typhimurium strains were isolated in 2013 and 2014 from a pig carcass and pork in Nouvelle-Aquitaine and Bretagne, respectively.Clusters 5M and 1T are examples of genotype persistence and survival along time in the pork sector in France.Minimum-spanning tree based on cgMLST analysis of the 173 Salmonella isolates.Each node represents a cgST.The node size is proportional to the number of isolates sharing the same genotype.The branch lengths correspond to allelic differences in log-scale.Clusters formed by nodes with a maximum of 10 allelic differences were labelled with coloured halos and, in parenthesis, the number of different alleles between the most distanced isolates in the cluster and the country of origin (SP: Spain; FR: France).Node colouring corresponds to the serovar in A and to the matrix type in B. Matrix origin and geographic location are indicated with a coloured strip.Sampling year and sequence type (ST) are indicated as labels.Mobile genetic elements (black), plasmids (orange), antimicrobial resistance genes (green), biocide resistance genes (pink) and virulence factor genes (blue) are indicated as a heat map.The accessory genome genes that were found in all the isolates are not represented.
Cluster 12M provided evidence of S. 1,4, [5],12:i:-genotype persistence in the French DFS production chain.The oldest strain of cluster 12M was sampled in 2014 in Centre-Val-de-Loire from pork, then another isolate was collected in 2016 in Occitanie from pig carcass, and four isolates in 2018 in Auvergne-Rhône-Alpes from outbreak related DFS.According to these results, this clone was circulating in the DFS production chain (slaughterhouse > cutting plant > retail) and different France regions for at least four years (2014-2018).
Other examples of clones circulating from pig carcasses to final sausage products are shown by the clusters 1M and 2M including region-specific isolates.Cluster 1M isolates were collected in November 2016 in the same region in France (Provence-Alpes-Côte-d' Azur), from both a pig carcass and a fresh sausage.Cluster 2M isolates were collected in the same region in Spain (Girona) with one strain isolated in 2018 from a pig carcass and two isolated in 2019 from DFS (same batch).
Clusters 4D, 7D (S.Derby ST40), and 8D (S.Derby ST71) isolates were collected in different years in pig carcasses and pork from the same or different geographic locations, and had few SNPs of difference, suggesting a common S. Derby ancestor circulating in the pork sector and within French regions.Matrix origin and geographic location or region are indicated with a coloured strip.Sampling year and sequence type (ST) are indicated as labels.Mobile genetic elements (black), plasmids (orange), antimicrobial resistance genes (green), biocide resistance genes (pink) and virulence factor genes (blue) are indicated as a heat map.The accessory genome genes that were found in all the isolates of the S. Derby serovar are not represented in the figure .Interestingly, the cluster 13M is an example of a multi-country occurrence with S. 1,4, [5],12:i:-isolates collected in Spain (Barcelona) in 2019 and in France (Bretagne) in 2021.The genomic distance within all the isolates is of 16 SNPs.

Characterization of Salmonella isolates through accessory genome analysis
Resistome, virulome and MGE of all the genomes were examined to characterize the antimicrobial resistances, virulence potential, heavy metal, and biocide tolerances (Figs. 3, 4 and 5) (extended data in Supplementary Tables S4, S5 and S6).
Fosfomycin resistance, fosA7_1 gene, was only detected in S. Derby.The 55% of the S. Derby analysed genomes had the Tn7 and SGI-1 profiles, including aadA2_1, sul1_5 and tet(A)_6 genes, which code for resistance to aminoglycoside, sulphonamide, and tetracycline, respectively.These isolates closely clustered (ST40) and were recovered in both France and Spain from pig carcass and pork during 2014 and 2015.In contrast, S. Derby ST71 showed no AMR genes (except for aac(6ʹ)-Iaa) (Fig. 3).
Third generation extended-spectrum β-lactamase (ESBL) resistance genes were found in S. Typhimurium and S. Derby serovars.Specifically in S. Typhimurium, two isolates contained blaCARB-2_1 and two other strains contained blaOXA-1_1 ESBL genes, coding for resistance to carbenicillinase and carbapenemase, respectively.blaCTX-M-1_1 ESBL resistance gene, coding for cefotaxime resistance 18 , was only found in one isolate of S. Derby ST40.

Genes implied in biofilm formation, biocide and stresses tolerances
A total of 121 genes related to biofilm formation, stress adaptation and biocide and chemical/metal compounds resistance, among others, were evaluated (Supplementary Table S6).The profile of bactericide resistance genes highly depended on the serovar (Figs.

Discussion
The relevance of S. 1,4, [5],12:i:-was raised in the recent decades and the serovar increase observed in France agrees with the information reported from Spain 20 and worldwide 21 .S. 1,4, [5],12:i:-was first described as an atypical monophasic S. Typhimurium in 1987 22 and it was spread in Spain during the 1990s 23 .From then on, within the context of pork industry globalization 24 , its dominance among the existing 2,600 Salmonella serovars has occurred in the pig herds specially [25][26][27] .S. 1,4, [5],12:i:-(ST34) is also the most abundant serovar from the evaluated panel of Salmonella genomes, corresponding to isolates from the pig production chain (i.e., pig carcasses, pork, pork sausages and dry fermented sausages) from both France (65%) and Spain (44%) during the 1997-2021 period.Pig carcasses, before being cut, are cooled down to refrigeration temperatures (0-4.4 °C), which has been reported to cause a Salmonella decrease in meat though not eliminating it completely 28 .Interestingly, the number of S. 1,4, [5],12:i:-isolates is higher in fresh sausages and DFS than in pig carcasses, its main source of contamination.These two facts could indicate that there is a selection towards the 1,4, [5],12:i:-serovar along pork and DFS production chain, a process which ends when food matrix is fermented (i.e., acidified) and dried 5 .DFS are a harsh environment for Salmonella and a progressive decrease of the pathogen has been described 29 .Under these circumstances, the high stress tolerance described for S. 1,4, [5],12:i:-among Salmonella serovars 30 and the efficient colonization and survival abilities displayed above its parent S. Typhimurium strain 31 , could account for the higher prevalence of this serovar at the end of the pig production chain (i.e., fresh sausages and, particularly DFS).S. Derby and S. Rissen serovars seem less well adapted to the environment of DFS processing plants and to the production processes of DFS.Indeed, despite its prevalence during the last 20 years in pig herds has been stable 32 , as shown by French Salmonella Network data, our genomic panel showed a decrease of S. Derby and S. Rissen serovars along the DFS production chain, from pig carcass (51% and 14%, respectively) to the final product (7% and 6%, respectively).
The phylogenomic relationship between the 173 isolates shows Salmonella clusters of two or more isolates with equal or less than 10 allelic and 20 SNP differences in the core genome from S.1,4, [5],12:i:-, S. Derby, S. Rissen, S. Typhimurium and S. Worthington.Among the clusters, most of them indicated genotype persistence and survival along time in the pork sector and DFS production chain while others are related with region-specificity.Within our panel only three isolates clustered together in the cgSNP analyses although having different origin country, suggesting a witness of international trade exchange.Salmonella Typhimurium and S. 1,4, [5],12:i:-isolates ancestry and phylogeny has been studied in several pig-related environments (i.e., pig farms and slaughterhouses) 20,33,34 and only a few studies 35 have focused in the production chain of pork products from official control sampling.The phylogenomic results have unveiled that 7 out of 9 Salmonella clusters identified within DFS matrix were due to the monophasic S. 1,4, [5],12:i:-, thus increasing its concern for official authorities and industry.S. Typhimurium short-term substitution rate has been reported to be of 1-2 SNPs per genome per year, thus providing information of strain clonality or common ancestor 36 .WGS-derived SNPs provided great cluster resolution in our panel that showed S. Typhimurium clonal isolates dissemination and transmission between regions (cluster 1 T, 0 cgSNPs) and S. 1,4, [5],12:i:-isolates with a common ancestor in the DFS production chain (cluster 12 M, 17 cgSNPs).Cross-country spread of Salmonella due to exportation of DFS was found in our phylogenetic results.Out of 173 samples analysed, three noticed cross-border contamination with a prevalence of 1.8% in our study.Other studies also reported Salmonella dissemination due to pig trade in Europe 20 .
In agreement with previous studies 37 , S. Derby STs mainly found in the French pork sector were ST40 and ST39 and the same was observed for Spanish genomes.Regardless of its polyphyletic nature, cgSNP analysis of S. Derby isolates was highly resolutive and closely related clusters were identified within ST40 and ST71 and genotypes with matrix or geographic persistence were shown.WGS approach has also been used for trace-back microbial investigations, which have indicated DFS as the main source of Salmonella outbreaks 38,39 .In our study, cluster 5M (15 cgSNPs) revealed seven 1,4, [5],12:i:-isolates from DFS related with an outbreak in 2018 and from Vol:.( 1234567890 www.nature.com/scientificreports/pig carcass in 2019, which emphasizes the importance of following good manufacturing practices and validating the DFS production process 40 together with adequate sampling plans and monitoring.Successful implementation of continuous monitoring of Salmonella has shown an effective control of the pathogen dissemination 41 .On the other hand, further studies should be carried out on the ability of some S. 1,4, [5],12:i:-clones to resist cleaning and disinfection practices applied in DFS manufacturing processes.MGE determine the potential for genomic plasticity and pathogenicity of a bacteria 42,43 .Among the identified MGE, plasmid replicon types IncF and Col are the two most abundant replicon families in the dataset.IncFIB and IncFII virulence plasmids are among the best characterized and abundant plasmids within the genus 44 and have been described to be part of the ancestral virulence plasmids together with rck, spv and pef virulence operons 45 .The inheritance of these plasmids is primarily vertical and serovar divergence theory may explain why only one S. Derby strain and all genomes of S. Typhimurium contain these plasmids 46 .Colicinogenic (Col) plasmids, which encode colicin bacteriocins, are typical from Enterobacteriaceae and are abundant in animal guts 47 .ColE10_1 is usually found in Salmonella and its relationship with quinolone resistance spread through qnrS1 and qnrB19 genes has been described 48 .In our panel, Col plasmids were found in genomes from S. 1,4, [5],12:i:-(9.80%),S. Derby (15.56%) and S. Rissen (18.18%), isolated from all studied matrixes.ColE10_1 plasmid was the most detected between the Col plasmids, specifically, it was found in 6 S. Derby genomes (13.33%), but qnrB19_1 gene, that confers resistance to quinolone, was not detected concurrently.
Furthermore, transposons and Salmonella Genomic Islands are MGE usually integrated in the chromosome and carry specific antimicrobial resistance genes (ARG), virulence factors and biocide resistance genes.In 1980s, the acquisition of Tn21 and SGI-4 favoured the expansion of 1,4, [5],12:i:-European epidemic clone 49 .The majority of 1,4, [5],12:i:-genomes in the panel showed the Tn21 genetic element, which encodes mercury resistance (merRT) and antibiotic (ASSuT profile) genes, and SGI-4, encoding genes involved in arsenic (ars operon) and copper (pco operon) resistances 31,49,50 .Isolates of 1,4, [5],12:i:-mainly from DFS (90.9%) had the ACSSuTTm profile, which is usually related to the acquisition of the class 1 integron 51 .Stress conditions (e.g., cleaning and disinfection procedures) promote the gain of MGE 52 that can include genes conferring resistance to heavy metals, biocides and biofilm formation, providing the ability to overcome stress conditions and favouring S. 1,4, [5],12:i:-serovar survival and its selection 51,53 .In S. Derby ST40 there is a big cluster of isolates that carry resistance genes to quaternary ammonium and mercury compounds, co-occurring with aadA2, sul1 and tet(A) AMR genes.This fact was already described by Sévellec et al. 17 for the presence of the SGI-1, which also included tetA gene and extra mercury resistance genes (merA and merC) located in a Tn7 transposon.
Salmonella Typhimurium and its monophasic variant shared some genomic particularities (i.e., presence of SPI-2 and SPI-13) in comparison to the other studied serovars.SPI-2, which was found exclusively in S. Typhimurium and S. 1,4, [5],12:i:-isolates, is a 5-kb locus of horizontally acquired virulence genes that encodes a type III secretion system responsible for delivering effector proteins to the host cell after infection 54 .SPI-13, which was found in some genomes of S. 1,4, [5],12:i:-, harbours genes that encode proteins putatively involved in bacterial metabolism, however, their functions remain largely uncharacterized 55 .
Virulence factors related to Salmonella adherence, bcf operon, and infection, entAB/fepCG/ompA, were excluding each other in the Salmonella genomes.The bcf gene, standing for bovine colonization factor, is an operon encoding for cryptic fimbriae and plays a role in the regulation of biofilm formation when Salmonella colonizes the intestines 56,57 , though has not been described to promote the biofilm formation in industrial surfaces.In contrast, ent operon encodes for the ferric iron binding siderophore enterobactin and fep operon encodes for the siderophore ABC transporter 58 .Both ent and fep operons, together with the ferric iron binding siderophore salmochelin constitute the primary ferric iron import system of Salmonella and are required for its persistent infection in macrophages 58 .Functions of outer membrane proteins (OMPs) are multiple and iron regulation function has also been attributed, specifically for the take up of ferri-siderophore complexes 59,60 .Nonetheless, ompA, encoding for the outer membrane protein A, plays an important role in the intracellular virulence of Salmonella due to the self-protection from the macrophages nitrosative stress 61 and the activation of the immune system response 62 .The shdA gene was exclusively found in S. 1,4, [5],12:i:-and S. Infantis, unequally found in S. 1,4, [5],12:i:-isolates from the same sampling and in different proportions in the studied matrixes (7.1% in pig carcass, 5.0% in pork, 6.3% in fresh sausages and 11.5% in DFS).Gene shdA encodes for an OMP that is expressed while the pathogen inhabits the animal intestine and allows its specific binding through fibronectin 63 , an extracellular adhesion molecule involved in muscular tissue repair.The presence of shdA could be an advantage for S. 1,4, [5],12:i:-isolates attachment to pig carcasses and fresh pork, enhancing its selection along the production chain and together with the abovementioned stress tolerance result in the serovar persistence and survival.
Multidrug resistancet (MDR) Salmonella strains represent a serious challenge worldwide in the treatment and control of Salmonella infections, since these strains exhibit resistance to three or more antimicrobial classes 64 .MDR Salmonella isolates from pigs was of 39.1% in the EU in 2021 65 .Our results show that the most prevalent serovar in DFS, S. 1,4, [5],12:i:-, is also the serovar described to harbour more ARG in its genomes (i.e., 91% of S. 1,4, [5],12:i:-genomes had three or more ARG), thus proving the warning for its worldwide spread.Notwithstanding, extended-spectrum β-lactamase (ESBL) genes, blaCARB-2_1 and blaOXA-1_1, were found in S. Typhimurium, and blaCTX-M-1_1 in S. Derby ST40, in pig carcasses and DFS from both countries, France and Spain, since 2006.WGS approach allowed the detection of ESBL in a large genome dataset without in vitro susceptibility testing and the monitoring of MDR Salmonella profiles which is of interest for tracking resistome evolution and transfer in different ecosystems and to identify emerging resistance hazards more quickly 66 .
Several genetic markers of resistance to antibiotics and biocides, virulence factors and MGE have been found in the analysed Salmonella genomes, especially in S. 1,4, [5],12:i:-.Considering the high figures of the pig and pork derivatives industry and the fact that DFS are RTE products (i.e.eaten without the need for cooking), the transmission of Salmonella isolates and the corresponding resistance genes along the pork production chain is of concern.The ability of the enteric pathogen to survive along the DFS production process, overcoming disinfection cycles and DFS harsh conditions and the remarkable presence of strains with MDR genetic profile emphasize the need for Salmonella monitoring globally, paying special attention to S. 1,4, [5],12:i:-serovar.Further research on phenotype verification would confirm the survival advantage provided by the genetic markers encountered in the genomic Salmonella panel.In this context, WGS technology is a powerful tool to establish precise phylogenetic relationships between genomic clusters of persistent and transmissible strains in the pork sector, confirming the spread of the S. 1,4, [5]12:i:-European epidemic clone and characterizing the differences in the resistome and virulence profile between Salmonella serovars and food matrixes.Sharing genome sequences of isolates together with the corresponding metadata is essential to perform international pathogen surveillance, quickly identify outbreaks, and move forward towards the One Health approach.

Salmonella isolates origin and selection
Fifty Spanish Salmonella isolates were analysed for this study, 36 isolates were provided by the official control food services of the Department of Health (Catalan Public Health Agency, Government of Catalonia) and 14 were from the IRTA culture collection.Isolates originated from different matrixes (pig carcass, n = 14, pork, n = 1, and DFS, n = 21) sampled in the frame of the "Biological Hazards Surveillance Program" (BHSP) and "Salmonella control program" (SCP), from 2016-2019 and 2018-2019, respectively.IRTA culture collection provided 12 Salmonella spp.genomes isolated from dry fermented sausages and 2 from pork from 1997-2018.
For French data, ANSES database of Salmonella Network was inquired for Salmonella spp.isolates collected from 2002 to 2022 from pig carcass, pork, and sausages (including DFS and "fresh sausages" made from pork and species).A total of 4717 references were obtained and evaluated to determine the proportion of the main serovars over time in the pork production chain in France.From those, 143 isolates had the genome available, and 123 genomes were selected for this study.For pig carcasses and pork origin genome isolates, sample duplicates were removed and the French regional pig carcass and pork production data 67 was considered to balance the number of genome isolates for each French region (i.e., n = 35 genomes from pig carcasses and n = 35 from pork) 67 .All Salmonella genomes available from dry fermented sausages (n = 37) and fresh sausages (n = 16) were selected.
Specific information for Spanish and French Salmonella genomes, including matrix, is summarised in Supplementary Table S1.

Genome sequencing and bioinformatics
Genomic DNA of the 50 IRTA Salmonella spp.isolates was extracted and isolated with the QIAamp DNA Mini QIAcube Kit (QIAGEN) with the automatic QIAcube sample preparation system (QIAGEN).DNA was quantified spectrophotometrically (µDrop plate, Thermo Fisher Scientific, Waltham, MA, USA) and fluorometrically (Quant-iT™ 1X dsDNA HS Assay Kit, Invitrogen, Merelbeke, Belgium) in a Varioskan™ multiplate reader (Thermo Fisher Scientific, USA).DNA samples were sent to Macrogen, Inc (South Korea) for library preparation and sequencing.Nextera DNA XT technology (Illumina) was used for library preparation and indexing according to the manufacturer recommendations.Paired-end sequencing (2 × 150 bases) was performed with an Illumina NovaSeq6000 sequencer.
The 123 French isolates were previously sequenced using Illumina chemistry producing paired-end reads as described by Radomski et al. 68 and Sévellec et al. 17 .
Spanish and French raw reads were quality checked and filtered as described by De Sousa Violante et al. 69 and with an in-house pipeline.In brief, Trimmomatic v0.40 70 was used for the trimming step, FastQC v0.11.5 to check the read quality and ConFindr v0.8.1 to identify intra-and cross-species contamination 71 .An in silico PCR was performed to confirm the monophasic S. Typhimurium variant according to the primers described in the ISO/CD TS 6579-4 72 .
The metadata of the final panel of 173 Salmonella spp.genomes set for bioinformatic analysis are reported in the Supplementary Table S1.
The maximum likelihood phylogenomic trees were constructed from cgSNP results using RaxML v8.2.10 76 , with the evolutionary model GTR CAT and 100 bootstraps.Trees were visualized and annotated using interactive www.nature.com/scientificreports/Tree Of Life (iTOL) 77 .A cutoff of 20 cgSNPs was set to define clusters of closely related isolates, based on the short-term substitution rate of 1-2 SNPs per genome per year for Salmonella 34,36,78 and the range of strain isolation dates (1999-2021), as recommended by the European Food Safety Authority (EFSA) for Salmonella epidemiologically related strains 11 .
In silico detection of resistance and virulence genes, from ResFinder v4.4.2,Bacmet v2.0 and VFDB v4.0 databases was performed using an in-house pipeline whereas the detection of SPI and plasmid track down was performed through SPIFinder v2.0 and PlasmidFinder v2.0.1 databases, respectively, available online at the Center for Genomic Epidemiology (CGE) (Denmark).The minimum threshold of genetic identity was set at 90% for the in-house pipeline and 95% for the online databases and, the coverage at 80% in both cases.

Figure 1 .
Figure 1.Serovar distribution of the main Salmonella serovars in the pork production chain in France from 2008 to 2020.

Figure 2 .
Figure 2.Minimum-spanning tree based on cgMLST analysis of the 173 Salmonella isolates.Each node represents a cgST.The node size is proportional to the number of isolates sharing the same genotype.The branch lengths correspond to allelic differences in log-scale.Clusters formed by nodes with a maximum of 10 allelic differences were labelled with coloured halos and, in parenthesis, the number of different alleles between the most distanced isolates in the cluster and the country of origin (SP: Spain; FR: France).Node colouring corresponds to the serovar in A and to the matrix type in B.

Figure 3 .
Figure 3. SNP core phylogenomic tree of 110 S. Typhimurium and S. 1,4,[5],12:i:-isolates including metadata, ST and accessory genome.Tree was constructed using LT2 reference genome.A cutoff of ≤ 20 SNPs highlighted 16 clusters indicated with numbers and letters (e.g., 1T stands for Cluster 1 of S. Typhimurium and 1M stands for Cluster 1 of the monophasic variant, S. 1,4,[5],12:i:-) and, in parenthesis, the number of different cgSNPs between the most distanced isolates in the cluster.Outbreak related isolates are indicated with a black triangle.Matrix origin and geographic location are indicated with a coloured strip.Sampling year and sequence type (ST) are indicated as labels.Mobile genetic elements (black), plasmids (orange), antimicrobial resistance genes (green), biocide resistance genes (pink) and virulence factor genes (blue) are indicated as a heat map.The accessory genome genes that were found in all the isolates are not represented.

Figure 4 .
Figure 4. SNP core phylogenomic tree of 45 S. Derby isolates, including metadata, ST (ST39 and ST40 (A) and ST71 (B)) and accessory genome.Tree was constructed RM006 as reference genome A cutoff of ≤ 20 SNPs highlighted 9 clusters indicated with numbers and letters, e.g., 1D stands for Cluster 1 of S. Derby, next to the strain label with the cgSNP value in parenthesis.Outbreak related isolates are indicated with a black triangle.Matrix origin and geographic location or region are indicated with a coloured strip.Sampling year and sequence type (ST) are indicated as labels.Mobile genetic elements (black), plasmids (orange), antimicrobial resistance genes (green), biocide resistance genes (pink) and virulence factor genes (blue) are indicated as a heat map.The accessory genome genes that were found in all the isolates of the S. Derby serovar are not represented in the figure.

Figure 5 .
Figure 5. SNP core phylogenomic tree of the 11 S. Rissen including metadata, ST and accessory genome.Tree was constructed using GJ0703-2 as reference genome.A cutoff of ≤ 20 SNPs highlighted 1 cluster indicated with a number and letter, e.g., 1R stands for Cluster 1 of S. Rissen, next to the strain label with the cgSNP value in parenthesis.Outbreak related isolates are indicated with a black triangle.Matrix origin and geographic location or region are indicated with a coloured strip.Sampling date, in years, and sequence type (ST) are indicated as labels.No mobile genetic elements were observed.Plasmids (orange), Antimicrobial resistance genes (green), biocide resistance genes (pink) and virulence factor genes (blue) are indicated as a heat map.The accessory genome genes that were found in all the isolates of the S. Rissen serovar are not represented in the figure. https://doi.org/10.1038/s41598-024-62141-9

Table 1 .
Summary of the Salmonella serovars distribution (%) in the different matrixes studied (pig carcass, pork meat, fresh sausage, dry fermented sausages (DFS)) determined in silico using SeqSero+ and monophasic variant of S. Typhimurium confirmed by in silico PCR.Dash (-): Serovar not present in the panel.