Abstract
Klebsiella pneumoniae is an important nosocomial infectious agent with a high antimicrobial resistance (AMR) burden. The application of long read sequencing technologies is providing insights into bacterial chromosomal and putative extra-chromosomal genetic elements (PEGEs) associated with AMR, but also epigenetic DNA methylation, which is thought to play a role in cleavage of foreign DNA and expression regulation. Here, we apply the PacBio sequencing platform to eight Portuguese hospital isolates, including one carbapenemase producing isolate, to identify methylation motifs. The resulting assembled chromosomes were between 5.2 and 5.5Mbp in length, and twenty-six PEGEs were found. Four of our eight samples carry blaCTX-M-15, a dominant Extended Spectrum Beta Lactamase in Europe. We identified methylation motifs that control Restriction–Modification systems, including GATC of the DNA adenine methylase (Dam), which methylates N6-methyladenine (m6A) across all our K. pneumoniae assemblies. There was a consistent lack of methylation by Dam of the GATC motif downstream of two genes: fosA, a locus associated with low level fosfomycin resistance, and tnpB transposase on IncFIB(K) plasmids. Overall, we have constructed eight high quality reference genomes of K. pneumoniae, with insights into horizontal gene transfer and methylation m6A motifs.
Similar content being viewed by others
Introduction
Klebsiella pneumoniae (Kp) are Gram-negative bacteria that are found in the normal flora of the mouth, intestines, skin and faeces, but in other parts of the body, such as the lungs, can cause severe morbidity with a diverse disease spectrum that can culminate in complicated invasive infections. This pathogen is increasingly recognized as an important etiological agent of healthcare associated infections. Kp has also been identified as a key route of introduction and dissemination of antimicrobial resistance (AMR) genes into other clinically significant pathogens1,2. In Europe, Kp with resistance to fluoroquinolones and carbapenems, and third-generation cephalosporins has been increasing, leading to reduced treatment options3.
The genome sequencing of Kp clinical isolates can provide insights into AMR, but also epigenetic information superimposed over nucleotide sequences4. The formation of epigenetic lineages enables the adaptation of bacterial populations to harsh or changing environments and modulates the interaction of pathogens with their eukaryotic hosts5,6,7. Epigenetic signals control DNA–protein interactions and can cause phenotypic change in the absence of mutation8. A common mechanism of epigenetic signalling is DNA methylation by orphan methyltransferases (MTases) such as Dam, which have roles in chromosome replication and segregation, nucleoid organization, cell cycle control, and DNA repair4,8,9. DNA methylation is also the key element of Restriction–Modification (R–M) systems that not only provide defence against foreign DNA, but also encourage bacterial evolution by driving the persistence of plasmids and other mobile genetic elements10,11.
Single Molecule, Real-Time (SMRT) platforms detect DNA modifications by measuring variation in the polymerase kinetics of DNA base incorporation during sequencing. The approach has the ability to detect genome-wide MTase N6-methyladenine (m6A) and N4-methylcytosine (m4C) target motifs at coverage levels recommended for assembly, and reveal phase variation of related genes12. DNA adenine methylase (Dam), which methylates m6A in the GATC sequence, plays a key role in DNA mismatch repair, as well as in bacterial virulence and gene expression, including in some strains of Kp10,13,14,15. R–M systems have also been observed on bacterial plasmids where they may contribute to their maintenance11. Kp is known to have type I and type II R–M systems16. The two systems have different mechanics and their distinct motif types have been reviewed elsewhere4,8. Briefly, type I R–M systems consist of specificity, modification and restriction subunits. The first two subunits are usually located together on a chromosome and are under the control of the same promoter17. The restriction subunit in the type I R–M system recognises long bipartite motifs, such as GGCAN8TCG. While not part of complete type II R–M, Dam is the most common MTase in Gamma-proteobacteria and it recognises palindromic 5′-GATC-3′ motifs4,8.
Here we applied SMRT sequencing to eight Kp isolates from Portugal with antibiotic susceptibility phenotyping to characterise the bacterial epigenome and explore the relationship between methylation and AMR. We focused on methylation, including around AMR genes, and on differences in the abundance of R–M recognition motifs on Kp chromosomal and mobile genetic elements. The abundance of some target methylation motifs was different between chromosomes and plasmids, especially the GATC motifs methylated by orphan MTase Dam. We also found that a GATC motif immediately downstream of the fosA gene, which confers low level fosfomycin resistance18, is consistently unmethylated in our samples. Isolates that had the tnpB transposase gene19 on the IncFIB(K) plasmid also consistently lacked methylation immediately downstream of this gene.
Results
Genome assemblies and phylogeny
Eight Kp isolates with different multi-locus sequence types (MLSTs)20 were sourced from two hospitals in Lisbon, Portugal, between 2005 and 2013, and sequenced on the PacBio RSII technology (Table 1). The assembled chromosomes were between 5.2 and 5.5 Mbp in length and had GC content values between 57.2 and 57.7%. By contrast, 26 putative extra-chromosomal genetic elements (PEGEs) ranged in length between 3.6 and 284.2 Kbp and did not segregate into clusters based on sequence length. The PEGEs have a GC content between 41.4 and 54.1%, except for an outlying 9.9 Kbp plasmid with 60.1%. This outlying plasmid had 100% coverage and 93% identity to several plasmids of Gram-negative bacteria (e.g. CP027616.1, CP023430.1) (Table 1). To put our samples in a broader context we constructed a maximum likelihood phylogenetic tree based on the alignment of seven MLST informative and nineteen core genome loci, which contained 83 representative Kp MLST groups sourced from the NCBI database (Fig. 1)21. All our isolates were the nearest neighbour of isolates with the same MLST, but there was varying heterogeneity in genetic distance between the samples from same MLST, resulting from using 19 additional loci that could differentiate geographical and temporal differences. For example, the nearest neighbour to our Kp3860 isolate (ST307) was one collected in Malta in the same year with an identical MLST and no sequence differences across the genes analysed. On the other hand, our Kp2209 isolate had some divergence from another ST133 sample isolated in Thailand eight years earlier.
Antimicrobial resistance loci
Consistent with their Portuguese origin, four of our eight samples carry blaCTX-M-15, a dominant extra-spectrum beta-lactamase in Europe and a source of resistance to third-generation cephalosporins22. Two isolates, Kp1675 and Kp1264, carry the gene on a chromosomal region flanked by Tn2 (Tn3 family) and IS26 (family IS6) mobile genetic elements. Two further isolates, Kp3860 and Kp2209, carry blaCTX-M-15 gene on IncFIB(K) plasmids. Finally, in addition to chromosomal blaCTX-M-15, Kp1264 also carries it on IncFIA(HI1) plasmid which belongs to the same incompatibility group as IncFIB(K)23. All five blaCTX-M-15 fragments (4 isolates) are embedded in the 9500 nt sequence that is near identical between samples (> 99.98% similarity), though Kp2209 has only 50% coverage compared to 93% coverage for the other sequences. This 9500 nt sequence also carries quinolone and aminoglycoside resistance genes. In three isolates (Kp1675, Kp1264 and Kp3860), the region containing blaCTX-M-15 also contains blaOXA-1, which is a penicillinase and a major correlate of resistance to piperacillin/tazobactam and co-amoxiclav in Kp and E. coli, and is commonly associated with co-carriage of aac(6′)-Ib-cr, which restricts aminoglycoside and fluoroquinolone treatment options24,25. Only the Kp2564 isolate bore a KPC-3 carbapenemase coding gene (blaKPC-3), which is located on an IncFIA plasmid. This carbapenemase is thought to have originated in clonal group 258 (CG258), to which this sample belongs26. KPC-3 has now spread across Kp MLSTs globally, including in Portugal27, where its prevalence has increased dramatically over the last decade28. All samples were tested for susceptibility to major anti-microbials (Table S1). Sample Kp2564 was the only one resistant to imipenem, which is consistent with presence of blaKPC-3. Of all samples only Kp1264 appeared resistant to fosfomycin.
Methylation analysis
Seven of the eight samples had sufficient quality and coverage to perform DNA methylation analysis using the SMRT portal. For the other sample (Kp2564), we successfully identified methylation motifs, but low sequencing coverage meant that for non-GATC motifs the detection of motifs methylation state was unreliable. After comparing the generality of motifs versus the frequency of motif methylation (Figure S1) for each isolate, we removed the motifs for which less than 60% of motif occurrences were methylated. For Kp2564 we focused only on GATC, AACN5RTGC and GCAYN5GTT motifs, the latter two being part of same type I bipartite recognition sequence. While our raw results did contain four m4C modifications, the highest modification rate was 17% and therefore omitted from further analysis.
All seven high quality assemblies contained a GATC recognition motif of the Dam based type R–M system. A total of ten type I R–M system motifs were found. For each of these ten we identified specific MTases (Table 2) by comparing their specificity subunits to those in the REBASE database16. For all but one specificity subunit, REBASE has 100% matches of the DNA target recognition domain. The recognition sequence motif on plasmid_5 of Kp2564 was the exception with 82% highest amino acid identity in the target domain. For Kp1363, SMRT Analysis returned four motifs (CGACCN4TGG, CGAYDN4TGG, CCAN4GATCA and CCAN5RTCG) without partner motif strings. The first and second motifs are imperfect reverse complements of the fourth. The third motif is likely to be recognised by the same specificity subunit, but may show low methylation (60%) due to the presence of a type II GATC motif within the type I motif. Given the highly similar number of sites (2336 for the first three motifs versus 2311 for the fourth), the similarity of motifs and only two specificity proteins in the assembly, we suspect that the four motifs are recognised by the same specificity subunit. A problematic sequence was GAAAN6GGG in Kp3860 for which the SMRT Analysis pipeline did not return a partner motif, nor plausible reverse complement. The assembly has one specificity protein, but has no exact match to known recognition sequences in REBASE, with the closest being GAGN6GGG from an E. coli specificity subunit (S.Eco77I), with identity 36% (e-value 7e-72).
Due to a lack of a corresponding restriction enzyme in Kp, the GATC motif is not involved in the defence against foreign DNA8. To assess if this is reflected in the number of motifs on the Kp chromosome and PEGEs, we examined the relative abundance of GATC motifs, as measured by the ratio of total length of observed motifs to those of genomic sequence. For the GATC motif, there was much greater abundance on the chromosomes compared to PEGEs (Wilcoxon P < 3 × 10−6) (Table S2, Fig. 2a). We found the same result in 673 high quality Kp assemblies from the NCBI RefSeq database (all N50s > 4.5Mbp; Fig. 2b), where the GATC abundance on the chromosome (mean 2.25%; standard deviation 0.017%) was greater than on PEGEs (mean 1.52%; std. dev. 0.088%) (Wilcoxon P < 0.001). We also investigated the abundance of type I R–M system motifs (i.e. non-GATC) by comparing the share of chromosomes occupied by the motifs that are recognised by assembly’s MTases versus the share occupied by recognition sequences from other assemblies. There was no strong difference in the abundance of type I recognition motifs on chromosomes or between chromosomes and PEGEs (Wilcoxon P > 0.230). Similarly, there was no strong difference between PEGEs that were recognised by their own assembly’s R–M system compared to those that were not recognised (Wilcoxon P = 0.187).
Unmethylated motifs
Methylated motifs (see Table 2) covered a large portion of the genome and were relatively evenly distributed. Therefore, we focused on unmethylated motifs 50bps upstream and downstream of the genes. Upstream sequences capture gene promoters, whereas downstream sequences provide a control. We focused on the genes and motifs that have higher than expected number of unmethylated motifs (Table 3) in most isolates (> 3/7) (see Data S1 for the full list of identified genes). All such cases had a GATC motif, with only fosA and tnpB IS3 family (IS2 group) transposase genes having a consistently unmethylated motif downstream. TnpB is a key component of mobile genetic elements’ transposition mechanism, and in four samples that had the gene, it was located on a large (197–284kbp) IncFIB(B) plasmid containing various heavy metal and AMR loci. The fosA gene provides Kp with an inherent low-level resistance to the fosfomycin antibiotic29,30. Only one isolate (Kp1264) was resistant to fosfomycin (Table S1), but it was one of two isolates (Kp1264 and Kp1363; Fig. 3) that had the R–M system inserted after fosA. Both fosA and tnpB genes lacked methylation at a GATC motif downstream of the gene, so the unmethylated GATC should not be directly affecting promoter region; however, the lack of methylation may indicate that MTase cannot access the site.
Of the seven isolates with high quality site methylation data, only Kp1264 (ST147) and Kp1363 (ST76) did not have the unmethylated motif downstream of fosA. In both isolates, the required GATC motif was missing due to insertion of an endonuclease gene (> 3 kbp) (Fig. 3). To assess the frequency of this insertion we analysed available Kp assemblies (n = 3584) from the NCBI database. We found that 99.4% of assemblies had the syrM1 gene upstream of chromosomal fosA. The exceptions were 23 samples from diverse geographical regions and sequence types (Table S3). Further, we observed a limited variety of genes downstream on the opposite strand of fosA (Table 4). The top five genes accounted for 96.2% of the assemblies, with the predominant one being the GntR family transcription regulator (68.3%)31. For the ST147 sequence type (NCBI, n = 131), we found most assemblies (130/131) had toxic protein SymE in the same position. The outlying assembly (GCA_004145685.1_ASM414568v1_genomic) had a truncated version of SymE due to the scaffold ending in the coding sequence. For the ST76 sequence type, all NCBI isolates (n = 7) and Kp1363 (one locus mismatch) had coding sequence of endoribonuclease SymE downstream of fosA (Table 4).
For a random subset of unmethylated motifs (from Table 3), we examined the IPDs of individual reads to evaluate the existence of distinct subpopulations with regard to the methylation status of genes, thereby investigating potential intra-populational epigenetic diversification (Figure S4). Standard SMRT Analysis output files report the methylated fraction of reads in the sample, but only for motifs that were called as methylated. We modified the SMRT Analysis algorithms to produce per base IPD values for all nucleotides. After analysing the underlying reads, we found that IPD levels in some genes, including fosA and dksA (DnaK suppressor protein), differentiated two subpopulations corresponding to methylated and unmethylated cells. By contrast, we observed no methylated subpopulation in mglB (D-ribose transporter subunit RbsB).
Discussion
Across seven Kp isolates with high quality PacBio data, we identified the common Dam based methylation mechanism involving GATC motifs, as well as several type I R–M systems. With a type I R–M system, we observed that the abundance of recognition motifs did not vary between chromosomes of different assemblies, but that some depletion of recognition motifs has occurred in PEGEs. This observation supports the role of type I R–M system as a defence mechanism against invasion by foreign DNA, but not its regulatory function, as chromosomal abundance was not correlated to specificity subunit. In contrast, there was a clear difference in abundance of the GATC motif on chromosomes and PEGEs, which suggests that the motif has a function as identified by previous research8,9.
Because GATC motifs are known to have a regulatory function, we expected to find that some genes had consistently unmethylated GATC in gene promoter regions8,9. However, the finding of consistently unmethylated GATC downstream of genes (e.g. fosA and tnpB) is intriguing. This outcome may be an indication of a secondary structure that is inaccessible to MTase, or that the GATC motif downstream of fosA and tnpB is a distant promoter or regulatory region. The phenotypic impact of this GATC motif requires follow-up experiments ideally via single nucleotide site-directed mutagenesis as to prevent methylation with minimum downstream effects. Analysis of this region has also led us to identify the limited variety of recombination events around fosA and their potential MLST specificity. Toxic protein SymE is part of the plasmid toxin-antitoxin system. To assess its prevalence in Kp we evaluated 131 randomly chosen ST147 samples from the NCBI database. We found the same fosA-SymE sequence in all but one of them, but in no other major ST. The potential impact of the SymE insertion on fosfomycin resistance should be evaluated in follow-up experiments. The other coding sequences in that location do not appear to be part of the toxin-antitoxin system32, and therefore this locus has the potential to evaluate the robustness of a Kp phylogeny based on sequence types. We would expect that sequence types with the same insertion would cluster together, and the opposite observation would indicate possible recombination of MLST genes.
The general lack of resistance to fosfomycin in Kp species, despite decades of active use of fosfomycin for infection control30,33, is in contrast to the emergence of resistance to other antimicrobials. However, Kp has been reported to acquire resistance in vitro within 24 hours of exposure to fosfomycin34; though this resistance does not seem to persist as clinical studies report the limited spread of fosfomycin resistance33. The identified unmethylated GATC motif downstream of fosA may be correlated with the rapid acquisition of fosfomycin resistance in individual isolates. Our dataset is too small to draw any robust statistical inference, but the only sample in our study resistant to fosfomycin, Kp1264, is one of the two samples which lacks a GATC motif downstream of fosA (Fig. 3). The plasmid toxin-antitoxin system downstream of fosA in both Kp1363 and Kp1264 requires constant transcription, which means the genomic region is accessible to transcription machinery; whereas unmethylated GATC may be the result of DNA regional conformation inaccessible to Dam. It is unlikely that a lack of methylation downstream of the fosA gene is itself the fosfomycin resistance mechanism. If correlation exists, the more likely explanation is that conformation of the genomic region that prevents methylation, also prevents fosA transcription. This hypothesis is particularly relevant for the treatment of complicated infections by carbapenemase-producing strains, for which fosfomycin is often used in combination with another drug (e.g. colistin) as a last resort therapeutic option35,36.
Overall, our work has provided new insights into methylation and potential MLST specificity. We have generated eight new reference genomes for Kp and analysed their methylomes. The findings reinforce the role of type I motifs as a defence mechanism against foreign DNA. We also identified a higher than expected rates of the GATC motif on Kp chromosomes which, together with absence of respective endonuclease, support existence of Dam function such as DNA mismatch repair37 and gene expression regulation9. We identified two genes, fosA and tnpB, which have an unmethylated GATC motif downstream of each locus. Methylation analysis may be useful for identification of distant regulatory regions or frequent secondary DNA structures. In particular, a lack of methylation downstream of fosA genes could explain not only rapid emergence of resistance in individual samples, but also lack of widespread resistance in Kp. Further, using a bioinformatics approach we detected the presence of both methylated and unmethylated sequencing reads existing within individual samples, potentially representing subpopulations of distinct epigenetic lineages, which could contribute to stochastic phenotypic switching mechanisms in bacteria38,39. Such heterogeneity can be investigated in more depth using a SMALR approach40. This important biology and other findings should be evaluated in larger genomic and functional studies, and could lead to new insights into Kp infection control.
Methods
Sample collection, culture and sequencing
All eight samples were collected from Hospital de Santa Maria (n = 6) and Hospital St Antonio dos Capuchos (n = 2) in Lisbon, Portugal. The samples were cultured and tested for AMR as described previously28. DNA was extracted from strain cultures, grown overnight at 37ºC on Mueller–Hinton Agar. DNA extraction was carried out using the Cetyl trimethylammonium bromide (CTAB) method previously described using Tris–Acetate buffer (10 mM, pH 8)41. Our samples were generated without TET1 oxidation; thus we could not examine 5mC methylation12,42. All DNA samples were sequenced at the Genome Institute Singapore on the PacBio RSII platform.
PCR and Sanger sequencing-based MLST analysis was based on fragments of seven housekeeping genes: rpoB (beta-subunit of RNA polymerase), gapA (glyceraldehyde 3-phosphate dehydrogenase), mdh (malate dehydrogenase), pgi (phosphoglucoseisomerase), phoE (phosphorine E), infB (translation initiation factor 2), and tonB (periplasmic energy transducer). Details of the MLST scheme including amplification and sequencing primers, allele sequences and MLSTs are available on the Institute Pasteur’s MLST Web site (https://bigsdb.pasteur.fr/klebsiella/klebsiella.html).
Bioinformatic analysis
A summary of the bioinformatic and analysis pipeline is provided (Figure S3). Data assembly was performed using three different software tools (HGAP3, Canu and Flye). HGAP3 is part of the SMRT Analysis toolkit (version 2.3), while Canu and Flye software are standalone43,44,45. Short read polishing was not performed due to > 100-fold long read coverage for each sample. We have aligned raw reads to the assemblies and examined region ± 2000 nt around fosA gene in each assembly. We did not observe46 any SNPs, InDels or alignment abnormalities. Between samples, the coverage of the region ranged from 77- to 219-fold. Generated assemblies were compared using the Busco enterobacteriales (odb10) dataset47. For each sample the assembly with highest Busco complete gene count was selected. The sum of complete and fragmented genes yielded the same “best” assemblies. While the HGAP3 assembly had marginally higher N50 score, in most cases its Busco scores were substantially lower. All but one of the selected assemblies had a scaffold longer than 5Mbp. The outlying assembly, Kp1675, had a chromosome split into two scaffolds which was considered an acceptable trade-off for a 5.3% higher share of complete genes. The in silico resistance profile, MLST, O-locus and K-locus types were determined using the kleborate tool48,49. Plasmid and Insertion Sequence (IS) identities were established using PlasmidFinder50 and ISFinder51.
Methylation analysis
We used the generated assemblies to perform methylation detection by applying the PacBio SMRT Analysis toolkit (v2.3). To discriminate between natural variation in IPD and true methylation the toolkit relies on expected distribution of unmethylated IPDs given the nucleotide sequence context. When IPD value exceeds the probability threshold, we used default p-value 0.01, the relevant base is called as modified. The calculations are implemented in KineticWorker.py, which is part of SMRT Analysis toolkit52. After comparing the generality of motifs versus the frequency of motif methylation for each isolate, we removed the motifs for which less than 60% of motif occurrences were methylated. We retained two motifs for the Kp2564 isolate which had methylation fractions below 60% (Table 2), where the low level of methylation was result of insufficient sequencing coverage.
NCBI datasets
To assess the abundance of GATC motif on Kp chromosomes, we downloaded from the NCBI RefSeq database all Kp assemblies where their longest chromosomal contig was at least 4.5Mbps (n = 673)53. For an analysis of the prevalence of GATC motif downstream of fosA gene, we downloaded all Kp assemblies in the NCBI pathogen database (as of November 15, 2019; n = 3584)21. For reconstruction of a Kp phylogenetic tree we determined the MLST20 of each of the 3584 assemblies using kleborate software48. For each MLST which had more than nine samples from different geographical locations (city, region, or country) and year of collection data, we included a single randomly selected assembly for phylogenetic reconstruction. A total of 83 isolates were used in the construction of the phylogenetic tree. The tree was constructed using IQTREE software54 with the GTR + F + G4 model.
Ethical statement
The study was approved by the Hospital de Santa Maria and Hospital St Antonio dos Capuchos Ethics Committees, and all methods were performed in accordance with the relevant guidelines and regulations, including informed consent from all patients.
Data availability
The assembled sequences are deposited in the European Nucleotide Archive (project PRJEB38289). Accession numbers and metadata are presented in Table S1. The authors confirm all supporting data, code and protocols have been provided within the article or through supplementary files.
References
Wyres, K. L. & Holt, K. E. Klebsiella pneumoniae as a key trafficker of drug resistance genes from environmental to clinically important bacteria. Curr. Opin. Microbiol. 45, 131–139 (2018).
Navon-Venezia, S., Kondratyeva, K. & Carattoli, A. Klebsiella pneumoniae: a major worldwide source and shuttle for antibiotic resistance. FEMS Microbiol. Rev. 41, 252–275 (2017).
European Centre for Disease Prevention and Control. Surveillance of antimicrobial resistance in Europe 2018. (2018).
Blow, M. J. et al. The epigenomic landscape of prokaryotes. PLOS Genet. 12, e1005854 (2016).
Beaulaurier, J. et al. Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation. Nat. Biotechnol. 36, 61–69 (2018).
Phelan, J. et al. Methylation in Mycobacterium tuberculosis is lineage specific with associated mutations present globally. Sci. Rep. 8(1), 160. https://doi.org/10.1038/s41598-017-18188-y (2018).
Estibariz, I. et al. The core genome m5C methyltransferase JHP1050 (M.Hpy99III) plays an important role in orchestrating gene expression in Helicobacter pylori. Nucleic Acids Res. 47, 2336–2348 (2019).
Sánchez-Romero, M. A., Cota, I. & Casadesús, J. DNA methylation in bacteria: From the methyl group to the methylome. Curr. Opin. Microbiol. 25, 9–16 (2015).
Adhikari, S. & Curtis, P. D. DNA methyltransferases and epigenetic regulation in bacteria. FEMS Microbiol. Rev. 40, 575–591 (2016).
Nye, T. M. et al. DNA methylation from a type I restriction modification system influences gene expression and virulence in streptococcus pyogenes. PLoS Pathog. 15(6), e1007841. https://doi.org/10.1371/journal.ppat.1007841 (2019).
Wang, R., Lou, J. & Li, J. A mobile restriction modification system consisting of methylases on the IncA/C plasmid. Mob. DNA 10, 26 (2019).
Beaulaurier, J., Schadt, E. E. & Fang, G. Deciphering bacterial epigenomes using modern sequencing technologies. Nat. Rev. Genet. 20, 157–172 (2019).
Casselli, T. et al. DNA methylation by restriction modification systems affects the global transcriptome profile in Borrelia burgdorferi. J. Bacteriol. 200(24), e00395–18. https://doi.org/10.1128/JB.00395-18 (2018).
Pirone-Davies, C. et al. Genome-wide methylation patterns in Salmonella enterica subsp. enterica Serovars. PLoS ONE 10(4), e0123639. https://doi.org/10.1371/journal.pone.0123639 (2015).
Kumar, S. et al. N4-cytosine DNA methylation regulates transcription and pathogenesis in Helicobacter pylori. Nucleic Acids Res. 46, 3429–3445 (2018).
Roberts, R. J., Vincze, T., Posfai, J. P. & Macelis, D. REBASE: Restriction enzymes and methyltransferases. Nucleic Acids Res. 31, 418–420 (2003).
Murray, N. E. Type I Restriction Systems: Sophisticated Molecular Machines (a Legacy of Bertani and Weigle). Microbiol. Mol. Biol. Rev. 64, 412–434 (2000).
Elliott, Z. S. et al. The Role of fosA in challenges with fosfomycin susceptibility testing of multispecies Klebsiella pneumoniae carbapenemase-producing clinical isolates. J. Clin. Microbiol. 57(10), e00634–19. https://doi.org/10.1128/JCM.00634-19 (2019).
Bao, W. & Jurka, J. Homologues of bacterial TnpB-IS605 are widespread in diverse eukaryotic transposable elements. Mob. DNA 4, 12 (2013).
Diancourt, L., Passet, V., Verhoef, J., Grimont, P. A. D. & Brisse, S. Multilocus sequence typing of Klebsiella pneumoniae nosocomial isolates. J. Clin. Microbiol. 43, 4178–4182 (2005).
Agarwala, R. et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. https://doi.org/10.1093/nar/gkx1095 (2018).
Kazmierczak, K. M., de Jonge, B. L. M., Stone, G. G. & Sahm, D. F. Longitudinal analysis of ESBL and carbapenemase carriage among Enterobacterales and Pseudomonas aeruginosa isolates collected in Europe as part of the international network for optimal resistance monitoring (INFORM) global surveillance programme, 2013–17. J. Antimicrob. Chemother. https://doi.org/10.1093/jac/dkz571 (2020).
Partridge, S. R., Kwong, S. M., Firth, N. & Jensen, S. O. Mobile genetic elements associated with antimicrobial resistance. Clin. Microbiol. Rev. 31, e00088-e117 (2018).
Sugumar, M., Kumar, K. M., Manoharan, A., Anbarasu, A. & Ramaiah, S. Detection of OXA-1 β-lactamase gene of Klebsiella pneumoniae from blood stream infections (BSI) by conventional PCR and in-silico analysis to understand the mechanism of OXA mediated resistance. PLoS ONE 9(3), e91800. https://doi.org/10.1371/journal.pone.0091800 (2014).
Xiang Yang Zhou, Bordon, F., Sirot, D., Kitzis, M. D. & Gutmann, L. Emergence of clinical isolates of Escherichia coli producing TEM-1 derivatives or an OXA-1 β-lactamase conferring resistance to β-lactamase inhibitors. Antimicrob. Agents Chemother. 38, 1085–1089 (1994).
Chen, L. et al. Carbapenemase-producing Klebsiella pneumoniae: Molecular and genetic decoding. Trends Microbiol. 22, 686–696 (2014).
Rodrigues, C. et al. KPC-3-producing Klebsiella pneumoniae in Portugal linked to previously circulating non-CG258 lineages and uncommon genetic platforms (Tn4401d-IncFIA and Tn4401d-IncN). Front. Microbiol. 7, 1000. https://doi.org/10.3389/fmicb.2016.01000 (2016).
Perdigão, J. et al. Whole-genome sequencing resolves a polyclonal outbreak by extended-spectrum beta-lactam and carbapenem-resistant Klebsiella pneumoniae in a Portuguese tertiary-care hospital. Microb. Genomics https://doi.org/10.1099/mgen.0.000349 (2020).
Ito, R. et al. Widespread fosfomycin resistance in gram-negative bacteria attributable to the chromosomal fosA gene. MBio 8, e00749-e817 (2017).
Aghamali, M. et al. Fosfomycin: mechanisms and the increasing prevalence of resistance. J. Med. Biol. 68, 11–25 (2019).
Suvorova, I. A., Korostelev, Y. D. & Gelfand, M. S. GntR family of bacterial transcription factors and their DNA binding motifs: Structure, positioning and co-evolution. PLoS ONE 10, e0132618 (2015).
Shao, Y. et al. TADB: A web-based resource for Type 2 toxin-antitoxin loci in bacteria and archaea. Nucleic Acids Res. 39, D606–D611 (2011).
Karageorgopoulos, D. E., Wang, R., Yu, X.-H. & Falagas, M. E. Fosfomycin: evaluation of the published evidence on the emergence of antimicrobial resistance in Gram-negative pathogens. J. Antimicrob. Chemother. 67, 255–268 (2012).
Diep, J. K., Sharma, R., Ellis-Grosse, E. J., Abboud, C. S. & Rao, G. G. Evaluation of activity and emergence of resistance of Polymyxin B and ZTI-01 (fosfomycin for injection) against KPC-producing klebsiella pneumoniae. Antimicrob. Agents Chemother. 62(2), e01815–17. https://doi.org/10.1128/AAC.01815-17 (2018).
Grabein, B., Graninger, W., Rodríguez Baño, J., Dinh, A. & Liesenfeld, D. B. Intravenous fosfomycin—back to the future Systematic review and meta-analysis of the clinical literature. Clin. Microbiol. Infect. 23, 363–372 (2017).
Popovic, M., Steinort, D., Pillai, S. & Joukhadar, C. Fosfomycin: An old, new friend?. Eur. J. Clin. Microbiol. Infect. Diseases 29, 127–142 (2010).
Marinus, M. G. DNA Mismatch Repair. EcoSal Plus 5, (2012).
Beaumont, H. J. E., Gallie, J., Kost, C., Ferguson, G. C. & Rainey, P. B. Experimental evolution of bet hedging. Nature 462, 90–93 (2009).
Casadesús, J. & Low, D. A. Programmed heterogeneity: Epigenetic mechanisms in bacteria. J. Biol. Chem. 288, 13929–13935 (2013).
Beaulaurier, J. et al. Single molecule-level detection and long read-based phasing of epigenetic variations in bacterial methylomes. Nat. Commun. 6, 1–12 (2015).
Parish, T., Stoker, N. G., van Soolingen, D., de Haas, P. E. W. & Kremer, K. Restriction Fragment Length Polymorphism Typing of Mycobacteria. in Mycobacterium Tuberculosis Protocols 165–203 (Humana Press, 2003). http://dx.doi.org/https://doi.org/10.1385/1-59259-147-7:165
Clark, T. A. et al. Enhanced 5-methylcytosine detection in single-molecule, real-time sequencing via Tet1 oxidation. BMC Biol. 11, 4 (2013).
Chin, C. S. et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10, 563–569 (2013).
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
Robinson, J. T., Thorvaldsdóttir, H., Wenger, A. M., Zehir, A. & Mesirov, J. P. Variant review with the integrative genomics viewer. Can. Res. 77, e31–e34 (2017).
Seppey, M., Manni, M. & Zdobnov, E. M. BUSCO: Assessing genome assembly and annotation completeness. in Methods in Molecular Biology 1962, 227–245 (Humana Press Inc., 2019).
Wyres, K. L. et al. Identification of Klebsiella capsule synthesis loci from whole genome data. Microb. Genomics 2, e000102 (2016).
Wick, R. R., Heinz, E., Holt, K. E. & Wyres, K. L. Kaptive web: User-Friendly capsule and lipopolysaccharide serotype prediction for Klebsiella genomes. J. Clin. Microbiol. 56, 197–215 (2018).
Carattoli, A. et al. In Silico detection and typing of plasmids using plasmidfinder and plasmid multilocus sequence typing. Antimicrob. Agents Chemother. 58, 3895–3903 (2014).
Siguier, P., Perochon, J., Lestrade, L., Mahillon, J. & Chandler, M. ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 34, D32–6. https://doi.org/10.1093/nar/gkj014 (2006).
Flusberg, B. A. et al. Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat. Methods 7, 461–465 (2010).
Tatusova, T. et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 44, 6614–6624 (2016).
Nguyen, L. T., Schmidt, H. A., Von Haeseler, A. & Minh, B. Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Acknowledgements
We gratefully acknowledge the Scientific Computing Group for data management and compute infrastructure at Genome Institute of Singapore for their assistance. The MRC eMedLab computing resource was used for bioinformatics and statistical analysis.
Funding
S.C. is funded by Bloomsbury SET, Medical Research Council UK (MR/M01360X/1, MR/R025576/1, and MR/R020973/1) and BBSRC (Grant No. BB/R013063/1) grants. T.G.C. is funded by the Medical Research Council UK (Grant Nos. MR/M01360X/1, MR/N010469/1, MR/R025576/1, and MR/R020973/1) and BBSRC UK (Grant No. BB/R013063/1). This study was supported in part by UID/DTP/04138/2019 from Fundação para a Ciência e Tecnologia (FCT), Portugal. J.Pe. is supported by FCT through Estímulo Individual ao Emprego Científico (CEECIND/00394/2017).
Author information
Authors and Affiliations
Contributions
J.Pe., S.C., A.D., and T.G.C. conceived and directed the project. A.D. coordinated sample collection. J.P., R.E. and A.M. undertook sample processing and DNA extraction. P.Fd.S., M.L.H. and S.C. coordinated sequencing. A.S, J.Ph. and JC performed bioinformatic and statistical analyses under the supervision of S.C. and T.G.C.. A.S., J.Pe., S.C. and T.G.C. interpreted results. A.S. wrote the first draft of the manuscript. All authors commented and edited on various versions of the draft manuscript and approved the final manuscript. A.S. and T.G.C. compiled the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Spadar, A., Perdigão, J., Phelan, J. et al. Methylation analysis of Klebsiella pneumoniae from Portuguese hospitals. Sci Rep 11, 6491 (2021). https://doi.org/10.1038/s41598-021-85724-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-85724-2
This article is cited by
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.