Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Integron cassettes integrate into bacterial genomes via widespread non-classical attG sites

Abstract

Integrons are genetic elements involved in bacterial adaptation which capture, shuffle and express genes encoding adaptive functions embedded in cassettes. These events are governed by the integron integrase through site-specific recombination between attC and attI integron sites. Using computational and molecular genetic approaches, here we demonstrate that the integrase also catalyses cassette integration into bacterial genomes outside of its known att sites. Once integrated, these cassettes can be expressed if located near bacterial promoters and can be excised at the integration point or outside, inducing chromosomal modifications in the latter case. Analysis of more than 5 × 105 independent integration events revealed a very large genomic integration landscape. We identified consensus recombination sequences, named attG sites, which differ greatly in sequence and structure from classical att sites. These results unveil an alternative route for dissemination of adaptive functions in bacteria and expand the role of integrons in bacterial evolution.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: The integron system and the att sites.
Fig. 2: Distribution of integrons across bacteria using the RefSeq NCBI database.
Fig. 3: Cassette integration and excision in genome sites.
Fig. 4: Computational analysis of deep sequencing data.
Fig. 5: Hotspots as donor and receptor sites during conjugation assay.
Fig. 6: The alternative route of the integron cassette dissemination.

Similar content being viewed by others

Data availability

Fastq sequences of genomic cassette integration are publicly available in NCBI SRA (accession no. SRR23447848, SRR23447849 and SRR23447850). Bacterial genomes were directly downloaded from the NCBI RefSeq database (https://ftp.ncbi.nlm.nih.gov/genomes/refseq/). Source data are provided with this paper.

Code availability

All scripts used to quantify the ability of IntegronFinder 2.0 to detect SALINs are available at https://gitlab.pasteur.fr/hub/salin. The Nextflow pipeline used to analyse the raw fastq files and generate Fig. 4 and Extended Data Fig. 6 is available at https://gitlab.pasteur.fr/gmillot/14985_loot. The MarkDuplicates-Picard tool of GATK is available at https://github.com/broadinstitute/gatk. Bedtools used to extract sequence around integration sites is available at https://github.com/arq5x/bedtools2. IntegronFinder v.2.0.2 is available at https://github.com/gem-pasteur/Integron_Finder.

References

  1. Stokes, H. W. & Hall, R. M. A novel family of potentially mobile DNA elements encoding site-specific gene-integration functions: integrons. Mol. Microbiol. 3, 1669–1683 (1989).

    CAS  PubMed  Google Scholar 

  2. Escudero, J. A., Loot, C., Nivina, A. & Mazel, D. The integron: adaptation on demand. Microbiol. Spectr. 3, MDNA3-0019-2014 (2015).

    PubMed  Google Scholar 

  3. Cury, J., Jove, T., Touchon, M., Neron, B. & Rocha, E. P. Identification and analysis of integrons and cassette arrays in bacterial genomes. Nucleic Acids Res. 44, 4539–4550 (2016).

    CAS  PubMed  PubMed Central  Google Scholar 

  4. Neron, B. et al. IntegronFinder 2.0: Identification and analysis of integrons across bacteria, with a focus on antibiotic resistance in Klebsiella. Microorganisms https://doi.org/10.3390/microorganisms10040700 (2022).

  5. Buongermino Pereira, M. et al. A comprehensive survey of integron-associated genes present in metagenomes. BMC Genomics 21, 495 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  6. Guerin, E. et al. The SOS response controls integron recombination. Science 324, 1034 (2009).

    CAS  PubMed  Google Scholar 

  7. Baharoglu, Z. & Mazel, D. Vibrio cholerae triggers SOS and mutagenesis in response to a wide range of antibiotics: a route towards multiresistance. Antimicrob. Agents Chemother. 55, 2438–2441 (2011).

    CAS  PubMed  PubMed Central  Google Scholar 

  8. Richard, E., Darracq, B., Loot, C. & Mazel, D. Unbridled integrons: a matter of host factors. Cells https://doi.org/10.3390/cells11060925 (2022).

  9. Bouvier, M., Demarre, G. & Mazel, D. Integron cassette insertion: a recombination process involving a folded single strand substrate. EMBO J. 24, 4356–4367 (2005).

    CAS  PubMed  PubMed Central  Google Scholar 

  10. Francia, M. V., de la Cruz, F. & Garcia Lobo, J. M. Secondary-sites for integration mediated by the Tn21 integrase. Mol. Microbiol. 10, 823–828 (1993).

    CAS  PubMed  Google Scholar 

  11. Recchia, G. D., Stokes, H. W. & Hall, R. M. Characterisation of specific and secondary recombination sites recognised by the integron DNA integrase. Nucleic Acids Res. 22, 2071–2078 (1994).

    CAS  PubMed  PubMed Central  Google Scholar 

  12. Recchia, G. D. & Hall, R. M. Plasmid evolution by acquisition of mobile gene cassettes: plasmid pIE723 contains the aadB gene cassette precisely inserted at a secondary site in the incQ plasmid RSF1010. Mol. Microbiol. 15, 179–187 (1995).

    CAS  PubMed  Google Scholar 

  13. Francia, M. V. & Garcia Lobo, J. M. Gene integration in the Escherichia coli chromosome mediated by Tn21 integrase (Int21). J. Bacteriol. 178, 894–898 (1996).

    CAS  PubMed  PubMed Central  Google Scholar 

  14. Segal, H. & Elisha, B. G. Identification and characterization of an aadB gene cassette at a secondary site in a plasmid from Acinetobacter. FEMS Microbiol. Lett. 153, 321–326 (1997).

    CAS  PubMed  Google Scholar 

  15. Segal, H., Francia, M. V., Lobo, J. M. & Elisha, G. Reconstruction of an active integron recombination site after integration of a gene cassette at a secondary site. Antimicrob. Agents Chemother. 43, 2538–2541 (1999).

    CAS  PubMed  PubMed Central  Google Scholar 

  16. Souque, C., Escudero, J. A. & MacLean, R. C. Off-target integron activity leads to rapid plasmid compensatory evolution in response to antibiotic selection pressure. mBio 14, e0253722 (2023).

    PubMed  Google Scholar 

  17. Loot, C., Bikard, D., Rachlin, A. & Mazel, D. Cellular pathways controlling integron cassette site folding. EMBO J. 29, 2623–2634 (2010).

    CAS  PubMed  PubMed Central  Google Scholar 

  18. Vit, C. et al. Cassette recruitment in the chromosomal Integron of Vibrio cholerae. Nucleic Acids Res. 49, 5654–5670 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  19. Jove, T., Da Re, S., Denis, F., Mazel, D. & Ploy, M. C. Inverse correlation between promoter strength and excision activity in class 1 integrons. PLoS Genet. 6, e1000793 (2010).

    PubMed  PubMed Central  Google Scholar 

  20. Nivina, A., Escudero, J. A., Vit, C., Mazel, D. & Loot, C. Efficiency of integron cassette insertion in correct orientation is ensured by the interplay of the three unpaired features of attC recombination sites. Nucleic Acids Res. 44, 7792–7803 (2016).

    CAS  PubMed  PubMed Central  Google Scholar 

  21. Mazel, D., Dychinco, B., Webb, V. A. & Davies, J. A distinctive class of integron in the Vibrio cholerae genome. Science 280, 605–608 (1998).

    CAS  PubMed  Google Scholar 

  22. Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).

    PubMed  PubMed Central  Google Scholar 

  23. Omotajo, D., Tate, T., Cho, H. & Choudhary, M. Distribution and diversity of ribosome binding sites in prokaryotic genomes. BMC Genomics 16, 604 (2015).

    PubMed  PubMed Central  Google Scholar 

  24. Le Roux, F., Binesse, J., Saulnier, D. & Mazel, D. Construction of a Vibrio splendidus mutant lacking the metalloprotease gene vsm by use of a novel counterselectable suicide vector. Appl. Environ. Microbiol. 73, 777–784 (2007).

    PubMed  Google Scholar 

  25. Betton, J. M. Cloning vectors for expression-PCR products. Biotechniques 37, 346–347 (2004).

    CAS  PubMed  Google Scholar 

  26. Rousset, F. et al. Genome-wide CRISPR-dCas9 screens in E. coli identify essential genes and phage host factors. PLoS Genet. 14, e1007749 (2018).

    PubMed  PubMed Central  Google Scholar 

  27. Loot, C. et al. The integron integrase efficiently prevents the melting effect of Escherichia coli single-stranded DNA-binding protein on folded attC sites. J. Bacteriol. 196, 762–771 (2014).

    PubMed  PubMed Central  Google Scholar 

  28. Loot, C. et al. Differences in integron cassette excision dynamics shape a trade-off between evolvability and genetic capacitance. mBio https://doi.org/10.1128/mBio.02296-16 (2017).

  29. Rutkai, E., Dorgai, L., Sirot, R., Yagil, E. & Weisberg, R. A. Analysis of insertion into secondary attachment sites by phage lambda and by int mutants with altered recombination specificity. J. Mol. Biol. 329, 983–996 (2003).

    CAS  PubMed  Google Scholar 

  30. Tanouchi, Y. & Covert, M. W. Combining comprehensive analysis of off-site lambda phage integration with a CRISPR-based means of characterizing downstream physiology. mBio https://doi.org/10.1128/mBio.01038-17 (2017).

  31. Crozat, E., Fournes, F., Cornet, F., Hallet, B. & Rousseau, P. Resolution of multimeric forms of circular plasmids and chromosomes. Microbiol. Spectr. https://doi.org/10.1128/microbiolspec.PLAS-0025-2014 (2014).

  32. Kluyver, T. et al. Jupyter Notebooks—a publishing format for reproducible computational workflows. In Positioning and Power in Academic Publishing: Players, Agents and Agendas (eds Loizides, F. & Schmidt, B.) 87–90 (IOS Press, 2016).

  33. McKinney, W. Data structures for statistical computing in Python. Proc. 9th Python Sci. Conf. 445, 51–56 (2010).

    Google Scholar 

  34. Cock, P. J. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  35. Waskom, M. L. seaborn: statistical data visualization. J. Open Source Softw. 6, 3021 (2021).

    Google Scholar 

  36. Hug, L. A. et al. A new view of the tree of life. Nat. Microbiol. 1, 16048 (2016).

    CAS  PubMed  Google Scholar 

  37. Biskri, L., Bouvier, M., Guerout, A. M., Boisnard, S. & Mazel, D. Comparative study of class 1 integron and Vibrio cholerae superintegron integrase activities. J. Bacteriol. 187, 1740–1750 (2005).

    CAS  PubMed  PubMed Central  Google Scholar 

  38. Demarre, G., Frumerie, C., Gopaul, D. N. & Mazel, D. Identification of key structural determinants of the IntI1 integron integrase that influence attC x attI1 recombination efficiency. Nucleic Acids Res. 35, 6475–6489 (2007).

    CAS  PubMed  PubMed Central  Google Scholar 

  39. Jaskolska, M., Adams, D. W. & Blokesch, M. Two defence systems eliminate plasmids from seventh pandemic Vibrio cholerae. Nature 604, 323–329 (2022).

    CAS  PubMed  PubMed Central  Google Scholar 

  40. Demarre, G. et al. A new family of mobilizable suicide plasmids based on the broad host range R388 plasmid (IncW) or RP4 plasmid (IncPa) conjugative machineries and their cognate E. coli host strains. Res. Microbiol. 156, 245–255 (2005).

    CAS  PubMed  Google Scholar 

  41. Guzman, L. M., Belin, D., Carson, M. J. & Beckwith, J. Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J. Bacteriol. 177, 4121–4130 (1995).

    CAS  PubMed  PubMed Central  Google Scholar 

  42. Val, M. E., Skovgaard, O., Ducos-Galand, M., Bland, M. J. & Mazel, D. Genome engineering in Vibrio cholerae: a feasible approach to address biological issues. PLoS Genet. 8, e1002472 (2012).

    CAS  PubMed  PubMed Central  Google Scholar 

  43. Bouvier, M., Ducos-Galand, M., Loot, C., Bikard, D. & Mazel, D. Structural features of single-stranded integron cassette attC sites and their role in strand selection. PLoS Genet. 5, e1000632 (2009).

    PubMed  PubMed Central  Google Scholar 

  44. Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).

    CAS  PubMed  PubMed Central  Google Scholar 

  45. Wagih, O. ggseqlogo: a versatile R package for drawing sequence logos. Bioinformatics 33, 3645–3647 (2017).

    CAS  PubMed  Google Scholar 

  46. de Lemos Martins, F., Fournes, F., Mazzuoli, M. V., Mazel, D. & Val, M. E. Vibrio cholerae chromosome 2 copy number is controlled by the methylation-independent binding of its monomeric initiator to the chromosome 1 crtS site. Nucleic Acids Res. 46, 10145–10156 (2018).

    PubMed  PubMed Central  Google Scholar 

  47. Madic, J. et al. Three-color crystal digital PCR. Biomol. Detect. Quantif. 10, 34–46 (2016).

    CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank G. Macaux for experimental help; all the lab members for helpful discussion; M. Monot and L. Ma from the Biomics platform, C2RT, Institut Pasteur, Paris, France, supported by France Génomique (ANR-10-INBS-09) and IBISA. This work was supported by the Institut Pasteur, the Centre National de la Recherche Scientifique (CNRS-UMR 3525), the Fondation pour la Recherche Médicale (FRM Grant No. EQU202103012569; D.M.), ANR Chromintevol (ANR-21-CE12-0002-01; C.L.) and the French Government’s Investissement d’Avenir programme Laboratoire d’Excellence ‘Integrative Biology of Emerging Infectious Diseases’ (ANR-10-LABX-62-IBEID; D.M.).

Author information

Authors and Affiliations

Authors

Contributions

C.L. and D.M. designed the research. C.L., E.R., C.V., B.D., D.L., V.P., F.L. and T.N. performed the experiments. G.A.M. and F.L. performed the computational analysis of deep sequencing data. E.L., J.C., B.N. and E.P.C.R. performed the bioinformatics genomics analysis. C.L. and G.A.M. wrote the draft of the manuscript. All authors read, amended and approved the final version of the manuscript.

Corresponding author

Correspondence to Céline Loot.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Microbiology thanks Sandra Da Re, Louis-Marie Bobay and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Random PCR approach used to determine the genome integration sites.

The genome integrated attC cassette is represented by a light grey arrow (coding sequence) followed by a dark grey triangle (attC). The red arrow indicates the integration site to determine. Primers used for the PCR amplification and sequencing are shown. Electrophoresis analysis of PCR products is shown. MW: Molecular Weight Marker, kb: kilobases.

Source data

Extended Data Fig. 2 Cassette recombination events mediated by IntI2 and IntI3 integrases.

The graph represents the recombination frequencies of attCaadA7 donor sites into attI and attCereA2 sites mediated by IntI2 and IntI3 integrases. The expressed integrases (IntI2 or IntI3) are indicated above the graph and the receptor sites in the axis-x legends. Bar charts show the mean of three independent experiments (n = 3, individual plots) and error bars show the standard deviation. Statistical comparisons (Student’s t test) are as follow: na. (not applicable); Grey asterisk (*) indicates the recombination frequency was below detection level.

Source data

Extended Data Fig. 3 Cassette integration events in genome in presence of an integron.

a, Experimental setup of the cassette integration assay (as in Fig. 3a). b, Electrophoresis migration of PCR products. Several PCR products are sequenced and the integration sites are indicated (att sites and gene location). c, Recombination frequencies of attCaadA7 donor sites into the att sites and genome. Integration sites are indicated in the axis-x legends. Bar charts show the mean of four independent experiments (n = 4, individual plots) and error bars show the standard deviation. Statistical comparisons (Student’s t test) are as follow: na. (not applicable); Grey asterisk (*) indicates the recombination frequency was below detection level. ts: thermosensitive; MW: Molecular Weight Marker, kb: kilobases.

Source data

Extended Data Fig. 4 Excision events observed for two genome-integrated cassettes.

a, Excisions occurring at the integration point (1) and reconstituting the mltA and ilvD genes are shown. Excisions occurring outside the integration point (2) inducing either genome deletion (excision point located in the genome) or insertion (excision point located in the integrated cassette) are shown. b, Sequences of the mltA and ilvD integration sites and of excision sites are shown. All the sequenced excision events (that is 64) are shown. Sizes in bps of each induced genome deletion (-) and insertion (+) are indicated. Cassette sequences are highlighted in grey and cleavage points are indicated by head of arrows. bps: base pairs.

Extended Data Fig. 5 Library construction for Deep sequencing.

All the steps of the library construction are indicated. Adaptors are represented by dark and light green rectangles. attC cassettes are represented by light grey rectangles (coding sequence) followed by dark grey triangles (attC).

Extended Data Fig. 6 Computational analysis of Deep sequencing data.

The figure shows the previous results obtained with IntI1 grouped with the new results obtained with IntI2 and IntI3. The legend is the same as in Fig. 4. The red boxes show the ybhO hotspot sites.

Extended Data Fig. 7 Digital PCR analysis of the oriC copy number relative to terC in E. coli recipient strain during a conjugation mimicking assay.

DNA extraction and PCR were performed at 0, 30, 60, 180 minutes (min) and at an overnight (O/N) time of incubation after the beginning of the conjugation. Bar charts show the mean of three independent experiments (n = 3, individual plots) and error bars show the standard deviation. Grey asterisk (*) indicates the recombination frequency was below detection level.

Source data

Extended Data Fig. 8 Analysis of the attG integration site motifs.

a, Consensus sequence of the six highest hotspot integration sites. A total of 20 bases around the cleavage point was inputted into the WebLogo program (http://weblogo.berkeley.edu/) to generate the motif. The cleavage occurs between the -1 and 1 bases. Bits refers to the information content. b, Recombination frequencies of attCaadA7 donor sites into the mutated ybhO hotspot and attI1 sites. The 4 nucleotide sequences indicated in the axis-x legends correspond to the position 6 to 9 given that position -1 corresponds to the G of the cleaved triplet site. The wt ybhO and attI1 sites are indicated in red at the top of the bars. Bar charts show the mean of three independent experiments (n = 3, individual plots) except for ybhO wt CAGC (with IntI1) for which n = 8. Error bars show the standard deviation. Statistical comparisons (Student’s t test) are as follow: ns. (not significant); all two-sided. Grey asterisk (*) indicates the recombination frequency was below detection level.

Source data

Extended Data Fig. 9 Determination of the attG recombination nature.

a, Single or double cleavage of attG sites?. Double cleavage: we expect heterogeneity of products due to the repair process. Single cleavage: we expect homogeneity of products due to the replication process. b, Bottom or top strand cleavage of attG sites?. Bottom strand cleavage: we expect an orientation + (light brown arrow) and a functional attC site. Top strand cleavage: we expect an orientation - (dark brown arrow) and a non-functional attC site. attG containing replicons are represented by dark lines and attC cassettes by grey ones. The precise cleavage point is indicated by a red arrow. ss and ds: single and double-stranded; bs: bottom strand.

Supplementary information

Reporting Summary

Peer Review File

Supplementary Table 1

Strains, transformed strains, plasmids and primers used in the present study.

Source data

Source Data Fig. 2

IntegronFinder data.

Source Data Fig. 3

Conjugation and excision assay results used for statistical analysis.

Source Data Fig. 5

Conjugation assay results used for statistical analysis.

Source Data Extended Data Fig. 2

Conjugation assay results used for statistical analysis.

Source Data Extended Data Fig. 3

Conjugation assay results used for statistical analysis.

Source Data Extended Data Fig. 7

Digital PCR data.

Source Data Extended Data Fig. 8

Conjugation assay results used for statistical analysis.

Unprocessed gels

Unprocessed gels from Extended Data Figs. 1 and 3.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Loot, C., Millot, G.A., Richard, E. et al. Integron cassettes integrate into bacterial genomes via widespread non-classical attG sites. Nat Microbiol 9, 228–240 (2024). https://doi.org/10.1038/s41564-023-01548-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41564-023-01548-y

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing