Abstract
Bacteria and archaea insert spacer sequences acquired from foreign DNAs into CRISPR loci to generate immunological memory. The Escherichia coli Cas1–Cas2 complex mediates spacer acquisition in vivo, but the molecular mechanism of this process is unknown. Here we show that the purified Cas1–Cas2 complex integrates oligonucleotide DNA substrates into acceptor DNA to yield products similar to those generated by retroviral integrases and transposases. Cas1 is the catalytic subunit and Cas2 substantially increases integration activity. Protospacer DNA with free 3′-OH ends and supercoiled target DNA are required, and integration occurs preferentially at the ends of CRISPR repeats and at sequences adjacent to cruciform structures abutting AT-rich regions, similar to the CRISPR leader sequence. Our results demonstrate the Cas1–Cas2 complex to be the minimal machinery that catalyses spacer DNA acquisition and explain the significance of CRISPR repeats in providing sequence and structural specificity for Cas1–Cas2-mediated adaptive immunity.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 51 print issues and online access
$199.00 per year
only $3.90 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Barrangou, R. et al. CRISPR provides acquired resistance against viruses in prokaryotes. Science 315, 1709–1712 (2007)
van der Oost, J., Westra, E. R., Jackson, R. N. & Wiedenheft, B. Unravelling the structural and mechanistic basis of CRISPR–Cas systems. Nature Rev. Microbiol. 12, 479–492 (2014)
Mojica, F. J., Diez-Villasenor, C., Garcia-Martinez, J. & Soria, E. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol. 60, 174–182 (2005)
Bolotin, A., Quinquis, B., Sorokin, A. & Ehrlich, S. D. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology 151, 2551–2561 (2005)
Pourcel, C., Salvignol, G. & Vergnaud, G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. Microbiology 151, 653–663 (2005)
Stern, A., Keren, L., Wurtzel, O., Amitai, G. & Sorek, R. Self-targeting by CRISPR: gene regulation or autoimmunity? Trends in Genet. 26, 335–340 (2010)
Carte, J., Wang, R., Li, H., Terns, R. M. & Terns, M. P. Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes. Genes Dev. 22, 3489–3496 (2008)
Haurwitz, R. E., Jinek, M., Wiedenheft, B., Zhou, K. & Doudna, J. A. Sequence- and structure-specific RNA processing by a CRISPR endonuclease. Science 329, 1355–1358 (2010)
Deltcheva, E. et al. CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III. Nature 471, 602–607 (2011)
Brouns, S. J. et al. Small CRISPR RNAs guide antiviral defense in prokaryotes. Science 321, 960–964 (2008)
Garneau, J. E. et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature 468, 67–71 (2010)
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012)
Yosef, I., Goren, M. G. & Qimron, U. Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli. Nucleic Acids Res. 40, 5569–5576 (2012)
Datsenko, K. A. et al. Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system. Nature Commun. 3, 945 (2012)
Swarts, D. C., Mosterd, C., van Passel, M. W. & Brouns, S. J. CRISPR interference directs strand specific spacer acquisition. PLoS ONE 7, e35888 (2012)
Nuñez, J. K. et al. Cas1–Cas2 complex formation mediates spacer acquisition during CRISPR–Cas adaptive immunity. Nature Struct. Mol. Biol. 21, 528–534 (2014)
Wiedenheft, B. et al. Structural basis for DNase activity of a conserved protein implicated in CRISPR-mediated genome defense. Structure 17, 904–912 (2009)
Babu, M. et al. A dual function of the CRISPR–Cas system in bacterial antivirus immunity and DNA repair. Mol. Microbiol. 79, 484–502 (2011)
Kim, T. Y., Shin, M., Huynh Thi Yen, L. & Kim, J. S. Crystal structure of Cas1 from Archaeoglobus fulgidus and characterization of its nucleolytic activity. Biochem. Biophys. Res. Commun. 441, 720–725 (2013)
Beloglazova, N. et al. A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats. J. Biol. Chem. 283, 20361–20371 (2008)
Samai, P., Smith, P. & Shuman, S. Structure of a CRISPR-associated protein Cas2 from Desulfovibrio vulgaris. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 66, 1552–1556 (2010)
Nam, K. H. et al. Double-stranded endonuclease activity in Bacillus halodurans clustered regularly interspaced short palindromic repeats (CRISPR)-associated Cas2 protein. J. Biol. Chem. 287, 35943–35952 (2012)
Li, M. & Craigie, R. Processing of viral DNA ends channels the HIV-1 integration reaction to concerted integration. J. Biol. Chem. 280, 29334–29339 (2005)
Cherepanov, P. LEDGF/p75 interacts with divergent lentiviral integrases and modulates their enzymatic activity in vitro. Nucleic Acids Res. 35, 113–124 (2007)
Hare, S. et al. A novel co-crystal structure affords the design of gain-of-function lentiviral integrase mutants in the presence of modified PSIP1/LEDGF/p75. PLoS Pathog. 5, e1000259 (2009)
Yang, J. Y., Jayaram, M. & Harshey, R. M. Positional information within the Mu transposase tetramer: catalytic contributions of individual monomers. Cell 85, 447–455 (1996)
Dinardo, S., Voelkel, K. A., Sternglanz, R., Reynolds, A. E. & Wright, A. Escherichia coli DNA topoisomerase I mutants have compensatory mutations in DNA gyrase genes. Cell 31, 43–51 (1982)
Pruss, G. J., Manes, S. H. & Drlica, K. Escherichia coli DNA topoisomerase I mutants: increased supercoiling is corrected by mutations near gyrase genes. Cell 31, 35–42 (1982)
Chow, S. A., Vincent, K. A., Ellison, V. & Brown, P. O. Reversal of integration and DNA splicing mediated by integrase of human immunodeficiency virus. Science 255, 723–726 (1992)
Au, T. K., Pathania, S. & Harshey, R. M. True reversal of Mu integration. EMBO J. 23, 3408–3420 (2004)
Engelman, A., Mizuuchi, K. & Craigie, R. HIV-1 DNA integration: mechanism of viral DNA cleavage and DNA strand transfer. Cell 67, 1211–1221 (1991)
Mizuuchi, K. & Adzuma, K. Inversion of the phosphate chirality at the target site of Mu DNA strand transfer: evidence for a one-step transesterification mechanism. Cell 66, 129–140 (1991)
Curcio, M. J. & Derbyshire, K. M. The outs and ins of transposition: from mu to kangaroo. Nature Rev. Mol. Cell Biol. 4, 865–877 (2003)
Arslan, Z., Hermanns, V., Wurm, R., Wagner, R. & Pul, U. Detection and characterization of spacer integration intermediates in type I-E CRISPR-Cas system. Nucleic Acids Res. 42, 7884–7893 (2014)
Tyson, G. W. & Banfield, J. F. Rapidly evolving CRISPRs implicated in acquired resistance of microorganisms to viruses. Environ. Microbiol. 10, 200–207 (2008)
Sheflin, L. G. & Kowalski, D. Altered DNA conformations detected by mung bean nuclease occur in promoter and terminator regions of supercoiled pBR322 DNA. Nucleic Acids Res. 13, 6137–6154 (1985)
Goren, M. G., Yosef, I., Auster, O. & Qimron, U. Experimental definition of a clustered regularly interspaced short palindromic duplicon in Escherichia coli. J. Mol. Biol. 423, 14–16 (2012)
Savitskaya, E., Semenova, E., Dedkov, V., Metlitskaya, A. & Severinov, K. High-throughput analysis of type I-E CRISPR/Cas spacer acquisition in E. coli. RNA Biol. 10, 716–725 (2013)
Shmakov, S. et al. Pervasive generation of oppositely oriented spacers during CRISPR adaptation. Nucleic Acids Res. 42, 5907–5916 (2014)
Deveau, H. et al. Phage response to CRISPR-encoded resistance in Streptococcus thermophilus. J. Bacteriol. 190, 1390–1400 (2008)
Semenova, E. et al. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc. Natl Acad. Sci. USA 108, 10098–10103 (2011)
Westra, E. R. et al. Type I-E CRISPR-cas systems discriminate target from non-target DNA through base pairing-independent PAM recognition. PLoS Genet. 9, e1003742 (2013)
Craigie, R. & Bushman, F. D. HIV DNA integration. Cold Spring Harbor Perspect. Med. 2, a006890 (2012)
Nowotny, M. Retroviral integrase superfamily: the structural perspective. EMBO Rep. 10, 144–151 (2009)
Hochstrasser, M. L. & Doudna, J. A. Cutting it close: CRISPR-associated endoribonuclease structure and function. Trends Biochem. Sci. 40, 58–66 (2015)
Paleček, E. Local supercoil-stabilized DNA structures. Crit. Rev. Biochem. Mol. Biol. 26, 151–226 (1991)
Engelman, A. & Craigie, R. Efficient magnesium-dependent human immunodeficiency virus type 1 integrase activity. J. Virol. 69, 5908–5911 (1995)
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009)
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004)
Acknowledgements
We are grateful to M. Chung, P. J. Kranzusch and A.V. Wright for technical assistance and members of the Doudna laboratory and J. Cate for discussions. This project was funded by US National Science Foundation grant no. 1244557 to J.A.D. and by NIH grant AI070042 to A.E. This work used the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley, supported by NIH S10 Instrumentation Grants S10RR029668 and S10RR027303. J.K.N. is supported by a US National Science Foundation Graduate Research Fellowship and a UC Berkeley Chancellor’s Graduate Fellowship. A.S.Y.L. is supported as an American Cancer Society Postdoctoral Fellow (PF-14-108-01-RMC). J.A.D. is an Investigator of the Howard Hughes Medical Institute and a member of the Center for RNA Systems Biology.
Author information
Authors and Affiliations
Contributions
J.K.N. performed the biochemical experiments. A.S.Y.L. processed and analysed the high-throughput sequencing data. J.K.N., A.S.Y.L., A.E. and J.A.D. designed the study, analysed the data and wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
J.A.D. and J.K.N. have filed a related patent application.
Extended data figures and tables
Extended Data Figure 1 The integration reaction is dependent on the presence of protospacers, low salt and divalent metal ions.
a, In vitro integration assay alongside EcoRI- and Nb.BbvCI nickase-treated pCRISPR. b, Salt-dependence assay using Cas1 or Cas2 only and Cas1+Cas2. The titration corresponds to 0, 25, 50, 100 and 200 nM KCl, in addition to the salt carried in from the reaction reagents. c, Integration assays in the presence of 10 mM EDTA, Mg2+, Mn2+ or no additive. d, Integration assays with increasing protospacer concentrations. e, A comparison of post-reaction treatments as indicated. The data presented in a–e are representative of at least three replicates.
Extended Data Figure 2 Cas1 requires Cas2 for robust protospacer integration.
a, Schematic of the integration assays using 32P-labelled protospacers (PDB code 4P6I for Cas1–Cas2). b, Integration assays in the presence of increasing protein and 10 mM MnCl2. The titration corresponds to 0, 50, 100 and 200 nM protein. c, Same as b except in the presence of 10 mM MgCl2. The data presented in b and c are representative of at least three replicates.
Extended Data Figure 3 The catalytic activity of Cas1 is required for integration.
a, Close-up view of the Cas1 active site with the conserved residues shown in stick configurations (PDB 4P6I). b, Integration assays of purified Cas1 active site mutants complexed with wild-type Cas2. c, The same as b except using radiolabelled protospacers. The data presented in b and c are representative of at least three replicates.
Extended Data Figure 4 Band X corresponds to topoisomers of pCRISPR.
a, Agarose gel of purified relaxed and band X integration products. b, Analysis of the total reaction products, after phenol chloroform extraction and ethanol precipitation, on a pre-stained agarose gel. c, Same as b except ethidium bromide staining was performed after electrophoresis. d, PCR amplification products of various segments of pCRISPR using the relaxed, band X or pCRISPR template shown in a. The laddering effect of minor products using CRISPR locus primers likely reflects the propensity of CRISPR repeats to form DNA hairpins. The data presented in a–d are representative of at least three replicates.
Extended Data Figure 5 Cas1 catalyses the disintegration of half-site integrated protospacers.
a, Schematic of the four strands constituting the Y DNA substrate used in the disintegration assays. b, Native polyacrylamide gel analysis of the annealing products with either strand A or strand C radiolabelled. c, Native polyacrylamide gel analysis of disintegration assay products using Y DNA substrates with strand A labelled. d, Denaturing gel analysis of the disintegration assay products with strand A labelled.
Extended Data Figure 6 Cas1–Cas2 can integrate various lengths of double-stranded DNA with blunt- or 3′-overhang ends into a supercoiled target plasmid.
a, Integration assays using the indicated lengths of protospacer DNA. b, Integration assays using varying 5′ or 3′ overhang lengths. c, d, A comparison of integration assays using pCRISPR or Nb.BbvCI-nicked pCRISPR target. e, Integration assay using different target plasmids with or without a CRISPR locus. The green arrows correspond to the relaxed product of each target and the cyan arrows correspond to the band X product. The data presented in a–e are representative of at least three replicates.
Extended Data Figure 7 Cas1 tyrosine mutants support integration activity in vitro.
a, A close-up of the Cas1 active site with the tyrosine residues labelled in blue. b, Structure-based sequence alignment of Cas1 proteins, highlighting the tyrosine residues mutated to alanine in this study. c, Radiolabelled protospacer integration assay of Cas1 tyrosine mutants complexed with wild-type Cas2. The gel presented in c is representative of at least three replicates.
Extended Data Figure 8 High-throughput sequencing of integration products reveals sequence-specific integration.
a, Schematic of the workflow for high-throughput sequencing analysis of the integration sites. b, Raw map of the total reads along pCRISPR before collapsing into single peaks of protospacer–pCRISPR junctions depicted in Fig. 4. c, Same as b, except for the pUC19 target. d, Sequence of the leader-end of the CRISPR locus in E. coli. e, f, WebLogo analysis from the −5 to +5 positions surrounding the protospacer integration sites on the plus (e) and minus (f) of pCRISPR. The arrow points to the nucleotide that is covalently joined to the protospacer. g, h, Same as e, f, except for the pUC19 target.
Extended Data Figure 9 Cas1–Cas2 correctly orients the protospacer DNA during integration.
a–f, Mapped integration sites along the CRISPR locus of pCRISPR when using protospacer DNA with nucleotide ends ‘wild-type’ 3′ C and 3′ T (a), 3′ A and 3′ T (c), and 3′ C and 3′ C (e). The red arrow in c and e points to the nucleotide change in the protospacer DNA compared to the ‘wild-type’ sequence in a. The protospacer DNA 3′ nucleotide and the CRISPR locus strand biases in a, c, e are plotted in b, d and f, respectively, as percentages of integration events within the CRISPR locus. The black and clear bars represent the (−) and (+) strands of the CRISPR locus, respectively. NS corresponds to not significant and P < 0.0001 by chi-square test. The n values for b, d and f are 5,623, 5,685 and 12,453 reads along the CRISPR locus, respectively.
Extended Data Figure 10 Model of the CRISPR–Cas adaptive immunity pathway in E. coli.
Mature double-stranded protospacers bearing a 3′ C-OH are site-specifically integrated into the leader-end of the CRISPR locus. Correct protospacer integration (left) results in the 5′G/3′C as the first nucleotide of the spacer, proximal to the leader. After transcription of the CRISPR locus and subsequent crRNA processing, foreign DNA destruction is initiated by strand-specific recognition of the 3′-TTC-5′ PAM sequence in the target strand by the crRNA-guided Cascade complex. Incorrect protospacer integration (right) cannot initiate foreign DNA destruction due to the inability for the crRNA to recognize the strand with the 3′-TTC-5′ PAM. Thus, foreign DNA interference during CRISPR–Cas adaptive immunity relies on the Cas1–Cas2 complex for correctly orienting the protospacer during integration.
Rights and permissions
About this article
Cite this article
Nuñez, J., Lee, A., Engelman, A. et al. Integrase-mediated spacer acquisition during CRISPR–Cas adaptive immunity. Nature 519, 193–198 (2015). https://doi.org/10.1038/nature14237
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nature14237
This article is cited by
-
A genomic catalogue of soil microbiomes boosts mining of biodiversity and genetic resources
Nature Communications (2023)
-
Soil viral diversity, ecology and climate change
Nature Reviews Microbiology (2023)
-
Histones direct site-specific CRISPR spacer acquisition in model archaeon
Nature Microbiology (2023)
-
Genetic advancements in obesity management and CRISPR–Cas9-based gene editing system
Molecular and Cellular Biochemistry (2023)
-
Alternative functions of CRISPR–Cas systems in the evolutionary arms race
Nature Reviews Microbiology (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.