A hallmark of type I CRISPR–Cas systems is the presence of Cas3, which contains both the nuclease and helicase activities required for DNA cleavage during interference. In subtype I-D systems, however, the histidine-aspartate (HD) nuclease domain is encoded as part of a Cas10-like large effector complex subunit and the helicase activity in a separate Cas3’ subunit, but the functional and mechanistic consequences of this organisation are not currently understood. Here we show that the Sulfolobus islandicus type I-D Cas10d large subunit exhibits an unusual domain architecture consisting of a Cas3-like HD nuclease domain fused to a degenerate polymerase fold and a C-terminal domain structurally similar to Cas11. Crystal structures of Cas10d both in isolation and bound to S. islandicus rod-shaped virus 3 AcrID1 reveal that the anti-CRISPR protein sequesters the large subunit in a non-functional state unable to form a cleavage-competent effector complex. The architecture of Cas10d suggests that the type I-D effector complex is similar to those found in type III CRISPR–Cas systems and that this feature is specifically exploited by phages for anti-CRISPR defence.
CRISPR–Cas (clustered regularly interspaced short palindromic repeats–CRISPR-associated proteins)1 systems are present in most bacteria and archaea2 and provide sequence-specific protection against invading DNA or RNA3,4,5,6,7,8,9. CRISPR–Cas immunity is obtained through a three-step process10,11 consisting of adaptation (also known as spacer acquisition), involving incorporation of fragments of invading DNA or RNA into genomic CRISPR arrays; expression, transcription and maturation of CRISPR-RNAs (crRNAs) carrying a sequence complementary to the invading nucleic acids and assembly of the Cas–crRNA surveillance complex; and finally, interference, cleavage of foreign nucleic acids that have been encountered before by a Cas–crRNA effector complex3,11. CRISPR–Cas systems have been divided into two main classes depending on whether their effector complexes consist of multiple (Class 1) or a single (Cas9, Class 2) protein, six types (I–VI) based on the configuration of the signature Cas proteins, and finally 33 subtypes on the basis of functional and mechanistic features12. Type I (Class 1) CRISPR–Cas systems encode a large multi-subunit effector complex termed Cascade (CRISPR-associated complex for antiviral defence), which also includes a crRNA that recognises the invading DNA sequence during the interference stage. Formation of a DNA–RNA heteroduplex (an R-loop) inside the effector complex then triggers recruitment of stand-alone helicase and nuclease activities, which unwind and cleave DNA, respectively. In most type I systems, both these functions are carried out by the signature protein Cas32, which contains an N-terminal metal-dependent histidine-aspartate (HD) nuclease domain, responsible for endo-nucleolytic cleavage of the target, fused to an ATP-dependent and single-stranded DNA (ssDNA)-stimulated superfamily-2 (SF2) helicase (Supplementary Fig. 1a)13,14,15,16. However, in type I-A and I-D systems, the activities of Cas3 are carried out by separate proteins called Cas3’ (helicase) and Cas3” (nuclease). Type III CRISPR–Cas systems also have multi-protein effector complexes for which the large subunits are members of the Cas10 family2. Cas10 proteins contain an N-terminal HD nuclease domain that is evolutionarily distinct from Cas3”, two RNA recognition motif (RRM) domains homologous to those found in polymerases and cyclases, and a helical Thumb/C-terminal domain (Supplementary Fig. 1b)17. Consequently, type III effector complexes are intrinsic nucleases and do not need to recruit additional proteins for activity.
Type I-D CRISPR–Cas systems, which are found both in bacteria and archaea, are exceptional in that they contain a hybrid large subunit, Cas10d, bearing resemblance to both the signature proteins found in type I (Cas3) and type III (Cas10)18. Cas10d thus contains an N-terminal HD domain similar to those in type I Cas3 and Cas3” proteins, which is distinguished by lacking a circular permutation found in type III large subunit HD nucleases that results in a conserved His residue shifting from the N to the C terminal end of the domain2. The Cas10d HD domain is fused to a predicted inactivated polymerase consisting of two RRM domains reminiscent of those found in type III Cas10 subunits19. Recently, it was shown that an archaeal type I-D system from Sulfolobus islandicus is capable of cleaving both double-stranded DNA (dsDNA) and, remarkably, also ssDNA, which could be targeting rudivirus (SIRV) replicative intermediates20. dsDNA cleavage is catalysed by the Cas10d HD domain and in the presence of the Cas3’ helicase and ATP, multiple cleavage events take place on both DNA strands. ssDNA cleavage, on the other hand, appears to follow a 6-nucleotide (nt) ruler-like mechanism and is catalysed by the backbone subunit Csc2 (Cas7), in a way reminiscent of the protospacer RNA transcript cleavage observed in type III systems20. Here, we present the crystal structure of the Cas10d large effector complex subunit from the Sulfolobus islandicus I-D system, both in isolation and in complex with the Sulfolobus islandicus rod-shaped virus 3 (SIRV3) anti-CRISPR protein, AcrID1. The structures reveal an unusual domain architecture of the large subunit, which supports that the effector complex is similar to those found in type III CRISPR–Cas systems. Moreover, we show that binding of the anti-CRISPR protein sequesters Cas10d in a non-functional state, in which it cannot engage in interference. Taken together, our data provide a framework for understanding essential mechanistic features that distinguish type I-D from other type I and type III CRISPR–Cas subtypes.
S. islandicus Cas10d and SIRV3 AcrID1 form a circular higher-order complex
The crenarchaeon Sulfolobus islandicus LAL14/1 carries one subtype I-D CRISPR–Cas locus containing spacers matching the genome of the lytic rod-shaped virus, Sulfolobus islandicus rod-shaped virus 2 (SIRV2), for which it is a laboratory host21. To counter the CRISPR–Cas immunity, SIRV2 on its part expresses a 12.2 kDa anti-CRISPR (Acr) protein, AcrID1, which is conserved in many archaeal viruses including Sulfolobus islandicus rod-shaped virus 3 (SIRV3)22. SIRV3 AcrID1 was shown to inhibit type I-D immunity in S. islandicus through direct binding to the large subunit of the effector complex, Cas10d23. To understand the molecular architecture of the unique type I-D system as well as mechanism of inhibition by AcrID1, we reconstituted the Cas10d–AcrID1 complex in vitro and determined the crystal structure to 3.5 Å resolution by single-wavelength anomalous dispersion (SAD) phasing using Se–Met substitution (Supplementary Table 1). In the complex with Cas10d, AcrID1 forms a dimer as observed in the crystal structure of the isolated anti-CRISPR protein23, and makes extensive contacts with all major domains of Cas10d (Fig. 1a). The net result of these interactions is that two AcrID1 dimers sequester two Cas10d subunits into a large, circular 4:2 complex (Fig. 1b). The buried surface areas between the AcrID1 dimer and the two molecules of Cas10d constitute approximately 930 and 880 Å2 at the two distinct sites of interaction, and size exclusion chromatography supports that this higher-order complex represents the biologically relevant form23.
Cas10d has a unique domain architecture
As predicted from comparative sequence analysis, the 847 residue (94 kDa) S. islandicus Cas10d consists of six domains: A HD nuclease domain (1–186), a Cyclase/RRM domain (187–425), a zinc finger domain (426–490), a Palm/RRM domain (491–623), and a C-terminal helical region consisting of a polymerase-like Thumb domain (624–719) and an extension with predicted homology to Cas11 (720–847) (Figs. 1a, 2a and Supplementary Fig. 1b)12. The six domains are arranged in a right-angled wedge with the Cyclase/RRM and Cas11 domains at the extremes. The overall structure of the Cas10d subunit and relative domain organisation is unique when compared to the closest type III Cas10 orthologues for which structures are available (type III-A Csm1 and type III-B Cmr2, Supplementary Fig. 2)24,25. Most noteworthy is the fact that the Cyclase/RRM domain forms an extended structure that protrudes away from the core of the protein making the protein significantly more open than known type III Cas10 proteins (Fig. 2a and Supplementary Fig. 2).
Cas10d contains a hybrid HD nuclease domain
The HD domain is a general divalent metal-dependent hydrolytic phosphodiesterase domain26, which is configured to function as an endo and/or exo-nuclease in CRISPR–Cas interference14,15,24,27,28,29. HD nuclease domains are found in a wide range of nucleic acid enzymes and consist of a conserved core of 6–7 helices that organise the residues of the active site16. Outside this core motif, the structures vary considerably. The HD domain found in Cas10d appears to be a minimalised version of the core motif found in other type I systems, such as Thermus thermophilus Cas3 (Cas3Tth) and Thermobifida fusca Cas3 (Cas3Tfu, Fig. 2b)15,16. Compared to Cas3, Cas10d lacks two helices (α7 and α8), but compensates with an N-terminal extension (residues 1-26), part of which forms a helix (α1, residues 14–24) that spatially overlaps with α8 in Cas3Tth despite running in the opposite direction. Interestingly, this N-terminal topology is also found in type I-A Cas3” proteins, such as Methanocaldococcus jannaschii Cas3 (Cas3Mja, Supplementary Fig. 3a), which are isolated nuclease subunits30. The Cas10d HD domain thus appears to represent a structural hybrid between Cas3” (α1–α3, residues 13–59) and Cas3 (α2–α6, residues 26–134 and α8, residues 139–157). Finally, at its C-terminal end (residues 158–186), the Cas10d HD domain contains two additional helices (α9–α10), which spatially overlap with a helix and a short beta-hairpin in full length Cas3Tth (Supplementary Figs. 3b and 4).
The Cas10d HD domain binds a single metal ion
The HD nuclease domain is defined by the presence of three conserved sequence motifs, I, II, and V that each contain metal-coordinating His and Asp residues. Two of these are always adjacent in the sequence of motif II and have given rise to the name of the domain (HD)26. Interestingly, HD nucleases are generally not active with Mg2+, but require one or more transition metal ions (Mn2+, Ni2+, Cu2+, or Fe2+) for activity16. In the active site of the Cas3Tth HD domain, two His, two Asp, and a Ser residue coordinate a single divalent metal ion16. Other Cas3 HD domains, such as the one found in Cas3Tfu, have an extended set of residues that includes two additional histidines that come together to coordinate two ions (Fig. 2b, left)14. Structural and sequence alignment with Cas10d reveals that while the core HD domain active site is very similar to that found in Cas3Tth, the additional residues that coordinate the second metal ion (His115, His149, and His150 in Cas3Tfu) are not present, suggesting that this HD domain only binds a single (divalent) cation (Fig. 2b, right, and Supplementary Fig. 4). This difference likely affects catalysis and/or specificity of the Cas10d HD domain but since neither ions nor substrate are present in this structure, the final conclusion must await further analysis.
Cas10d contains a degenerate nucleotidyl transferase fold
Like type III Cas10 proteins, such as Csm1 and Cmr2, Cas10d contains two domains following the HD domain with ferredoxin/RRM-like folds, which we have termed the Cyclase/RRM and Palm/RRM domains based on their predicted homology to nucleotidyl transferases31. The first of these RRM domains (residues 187–425) maintains the interleaved βαβ-βαβ structure characteristic of an RRM domain but has two very short β-strands and appears partially degenerate (Fig. 2a, dark green and Supplementary Fig. 5a). Moreover, the helix of the last βαβ motif contains a large insertion (residues 230–407) forming a separate, mainly helical, domain of nearly 150 residues that extends away from the core of the protein (Fig. 2a, green). Between the two RRM domains, there is a zinc finger domain (residues 426–490) involving cysteines 429, 432, 478, and 481, which come together to tightly interact with a Zn2+ ion, similarly to what was observed for P. furiosus Cmr2/Cas10 (type III-B, Supplementary Fig. 2)24. The second RRM domain (Palm/RRM, residues 491–623) is more canonical except for a small β-hairpin insertion surrounding the helix of the first βαβ RRM motif (Supplementary Fig. 5b). This is highly reminiscent of the Palm and Fingers domains of some polymerases, such as prokaryotic DNA polymerase IV (Supplementary Fig. 5c)32. Due to the extended conformation of the Cyclase/RRM domain, the Palm/Fingers β-sheet structures in Cas10d appear more exposed compared to other large subunit structures where they are sandwiched between α-helices14,24,28,29. Interestingly, the Cas10d Palm/RRM domain is shorter than that found in many type III Cas10 proteins (e.g. Cmr2) and does not have the conserved β-hairpin motif (GGDD) nor the P-loop required for ATP binding, both of which are conserved in the Cmr2 and Csm1 type III subunits (Supplementary Fig. 6)24,25. Moreover, the Finger domain is placed more towards the back of the Palm domain compared to DNA polymerase IV allowing a β hairpin from the Cyclase/RRM domain to block the putative ATP binding pocket. Together, these observations suggest that the domain is not capable of nucleotide binding and thus inactive (Supplementary Fig. 5c). Conversely, the zinc finger motif is very well conserved between the large subunits of type III (Cas10 group) and Cas10d (Supplementary Figs. 1b, 2, and 6).
Finally, the C-terminal region of Cas10d consists of 10 α-helices, and is thus more similar to type III than type I large subunits24,28, for which the domain adopts an αβ fold (Fig. 2a, dark blue and purple)14,15,29. The domain can be broken down into a Thumb domain similar to those found in polymerases (residues 624–719) and a helical bundle, which is structurally similar to the small effector complex subunit Csm2/Cmr5/Cas11 (residues 720–847, Fig. 2a, purple). This is consistent with comparative sequence analysis, which has suggested that several large subunits contain Cas11-like domains at their C termini12. Taken together, our observation that the HD nuclease domain of Cas10d is a hybrid between the type I domains found in Cas3 and Cas3” while that the rest of the protein is more similar to type III large subunits suggests that the type I-D large subunit could represent an evolutionary intermediate between canonical type I and type III CRISPR–Cas systems.
SIRV3 AcrID1 makes extensive contacts to all major domains of Cas10d
In the 2:4 complex of Cas10d and AcrID1 there are no direct interactions between the two Cas10d subunits but several specific interactions between both AcrID1 monomers and each domain of Cas10d that effectively cross-link the Cas10d subunits (Fig. 1b). These interactions tie one AcrID1 monomer (chain A) to the HD nuclease, Palm/RRM, and Thumb/Cas11 domains of one Cas10d subunit and the other AcrID1 monomer (chain B) to the Cyclase/RRM and C-terminal Thumb domains of another Cas10d subunit (Fig. 3). Contacts to the Cyclase/RRM domain are extensive and involve both charged and hydrogen bond interactions to two exposed loops (273–275 and 292–307, Fig. 3a). Although these regions contain some similarity to Csm1/Cmr2, direct contacts appear to be specific and uniquely targeting Cas10d (Supplementary Fig. 6). The interactions between AcrID1 and the HD nuclease domain are especially intriguing because they involve an N terminal extension of Cas10d (residues 1–13) for which specific contacts are made to both Asn3 and Arg6 (Fig. 3b). Interestingly, this extension is not part of the structured HD domain and likely another unique feature of Cas10d (Supplementary Fig. 4). Structural alignment with divalent metal-containing HD domains shows that particularly His45, which is the active site residue closest to the N-terminus, is further away from the rest of the active site than in metal-bound Cas3 HD domain structures. Consequently, this could either be a result of the missing ion or AcrID1 interaction. Contacts between AcrID1 and the Palm/RRM domain do not appear as strong as those to the Cyclase/RRM domain and most involve the exposed loop region 516–531, which is also not present in type III CRISPR–Cas large subunits (Fig. 3c and Supplementary Fig. 6). Finally, a few contacts exist between AcrID1 and the Thumb/Cas11 domains of Cas10d (Fig. 3d).
AcrID1 prevents formation of a cleavage-competent I-D effector complex on target DNA
To understand the functional consequences of AcrID1 binding, we analysed the effect of AcrID1 on dsDNA cleavage by a complete type I-D effector complex reconstituted in vitro20. I-D backbone complex consisting of Csc1 (Cas5), Csc2 (Cas7) and crRNA was mixed with Cas10d co-purified with SS (small subunit, Cas11), Cas3’, ATP, and increasing amounts of AcrID1 before addition of double-stranded target DNA radio-labelled on the non-target strand (dsNTS). Upon incubation of the effector complex components with the dsNTS, but in the absence of AcrID1, several cleavage products appeared, which have been shown to be the result of the activity of the Cas10d HD domain (Fig. 4a)20. This activity was gradually reduced by addition of increasing amounts of recombinant AcrID1 and almost completely absent at a 1:4 molar ratio of Cas10d and the anti-CRISPR protein. Next, we investigated dsDNA binding by Cas10d/SS and the I-D backbone in the presence or absence of AcrID1 by electrophoretic mobility shift assay (EMSA, Fig. 4b). Pre-incubation of the main effector complex components (dsDNA, I-D backbone, Cas10d/SS, but not Cas3’) with increasing amounts of AcrID1 significantly reduced the binding to the target (Fig. 4b, left) while pre-incubation of the effector complex components with the dsDNA target before the addition of AcrID1 did not show this effect (Fig. 4b, right panel). To further delineate where AcrID1 interferes with effector complex assembly, we repeated the cleavage assay with preincubation of dsDNA with both backbone and Cas10d/SS or only one or the other, before addition of the remaining components, Cas3’/ATP and in the presence or absence of AcrID1 (Fig. 4c). This experiment confirmed that binding of both Cas10d/SS and backbone to dsDNA is both necessary and sufficient to prevent AcrID1 from interfering, while neither Cas10d/SS nor backbone binding to dsDNA is effective on their own. Taken together, this suggests that AcrID1, through its interaction with Cas10d, prevents formation of a cleavage-competent effector complex on DNA and moreover, that AcrID1 must interfere with Cas10d before a cleavage-competent complex with backbone and DNA has formed.
This result prompted the question of whether the extended conformation of Cas10d observed in the crystal structure is induced by AcrID1 binding or is an intrinsic feature of the protein. To answer this, we finally pursued a structure of the isolated Cas10d subunit. The isolated protein does not readily crystallise likely due to intrinsic flexibility, however, we were able to obtain crystals of a mildly trypsin-treated sample and determine the structure at 4.0 Å using molecular replacement (Supplementary Table 1). The new crystals contain two copies of Cas10d in the crystallographic asymmetric unit, for which the flexible N terminus (residues 1–13) that interacts with AcrID1, appears to have been cleaved off. Moreover, the Palm/RRM domain has been nicked by the protease resulting in the protein appearing as two bands on a denaturing polyacrylamide gel. Nevertheless, the structure confirms the overall, extended conformation of Cas10d as observed in complex with AcrID1 (Fig. 5a). The helical insertion in the Palm/RRM domain, which interacts directly with AcrID1 in the complex structure, has shifted away from the core of the protein by about 5 Å in comparison to the complex with AcrID1 (Fig. 5a and Supplementary Fig. 7). Together, this suggests that the flexibility of Cas10d is mainly caused by the extended Palm/RRM domain. Consequently, it is likely that this domain gets organised when Cas10d joins the backbone complex and forms the complete I-D effector complex. In summary, we conclude that the extended structure of Cas10d is an intrinsic feature and not an effect of AcrID1 binding.
In this paper, we demonstrate that the S. islandicus type I-D large subunit, Cas10d, has a unique structure consisting of a type I-like HD nuclease domain joined to a set of inactivated type III-like nucleotidyl transferase domains and a Cas11-like C-terminal domain. Moreover, we find that the overall domain organisation is unusual and more extended than other effector complex large subunits, likely due to an intrinsic flexibility of the Palm/RRM domain. AcrID1 is one of only three known SIRV anti-CRISPR proteins23,33,34,35. Acr proteins are typically small, highly divergent proteins that use unique features of their target Cas proteins for recognition36. Thus, sequence alignment of AcrID1 with Acr proteins targeting other type I CRISPR–Cas systems, such as Pseudomonas aeruginosa AcrF3 (anti-type I-F) or AcrE1 (anti-type I-E) reveal little similarity (Supplementary Fig. 8a). The interaction between AcrID1 and Cas10d appears uniquely tailored to the type I-D system, which mirrors what has been observed for other type I Cas-Acr complexes29,37,38,39. Even between closely homologous AcrID proteins from the Sulfolobus islandicus viruses SIRV2 and SIRV3, both targeting the type I-D CRISPR–Cas system, almost none of the interactions are conserved suggesting that these proteins bind in very unique ways to their target proteins (Supplementary Fig. 8b).
The isolated and AcrID1-bound structures of Cas10d leave open the question of what the conformation of Cas10d is inside the type I-D effector complex. However, some information can be gleaned from comparison to the closest homologous type III effector complexes, such as the T. onnurineus Csm (type III-A) and A. fulgidus Cmr (type III-B) complexes determined by cryo-EM40,41. Comparison of Cas10d with the conformation of the corresponding large subunit in the Csm complex, Csm1, and taking the observed flexibility of Cas10d into account suggests that the helical extension of the Cyclase/RRM domain likely folds back up upon interaction with the effector complex and target DNA (Fig. 5a). Consequently, this also suggests that AcrID1, by stabilising the open conformation, sequesters Cas10d in a non-functional state preventing access to the effector complex (Figs. 5a, 6 and Supplementary Fig. 8a). This is consistent with our functional data and might also suggest that the AcrID1 and DNA binding sites on Cas10d overlap as has been seen for other Acr–Cas complexes37,42. On the other hand, once a cleavage-competent complex has been formed between Cas10d, backbone, and target DNA, AcrID1 is no longer able to interfere (Fig. 6). This principle of inhibition is likely unique and is supported by recent findings showing that neither Cas10d nor the backbone complex can bind target DNA on their own20. It suggests that the interaction between Cas10d and target DNA is quite tenuous, which thus provides a window of opportunity for interference by an anti-CRISPR protein that can capture the large subunit. The additional interactions present in the circular 4:2 complex between Cas10d and AcrID1 as well as the stabilisation this complex confers to an intrinsically flexible subunit, likely play a role in allowing this to happen.
Finally, super-positioning of Cas10d on the large subunit of the A. fulgidus Cmr complex (Cmr2, type III-B) reveals a striking structural overlap of the Cas11-like domain of Cas10d with the Cas11 subunit of the type III-B system (Cmr5, rmsd 7.5 Å over 612 atoms), suggesting that the C-terminal domain of Cas10d in fact plays the role of the small subunit inside the type I-D effector complex (Fig. 5b). This is consistent with the lack of a cas11 gene in the type I-D systems as well as previously observed structural similarities between small subunits and the C-terminal domain of several large subunits43. Moreover, recent studies have found that Cas10d co-purifies with a small 14 kDa protein (SS), which likely results from an alternative translation start position at the beginning of the Cas11 domain in the gene encoding the large subunit20,44. Comparison with the Cmr complex, which contains two copies of the Cmr5 (Cas11) small subunits further suggests that in the type I-D effector complex, these roles may be played by multiple copies of the C-terminal Cas11 domain of Cas10d generated by the alternative translation start. Structural studies of Cas10d in the context of the effector complex will hopefully shed light on this intriguing aspect of the type I-D effector complex40,41.
Protein expression and purification
The plasmids pET30a-cas10d and pET30a-acrID1 were used for expression of recombinant Cas10d and AcrID1, both with C-terminal His6 tags23. Separate 2 L cultures of E. coli B834 (DE3) cells (Novagen) carrying plasmids expressing either Cas10d or AcrID1 were grown in lysogeny broth (LB) medium supplemented with 50 µg/mL kanamycin at 37 °C with shaking, until OD600 reached approximately 0.6–0.8. The cells were harvested by centrifugation (1500 × g for 10 min at 4 °C) and used to inoculate 6 L of M9 minimal medium (Molecular Dimensions), supplemented with 50 µg/mL kanamycin, glucose, vitamins and amino acids with the exception of L-methionine. The cells were grown for 3 h at 37 °C after which 20 mg/mL seleno–methionine (Se–Met, Molecular Dimensions) was added and the cells were further incubated until OD600 reached 0.8. Protein expression was then induced with 0.5 mM isopropyl-β-D-thio-galactopyranoside (IPTG) and expression continued overnight at 20 °C. For protein purification, cells were resuspended in lysis buffer (50 mM Tris-Cl, pH = 8.0, 300 mM NaCl, 20 mM imidazole, 3 mM β-mercaptoethanol, β-ME) supplemented with 1 mM phenyl methyl sulfonyl fluoride (PMSF) and 10 µg/mL DNase I. The cells were lysed by sonication and cell debris removed by centrifugation (23,000 × g for 45 min at 4 °C). The cell lysate was heated at 75 °C for 20 min to remove the bulk of E. coli proteins as a first step of purification, then centrifuged to clear the supernatant containing either Cas10d-His or AcrID1-His. The samples were then applied to a pre-packed 5 mL HisTrap column (Cytiva), pre-equilibrated in lysis buffer and eluted using a stepwise gradient into lysis buffer with increasing concentrations (30–300 mM) of imidazole. The eluate containing the protein of interest was then applied on a 1 mL Heparin column (Cytiva), pre-equilibrated in 50 mM Bicine, pH = 8.5, 50 mM NaCl, 5 mM β-ME, and eluted with a linear gradient into the same buffer containing 1 M NaCl. The sample was finally purified by size exclusion chromatography using a Superdex 200 increase 10/300 column (Cytiva), pre-equilibrated in 25 mM Hepes-NaOH, pH = 7.5, 150 mM NaCl, 5 mM β-ME. Peak fractions of Cas10d were pooled and concentrated to 5–15 mg/mL and used for crystallisation in isolation or to form the AcrID1-bound complex23. For complex formation, Cas10d and AcrID1 were mixed in a 1:2 (Cas10d:AcrID1) molar ratio and incubated at 75 °C for 1 min. The complex was then isolated using a Superdex 200 increase 10/300 column (Cytiva) pre-equilibrated in 30 mM Hepes-NaOH, pH = 7.5, 150 mM NaCl, 5 mM β-ME and concentrated to 4 mg/mL. For the isolated Cas10d structure, a sample of the protein was treated with bovine trypsin in a 1:500 (trypsin:Cas10d) molar ratio for 50 min at room temperature. The proteolysis reaction was quenched with 1 mM PMSF after which the sample was purified on a Superdex 200 increase 10/300 column (Cytiva), pre-equilibrated in 25 mM Na-Hepes, pH = 7.5, 300 mM NaCl, 5 mM β-ME. The protein eluted in a similar position to the uncut protein, but sodium dodecylsulphate polyacrylamide gel electrophoresis (SDS-PAGE) revealed the presence of two bands corresponding to approximately 45 and 50 kDa. The protein was concentrated to ~5 mg/ml using a 30 kDa MWCO VivaSpin sample concentrator (Sigma-Aldrich) before crystallisation screening. All protein concentrations were estimated using a NanoDrop measuring OD280 and theoretical extinction coefficients calculated from the known protein sequences.
Crystallisation and structure determination
The purified Cas10d–AcrID1 complex was crystallised in 0.1 M Hepes, pH = 7.5, 4% (w/v) PEG 8000, 5% (v/v) ethylene glycol by sitting drop vapour diffusion prepared in Swissci MRC 2-drop crystallisation plates (Hampton Research) using a Mosquito® nanoliter protein crystallisation robot (TTP Labtech). Initial crystals appeared after two days at 19 °C and further optimised for data collection. All crystals were cryo-cooled in the reservoir buffer supplemented with 30% (v/v) ethylene glycol. After extensive crystal screening and optimisation, a single-wavelength Se anomalous dataset extending to 3.5 Å (CC1/2 = 0.55 at 3.48 Å) was obtained at 100 K and 0.9793 Å X-ray wavelength, identified using a fluorescence energy scan for selenium using mxCUBE at the PETRA III beamline P14 in Hamburg, Germany (see Supplementary Table 1 for crystallographic statistics). The raw data were processed and scaled using XDS and XSCALE45, and data quality assessed using CCP446 and Phenix.xtriage47. Initial phases were calculated using Phenix.autosol and the map improved by density modification as implemented in the Phenix suite47. The initial Se-phased map was of sufficient quality to place an AcrID1 dimer based on the existing structure23, while Cas10d was manually built in Coot48 using the Se-phased experimental map and using the Se anomalous map as guide for methionine positions, which was critical to correct tracing. The model was improved by iterative model building and refinement in Phenix.refine. The clash score and Ramachandran statistics were improved using Namdinator49, while shifts in registry and model geometry were corrected by model optimisation in ISOLDE50 combined with refinement in Phenix.refine. The final structure has R-work=23.7% and R-free=26.6% and a relatively high, average B factor (137 Å2) likely owing to the high level of flexibility of Cas10d (Supplementary Table 1).
For the isolated Cas10d structure, initial crystals appeared by sitting drop vapour diffusion overnight at 19 °C in 100 mM Hepes-NaOH pH = 7.5, 8% (w/v) PEG 8000, 8 % (v/v) ethylene glycol and further optimised by addition of 1% (w/v) PEG 3350. The crystals were cryo-cooled in the reservoir buffer supplemented with 25% (v/v) ethylene glycol before native, monochromatic x-ray data extending to 4.0 Å were collected at the Diamond Light Source beamline I04 in Oxford, UK. The raw data were processed and scaled using XDS and XSCALE45, and data quality was assessed using Phenix.xtriage47. The structure was determined using molecular replacement (Phenix.phaser)51 using Cas10d from the Cas10d–AcrID1 complex as a search model. The crystals of isolated Cas10d contain two copies of the protein per crystallographic asymmetric unit, one of which appears to have a partial RRM/Cyclase domain, presumably due to the proteolytic treatment. In the other molecule, which is more intact, the RRM/Cyclase domain has shifted ~5 Å compared to the complex structure. Inspection of the packing further revealed that parts of the intact structures would clash in the new crystal, suggesting this is the reason that proteolysis allowed crystallisation of the isolated protein. Both chains were rebuilt in Coot and iteratively refined using Phenix.refine. The final structure has R-work = 34.3% and R-free = 37.0% and a very high, average B factor (228 Å2), presumably due to the additional flexibility induced by cleavage of the protein (Supplementary Table 1). All structure figures were created in PyMOL.
Labelling of dsDNA target
DNA oligonucleotides (Supplementary Table 2) were purchased from Integrated DNA Technologies (IDT) and purified by recovering the corresponding bands from a 12% denaturing polyacrylamide gel. The DNA of the non-target strand was 5’-labelled with [γ-32P]-ATP (PerkinElmer) and T4 polynucleotide kinase (New England Biolabs) before double-stranded DNA (dsDNA) was generated by annealing the labelled non-target strand with 2-fold molar excess of unlabelled target strand.
I-D CRISPR–Cas effector purification
A S. islandicus LAL14/1 derivative strain lacking all CRISPR arrays52 as well as the type I-A and type III-B systems but retaining the I-D effector module transformed with pEXA2-Csc1-S5–6 was used to purify backbone complex (Csc1, Csc2, and crRNA) essentially as described before20. Briefly, cells were grown in SCVY medium and the His-tagged complex purified using a His-trap column (Cytiva) followed by gel filtration using a Superdex 200 10/300 column (Cytiva) eluting in 20 mM Hepes-OH, pH = 7.5, and 300 mM NaCl. Recombinant Cas10d for the functional assays co-purified with small subunit (SS) and was purified from E. coli as described previously23. Recombinant MBP-Tev-Cas3’-His was overexpressed from pMAL-TEV-Cas3’ in E. coli BL21 (DE3) Rosetta overnight at 20 °C and purified by HisTrap and gel filtration as described above, cleaved by Tev protease overnight at 4 °C, and purified by HisTrap and gel filtration again, eluting in a final buffer containing 20 mM Hepes-OH, pH = 7.5, and 300 mM NaCl.
DNA cleavage assays
DNA cleavage assays were conducted in 10 μL of reaction volume containing 0.02 mg/mL backbone complex, 0.03 mg/mL Cas10d, 0.01 mg/mL Cas3’, 100 μM ATP, 20 nM dsDNA substrate and the indicated amounts of AcrID1 in a cleavage buffer (20 mM MES-NaOH pH = 6.0, 10 mM MgCl2, 1 mM DTT) at 65 °C for 1 h. Reactions were stopped by adding 2× RNA loading dye (New England Biolabs). For electrophoresis, samples were heated for 3 min at 95 °C and analysed by 12% denaturing polyacrylamide gel electrophoresis (PAGE). For visualisation, a phospho-imager screen was exposed to the gel overnight and scanned with Typhoon FLA-7000. DNA cleavage gels (Fig. 4a, c) were run at identical conditions. Source data are provided as a Source data file.
Electrophoretic mobility shift assays
Each reaction mixture contained 0.02 mg/mL backbone complex, 0.03 mg/mL Cas10d, 20 nM dsDNA substrate and the indicated amounts of AcrID1 in cleavage buffer (20 mM MES-NaOH pH = 6.0, 10 mM MgCl2, 1 mM DTT) and was incubated at 65 °C for 30 min. An equal volume of 2× native loading buffer (0.02% bromophenol blue, 40% (v/v) glycerol) was added and the samples were loaded on an 8% native PAGE running at 25 °C in 40 mM Tris, 20 mM acetic acid, pH = 8.4. Migration on the gel was visualised by exposure to a phospho-imager screen overnight, which was subsequently scanned on a Typhoon FLA-7000.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Coordinates and structure factors for Cas10d–AcrID1 (6THH) and for isolated Cas10d (6YES) have been deposited in the Protein Data Bank. Other publicly available datasets include PDB ID 6MUT, Cryo-EM structure of ternary Csm-crRNA-target RNA with anti-tag sequence complex in type III-A CRISPR–Cas system; PDB ID 3X1L, Crystal Structure of the CRISPR–Cas RNA Silencing Cmr Complex Bound to a Target Analog; PDB ID 4UW2, Crystal structure of Csm1 in T.onnurineus; PDB ID 4QQW, Crystal structure of T. fusca Cas3; PDB ID 4W8Y, Structure of full length Cmr2 from Pyrococcus furiosus (Manganese bound form); PDB ID 5B7I, Cas3-AcrF3 complex; PDB ID 3S4L, The CRISPR-associated Cas3 HD domain protein MJ0384 from Methanocaldococcus jannaschii; PDB ID 3SK9, Crystal structure of the Thermus thermophilus cas3 HD domain; and PDB ID 1JX4, Crystal Structure of a Y-family DNA Polymerase in a Ternary Complex with DNA Substrates and an Incoming Nucleotide. Source data are provided with this paper.
Bolotin, A., Quinquis, B., Sorokin, A. & Ehrlich, S. D. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology 151, 2551–2561 (2005).
Makarova, K. S. et al. An updated evolutionary classification of CRISPR-Cas systems. Nat. Rev. Microbiol. 13, 722–736 (2015).
Barrangou, R. et al. CRISPR provides acquired resistance against viruses in prokaryotes. Science 315, 1709–1712 (2007).
van der Oost, J., Jore, M. M., Westra, E. R., Lundgren, M. & Brouns, S. J. CRISPR-based adaptive and heritable immunity in prokaryotes. Trends Biochem Sci. 34, 401–407 (2009).
Garneau, J. E. et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature 468, 67–71 (2010).
Sontheimer, E. J. & Marraffini, L. A. Microbiology: slicer for DNA. Nature 468, 45–46 (2010).
Westra, E. R. et al. The CRISPRs, they are a-changin’: how prokaryotes generate adaptive immunity. Annu Rev. Genet. 46, 311–339 (2012).
Sashital, D. G., Wiedenheft, B. & Doudna, J. A. Mechanism of foreign DNA selection in a bacterial adaptive immune system. Mol. Cell 46, 606–615 (2012).
Sorek, R., Lawrence, C. M. & Wiedenheft, B. CRISPR-mediated adaptive immune systems in bacteria and archaea. Annu Rev. Biochem 82, 237–266 (2013).
Marraffini, L. A. & Sontheimer, E. J. CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea. Nat. Rev. Genet. 11, 181–190 (2010).
Wiedenheft, B., Sternberg, S. H. & Doudna, J. A. RNA-guided genetic silencing systems in bacteria and archaea. Nature 482, 331–338 (2012).
Makarova, K. S. et al. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol. 18, 67–83 (2020).
Sinkunas, T. et al. Cas3 is a single-stranded DNA nuclease and ATP-dependent helicase in the CRISPR/Cas immune system. EMBO J. 30, 1335–1342 (2011).
Gong, B. et al. Molecular insights into DNA interference by CRISPR-associated nuclease-helicase Cas3. Proc. Natl Acad. Sci. USA 111, 16359–16364 (2014).
Huo, Y. et al. Structures of CRISPR Cas3 offer mechanistic insights into Cascade-activated DNA unwinding and degradation. Nat. Struct. Mol. Biol. 21, 771–777 (2014).
Mulepati, S. & Bailey, S. Structural and biochemical analysis of nuclease domain of clustered regularly interspaced short palindromic repeat (CRISPR)-associated protein 3 (Cas3). J. Biol. Chem. 286, 31896–31903 (2011).
Makarova, K. S., Aravind, L., Grishin, N. V., Rogozin, I. B. & Koonin, E. V. A. DNA repair system specific for thermophilic Archaea and bacteria predicted by genomic context analysis. Nucleic Acids Res. 30, 482–496 (2002).
Koonin, E. V., Makarova, K. S. & Zhang, F. Diversity, classification and evolution of CRISPR-Cas systems. Curr. Opin. Microbiol 37, 67–78 (2017).
Makarova, K. S. et al. Evolution and classification of the CRISPR-Cas systems. Nat. Rev. Microbiol 9, 467–477 (2011).
Lin, J. et al. DNA targeting by subtype I-D CRISPR-Cas shows type I and type III features. Nucleic Acids Res. 48, 10470–10478 (2020).
Jaubert, C. et al. Genomics and genetics of Sulfolobus islandicus LAL14/1, a model hyperthermophilic archaeon. Open Biol. 3, 130010 (2013).
Papathanasiou, P. et al. Stable maintenance of the rudivirus SIRV3 in a carrier state in Sulfolobus islandicus despite activation of the CRISPR-Cas immune response by a second virus SMV1. Rna Biol. 16, 557–565 (2019).
He, F. et al. Anti-CRISPR proteins encoded by archaeal lytic viruses inhibit subtype I-D immunity. Nat. Microbiol 3, 461–469 (2018).
Benda, C. et al. Structural model of a CRISPR RNA-silencing complex reveals the RNA-target cleavage activity in Cmr4. Mol. Cell 56, 43–54 (2014).
Jung, T. Y. et al. Crystal structure of the Csm1 subunit of the Csm complex and its single-stranded DNA-specific nuclease activity. Structure 23, 782–790 (2015).
Aravind, L. & Koonin, E. V. The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem Sci. 23, 469–472 (1998).
Mulepati, S. & Bailey, S. In vitro reconstitution of an Escherichia coli RNA-guided immune system reveals unidirectional, ATP-dependent degradation of DNA target. J. Biol. Chem. 288, 22184–22192 (2013).
Jia, N., Jones, R., Sukenick, G. & Patel, D. J. Second messenger cA4 formation within the composite Csm1 Palm pocket of type III-A CRISPR-Cas Csm complex and its release path. Mol. Cell 75, 933–943 e936 (2019).
Wang, X. et al. Structural basis of Cas3 inhibition by the bacteriophage protein AcrF3. Nat. Struct. Mol. Biol. 23, 868–870 (2016).
Beloglazova, N. et al. Structure and activity of the Cas3 HD nuclease MJ0384, an effector enzyme of the CRISPR interference. EMBO J. 30, 4616–4627 (2011).
Makarova, K. S., Aravind, L., Wolf, Y. I. & Koonin, E. V. Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems. Biol. direct 6, 38 (2011).
Ling, H., Boudsocq, F., Woodgate, R. & Yang, W. Crystal structure of a Y-family DNA polymerase in action: a mechanism for error-prone and lesion-bypass replication. Cell 107, 91–102 (2001).
Athukoralage, J. S. et al. An anti-CRISPR viral ring nuclease subverts type III CRISPR immunity. Nature 577, 572–575 (2020).
Bhoobalan-Chitty, Y., Johansen, T. B., Di Cianni, N. & Peng, X. Inhibition of Type III CRISPR-Cas immunity by an Archaeal virus-encoded anti-CRISPR. Protein Cell 179, 448–458 (2019). e411.
Peng X., Mayo-Munoz D., Bhoobalan-Chitty Y. & Martinez-Alvarez L. Anti-CRISPR Proteins in Archaea. Trends Microbiol. 11, 913–921 (2020).
Hwang, S. & Maxwell, K. L. Meet the Anti-CRISPRs: widespread protein inhibitors of CRISPR-Cas systems. CRISPR J. 2, 23–30 (2019).
Chowdhury, S. et al. Structure reveals mechanisms of viral suppressors that intercept a CRISPR RNA-guided surveillance complex. Cell 169, 47–57 (2017). e11.
Guo, T. W. et al. Cryo-EM structures reveal mechanism and inhibition of DNA targeting by a CRISPR-Cas surveillance complex. Cell 171, 414–41 (2017).
Wang, J. et al. A CRISPR evolutionary arms race: structural insights into viral anti-CRISPR/Cas responses. Cell Res. 26, 1165–1168 (2016).
Osawa, T., Inanaga, H., Sato, C. & Numata, T. Crystal structure of the CRISPR-Cas RNA silencing Cmr complex bound to a target analog. Mol. Cell 58, 418–430 (2015).
Jia, N. et al. Type III-A CRISPR-Cas Csm complexes: assembly, periodic RNA cleavage, dnase activity regulation, and autoimmunity. Mol. Cell 73, 264–277 e265 (2019).
Liu, L., Yin, M., Wang, M. & Wang, Y. Phage AcrIIA2 DNA mimicry: structural basis of the CRISPR and anti-CRISPR arms race. Mol. Cell 73, 611–620 (2019). e613.
Reeks, J., Naismith, J. H. & White, M. F. CRISPR interference: a structural perspective. Biochem J. 453, 155–166 (2013).
McBride T. M., et al. Diverse CRISPR-Cas complexes require independent translation of small and large subunits from a single gene. bioRxiv. Prerint at https://doi.org/10.1101/2020.04.18.045682 (2020).
Kabsch, W. Xds. Acta Crystallogr D. Biol. Crystallogr. 66, 125–132 (2010).
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr D. Biol. Crystallogr. 67, 235–242 (2011).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D. Biol. Crystallogr. 66, 213–221 (2010).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. Biol. Crystallogr. 60, 2126–2132 (2004).
Kidmose, R. T. et al. Namdinator - automatic molecular dynamics flexible fitting of structural models into cryo-EM and crystallography experimental maps. IUCrJ 6, 526–531 (2019).
Croll, T. I. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr D. Struct. Biol. 74, 519–530 (2018).
McCoy, A. J. et al. Phaser crystallographic software. J. Appl Crystallogr. 40, 658–674 (2007).
He, F., Vestergaard, G., Peng, W., She, Q. & Peng, X. CRISPR-Cas type I-A Cascade complex couples viral infection surveillance to host transcriptional regulation in the dependence of Csa3b. Nucleic Acids Res. 45, 1902–1913 (2017).
We are thankful to Saravanan Panneerselvam and Thomas Lykke-Møller Sørensen for assistance during data collection at EMBL PETRA beamlines P14 and P13, respectively, and Kamel El Omari at Diamond Light Source for discussions on low-resolution model building. We are also thankful to Jesper Lykkegaard Karlsen for computational support and Tristan Croll for his help through using ISOLDE for improving the model. Funding for this work came from a Lundbeck Foundation post-doctoral grant to M.C.M. and grants from the Novo Nordisk Foundation to X.P. (grant no. NNF17OC0031154) and D.E.B. (grant no. NNF17OC0028072).
The authors declare no competing interests.
Peer review information Nature Communications thanks Malcolm White and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Manav, M.C., Van, L.B., Lin, J. et al. Structural basis for inhibition of an archaeal CRISPR–Cas type I-D large subunit by an anti-CRISPR protein. Nat Commun 11, 5993 (2020). https://doi.org/10.1038/s41467-020-19847-x
Nature Communications (2022)
Insights into the inhibition of type I-F CRISPR-Cas system by a multifunctional anti-CRISPR protein AcrIF24
Nature Communications (2022)
Nature Reviews Molecular Cell Biology (2021)