Cas13d, the type VI-D CRISPR-Cas effector, is an RNA-guided ribonuclease that has been repurposed to edit RNA in a programmable manner. Here we report the detailed structural and functional analysis of the uncultured Ruminococcus sp. Cas13d (UrCas13d)-crRNA complex. Two hydrated Mg2+ ions aid in stabilizing the conformation of the crRNA repeat region. Sequestration of divalent metal ions does not alter pre-crRNA processing, but abolishes target cleavage by UrCas13d. Notably, the pre-crRNA processing is executed by the HEPN-2 domain. Furthermore, both the structure and sequence of the nucleotides U(-8)-C(-1) within the repeat region are indispensable for target cleavage, and are specifically recognized by UrCas13d. Moreover, correct base pairings within two separate spacer regions (an internal and a 3′-end region) are essential for target cleavage. These findings provide a framework for the development of Cas13d into a tool for a wide range of applications.
CRISPR-Cas system is a heritable, adaptive immune system that protects bacteria and archaea against phages and other invasive MGEs1,2,3,4. The most prominent feature of the system is the array of genomic DNA known as clustered regularly interspaced short palindromic repeats (CRISPR). CRISPR serves as an archive that stores information about previous infections in form of DNA sequences (termed spacers) acquired from the invading MGEs5,6,7. CRISPR-Cas systems can be assigned to class 1 and class 2 systems, which are further divided into types and subtypes2,4,6. Class 2 systems (including types II, V, and VI) deploy a single Cas effector protein to accomplish CRISPR RNA (crRNA) biogenesis and interference4,6,8. Due to their simplicity, class 2 systems have become powerful tools for genome editing and other applications in life sciences9,10,11,12,13,14,15,16.
Particularly interesting among the class 2 systems is the recently identified type VI system, which encompasses subtypes VI-A (effector Cas13a, also known as C2c2), VI-B (effector Cas13b), VI-C (effector Cas13c), and VI-D (effector Cas13d)14,17,18,19,20,21. Cas13 family is the only family of class 2 Cas enzymes known to exclusively target single-stranded RNA. RNA cleavage is mediated by two R-X4-H motifs, which are characteristic of higher eukaryotes and prokaryotes nucleotide (HEPN)-binding domains22,23,24,25,26. Typically, Cas13 effectors exhibit collateral non-specific RNase activity triggered upon binding target RNA sequence17,19,20,21,26. Also, Cas13 enzymes have been engineered as programmable RNA-binding modules for targeting and editing RNA19,20,27,28,29,30.
Analogously to the other members of the Cas13 superfamily, Cas13d also contains two R-X4-H HEPN motifs, but bears little overall similarity to the amino acid sequences of Cas13a and Cas13b17,18,19,20,21. Owing to its biochemical properties, especially small size, lack of constraints on the target flanking sequences, and highly efficient and specific target RNA cleavage, Cas13d is a promising candidate for RNA engineering and editing20,21. However, more widespread application is hindered by the lacking structural analysis of mechanisms underlying Cas13d activity.
Recently, cryo-EM structures of Eubacterium siraeum Cas13d (EsCas13d) in crRNA-bound, and target-bound states have been reported31. Nevertheless, some important issues remain unanswered or require further validation. For example, the pre-crRNA processing site has not been identified. Moreover, an electron density that likely represents a Mg2+ ion was found to interact with the highly ordered nucleotides at the 3′-end of the crRNA repeat region both in EsCas13d binary and ternary complexes31. Due to limited resolutions of the cryo-EM structures31, this prominent feature should be inspected in a higher-resolution structure of Cas13d. Besides, two different results were reported regarding the dependence of pre-crRNA processing on Mg2+ ion21,31, which requires further verification. Whether the crRNA repeat region is recognized in both structure-specific and sequence-specific manner needs detailed investigation. Last but not least, the mismatch tolerance between the crRNA spacer region and the target RNA should be thoroughly tested. Here, we try to answer these questions by using high-resolution crystal structures of Cas13d and biochemical studies, and provide additional molecular information that can be used in rational design of the CRISPR-Cas13d system for a wide range of potential applications.
Overall structure of the UrCas13d-crRNA binary complex
In order to understand the structural basis of pre-crRNA processing and mature crRNA recognition by Cas13d, we solved the crystal structure of the uncultured Ruminococcus sp. Cas13d (UrCas13d, also known as RspCas13d)-crRNA complex at 2.15 Å resolution using the single-wavelength anomalous diffraction method (Supplementary Table 1). The binary complex was obtained by the selenomethionine (SeMet)-derived UrCas13d (R288A/R823A) mutant co-expressed with CRISPR RNA in vivo. The SeMet-labeled structure was subsequently used as a model to solve the crystal structure of the unlabeled UrCas13d (R288A/R823A) binary complex at 1.86 Å resolution using the molecular replacement method.
Due to its minimal size among type VI CRISPR effectors, UrCas13d exhibits an atypically compact architecture (Fig. 1a–c and Supplementary Movie 1). Although the conventional bilobed architecture consisting of a recognition (REC) lobe and a nuclease (NUC) lobe is still discernible in UrCas13d, its compact structure blurs this bilobed notion to certain extent. The REC lobe consists of the N-terminal domain (NTD) and Helical-1 domain (Fig. 1a–d). The NUC lobe consists of two HEPN domains (HEPN-1 and HEPN-2) and Helical-2 domain. The mature crRNA is sandwiched within a positively charged channel formed by the REC and NUC lobes (Fig. 1d–h).
In the binary complex, except for NTD, the predominant secondary structures of UrCas13d are α-helices (Supplementary Fig. 1). NTD contains a seven-stranded, antiparallel β-sheet and short α-helix. HEPN-1 is composed of ten α-helices, and HEPN-2 domain is composed of eleven α-helices. Two HEPN domains interact with each other primarily through helix-α3 and helix-α28. Notably, the long helix-α28 of HEPN-2 domain extends to the opposite sides of the binary complex and roughly runs parallel to the crRNA structure. Helical-1 domain contains eight α-helices. Helical-2 domain contains eight α-helices, and wraps around helix-α28 of HEPN-2 domain.
A mature 50-nt crRNA was determined in the binary complex (Fig. 1b, e and Supplementary Fig. 2). It originated from an in vivo transcript encoded by a co-expressed CRISPR template, which was further processed by UrCas13d itself in E. coli cells. The crRNA consists of a 30-nt repeat region (C(-30)-C(-1)) and 20-nt spacer region (U(1)-A(20)) (Fig. 1e). The repeat region displays a stem-loop architecture. Its secondary structure is remarkably similar to the predicted one, with the exception of base pairing of A(-29) (Fig. 1b and Supplementary Fig. 3). The stem region shows a distorted A-form RNA duplex, which includes seven Watson-Crick base pairs and a wobble G-U base pair (Fig. 1b). The loop region contains four nucleotides (A(-20)-U(-17)). Interestingly, the nucleotides U(-8)-C(15) display four successive U-shaped turns (Fig. 1e).
Hydrated Mg2+ ions aid in stabilizing the crRNA repeat region
The high resolution of the binary complex unambiguously shows an octahedral-shaped electron density around the phosphate group of the nucleotide C(-7). Considering that Mg2+ and Na+ are metal ions provided in buffer systems during the purification and crystallization, a penta-hydrated Mg2+ was built into the present structure. The magnesium ion directly coordinates with five water molecules and an oxygen atom from the phosphate group of the nucleotide C(-7) (Fig. 2a). Moreover, the (Mg(H2O)5)2+ ion forms hydrogen bonds with the phosphate groups of the nucleotides A(-5), A(-3), and A(-2), as well as the side chain of residue D178 (Fig. 2a, b and Supplementary Movie 2). The detailed interactions between the (Mg(H2O)5)2+ ion and the surrounding chemical groups are shown in Fig. 3. In terms of position, the hydrated Mg2+ is situated in the center of an U-shaped turn formed by the nucleotides U(-8)-A(-2) of the repeat region. Notably, residues D178, K181 and K524 are involved in stabilizing the conformation of the (Mg(H2O)5)2+ ion and the coordinated nucleotides (Fig. 2b). These residues are conserved among Cas13d family members (Supplementary Fig. 4), and substituting any of them significantly decreases target RNA cleavage (Fig. 2c). Therefore, the penta-hydrated Mg2+ ion plays an important role in stabilizing the conformation of the crRNA repeat region.
In addition, a tetra-hydrated Mg2+ was found to coordinate with two backbone oxygen atoms of residues R137 and E140 from a loop region of NTD (Fig. 2d, e and Supplementary Movie 3). This (Mg(H2O)4)2+ ion forms hydrogen bonds with backbone atoms of residues T135, R137, S141, and S142, which helps to maintain the conformation of this loop region. Meanwhile, the side chains of H136 and S138 point outwards and form hydrogen bonds with the phosphate groups of U(-24) and G(-25), respectively. The H136A mutation slightly decreases target RNA cleavage (Fig. 2c). These imply that the (Mg(H2O)4)2+ ion is involved in the conformational stabilization of the crRNA repeat region.
To investigate whether metal ions are essential for pre-crRNA processing and target cleavage by UrCas13d, we carried out the cleavage assays with different types of metal ions. A pre-crRNA containing a repeat-spacer-repeat sequence was designed to serve as the processing target. The results clearly show that all tested divalent metal ions including Mg2+ are unnecessary for pre-crRNA processing, but essential for target RNA cleavage (Fig. 2f, g).
Specific recognition of the crRNA repeat region by UrCas13d
Conformationally, the U-shaped turn of the crRNA repeat region is stabilized not only by the penta-hydrated Mg2+, but also by extensive interactions with residues of UrCas13d (Figs. 3 and 4a). Although no residue directly obstructs base pairing between A(-29) and U(-8), the nucleotide A(-29) pairs with U(-6) (Fig. 1b). U(-8) and C(-7) swing away from the stem duplex and form a bubble in the repeat region. The (Mg(H2O)5)2+ ion coordinates with the phosphate group of C(-7), which apparently affects the conformational arrangement of U(-8) and C(-7). Besides, U(-8) stacks with A(-2), residue K56 interacts with the phosphate group of U(-8), and the side chain of Q171 forms hydrogen bond with the 2′-hydroxyl group of U(-8). The pyrimidine ring of C(-7) forms hydrogen bond with the side chain of N167, and also stacks with the side chains of Q171 and R145 (Fig. 4a, b). In addition, the side chain of R145 and the backbone nitrogen atom of M148 interact with the phosphate group of U(-6). All interactions described above help with the formation of the A(-29):U(-6) pairing. It should be noted that, when compared with a standard A-form RNA, nucleotides U(-8), A(-4), and A(-3) within the U-shaped turn have unusual conformations with respect to their sugar pucker or glycosyl bond (Supplementary Fig. 5).
Although the nucleotides A(-5)-C(-1) of the repeat region are unpaired, they are recognized in both the sugar-phosphate backbone-dependent and base-specific manners (Fig. 4a, b). The nucleotide A(-5) stacks with U(-6) and the side chain of N906 hydrogen bonds with the 2′-hydroxyl group of A(-5) (Fig. 4a). Residues N900 and R903 interact with the phosphate group of A(-5). The purine ring of A(-4) is flipped outwards to form hydrogen bond with the side chain of D914, and stacks with the side chains of R744 and R915. The side chains of K181 and R903 interact with the phosphate group of A(-4), and the side chain of R915 interacts with the 2′-hydroxyl group of A(-4) (Fig. 4a). The side chain of K181 also interacts with the phosphate group of A(-3). Furthermore, the purine ring of A(-2) is recognized by the side chains of K56, K61, N79 and N527 (Fig. 4b). The pyrimidine ring of C(-1) is recognized by G78 and R431, and partially stacks with the side chain of F75. The 2′-hydroxyl group of C(-1) hydrogen bonds with the side chain of K52.
To understand the functional importance of residues involved in recognition of the repeat region, we performed mutational analysis. As shown in Supplementary Fig. 6, single mutations of all aforementioned residues did not influence pre-crRNA processing to a detectable level. In contrast, single mutations D178A, K181A, and K524A mentioned above, as well as single mutations K61A, R744A, and R903A, greatly decrease target RNA cleavage. In addition, single mutations K56A, R145A, and Q171A also decrease target RNA cleavage to a certain level (Fig. 4c).
Rational engineering of the crRNA repeat region
The crRNA extends over the long axis of protein and interacts with every domain of UrCas13d (Fig. 1c, d). A large number of positively charged residues interact with the phosphodiester backbone of crRNA (Fig. 3). Notably, the nucleotides U(-24)-A(-13) of the stem-loop structure protrude from UrCas13d and are mostly exposed to the environmental solvent (Fig. 4d). Deletion of four Watson-Crick base pairs (U(-24)-A(-21)/U(-16)-A(-13)) within this exposed region (designated the Δstem mutant) has no obvious effect on pre-crRNA processing and target RNA cleavage (Fig. 4e, f). Moreover, the time-course study shows that, when compared with pre-crRNA or crRNA, their respective Δstem mutants have a similar efficiency in pre-crRNA processing and target RNA cleavage by UrCas13d and its mutants (Supplementary Fig. 7). The rest of the stem duplex is bound inside the channel formed by NTD, HEPN-1, and HEPN-2 domains. The sugar-phosphate backbone of this bound duplex interacts with the side chains of S53, S54, S138, R145, K799, N800, and K905 (Figs. 2e, 3 and 4b). NTD resides on the stem duplex and contributes most interactions with the crRNA repeat region (Fig. 3).
Importance of the crRNA repeat region for target RNA cleavage
To investigate the functional importance of the nucleotides U(-8)-C(-1) from the repeat region, we carried out various substitutions and deletions in this region and further inspected their pre-crRNA processing and target RNA cleavage capabilities. The results show that single deletions of A(-4), C(-7), and U(-8) decrease pre-crRNA cleavage, and single mutations A(-2)G, A(-4)U, A(-4)G, and C(-7)G have the similar effect (Fig. 4g and Supplementary Fig. 8a). Notably, single mutation or deletion of A(-2), A(-4), C(-7), or U(-8) significantly compromises target RNA cleavage (Fig. 4h). Therefore, both structure and sequence of the repeat region are critical for target RNA cleavage.
Recognition of the spacer region by UrCas13d
The spacer region of crRNA forms three U-shaped turns and is anchored inside the channel formed by all domains of UrCas13d except NTD (Fig. 1c–e). The recognition of the spacer region by UrCas13d is mostly dependent on its sugar-phosphate backbone (Fig. 3). Helical-1 domain contributes multiple interactions with the spacer region. More precisely, residues K370, G373, K439, and Y443 interact with the nucleotides U(1)-U(4) (Fig. 5a), whereas residues N337, K367, S398, K402, and K405 interact with the nucleotides U(13)-C(15) (Fig. 5b). Moreover, the nucleotides C(5)-A(12) are recognized by residues from Helical-2 and two HEPN domains (Fig. 5b, c). Residue K512 interacts with the phosphate groups of G(6), while residue K736 simultaneously interacts with the phosphate groups of G(6), A(7), and U(8). The side chain of R631 stacks with the purine ring of A(10) and interacts with the phosphate groups of U(9) and U(8). In addition, the 3′-tail of the nucleotides A(16)-A(20) extends to a positively charged groove formed by Helical-2 and HEPN-1 domains. The phosphate backbone of this region is stabilized mainly by residues from Helical-2 domain (Fig. 5d).
To further explore the functional importance of residues involved in interactions with the spacer region, the mutational studies were performed. The results indicate that single mutations N515A, N620A, and K736A unambiguously decrease target RNA cleavage (Fig. 5e, f).
Pre-crRNA processing site within HEPN-2 domain
In line with the previous studies20,21, the mature crRNA containing a 30-nt repeat region was determined in the high-resolution UrCas13d binary complexes (Fig. 1b, e). The electron density of the nucleotides C(-30)-C(-1) is clear (Supplementary Fig. 2). To understand how pre-crRNA was processed by UrCas13d, residues around the nucleotide C(-30) were selected to carry out the mutational studies (Fig. 6a and Supplementary Movie 4). The L803A, C806A, K887A, K891A, F904A, I909A, and Q911A mutations were found to have little effect on pre-crRNA cleavage (Fig. 6b). Notably, the K905A mutation abolishes pre-crRNA cleavage. Besides, the R802A mutation significantly decreases this activity (Fig. 6b). Protein sequence alignment shows that these two residues are conserved within the Cas13d family members (Supplementary Fig. 4). Thus, the mutational and structural analyses indicate that residues R802 and K905 from HEPN-2 domain are critical for pre-crRNA cleavage, which possibly adopts a base-catalyzed mechanism.
Active site for target RNA cleavage within two HEPN domains
The previous study has found that two R-X4-H HEPN ribonuclease motifs of Cas13d are responsible for target RNA cleavage and that the mutation of four catalytic residues in R-X4-H motifs abolishes the cleavage activity20. We further observed that a single R288A or H828A mutation of UrCas13d was sufficient to abrogate target RNA cleavage, the H293A mutant had a greatly decreased cleavage activity, whereas the R823A mutant still showed detectable activity to a certain degree (Fig. 5f). In the present model, the distance between the backbone Cα atoms of residues R288 and R823 of UrCas13d is ~15.3 Å (Fig. 6c and Supplementary Movie 5). The corresponding distances in LshCas13a and BzCas13b are ~17.8 Å and ~18.6 Å, respectively (Fig. 6d, e). This implies that, in comparison with LshCas13a and BzCas13b, UrCas13d requires a rather small conformational rearrangement to achieve its activation, which is consistent with the compact architecture of Cas13d.
Mismatch tolerance between the spacer region and target RNA
PAM-adjacent seed region within the crRNA spacer is known to be essential for target recognition in Cas9, Cas12a, and Cas12b32,33,34,35,36. Unlike these Cas effectors, Cas13a possesses a central seed region17,26,37. In the case of the UrCas13d binary complex, both internal nucleotides U(4)-U(8) and 3′-end nucleotides A(14)-A(20) within the spacer region are exposed and accessible to the environmental solvent. This may allow UrCas13d to use the internal or 3′-end nucleotides of the spacer to interrogate target RNA. To identify the mismatch tolerance between the spacer and target RNA, we prepared 11 target RNA variants by tiling 2-nt mismatches across the 22-nt targeted region and performed the cleavage assays (Fig. 6f and Supplementary Fig. 8b). The results demonstrated that mismatches of the nucleotides C(5)-U(8) and U(13)-C(22) abolished target cleavage (Fig. 6g). This suggests that correct base pairings within two separate spacer regions (an internal and a 3′-end region) are essential for target RNA cleavage.
To further investigate the structural differences between the UrCas13d-crRNA complex and the EsCas13d-crRNA complex, the protein backbones of two binary complexes were superimposed. The overall architectures of both binary complexes are compact and similar, with the root-mean-square deviation (RMSD) of 1.62 Å between 735 aligned Cα atoms of UrCas13d (922 amino acid residues) and EsCas13d (954 amino acid residues) (Supplementary Fig. 9a). Relative to the UrCas13d binary complex, the exposed stem-loop structure of the crRNA repeat region undergoes a ~3 Å movement towards the solvent environment in the EsCas13d binary complex. In the respective complexes, there are obvious differences in the exact position and orientation of the 5′-end nucleotide C(-30) (Supplementary Fig. 9b). C(-30) in the UrCas13d binary complex is located between two critical residues (R802 and K905) and the C-terminus of UrCas13d. Such positioning of C(-30) may be favorable for the pre-crRNA processing. In contrast, C(-30) in the EsCas13d binary complex swings away from two critical residues (R828 and K932) of pre-crRNA processing site, and the C-terminus of EsCas13d is located between these two residues and C(-30). In addition, in the target RNA-cleavage sites of two binary complexes, two catalytic residues (H293 of UrCas13d and H300 of EsCas13d) are reversely located in the respective HEPN-1 domains (Supplementary Fig. 9c).
Current availability of the type VI CRISPR-effector structures allows us to further analyze structural and functional features of Cas13d by comparing similarities and differences between Cas13a, Cas13b, and Cas13d25,26,31,38,39. Overall, the domain architecture of Cas13d shares certain spatial consistency with Cas13a, whereas Cas13b displays a distinct arrangement (Fig. 7a–e). The crRNA-target duplex is recognized by two HEPN and two Helical domains in Cas13d31; in Cas13a, it is recognized inside the pocket formed by HEPN-1, HEPN-2, Helical-2, and Helical-3 domains26. As inferred from the charge distribution on the surface of protein, Cas13b may accommodate the crRNA-target duplex within the pocket formed by Helical-1, Helical-2, RRI-1, and HEPN-1 domains38.
While Helical-1 domain of Cas13a and RRI-2 domain of Cas13b are responsible for pre-crRNA processing25,38,39, their counterpart is lacking in the compact Cas13d. Instead, residues from HEPN-2 domain play a critical role in pre-crRNA processing by Cas13d. Therefore, HEPN-2 domain of Cas13d not only contributes a R-X4-H motif for target RNA cleavage, but also provides a catalytic site for pre-crRNA cleavage.
The mature crRNAs show different recognition patterns within the Cas13a, Cas13b, and Cas13d binary complexes (Fig. 7a–h). In case of Cas13a, the repeat region of crRNA, which is characterized by stem-loop structure, is recognized by NTD and Helical-1 domains25,26,39. Without a counterpart of Helical-1 domain in Cas13a, the repeat region of crRNA is recognized by NTD and two HEPN domains in Cas13d, which still adopts a stem-loop structure. However, in Cas13d, about half of the stem-loop structure is exposed to the surrounding solvent, and the deletion of four Watson-Crick base pairs within this exposed region alters neither pre-crRNA processing nor target RNA cleavage. Notably, the nucleotides G(-9)-C(-1) of the repeat regions in both Cas13a and Cas13d exhibit a similar secondary structure and spatial U-shaped turn (Fig. 7f, g, i, j). A penta-hydrated Mg2+ greatly stabilizes the U-shaped turn in the UrCas13d binary complex and possibly in the EsCas13d ternary complex, whereas a positively charged residue plays a similar role in stabilizing this region in the Cas13a ternary complex (Fig. 7i, j). Unlike the 5′-end repeat region of crRNA in the Cas13a and Cas13d complexes, the repeat region is located at the 3′-end of crRNA in the Cas13b binary complex and shows a more complicated architecture containing four sub-regions, which is recognized by Helical-2, two RRI domains and the linker region in Cas13b (Fig. 7c, h)38. Due to the fact that most of interactions between the crRNA repeat region and Cas13b exist in the recognition of the crRNA sugar-phosphate backbone, Cas13b is likely to recognize the crRNA repeat region in a structure-specific rather than sequence-specific manner38. By contrast, in Cas13a and Cas13d, both structure and sequence of the crRNA repeat region are specifically recognized and critical for target RNA cleavage.
The pre-crRNA processing by Cas13a proceeds in a divalent metal-ion-independent manner, whereas the divalent metal ion is indispensable for target RNA cleavage17,18,39. However, two different views on the essentiality of Mg2+ for the pre-crRNA processing have been reported21,31. Here, we find that, similar to Cas13a, the divalent metal ion is not essential for pre-crRNA cleavage, but is critical for target cleavage by Cas13d. In the present study, two hydrated Mg2+ ions were found to be important for stabilizing the conformation of the crRNA repeat region, out of which the (Mg(H2O)5)2+ ion makes more important contributions. This suggests that the divalent metal ion may not be directly involved in catalyzing Cas13d-guided target RNA cleavage, but they could be important for maintaining the Cas13d-crRNA-target RNA complex in the conformation suitable for target cleavage.
In this study, we found that correct base pairings within the internal and the 3′-end spacer regions are essential for target recognition in Cas13d. The seed region may exist in either or both of these separate spacer regions in UrCas13d. In Cas13a, a central seed region has been determined17,26,37. Interestingly, a HEPN-nuclease switch region within the spacer has been identified adjacent to the seed region in Cas13a, and mismatches within this switch region may impede the conformational activation of Cas13a37. Whether the HEPN-nuclease switch region exists in Cas13d needs further investigation. In the Cas13b binary complex, given that the central region of the spacer is disordered and exposed to the surrounding solvent38, it is possible that Cas13b uses this region of the spacer for target interrogation. Further studies are required to determine the exact seed region of Cas13b.
In summary, the present study determines the pre-crRNA processing site, explains the importance of two hydrated Mg2+ ions for stabilizing the conformation of the crRNA repeat region, reveals the dependence of target RNA cleavage on structure and sequence of the crRNA repeat region, and identifies the mismatch tolerance between the crRNA spacer region and target RNA. Our findings provide valuable information for rational engineering that would aid in developing the CRISPR-Cas13d system into an efficient tool for research, therapeutics and biotechnology.
Purification of the UrCas13d binary complex assembled in vivo
The full-length gene encoding UrCas13d was synthesized (Supplementary Table 2) and inserted between NdeI and XhoI sites of the pET-30b vector. CRISPR template was synthesized (Supplementary Table 3) and cloned between NdeI and XhoI sites of the pCDFDuet-1 vector.
To acquire the binary complex assembled in vivo, vectors encoding the UrCas13d R288A/R823A mutant and CRISPR template were co-transformed into E.coli Rosetta (DE3) cells (Novagen). The cells were then grown at 37 °C for 3 h until the cell concentration reached OD600 of approximately 0.6, which was followed by induction with 0.2 mM IPTG at 18 °C for 12 h. The cells were harvested and lysed in buffer A (25 mM Tris-HCl (pH 7.5), 500 mM NaCl, 5 mM β-mercaptoethanol, 1 mM phenylmethanesulfonyl fluoride (PMSF)). The lysate was centrifuged and the supernatant was loaded onto a Ni-NTA Superflow (QIAGEN) column. The nickel column was washed with buffer A containing 10 mM imidazole, after which the binary complex was eluted with buffer A containing 50 mM imidazole. Afterwards, the eluted target complex was successively purified with a Q FF column and a Heparin HP column (GE Healthcare) using a salt gradient between buffer B1 (25 mM Tris-HCl (pH 7.5), 150 mM NaCl, 2 mM MgCl2, 2 mM dithiothreitol (DTT)) and buffer B2 (25 mM Tris-HCl (pH 7.5), 1 M NaCl, 2 mM MgCl2, 2 mM DTT) in both steps. The target complex was then dialyzed against buffer C1 (25 mM Tris-HCl (pH 7.5), 200 mM NaCl, 2 mM MgCl2, 2 mM DTT) and further purified using a Superdex 200 Increase (10/300 GL) column (GE Healthcare). Fractions containing the pure binary complex were collected, concentrated and aliquoted before storing at −80 °C for later crystallization uses. The SeMet-labeled binary complex was expressed in E. coli Rosetta (DE3) cells grown in M9 minimal medium supplemented with SeMet, Lys, Phe, Thr, Val, Leu, and Ile. The SeMet UrCas13d binary complex was purified following the procedure described above.
Purification of the uncomplexed UrCas13d
Full-length UrCas13d gene was expressed in E. coli Rosetta (DE3) cells (Novagen). After the cells were harvested and lysed in buffer A, the target protein was purified with a Ni-NTA Superflow (QIAGEN) column using buffer A and then with the SP FF column (GE Healthcare) using a salt gradient between buffer B3 (25 mM Tris-HCl (pH 7.5), 250 mM NaCl, 2 mM DTT) and buffer B4 (25 mM Tris-HCl (pH 7.5), 1 M NaCl, 2 mM DTT). Fractions of UrCas13d were dialyzed against buffer B3, concentrated, aliquoted and stored for later pre-crRNA or target RNA cleavage assays.
In vitro synthesis and purification of RNA fragments
T7 RNA polymerase was used in this study for in vitro synthesis of RNA fragments. Briefly, the synthetic DNA fragments encoding the sequences of pre-crRNA, crRNA, and target RNA were individually cloned into a modified pUC-119 vector between StuI and HindIII sites40. After the recombinant vector was amplified in E.coli DH5α cells (Novagen), it was linearized by HindIII, and then purified using phenol-chloroform extraction and ethanol precipitation. Transcription reactions were performed at 37 °C for 3 h, in a buffer composed of 100 mM HEPES-K (pH 7.9), 20 mM MgCl2, 30 mM DTT, 3 mM NTPs, 2 mM spermidine, 30 ng/μl linearized template, and 100 μg/ml home-made T7 RNA polymerase. The reaction sample was resolved in denaturing polyacrylamide gel containing 8 M urea. Target RNA bands were excised from the gel and eluted using the Elutrap system (GE Healthcare). The eluted RNA was collected by ethanol precipitation, dissolved in DEPC-treated H2O and preserved at −80 °C.
In order to generate desired mutants (Supplementary Tables 3, 4, and 5), PCR reactions using mutation-specific oligonucleotides and vectors encoding the wild-type UrCas13d, pre-crRNA, crRNA, or target RNA as templates were carried out. The results of mutagenesis were verified by sequencing.
The SeMet-labeled UrCas13d R288A/R823A binary complex assembled in vivo was crystallized by the hanging-drop vapor diffusion method at 16 °C. Crystals were obtained by mixing 1 μl of complex solution (A280 nm = 11.8) and 1 μl of reservoir solution (0.5% (w/v) Tryptone, 0.1 M HEPES sodium (pH 7.0), 16% (w/v) Polyethylene glycol 3350). Crystals grew to their full size within 6 days, after which they were harvested, cryoprotected in reservoir solution supplemented with 20% (v/v) glycerol and flash-frozen.
In addition, the unlabeled UrCas13d R288A/R823A binary complex assembled in vivo was crystallized by the hanging-drop vapor diffusion method at 16 °C. Crystals were obtained by mixing 1 μl of complex solution (A280 nm = 11.7) and 1 μl of reservoir solution (8% (v/v) Tacsimate (pH 7.0), 17% (w/v) Polyethylene glycol 3350). Crystals grew to their full size within 7 days, after which they were harvested, cryoprotected in reservoir solution supplemented with 20% (v/v) glycerol and flash-frozen.
Data collection and structure determination
X-ray diffraction data was collected at Shanghai Synchrotron Radiation Facility (SSRF) on beamline BL-17U1. Datasets were processed automatically by the program XIA2 integrated into data collection platform of the beamline41, and scaled by the program AIMLESS in CCP442,43. The crystal structure of the SeMet-labeled UrCas13d R288A/R823A binary complex was solved by single-wavelength anomalous dispersion method with the program AutoSol in PHENIX44. An asymmetric unit of this SeMet-labeled crystal structure contains two binary complexes. Next, using the SeMet-labeled structure as a model, the crystal structure of the unlabeled UrCas13d R288A/R823A binary complex was solved by the molecular replacement method with the program Phaser-MR in PHENIX44. An asymmetric unit of this crystal structure contains one binary complex. Model building was carried out in COOT45. Multiple rounds of refinement were performed with REFMAC in CCP443,46. All figures of the structures were prepared using PyMOL (http://pymol.org). Data collection and refinement statistics are listed (Supplementary Table 1).
Pre-crRNA cleavage assay
Pre-crRNA and the mutants were designed to contain two repeat regions separated by a spacer region (Supplementary Table 3). Reaction mixtures were prepared by incubating 9.6 μM purified UrCas13d with 4.8 μM pre-crRNA or the pre-crRNA mutants in the assay buffer (25 mM Tris-HCl (pH 7.5), 250 mM NaCl, 2 mM MgCl2, 2 mM DTT). Total volume of the reaction mixture was adjusted to 20 μl. The cleavage was allowed to proceed at 37 °C for 30 min, after which the reactions were terminated in liquid nitrogen and subsequently quenched at 75 °C for 5 min by adding 2× loading buffer (2× TBE buffer, 12 M urea). Samples were then run on a 20% PAGE TBE-urea denaturing gel and the cleavage products were visualized with the toluidine blue staining. For pre-crRNA cleavage in the presence of different metal ions or ethylenediaminetetraacetic acid (EDTA), the assay buffer contained 2 mM divalent metal ions or 10 mM EDTA without 2 mM DTT. Each cleavage assay was independently carried out three times to verify the repeatability of the results.
Target RNA cleavage assay
Reaction mixtures were prepared by incubating 1.6 μM UrCas13d with 0.8 μM crRNA or the crRNA mutants in the assay buffer (25 mM Tris-HCl (pH 7.5), 250 mM NaCl, 2 mM MgCl2, 2 mM DTT) at 37 °C for 15 min, and then a final concentration of 16 μM target RNA or the mismatched target RNAs was mixed with the prepared UrCas13d and crRNA sample at 20:2:1 molar ratio. Total volume of the reaction mixture was adjusted to 20 μl. The cleavage assays were then carried out at 37 °C for 45 min. For target RNA cleavage in the presence of different metal ions or EDTA, the assay buffer contained 2 mM divalent metal ions or 10 mM EDTA without 2 mM DTT. Each cleavage assay was independently carried out three times to verify the repeatability of the results. Reactions were terminated and samples were analyzed as described for the pre-crRNA cleavage assay.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The atomic coordinates and structure factors of the SeMet-labeled UrCas13d binary complex and the unlabeled UrCas13d binary complex have been deposited in the Protein Data Bank under the accession codes 6IV8 and 6IV9. The source data underlying Figs. 2c, f, g, 4c, e–h, 5e, f, 6b, g and Supplementary Figs. 6, 7a, b, e, f, 8 are provided as a Source Data file. Other data are available from the corresponding author upon reasonable request.
Van der Oost, J., Westra, E. R., Jackson, R. N. & Wiedenheft, B. Unravelling the structural and mechanistic basis of CRISPR-Cas systems. Nat. Rev. Microbiol. 12, 479–492 (2014).
Makarova, K. S. et al. An updated evolutionary classification of CRISPR-Cas systems. Nat. Rev. Microbiol. 13, 722–736 (2015).
Koonin, E. V., Makarova, K. S. & Wolf, Y. I. Evolutionary genomics of defense systems in Archaea and Bacteria. Annu Rev. Microbiol 71, 233–261 (2017).
Wright, A. V., Nunez, J. K. & Doudna, J. A. Biology and Applications of CRISPR Systems: harnessing Nature’s Toolbox for Genome Engineering. Cell 164, 29–44 (2016).
Amitai, G. & Sorek, R. CRISPR-Cas adaptation: insights into the mechanism of action. Nat. Rev. Microbiol. 14, 67–76 (2016).
Koonin, E. V., Makarova, K. S. & Zhang, F. Diversity, classification and evolution of CRISPR-Cas systems. Curr. Opin. Microbiol. 37, 67–78 (2017).
Hille, F. et al. The Biology of CRISPR-Cas: backward and forward. Cell 172, 1239–1259 (2018).
Shmakov, S. et al. Diversity and evolution of class 2 CRISPR-Cas systems. Nat. Rev. Microbiol. 15, 169–182 (2017).
Wang, H., La Russa, M. & Qi, L. S. CRISPR/Cas9 in genome editing and beyond. Annu Rev. Biochem. 85, 227–264 (2016).
Jiang, F. & Doudna, J. A. CRISPR-Cas9 structures and mechanisms. Annu Rev. Biophys. 46, 505–529 (2017).
Hsu, P. D., Lander, E. S. & Zhang, F. Development and applications of CRISPR-Cas9 for genome engineering. Cell 157, 1262–1278 (2014).
Knott, G. J. & Doudna, J. A. CRISPR-Cas guides the future of genetic engineering. Science 361, 866–869 (2018).
Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell 163, 759–771 (2015).
Shmakov, S. et al. Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems. Mol. Cell 60, 385–397 (2015).
Kim, D. et al. Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells. Nat. Biotechnol. 34, 863–868 (2016).
Kleinstiver, B. P. et al. Genome-wide specificities of CRISPR-Cas Cpf1 nucleases in human cells. Nat. Biotechnol. 34, 869–874 (2016).
Abudayyeh, O. O. et al. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science 353, aaf5573 (2016).
East-Seletsky, A. et al. Two distinct RNase activities of CRISPR-C2c2 enable guide-RNA processing and RNA detection. Nature 538, 270–273 (2016).
Smargon, A. A. et al. Cas13b Is a Type VI-B CRISPR-Associated RNA-Guided RNase Differentially Regulated by Accessory Proteins Csx27 and Csx28. Mol. Cell 65, 618–630 e7 (2017).
Konermann, S. et al. Transcriptome Engineering with RNA-Targeting Type VI-D CRISPR Effectors. Cell 173, 665–676 e14 (2018).
Yan, W. X. et al. Cas13d Is a Compact RNA-Targeting Type VI CRISPR Effector Positively Modulated by a WYL-Domain-Containing Accessory Protein. Mol. Cell 70, 327–339 e5 (2018).
Jiang, W., Samai, P. & Marraffini, L. A. Degradation of Phage Transcripts by CRISPR-Associated RNases Enables Type III CRISPR-Cas Immunity. Cell 164, 710–721 (2016).
Niewoehner, O. & Jinek, M. Structural basis for the endoribonuclease activity of the type III-A CRISPR-associated protein Csm6. RNA 22, 318–329 (2016).
Kazlauskiene, M., Kostiuk, G., Venclovas, C., Tamulaitis, G. & Siksnys, V. A cyclic oligonucleotide signaling pathway in type III CRISPR-Cas systems. Science 357, 605–609 (2017).
Liu, L. et al. Two Distant Catalytic Sites Are Responsible for C2c2 RNase Activities. Cell 168, 121–134 e12 (2017).
Liu, L. et al. The Molecular Architecture for RNA-Guided RNA Cleavage by Cas13a. Cell 170, 714–726 e10 (2017).
Abudayyeh, O. O. et al. RNA targeting with CRISPR-Cas13. Nature 550, 280–284 (2017).
Cox, D. B. T. et al. RNA editing with CRISPR-Cas13. Science 358, 1019–1027 (2017).
Gootenberg, J. S. et al. Nucleic acid detection with CRISPR-Cas13a/C2c2. Science 356, 438–442 (2017).
Gootenberg, J. S. et al. Multiplexed and portable nucleic acid detection platform with Cas13, Cas12a, and Csm6. Science 360, 439–444 (2018).
Zhang, C. et al. Structural Basis for the RNA-Guided Ribonuclease Activity of CRISPR-Cas13d. Cell 175, 212–223 e17 (2018).
Jiang, F., Zhou, K., Ma, L., Gressel, S. & Doudna, J. A. A Cas9-guide RNA complex preorganized for target DNA recognition. Science 348, 1477–1481 (2015).
Swarts, D. C., van der Oost, J. & Jinek, M. Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a. Mol. Cell 66, 221–233 e4 (2017).
Yang, H., Gao, P., Rajashankar, K. R. & Patel, D. J. PAM-Dependent Target DNA Recognition and Cleavage by C2c1 CRISPR-Cas Endonuclease. Cell 167, 1814–1828 e12 (2016).
Liu, L. et al. C2c1-sgRNA Complex Structure Reveals RNA-Guided DNA Cleavage Mechanism. Mol. Cell 65, 310–322 (2017).
Yamano, T. et al. Crystal Structure of Cpf1 in Complex with Guide RNA and Target DNA. Cell 165, 949–962 (2016).
Tambe, A., East-Seletsky, A., Knott, G. J., Doudna, J. A. & O’Connell, M. R. RNA Binding and HEPN-Nuclease Activation Are Decoupled in CRISPR-Cas13a. Cell Rep. 24, 1025–1036 (2018).
Zhang, B. et al. Structural insights into Cas13b-guided CRISPR RNA maturation and recognition. Cell Res 28, 1198–1201 (2018).
Knott, G. J. et al. Guide-bound structures of an RNA-targeting A-cleaving CRISPR-Cas13a enzyme. Nat. Struct. Mol. Biol. 24, 825–833 (2017).
Serganov, A. et al. Ribosomal protein S15 from Thermus thermophilus–cloning, sequencing, overexpression of the gene and RNA-binding properties of the protein. Eur. J. Biochem 246, 291–300 (1997).
Winter, G., Lobley, C. M. & Prince, S. M. Decision making in xia2. Acta Crystallogr D. Biol. Crystallogr 69, 1260–1273 (2013).
Evans, P. R. & Murshudov, G. N. How good are my data and what is the resolution? Acta Crystallogr D. Biol. Crystallogr 69, 1204–1214 (2013).
Winn, M. D. et al. Overview of the CCP4 suite and current developments. Acta Crystallogr D. Biol. Crystallogr 67, 235–242 (2011).
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D. Biol. Crystallogr 66, 213–221 (2010).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. Biol. Crystallogr 60, 2126–2132 (2004).
Vagin, A. A. et al. REFMAC5 dictionary: organization of prior chemical knowledge and guidelines for its use. Acta Crystallogr D. Biol. Crystallogr 60, 2184–2195 (2004).
This work was supported by the National Natural Science Foundation of China Grants (31770948, 31570875, and 31200559) and the Special Funds of the Central Government Guiding Local Science and Technology Development (2017L3009). We thank Fujian Normal University for financial support (Z0210509). The diffraction data were collected at the beamline BL-17U1 of Shanghai Synchrotron Radiation Facility (SSRF). We thank Xiao-Jian Hu and Huan Zhou for helpful discussions. We thank Zhijian J. Chen, Zhao-Qing Lou and Ning Gao for critical review of the manuscript.
The authors declare no competing interests.
Journal peer review information: Nature Communications thanks Hong Li and other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.