Structural insights into DNA cleavage activation of CRISPR-Cas9 system

Huai, Cong; Li, Gan; Yao, Ruijie; Zhang, Yingyi; Cao, Mi; Kong, Liangliang; Jia, Chenqiang; Yuan, Hui; Chen, Hongyan; Lu, Daru; Huang, Qiang

doi:10.1038/s41467-017-01496-2

Download PDF

Article
Open access
Published: 09 November 2017

Structural insights into DNA cleavage activation of CRISPR-Cas9 system

Cong Huai¹^na1,
Gan Li¹^na1,
Ruijie Yao¹,
Yingyi Zhang^2,3,
Mi Cao^2,3,
Liangliang Kong^2,3,
Chenqiang Jia¹,
Hui Yuan¹,
Hongyan Chen¹,
Daru Lu¹ &
…
Qiang Huang¹

Nature Communications volume 8, Article number: 1375 (2017) Cite this article

13k Accesses
94 Citations
14 Altmetric
Metrics details

Subjects

Abstract

CRISPR-Cas9 technology has been widely used for genome engineering. Its RNA-guided endonuclease Cas9 binds specifically to target DNA and then cleaves the two DNA strands with HNH and RuvC nuclease domains. However, structural information regarding the DNA cleavage-activating state of two nuclease domains remains sparse. Here, we report a 5.2 Å cryo-EM structure of Cas9 in complex with sgRNA and target DNA. This structure reveals a conformational state of Cas9 in which the HNH domain is closest to the DNA cleavage site. Compared with two known HNH states, our structure shows that the HNH active site moves toward the cleavage site by about 25 and 13 Å, respectively. In combination with EM-based molecular dynamics simulations, we show that residues of the nuclease domains in our structure could form cleavage-compatible conformations with the target DNA. Together, these results strongly suggest that our cryo-EM structure resembles a DNA cleavage-activating architecture of Cas9.

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Jun Yan, Paul Oyler-Castrillo, … Britt Adamson

Genome engineering with Cas9 and AAV repair templates generates frequent concatemeric insertions of viral vectors

Article 08 April 2024

Fabian P. Suchy, Daiki Karigane, … Hiromitsu Nakauchi

Structural basis of Integrator-dependent RNA polymerase II termination

Article Open access 03 April 2024

Isaac Fianu, Moritz Ochmann, … Patrick Cramer

Introduction

The genome editing process by the CRISPR-Cas9 system^{1, 2} is mediated by a single-guide RNA (sgRNA) and consists of two steps: DNA binding and DNA cleavage. In brief, the sgRNA-guided Cas9 first binds to the target DNA by interacting with its protospacer adjacent motif (PAM); then, the two nuclease domains (HNH and RuvC) catalyze the splitting of the scissile bonds in two DNA strands, respectively^{3, 4}. Several studies^5,6,7 reported the ternary complex structures of Cas9, sgRNA, and target DNA, and elucidated how Cas9 binds to its target DNA. Also, it is known that the HNH domain cuts the 20-mer target sequence complementary to sgRNA, and the RuvC domain cleaves the non-target sequence^{8, 9}.

Structural studies were essential for understanding the molecular mechanism of DNA binding to the bilobe-like Cas9–sgRNA binary complex^{5,6,7, 10}. However, structural information about the DNA cleavage step are incomplete. For example, the HNH active site in the earlier crystal structures is at a position far from the cleavage site of the target strand (>32 Å; HNH-state 1 in Supplementary Fig. 1a, b)^{6, 7}; so the complex is not in a DNA cleavage-activating state conformation. Although a recent study revealed a second conformational sate, in which the HNH domain is closer to the cleavage site¹¹, the distance from the C_α atom of catalytic residue 840 to the attacked phosphorus is still more than 19 Å (HNH-state 2 in Supplementary Fig. 1c). Also, a certain number of nucleotides from the 20-mer non-target sequence are missing in the available crystal structures^{7, 11} (Supplementary Fig. 1). So far no atomic model has been built for the ternary complex with a full-length DNA target in the DNA cleavage-activating state. Therefore, it is worthwhile exploring new complex structures, in particular active structures in the native cleavage environment, in order to provide new insights into the molecular mechanisms of DNA cleavage by CRISPR-Cas9.

Here we use cryo-electron microscopy (cryo-EM) to determine the structure of Streptococcus pyogenes Cas9 (SpCas9) in the DNA cleavage-activating state. As a result, a 5.2 Å cryo-EM structure of SpCas9 in complex with sgRNA and target DNA is reported. Compared with two known HNH states of SpCas9 (Supplementary Fig. 1), this structure captures a distinct conformational state in which the HNH domain is nearest to the DNA cleavage site. EM-based molecular dynamics (MD) simulations show that residues of the nuclease domains in this structure could form cleavage-compatible conformations with the target DNA. Therefore, this study provides mechanistic insights into the DNA cleavage by Cas9, and will facilitate engineering efforts to improve the specificity and efficiency of the CRISPR-Cas9 system.

Results

Cryo-EM structure of the Cas9–sgRNA–DNA ternary complex

To determine the cryo-EM structure of the Cas9–sgRNA–DNA ternary complex in Fig. 1a, SpCas9 with two nuclease activity-dead mutations (D10A, H840A) was incubated with a 55-bp target DNA from the fumarylacetoacetate hydrolase (Fah) gene¹² and the corresponding 98-nucleotide (nt) sgRNA to form the ternary complex (Supplementary Fig. 2). The complexes were then rapidly frozen to acquire cryo-EM images for single-particle reconstruction^{13, 14} (Supplementary Fig. 3a; Supplementary Table 1).

As illustrated by representative images in Fig. 1b, 2D class averaging shows the formation of the ternary complex. The 16-bp PAM proximal end of the DNA chain was clearly identified (arrow 1 in Fig. 1b) and weaker density likely corresponding to the connecting region of the PAM distal end to SpCas9 is also visible (arrow 2). After 3D classification and refinement (Supplementary Fig. 3b), we obtained an EM density map at an overall resolution of 5.2 Å with the Fourier shell correlation (FSC) at 0.143 (Supplementary Fig. 3c). Consistent with the 2D averaging images, the density map suggests that the PAM proximal end is in a stable, base-paired form. In contrast, no significant density was observed for the 16-bp PAM distal end, but only two small, separate density projections on the surface. Their positions are similar to that of the 10-bp PAM distal end in the recent cryo-EM structure with a 40-bp target DNA (6.0 Å resolution)¹¹ (Supplementary Fig. 4), and may be the connecting regions of two unwound DNA strands to SpCas9. These observations suggest that most of the ternary complexes in the cryo-EM experiments possess a PAM distal end in a loosely flexible form, which might be caused in the process of the RNA:DNA heteroduplex formation.

Our ternary complex has a well-defined bilobed topology, which is fully consistent with the overall domain architecture observed in the previous crystal structures (PDB IDs 4OO8 and 4UN3)^{6, 7}. Simple rigid-body fitting showed that the complex domains in the crystal structures fitted well into the EM density map. As shown in Fig. 1c, d, the density map at a higher contour level accurately resolves most of the secondary structural elements in these structures, including helices of SpCas9, the RNA:DNA heteroduplex, PAM, and the base-paired PAM proximal end. For example, most of the helices in the highest resolution (2.5 Å) crystal structure (4OO8) are located in the EM density regions with higher contour levels, and were exactly fitted into those regions (Fig. 1c), except for those in HNH, L1, and L2 regions, and one in the RuvC domain close to HNH (Supplementary Fig. 5). Meanwhile, sgRNA, the RNA:DNA heteroduplex, PAM, and the PAM proximal end in 4UN3 are also clearly resolved (Fig. 1d). For example, the major and minor grooves in the base-paired PAM proximal end were accurately identified. The clear definition of the secondary structural elements further validates the reliability of our cryo-EM 3D construction, as illustrated in Supplementary Fig. 3b. When considering that the crystal structures 4OO8 and 4UN3 are ternary complexes without the full-length non-target DNA strand, our EM map implies that the resolved helices in Fig. 1c possess similar structures in the absence and presence of the full-length non-target strand. On the other hand, when bound to the full-length non-target strand, the HNH, L1, and L2 regions and part of the RuvC domain might have different conformations from those observed in the crystal structures (Supplementary Fig. 5).

Significantly, similar to the recent study¹¹, our EM map also reveals a large conformational change of the HNH domain (residues 775–906) with respect to that in 4OO8 or 4UN3 (Supplementary Fig. 1). However, the HNH domain in our map is closer to the cleavage site by contacting the PI and REC1 domains (Fig. 2a). In contrast, no comparable density is present in the cryo-EM structure at 6.0 Å resolution¹¹ (Fig. 2b). We conclude that our cryo-EM experiment visualized a distinct conformational state of SpCas9 in a nearly native environment primed for the DNA cleavage (hereafter designated as HNH-state 3). So this structure provides further experimental evidence for the activation mechanism of the DNA cleavage: the ternary complex formation induces an HHN conformational change that moves its active site toward the cleavage site, and meanwhile mediates the binding of the non-target DNA strand to the RuvC active site^{11, 15}. The HNH mobility appears to be an intrinsic feature of the CRISPR-Cas9 system that is required for its cleavage function.

EM-based atomic model

As shown in Fig. 1c, d, the unambiguous placement of the non-HNH elements of the crystal structures into the density map allowed us to build atomic models for the ternary complex in the HNH-state 3 by combining homology modeling¹⁶, molecular dynamics flexible fitting (MDFF)¹⁷, and structural averaging refinement¹⁸. Since we did not observe significant density for the PAM distal end, we determine an atomic model for the SpCas9–sgRNA–DNA ternary complex, which includes only two base pairs of the PAM distal end linking to target/non-target sequence. Thus, the EM density-based atomic model consists of the experimental SpCas9 mutant (D10A, H840A), sgRNA, and 41 base pairs of the target DNA (Supplementary Fig. 6).

As shown in the atomic model in Fig. 3a, the HNH domain fits well into the density regions corresponding to the HNH-state 3, and contacts the REC1 and PI domains mainly by the segments of residues 861–864, 872–876, and 903–906 (Fig. 3b). Compared to the HNH-state 1, the HNH domain as a whole rotates about 170° around an axis, roughly, which is located vertically at the middle of a β-sheet segment (β6, residues 757–764) and an α-helix (α45, residues 925–940) (Fig. 3c). To facilitate the movement, the linker L2 (residues 906–923) undergoes a helix-to-loop conformational change, similar to that of the Staphylococcus aureus Cas9 (SaCas9)¹⁰. It seems that the HNH-state 2 is an intermediate state between the HNH states 1 and 3 (Fig. 3c). Through the rotation, with respect to the HNH states 1 and 2, the HNH active site is shifted toward the cleavage site by about 25 and 13 Å, respectively (Fig. 3d), so that the distance between the C_α atom of residue 840 and the scissile P reduces from ~32 (state 1) and ~19 Å (state 2) to ~10 Å (state 3), respectively. These distance reductions provide evidence that our cryo-EM study did indeed capture a distinct SpCas9 state in which the HNH active site is the closest to the scissile bond of the target DNA.

Our model also reveals the overall topology of the non-target strand bound to SpCas9. As shown in Figs. 1 and 3, the clear assignment of the EM density to SpCas9, sgRNA, the RNA:DNA heteroduplex, PAM, and the PAM proximal end allowed us to assign the remaining density to the first 41 bases of the 55-mer non-target strand in the model (Fig. 4a). As seen, the remaining density extends from the based-paired, PAM proximal end to the connecting region of the PAM distal end to SpCas9 in Supplementary Fig. 4. This connecting region is likely located at base −22 of the non-target strand. Because of the close contact of the HNH domain with the PI domain, about 7 bases of the 20-mer non-target sequence are enclosed in the channel formed by the HNH, RuvC and PI domains (from bases −7 to −1), including the scissile bond of the non-target strand (Fig. 4b). Therefore, about 13 bases of the non-target sequence are bound to the surface of Spcas9.

In addition, our cryo-EM structure reveals an interaction site of the 16-bp PAM proximal end with SpCas9 (Fig. 4c). Compared with our structure, the PAM proximal ends in the previously determined structures^{7, 11} are relatively short and this SpCas9–DNA interaction was not observed in those structures. By rigid-body fitting of the crystal structure (4UN3), we found that the SpCas9 protein binds to the PAM proximal end by its segment 1151–1156 in the PI domain (Fig. 4c). Since four residues of this segment are positively charged lysines, they might bind to the negatively charged, DNA backbone by electrostatic forces. The EM-based atomic model indicates that this segment interacts with bases 14–15 of the non-target strand, about 11 bases downstream PAM (Fig. 4d). Therefore, we speculate that this SpCas9–DNA interaction may play a role in the process of PAM recognition.

MD simulations

Since the cryo-EM density map shows a new domain architecture and backbone conformations of the complex, we wanted to know whether the wild-type residues of the HNH and RuvC domains in this architecture might be able to form the cleavage-compatible conformations with the DNA strands and the Mg²⁺ ions. Because it is difficult to use the catalytic active wild-type SpCas9 protein to form a stable complex with the target DNA, structural data are not yet available for directly building atomic models of the wild-type active sites of SpCas9. Therefore, we used our EM-based atomic model as an template to build the wild-type active site models. We changed its two activity-dead mutations into the wild-type residues (Asp10, His840) and then refined the residue positions by large-scale MD simulations¹⁹, in order to predict the most probable conformations of the wild-type HNH and RuvC residues with the target DNA and Mg²⁺ ions (Supplementary Fig. 6).

Interestingly, the MD simulations showed that two wild-type nuclease domains in the experimental cryo-EM architecture may form cleavage-compatible conformations with the target DNA for using the conserved HNH and RuvC catalytic mechanisms. Previous biochemical studies have suggested that the HNH domain uses a well-established, one-metal-ion hydrolysis mechanism to split the phosphodiester bond (P-O) of the target strand between bases 3 and 4 upstream of PAM^{4, 20, 21}. Consistent with this mechanism, the MD model shows that the phosphate group between bases 3 and 4 binds to the HNH catalytic cleft enclosed by Asp837, Asp839, and His840 (Fig. 5a). A Mg²⁺ ion binds to the carboxyl groups of Asp837 and Asp839 in this model and coordinates with the oxygen atoms of the phosphate group; besides, the N_δ atom of the general base His840 points toward the phosphate group, which could facilitate a nucleophilic attack with a water molecule to target the attacked phosphorus²². The distance from the N_δ atom to the attacked phosphorus is ~8 Å, which is long enough to could accommodate 1–2 water molecule(s) for hydrolysis. Further MD simulation confirmed that water molecules could stay stable at the HNH active site and on average, about 19 water molecules were present around the bound Mg²⁺ within 7 Å in the simulations (Supplementary Fig. 7a, b).

Meanwhile, consistent with the two-metal-ion hydrolysis mechanism^{6, 23, 24}, the MD model shows that two Mg²⁺ ions are bound to the carboxyl groups of Asp10, Glu762, Asp986. (Fig. 5b). The Mg²⁺-bound phosphate group between bases −4 and −3 of the non-target strand is the nearest one to the possible general base His983 or His982 for the hydrolysis. This suggests that the cleavage may take place at the P-O3′ bond of bases −4 and −3 by the two-metal-ion mechanism^{4, 20, 21}. Similar to what was observed for the HNH domain, the MD simulations showed that about 14 water molecules could stay in the RuvC active site around the center of two bound Mg²⁺ ions (within 10 Å) (Supplementary Fig. 7a, c).

Structure-based analysis of SpCas9 mutants

To further verify the above structural model by MD, we also used this model and Rosetta program²⁵ to predict RuvC residues that are important for the DNA binding. The structure-based prediction suggested that residues of the RuvC domain might make significant contributions to the binding (Supplementary Fig. 8a). Accordingly, we expressed seven SpCas9 mutants (S15A, Q920A, S964A, R967A, K974A, R976A, and N1317A) and examined their DNA cleavage activities.

We found that three RuvC mutants (S15A, K974A, R976A) were affected in their cleavage activities: S15A reduces the DNA cleavage rate, and K974A and R976A disrupt or reduce the DNA cleavage ratio (Fig. 5c), which is in agreement with a previous study²⁶. Capillary electrophoresis analysis indicated that the mutations only affected the cleavage of the non-target strand, but not that of the target strand (Fig. 5d), which supports that the MD model represents the bound state of the non-target strand to the RuvC domain. Indeed, our model shows that all three residues are located in the vicinity of the RuvC active site and interact with the non-target strand by hydrogen bonding (Supplementary Fig. 8b), and thus provides a structural explanation for their crucial roles in the DNA cleavage²⁶. However, to fully understand the DNA cleavage mechanisms by SpCas9, further ternary complex structures at higher resolutions are needed in the future.

Discussion

Although the CRISPR-Cas9 technology has been widely used for precision genome engineering, structural data for its DNA cleavage step are still incomplete. In this study, we have determined a cryo-EM structure for the SpCas9–sgRNA–DNA ternary complex at 5.2 Å resolution, and visualized a conformational state of SpCas9, in which the HNH active site is the closest to the target DNA cleavage site. The atomic model and the EM-based MD simulations strongly support that this cryo-EM structure represents a DNA cleavage-activating architecture of SpCas9, and thereby marks an important step toward elucidating how the Cas9 nuclease domains form the catalytic conformations commitment for splitting the DNA strands. Thus, our study provides valuable insights into the DNA cleavage mechanism by the CRISPR-Cas9 system, and opens new opportunities for improvement of its specificity and efficiency by exploiting potential Cas9–DNA interactions.

Methods

SpCas9 protein expression and preparation of nucleic acids

The sequence encoding SpCas9 and activity-dead SpCas9 (dCas9) were cloned into a pET-28a expression vector between the NdeI and XhoI sites, followed a His₆-GST tag and a TEV protease cleavage site at N-terminal. The primer sequences for constructing the plasmids are listed in Supplementary Table 2. These two proteins were expressed and purified by Novoprotein Co. Ltd. In brief, the expression vectors were transformed into Escherichia coli strain BL21 (DE3) (Tiangen Biotech), cultured in LB medium with 0.1 mM IPTG at 16 °C for 20 h for protein expression. The cells were harvested and lysed two times using a French press (JN-3000 PLUS, JNBIO) at 10,000 psi in lysis buffer (20 mM HEPES, pH 7.5, 500 mM KCl) at 4 °C. Clarified lysate was bound in batch to Ni-NTA agarose (Qiagen), washed with washing buffer (lysis buffer with 30 mM imidazole, pH 7.5), and eluted with elution buffer (lysis buffer with 100 mM imidazole, pH 7.5). The proteins were then incubated overnight at 4 °C with TEV protease to remove the His₆-GST tag. After further purification by heparin affinity chromatography, Sepharose HiTrap (GE Healthcare) and HiLoad Superdex 200 16/60 columns (GE Healthcare), the purified SpCas9 and dCas9 proteins were concentrated to 10.8 mg mL⁻¹ and 15.4 mg mL⁻¹, respectively, in Cas9 storage buffer (20 mM HEPES, pH 7.5, 150 mM KCl and 1 mM TCEP)⁴, and stored in −80 °C.

All the other SpCas9 mutants were introduced by sited-directed PCR and were subcloned into a pET-21a vector with His₆-tag at N-terminal. The primer sequences for constructing the SpCas9 mutants are listed in Supplementary Table 2. These mutant proteins were expressed in E. coli strain Rosetta (Tiangen Biotech) with 0.1 mM IPTG incubation, and were purified by chromatography with Ni-NTA agarose (Qiagen) on BioLogic LP system (Bio-Rad), similar to the purification of the wild-type SpCas9. The eluent proteins were concentrated to 1–2 mg mL⁻¹ by 100,000 MWCO centrifugal filter (Merck Millipore) in the Cas9 storage buffer with 50% glycine, and stored in −20 °C.

The 98-nt sgRNA was in vitro transcribed using PCR-generated DNA templates by MEGAshortscript T7 Transcription Kit (Thermo Fisher), and was purified by MEGAclear Transcription Clean-Up Kit (Thermo Fisher). The 55-mer target/non-target DNA strands were commercially synthetized (Sangon Technology), pre-annealed to double-strand DNA substrate (dsDNA) by heating to 95 °C and then slowly cooling down to room temperature in NEB buffer 2.1. The primer sequences used for sgRNA transcription are listed in Supplementary Table 2.

Cryo-EM

Purified dCas9 protein was mixed with the 98-nt sgRNA and 55-bp target DNA at a molar ratio of 1:1.2:1.5 to form the Cas9–sgRNA–DNA complex. The ternary complex was prepared in three steps: first, dCas9 protein was incubated with sgRNA in the reaction buffer (20 mM Tris-Cl (pH 7.5), 100 mM KCl, 5 mM MgCl₂, 1 mM DTT)⁵ at 37 °C for 10 min. Second, dsDNA was added and incubated for another hour; third, complex solution was cultured at 18 °C overnight to disperse the proteins. The final concentrations of dCas9, sgRNA, and DNA were 2.0, 2.4, and 3.0 μM, respectively. The formation of the ternary complex was confirmed by agarose gel electrophoresis.

The cryo-EM frozen sample was prepared by a standard plunge freezing procedure¹⁴. In brief, 2.2 μL dispersed complex solution was loaded onto a glow-discharged holey carbon grid (Quantifoil, 1.2 μm hole size, 200 meshes) and plunge-frozen in liquid ethane using Vitrobot (FEI) system. The sample was blotted for 4 s in the environment of 16 °C and 100% humidity. Then, the grids with frozen sample were imaged on a Titan Krios (FEI) EM operated at 300 kV, and collected using a K2 Summit camera (Gatan). The micrographs were recorded at a nominal magnification of ×18,000 (calibrated pixel size of 1.3 Å on the specimen) using low-dose exposure (10 e/s pixel²), exposure time 7.6 s, leading to a total accumulated dose of 45 e Å⁻². Each of the images recorded by K2 was with a random defocus ranging from −1.3 to −3.5 μm and was fractionated into 38 subframes.

Single-particle 3D reconstruction

All recorded cryo-EM images were aligned with whole-image motion correction¹⁴ and the subframes were summed together to a single micrograph for subsequent calculation. The defocus value of each micrograph was estimated by CTFFIND3²⁷. As illustrated in Supplementary Fig. 3b, about 300,000 particles were picked from 592 micrographs and analyzed in the single-particle 3D reconstruction using RELION (version 1.4)²⁸. First, four rounds of 2D reference-free alignment and classification (40 iterations) were carried out. A subset of ~67,000 particles from good 2D classes were selected for the unsupervised 3D classification using five classes. The crystal structure (PDB ID 4UN3) in the HNH-state 1 and in complex with a 5-bp PAM proximal end was low-pass filtered to 60 Å and then used as the reference. Then, the largest class with ~86% particles was selected for the subsequent high-resolution 3D refinement. The 3D auto-refine procedure yielded a unmasked density map at an overall resolution of 7.28 Å according to the gold-standard FSC = 0.143 criterion. Finally, the RELION post-processing with auto-masking and a B-factor of −80 Å² was carried out to sharpen the map to the resolution of 5.20 Å.

Building and refinement of atomic models

The atomic model of the ternary complex was built based on the cryo-EM density map and the crystal structures of SpCas9 in complex with sgRNA and short target DNAs (PDB IDs 4OO8 and 4UN3), with a series of modeling methods described in Supplementary Fig. 6. The crystal structure in 4OO8 was used to build the initial model of the full-length SpCas9 by homology modeling with MODELLER¹⁶. The sgRNA and target DNA in 4UN3 were used to build the initial model of the 98-nt sgRNA and the 41-bp target DNA by base replacement and sequence extension. The non-target sequence in the RuvC active site was placed by structural alignment using the structure of a typical RuvC domain (PDB 4LD0)²⁹ as the template. The initial position of the HNH domain was placed according to a reference structure (PDB 2QNC)³⁰ and the cryo-EM density. The initial all-atom model of the ternary complex was constructed by assembling the full-length SpCas9 with the experimental sgRNA, target DNA according to their relative positions in 4OO8 and 4UN3. The Mg²⁺ ions in the HNH and RuvC active sites were placed using their corresponding positions in 2QNC and 4LD0 as the templates, respectively. This complex model was then docked into the cryo-EM density map using the COLORES program³¹, and subsequently flexibly fitted into the map using the typical MDFF protocols¹⁷ with the program NAMD (Version 2.10)³². To prevent overfitting, harmonic restraints were applied to relevant internal coordinates to preserve the secondary structures of SpCas9 and the sgRNA:DNA heteroduplex. To improve the final model, several rounds of structural refinements were carried out by Rosetta macromolecular modeling suite³³.

The MD model was built with the EM-based atomic model by changing its two activity-dead mutations (Ala10, Ala840) into the wild-type residues (Asp10, His840). Then, this complex model was refined by the MD-based structure averaging method¹⁹. To the end, MD simulations with explicit water model were carried out (see the following subsection for simulation details). In the simulations, the atomic coordinates of the ternary complex were saved at 20-ps intervals. The final atomic model of the wild-type ternary complex was obtained by averaging the MD snapshot structures from four independent simulations of ~80 ns. The structures of the complex and its domains were visualized via UCSF chimera³⁴ and PyMOL (http://pymol.org).

MD simulation with explicit water model

The MD simulations with explicit water model were conducted using NAMD³². The CHARMM36 force field³⁵ and the TIP3P water³⁶ model were employed to model the simulation system of the ternary complex with the water solvent. To build a simulation system, the all-atom structure of the ternary complex was solvated in the center of a cubic box of water with a minimum distance of 12 Å from the complex surface to the edge of the box. The Na⁺ and Cl⁻ ions were used to mimic an ionic concentration of 0.15 M in the system, including certain number of additional Na⁺ ions that neutralizes the net negative charge of the complex. Periodic boundary conditions were used in the simulations, i.e., the Van der Waals interactions were treated with a cut-off distance of 10 Å using a smooth switching function from 8 Å, the electrostatic interactions were calculated with particle mesh Ewald (PME) method using a local interaction distance of 10 Å, the SHAKE algorithm was employed to constrain bonds involving hydrogen atom, and thereby a time step of 2.0 fs was used. The simulations were performed in the isobaric-isothermal (NPT) ensemble, at a constant pressure of 1 bar and a constant temperature of 298 K controlled by Langevin dynamics.

Computational analysis of SpCas9–DNA interface

The RosettaDNA program in the Rosetta macromolecular modeling suite³³ was used to identify the key residues of the RuvC domain that interact with the 20-mer non-target sequence, based on the MD-refined model of the wild-type ternary complex. For each residue at the interface of the RuvC domain and the non-target sequence, the extent of optimal for affinity was calculated with RosettaDNA²². As illustrated in Supplementary Fig. 8a, those interface residues with higher extents were considered to be significant for the RuvC binding to the non-target sequence, and thereby important for the cleavage activity of the RuvC nuclease domain.

DNA cleavage assay

The substrate DNA for the DNA cleavage assay was amplified by PCR with a pair of fluorescent labeled primers, thus, the target DNA strand was labeled by Fam and the non-target strand was labeled by Rox. To test the cleavage activity of SpCas9 mutants, the substrate DNA fragments (200 ng or ~0.6 pmol) was incubated with equal molar SpCas9/SpCas9 mutant-sgRNA duplex for 5, 15, 30, or 60 min at 37 °C in SpCas9 cleavage buffer (20 mM Tris-Cl, pH 7.5, 100 mM KCl, 5 mM MgCl₂, 1 mM DTT, 5% glycerol)⁵. The reactions were terminated by heating the solution at 70 °C for 10 min. Half of the cleavage products were resolved by electrophoresis on 1% agarose gel stained with DNA_GREEN (TIANDZ) and 1/10 of the products were mixed with highly deionized-formamide (Hidi) and analyzed by capillary electrophoresis.

Data availability

The cryo-EM map of the SpCas9–sgRNA–DNA ternary complex is deposited in the Electron Microscopy Data Bank under accession code EMD-8236. The atomic coordinates of the complex are deposited in the Protein Data Bank under the accession code 5Y36. Other data are available from the corresponding authors upon reasonable request.

References

Wright, A. V., Nunez, J. K. & Doudna, J. A. Biology and applications of CRISPR systems: harnessing nature’s toolbox for genome engineering. Cell 164, 29–44 (2016).
Article CAS PubMed Google Scholar
Hsu, P. D., Lander, E. S. & Zhang, F. Development and applications of CRISPR-Cas9 for genome engineering. Cell 157, 1262–1278 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gasiunas, G., Barrangou, R., Horvath, P. & Siksnys, V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl Acad. Sci. USA 109, E2579–E2586 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Article ADS CAS PubMed Google Scholar
Jinek, M. et al. Structures of Cas9 endonucleases reveal RNA-mediated conformational activation. Science 343, 1247997 (2014).
Article PubMed PubMed Central CAS Google Scholar
Nishimasu, H. et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell 156, 935–949 (2014).
Article CAS PubMed PubMed Central Google Scholar
Anders, C., Niewoehner, O., Duerst, A. & Jinek, M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature 513, 569–573 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Westra, E. R. et al. CRISPR immunity relies on the consecutive binding and degradation of negatively supercoiled invader DNA by Cascade and Cas3. Mol. Cell 46, 595–605 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wilkinson, R. & Wiedenheft, B. A CRISPR method for genome engineering. F1000Prime Rep. 6, 3 (2014).
Article PubMed PubMed Central CAS Google Scholar
Nishimasu, H. et al. Crystal structure of Staphylococcus aureus Cas9. Cell 162, 1113–1126 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jiang, F. et al. Structures of a CRISPR-Cas9 R-loop complex primed for DNA cleavage. Science 351, 867–871 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Ruppert, S. et al. Deficiency of an enzyme of tyrosine metabolism underlies altered gene expression in newborn liver of lethal albino mice. Gene Dev. 6, 1430–1443 (1992).
Article CAS PubMed Google Scholar
Scheres, S. H. Semi-automated selection of cryo-EM particles in RELION-1.3. J. Struct. Biol. 189, 114–122 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, X. et al. Electron counting and beam-induced motion correction enable near-atomic-resolution single-particle cryo-EM. Nat. Methods 10, 584–590 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sternberg, S. H., LaFrance, B., Kaplan, M. & Doudna, J. A. Conformational control of DNA target cleavage by CRISPR-Cas9. Nature 527, 110–113 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Fiser, A. & Sali, A. Modeller: generation and refinement of homology-based protein structure models. Methods Enzymol. 374, 461–491 (2003).
Article CAS PubMed Google Scholar
Trabuco, L. G., Villa, E., Mitra, K., Frank, J. & Schulten, K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure 16, 673–683 (2008).
Article CAS PubMed PubMed Central Google Scholar
Mirjalili, V. & Feig, M. Protein structure refinement through structure selection and averaging from molecular dynamics ensembles. J. Chem. Theory Comput. 9, 1294–1303 (2013).
Article CAS PubMed Google Scholar
Mirjalili, V., Noyes, K. & Feig, M. Physics-based protein structure refinement through multiple molecular dynamics trajectories and structure averaging. Proteins 82, 196–207 (2014).
Article CAS PubMed Google Scholar
Gasiunas, G., Barrangou, R., Horvath, P. & Siksnys, V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl Acad. Sci. USA 109, E2579–E2586 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, H., Choi, J. & Bailey, S. Cut site selection by the two nuclease domains of the Cas9 RNA-guided endonuclease. J. Biol. Chem. 289, 13284–13294 (2014).
Article CAS PubMed PubMed Central Google Scholar
Broo, K. S., Brive, L., Ahlberg, P. & Baltzer, L. Catalysis of hydrolysis and transesterification reactions of p-nitrophenyl esters by a designed helix-loop-helix dimer. J. Am. Chem. Soc. 119, 11362–11372 (1997).
Article CAS Google Scholar
Yang, W., Lee, J. Y. & Nowotny, M. Making and breaking nucleic acids: two-Mg2+-ion catalysis and substrate specificity. Mol. Cell 22, 5–13 (2006).
Article CAS PubMed Google Scholar
Palermo, G. et al. Catalytic metal ions and enzymatic processing of DNA and RNA. Acc. Chem. Res. 48, 220–228 (2015).
Article CAS PubMed Google Scholar
Ashworth, J. & Baker, D. Assessment of the optimization of affinity and specificity at protein-DNA interfaces. Nucleic Acids Res. 37, e73 (2009).
Article PubMed PubMed Central CAS Google Scholar
Slaymaker, I. M. et al. Rationally engineered Cas9 nucleases with improved specificity. Science 351, 84–88 (2016).
Article ADS CAS PubMed Google Scholar
Mindell, J. A. & Grigorieff, N. Accurate determination of local defocus and specimen tilt in electron microscopy. J. Struct. Biol. 142, 334–347 (2003).
Article PubMed Google Scholar
Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Article CAS PubMed PubMed Central Google Scholar
Górecka, K. M., Komorowska, W. & Nowotny, M. Crystal structure of RuvC resolvase in complex with Holliday junction substrate. Nucleic Acids Res. 41, 9945–9955 (2013).
Article PubMed PubMed Central CAS Google Scholar
Biertumpfel, C., Yang, W. & Suck, D. Crystal structure of T4 endonuclease VII resolving a Holliday junction. Nature 449, 616–620 (2007).
Article ADS PubMed CAS Google Scholar
van der Oost, J., Westra, E. R., Jackson, R. N. & Wiedenheft, B. Unravelling the structural and mechanistic basis of CRISPR-Cas systems. Nat. Rev. Microbiol. 12, 479–492 (2014).
Article PubMed PubMed Central CAS Google Scholar
Phillips, J. C. et al. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26, 1781–1802 (2005).
Article CAS PubMed PubMed Central Google Scholar
Leaver-Fay, A. et al. An object-oriented software suite for the simulation and design of macromolecules. Methods in Enzymology, 487, 545–574 (2011).
Article CAS PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF chimera-A visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Hart, K. et al. Optimization of the CHARMM additive force field for DNA: improved treatment of the BI/BII conformational equilibrium. J. Chem. Theory Comput. 8, 348–362 (2012).
Article CAS PubMed PubMed Central Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We are grateful to Dr Bo Wan for his help in the initial phase of this project, Dr Huaxing Zhu, Yuanping Liao (Novoprotein Co., Ltd.) for their assistance in protein expression, Drs. Peng Nan, Jungang Zhou and Jianping Liu for their assistance in protein purification and sample preparation, Dr Junrui Li for their assistance in the single-particle reconstruction, and to Dandan Zhang (Shanghai Supercomputer Center) for her assistance in MD simulation. The cryo-EM data were collected at the National Center for Protein Science Shanghai (NCPSS), Shanghai Institutes for Biological Sciences/Shanghai Science Research Center, Chinese Academy of Sciences. This work was supported by the grants from the Natural Science Foundation of China (31671386 and 91430112 to Q.H., and 31571371 to D.L.), the Shanghai Natural Science Foundation (13ZR1402400 to Q.H.), and the Special Program for Applied Research on Super Computation of the NSFC-Guangdong Joint Fund (the second phase, to Q.H.).

Author information

Cong Huai and Gan Li contributed equally to this work.

Authors and Affiliations

State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
Cong Huai, Gan Li, Ruijie Yao, Chenqiang Jia, Hui Yuan, Hongyan Chen, Daru Lu & Qiang Huang
National Center for Protein Science Shanghai, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
Yingyi Zhang, Mi Cao & Liangliang Kong
Shanghai Science Research Center, Chinese Academy of Sciences, Shanghai, 201204, China
Yingyi Zhang, Mi Cao & Liangliang Kong

Authors

Cong Huai
View author publications
You can also search for this author in PubMed Google Scholar
Gan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ruijie Yao
View author publications
You can also search for this author in PubMed Google Scholar
Yingyi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mi Cao
View author publications
You can also search for this author in PubMed Google Scholar
Liangliang Kong
View author publications
You can also search for this author in PubMed Google Scholar
Chenqiang Jia
View author publications
You can also search for this author in PubMed Google Scholar
Hui Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Daru Lu
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.H. and D.L. designed and initiated the project. C.H., G.L., Y.Z., M.C., L.K. and Q.H. performed the cryo-EM experiments. C.H., C.J., H.Y., H.C. and D.L. expressed and purified proteins for the experiments, and conducted the DNA cleavage assay. G.L., Q.H., R.Y. and C.H. carried out the single-particle 3D reconstruction of the cryo-EM structure. Q.H., G.L. and C.H. performed the building and refinement of atomic models. Q.H., C.H. and G.L. wrote the manuscript, and Q.H. revised the manuscript with the assistance from G.L. and C.H., and Q.H. supervised the project.

Corresponding authors

Correspondence to Daru Lu or Qiang Huang.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huai, C., Li, G., Yao, R. et al. Structural insights into DNA cleavage activation of CRISPR-Cas9 system. Nat Commun 8, 1375 (2017). https://doi.org/10.1038/s41467-017-01496-2

Download citation

Received: 06 February 2017
Accepted: 21 September 2017
Published: 09 November 2017
DOI: https://doi.org/10.1038/s41467-017-01496-2

This article is cited by

Engineered domain-inlaid Nme2Cas9 adenine base editors with increased on-target DNA editing and targeting scope
- Ding Zhao
- Xun Gao
- Zhanjun Li
BMC Biology (2023)
Genome-wide CRISPR off-target prediction and optimization using RNA-DNA interaction fingerprints
- Qinchang Chen
- Guohui Chuai
- Qi Liu
Nature Communications (2023)
Elongation roadblocks mediated by dCas9 across human genes modulate transcription and nascent RNA processing
- Inna Zukher
- Gwendal Dujardin
- Nick J. Proudfoot
Nature Structural & Molecular Biology (2023)
Sniper2L is a high-fidelity Cas9 variant with high activity
- Young-hoon Kim
- Nahye Kim
- Hyongbum Henry Kim
Nature Chemical Biology (2023)
CRISPR/Cas9 gRNA activity depends on free energy changes and on the target PAM context
- Giulia I. Corsi
- Kunli Qu
- Jan Gorodkin
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.