Real-time observation of DNA recognition and rejection by the RNA-guided endonuclease Cas9

Singh, Digvijay; Sternberg, Samuel H.; Fei, Jingyi; Doudna, Jennifer A.; Ha, Taekjip

doi:10.1038/ncomms12778

Download PDF

Article
Open access
Published: 14 September 2016

Real-time observation of DNA recognition and rejection by the RNA-guided endonuclease Cas9

Digvijay Singh¹^nAff9^nAff10^nAff11,
Samuel H. Sternberg²,
Jingyi Fei^3,4^nAff12,
Jennifer A. Doudna^2,5,6,7,8 &
…
Taekjip Ha ORCID: orcid.org/0000-0003-2195-6258^1,3,4^nAff9^nAff10^nAff11

Nature Communications volume 7, Article number: 12778 (2016) Cite this article

17k Accesses
163 Citations
174 Altmetric
Metrics details

Subjects

Abstract

Binding specificity of Cas9–guide RNA complexes to DNA is important for genome-engineering applications; however, how mismatches influence target recognition/rejection kinetics is not well understood. Here we used single-molecule FRET to probe real-time interactions between Cas9–RNA and DNA targets. The bimolecular association rate is only weakly dependent on sequence; however, the dissociation rate greatly increases from <0.006 s⁻¹ to >2 s⁻¹ upon introduction of mismatches proximal to protospacer-adjacent motif (PAM), demonstrating that mismatches encountered early during heteroduplex formation induce rapid rejection of off-target DNA. In contrast, PAM-distal mismatches up to 11 base pairs in length, which prevent DNA cleavage, still allow formation of a stable complex (dissociation rate <0.006 s⁻¹), suggesting that extremely slow rejection could sequester Cas9–RNA, increasing the Cas9 expression level necessary for genome-editing, thereby aggravating off-target effects. We also observed at least two different bound FRET states that may represent distinct steps in target search and proofreading.

Structural basis for mismatch surveillance by CRISPR–Cas9

Article Open access 02 March 2022

CRISPR–Cas12a-mediated DNA clamping triggers target-strand cleavage

Article Open access 14 July 2022

The energy landscape for R-loop formation by the CRISPR–Cas Cascade complex

Article 06 July 2023

Introduction

CRISPR (clustered regularly interspaced short palindromic repeats)–Cas systems provide adaptive immunity against foreign genetic elements in bacteria and archaea¹. In type II systems, the Cas9 endonuclease functions together with a dual-guide RNA comprising CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA) to target 20 base pair (bp) DNA sequences (cognate sequence) for double-stranded cleavage². Efficient targeting requires RNA–DNA complementarity as well as a specific motif flanking the target sequence called the PAM (protospacer adjacent motif, 5′-NGG-3′ for Streptococcus pyogenes Cas9)^2,3,4. Cas9–RNA complexes have proven to be extremely versatile tools for genome-engineering applications⁵, and minimizing off-target effects^6,7 remains an active area of study.

Numerous studies have assessed off-target DNA binding and cleavage by the Cas9–RNA complex, both in vitro and in vivo^{2,4,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43}. While subtly different conclusions have been reached depending on the exact method of analysis, these studies agreed about specificity being heavily influenced by the presence of a PAM, a 7–12 bp-long seed sequence proximal to the PAM, and the concentration of Cas9 and guide-RNA. Most of previous studies lacked dynamic information on DNA targeting; yet, in order to improve the efficacy of processing only the correct targets, we need such information on targeting dynamics. Single-molecule methods are ideal for this task because they can detect wide-ranging interactions (transient to long-lived) and identify multiple states in real time⁴⁴. Moreover, they can be used to obtain the dynamic information on short and specific DNA sequences, thus enabling the sequence-specific estimation of various kinetic parameters^45,46,47. Several single-molecule studies have examined sequence specificity in CRISPR targeting^{40,41,42,43,48,49}. Here we report a systematic investigation of the binding and dissociation kinetics of Cas9–RNA as a function of sequence mismatches to determine how quickly the cognate sequence is recognized and how quickly partially matching sequences are rejected.

Results

DNA interrogation by Cas9–RNA

We used single-molecule fluorescence resonance energy transfer^50,51 (smFRET) to directly observe individual Cas9–RNA complexes binding to DNA targets in real time. Donor (Cy3) and acceptor (Cy5) fluorophores were conjugated to modified nucleotides in the DNA target and crRNA, respectively, so that FRET between them would report on Cas9–RNA binding to the DNA (Fig. 1a and Supplementary Fig. 1). Fluorescence labelling did not compromise target cleavage (Supplementary Fig. 2). After introducing 20 nM Cas9–RNA complexes to cognate DNA target molecules immobilized on passivated microscope slides, two distinct populations were observed centred at FRET=0.92 and 0, respectively (Fig. 1b,c). The labelling sites are separated by 30 Å (ref. 52; Supplementary Fig. 1), consistent with the observation of the high FRET value upon Cas9–RNA binding. In control experiments using a non-cognate (fully mismatched) DNA target with PAM (Supplementary Table 1), or guide-RNA without Cas9, the 0.92 FRET state was not observed (Fig. 1c). Therefore, we assigned the 0.92 FRET state to a stably formed Cas9–RNA–DNA complex. The high FRET state was long-lived, with a lifetime (>3 min) limited only by fluorophore photobleaching (Supplementary Fig. 3d). A catalytically dead Cas9 mutant (dCas9; D10A and H840A mutations^2,3) showed signal indistinguishable from active Cas9 (Supplementary Fig. 3), indicating that DNA products remain tightly bound after cleavage as was observed previously⁴ (Supplementary Fig. 2). To capture the moment of binding, we added Cas9–RNA into the flow cell during data acquisition. FRET efficiency increased from 0 to 0.92 in a single step (Fig. 1d), suggesting that any intermediates on-path to target binding, if present, cannot be resolved at the time resolution of our measurements (0.1 s).

**Figure 1: Cas9–RNA binding to a cognate sequence.**

Effect of DNA target mismatches on Cas9–RNA binding

We next examined how DNA targets with imperfect RNA–DNA complementarity are discriminated against and rejected by Cas9–RNA. We prepared a series of donor-labelled, fully duplexed DNA containing mismatches relative to the guide-RNA (Supplementary Table 1 and Fig. 2a). The mismatches were introduced either from the PAM-proximal side or from the PAM-distal side, and are denoted using the naming convention x–y_mm where xth through yth bps are mismatched. The fraction of DNA bound by Cas9–RNA (ratio between counts with FRET >0.75 and total counts in FRET histograms) remained identical to the cognate DNA up to 12 PAM-distal mismatches (17–20_mm, 13–20_mm, 12–20_mm, 11–20_mm, 10–20_mm, 9–20_mm; Fig. 2b,d). The bound state remained stable, with the observed lifetimes limited only by fluorophore photobleaching (Supplementary Fig. 3d). A large decrease in the bound fraction occurred only when the number of mismatches from the distal end exceeded 13 bp (7–20_mm, 6–20_mm, 5–20_mm), corresponding to less than seven matched bp from the PAM-proximal end. In contrast, even 2 bp mismatches from the PAM-proximal end (1–2_mm) were deleterious for Cas9–RNA binding and binding to 4 bp PAM-proximal mismatches (1–4_mm) was indistinguishable from binding to fully mismatched (1–20_mm), underscoring the importance of the PAM-proximal seed region (Fig. 2c,d).

**Figure 2: Cas9–RNA binding to DNA with proximal or distal mismatches.**

Different DNA-binding modes of Cas9–RNA

For DNA targets to which Cas9–RNA binds weakly, we observed a second bound state with a mid-FRET peak at ∼0.42, in addition to the 0.92 high FRET state. Single-molecule time trajectories (Fig. 3a) and transition density plots reporting on the relative transition frequencies after hidden Markov modelling analysis⁵³ (Fig. 3b and Supplementary Figs 4–6) revealed reversible transitions between the unbound (FRET<0.2) state and both mid and high FRET=2 bound states, and lifetime analysis as a function of Cas9–RNA concentration confirmed that transitions are due to Cas9–RNA association/dissociation events (Supplementary Figs 7 and 8). The mid FRET was more frequently observed as the number of mismatches increased (Fig. 3b and Supplementary Fig. 4), and were also observed for DNA targets without PAM or without matching sequence, indicating that it does not require either (Supplementary Figs 9 and 10). The high FRET state was rarely observed without PAM (Supplementary Fig. 10). We propose that the Cas9–RNA has two binding modes (Fig. 4). The mid-FRET state (sampling mode) likely does not involve RNA–DNA heteroduplex formation and may represent a mode of PAM surveillance. It is possible that local diffusion may give rise to the time-averaged FRET value of the mid-FRET state. Sequence-independent sampling of DNA target for PAM can occur at multiple locations in the DNA target, and the mid-FRET state, representative of this sampling, expectedly had a broad FRET distribution (Supplementary Figs 4–6). If PAM is recognized during transient biding in the sampling mode, RNA–DNA heteroduplex formation follows, resulting in the high FRET state. Multimodal binding kinetics were also observed for Cascade in Type I CRISPR systems; however, its shorter-lived binding mode plays a priming function, which is absent for Cas9 (ref. 42).

**Figure 3: Cas9–RNA-bound state lifetimes for different DNA targets.**

**Figure 4: The proposed model of bimodal Cas9–RNA binding along with the kinetics of Cas9–RNA DNA targeting as a function of mismatches.**

Effect of mismatches on binding and dissociation kinetics

Survival probability distributions of dwell times in the bound state (FRET>0.2) before transitioning to the unbound state were best described by a double-exponential decay (Supplementary Fig. 11). The amplitude-weighted lifetime of the bound state (τ_avg) decreased precipitously even with just 2 bp PAM-proximal mismatches, likely because the R-loop failed to extend beyond the mismatches. In contrast, 12 bp mismatches were necessary from the distal end for any detectable decrease in τ_avg (Fig. 3c). Similar pattern was also observed for lifetime of transitions from high to zero FRET state, whereas the lifetimes of the mid to zero FRET state remained short for all DNA targets tested, on average ∼0.1 s (Fig. 3c), supporting our proposal that the mid-FRET state is a sampling mode that does not require sequence recognition. In contrast to the bound state lifetimes, the lifetimes of the unbound state were only weakly dependent on sequence (Fig. 3c and Supplementary Fig. 9), yielding the bimolecular association rate constant (k_on) of ∼6 × 10⁶ M⁻¹ s⁻¹ with some reduction for DNA targets without PAM. Overall, our kinetic analysis showed that mismatches affect Cas9–RNA binding mainly through changes in the dissociation rate. Complete kinetic model along with rates of transitions between different states for some DNA targets are in Fig. 4 and Supplementary Fig. 12.

The relative importance of PAM-proximal bps over the PAM-distal bps Fig. 2d, 3c supports the model of unidirectional extension of the RNA–DNA heteroduplex starting from the PAM-proximal end^4,48. For 5–20_mm that has a maximum of 4 bp heteroduplex extension from PAM, the bound state lifetime was ∼0.5 s, that is, such potential targets are rapidly rejected. The bound state lifetime increased to ∼8 s for 8–20_mm with 7 bp heteroduplex, and to ∼16 s for 9–20_mm with 8 bp heteroduplex. For 9 bp or more heteroduplexes, the measured lifetime was limited by photobleaching lifetime of ∼3 min (Supplementary Fig. 3). Therefore, DNA sequences with nine or more matching bp from the PAM-proximal end have extremely long times, and Cas9–RNA would be unable to reject such sequences rapidly. A prediction of this model is that inserting a roadblock of mismatches near this boundary would prematurely terminate heteroduplex extension such that dissociation kinetics would be independent of the presence of a matched sequence beyond the block. To test this prediction, we created two ‘roadblock’ targets, 9–12_mm and 5–8_mm. Indeed, the binding fraction and the lifetime of the bound state (Supplementary Fig. 13 and Fig. 3d) for 9–12_mm and 5–8_mm were similar to those of 9–20_mm and 5–20_mm, respectively, confirming our prediction.

Discussion

A previous single-molecule⁴⁸ study that investigated the Cas9—RNA-induced RNA–DNA heteroduplex formation via magnetic tweezers observed 11 PAM-proximal matches to be sufficient for stable RNA–DNA heteroduplex formation for StCas9 (the Cas9 orthologue from Streptococcus thermophilus). In our current study using SpCas9 (from S. pyogenes, simply referred to as Cas9), we found 9–10 PAM-proximal matches to be sufficient for ultrastable Cas9–RNA binding. The stability of RNA-guided CRISPR enzymes and DNA targets depends on energetic contributions of RNA–DNA heteroduplex and interactions between the DNA target and amino-acid residues of the CRISPR enzymes. The latter has been fine-tuned^54,55 through protein engineering to create more specific Cas9 variants and the small differences between different Cas9 orthologues may stem from the variations in the interactions between the DNA target and protein residues.

A two-step mechanism of Cas9–RNA binding involving PAM surveillance in the sampling mode and RNA–DNA heteroduplex formation upon PAM recognition (Fig. 4) is also supported by structural analysis of Cas9 and Cas9–RNA–DNA ternary complexes, in which interactions between PAM-interacting amino-acid motifs in Cas9 and the PAM of the DNA target precede and guide the further RNA–DNA heteroduplex formation^19,52,56. Our observation that the heteroduplex lifetime increases greatly between 6 and 8 bps can be explained by the recently determined Cas9–RNA structure⁵⁶, in which Watson–Crick faces of eight PAM-proximal nucleotides are solvent-exposed, thus primed for heteroduplex formation. Once an RNA–DNA heteroduplex of 8 bp or more is formed, Cas9–RNA establishes a stable complex with the DNA, regardless of PAM-distal mismatches. Therefore, Cas9–RNA is unable to rapidly reject such off-target DNA, which it cannot cleave, and is sequestered by off-target DNA, limiting the speed of genome editing. This effect would increase the minimal amount of Cas9–RNA required for genome editing, and may in turn lead to an increase in off-target cleavage. For applications requiring binding only, for example, genome decoration or gene regulation, binding specificity will be almost entirely determined by the first 8 or 9 bps away from PAM, greatly reducing the ability to target well-defined sequences in a large genome. For example, we found that 1,126 positions in the human reference genome match PAM plus 8 bp in the sequence we used (Supplementary Table 2). For future improvements in Cas9 proteins, we suggest that one should focus on rapid rejection of such off targets. Our observations may also further inform the design of the guide-RNA and the DNA targets with minimal off-target effects^{57,58,59,60,61,62,63,64,65,66,67,68}.

Methods

Preparation of DNA targets

All DNA oligonucleotides were purchased from Integrated DNA Technologies (Coralville, IA 52241). The Cy3 label in the DNA target is located 3 bp upstream of the PAM (5′-NGG-3′) and was achieved via conjugation of Cy3 N-hydroxysuccinimido (NHS) to an amino group attached to a modified thymine through a C6 linker (amino-dT). The entire panel of DNA targets used in our measurements is available in Supplementary Table 1. A 22-nucleotide-long biotinylated adaptor strand was used for surface immobilization (Supplementary Fig. 1b). DNA targets were prepared by mixing all three component strands and heating to 90 °C followed by cooling to room temperature over 3 h.

Expression and purification of Cas9 and dCas9

The protein purification protocol was adapted from pervious methods^2,19 as follows: a fusion construct inserted into a custom pET-based expression vector was used for protein expression. The fusion construct consisted of the sequence encoding Cas9 (Cas9 residues 1–1,368 from S. pyogenes) and an N-terminal decahistidine-maltose-binding protein (His10-MBP) tag, followed by a peptide sequence containing a tobacco etch virus protease cleavage site. The fusion protein was expressed in Escherichia coli strain BL21 Rosetta 2 (DE3; EMD Biosciences), grown in 2xYT medium at 18 °C for 16 h following induction with 0.5 mM isopropyl β-D-1-thiogalactopyranoside (IPTG). The harvested cells were lysed in 50 mM Tris pH 7.5, 500 mM NaCl, 5% glycerol, 1 mM tris(2-carboxyethyl)phosphine (TCEP), supplemented with protease inhibitor cocktail (Roche), and then homogenized (Avestin). Following ultracentrifugation, the supernatant-clarified cell lysate was separated from the cellular debris and bound in batch to Ni-NTA agarose (Qiagen). The resin was washed extensively with 50 mM Tris pH 7.5, 500 mM NaCl, 10 mM imidazole, 5% glycerol and 1 mM TCEP, and the bound protein was eluted in a single step with 50 mM Tris pH 7.5, 500 mM NaCl, 300 mM imidazole, 5% glycerol and 1 mM TCEP. TEV protease was added to the elutant and cleavage of the protein fusion was allowed to proceed overnight. Cas9 was then dialysed into Buffer A (20 mM Tris-Cl pH 7.5, 125 mM KCl, 5% glycerol and 1 mM TCEP) for 3 h at 4 °C, before being applied on a 5 ml HiTrap SP HP sepharose column (GE Healthcare). After washing with Buffer A for three column volumes, Cas9 was eluted using a linear gradient from 0 to 100% Buffer B (20 mM Tris-Cl pH 7.5, 1 M KCl, 5% glycerol and 1 mM TCEP) over 20 column volumes. The protein was further purified using gel filtration chromatography on a Superdex 200 16/60 column (GE Healthcare) in Cas9 Storage Buffer (20 mM Tris-Cl pH 7.5, 200 mM KCl, 5% glycerol and 1 mM TCEP). Cas9 was stored at −80 °C. Catalytically dead Cas9 (dCas9; D10A/H840A mutations) was prepared with the same protocol.

Preparation of guide-RNA and Cas9–RNA

The guide-RNA consists of crRNA and tracrRNA. The crRNA with an amino-dT was purchased from Integrated DNA Technologies and was labelled using Cy5-NHS. The tracrRNA was prepared using in vitro transcription as described previously⁴. The guide-RNA was assembled freshly for each experiment by mixing equimolar amount of Cy5-labelled crRNA with tracrRNA, heated to 80 °C followed by slow cooling to room temperature. The guide-RNA was then complexed with Cas9 (two to three times the stoichiometric amount of guide-RNA) to form the Cas9–RNA complex for use in imaging experiments. RNA sequences are available in Supplementary Table 1. A detailed schematic of the DNA and the Cas9–RNA design can be found in the Supplementary Fig. 1. The Cas9–RNA activity on the cognate sequence used in this study was characterized previously⁴. Our biochemical assays showed that fluorophore labelling in the DNA target or crRNA had not impaired DNA target cleavage. (Supplementary Fig. 2).

Single-molecule detection and data analysis

Cy3-labelled DNA targets were immobilized on the polyethylene glycol-passivated surface using neutravidin–biotin interaction. The DNA target molecules were then imaged in the presence of Cy5-labelled Cas9–RNA (referred to as Cas9–RNA for brevity here) using the total internal reflection fluorescence microscopy. Imaging was performed at room temperature in a buffer (20 mM Tris-HCl, 100 mM KCl, 5 mM MgCl₂, 5% (v/v) glycerol, 0.2 mg ml⁻¹ bovine serum albumin, 1 mg ml⁻¹ glucose oxidase, 0.04 mg ml⁻¹ catalase, 0.8% dextrose and saturated Trolox (∼3 mM)). The time resolution for all the experiments was 100 ms, unless stated otherwise. Detailed methods of smFRET data acquisition and analysis were described previously⁵¹. The FRET efficiency of a single molecule was approximated as FRET=I_A/(I_D+I_A), where I_D and I_A are the background and leakage-corrected emission intensities of the donor and acceptor, respectively.

FRET histograms and Cas9–RNA-bound DNA fraction

The first five frames (100 ms each) of each of the molecule’s FRET time trajectories were used as data points to construct the FRET histograms. The first 10 frames were used for the FRET histograms in Supplementary Fig. 2. The Cas9–RNA-bound DNA fraction was calculated as the fraction of data points with FRET >0.75 and the total number of data points in the FRET histograms. For each DNA target, the single-molecule FRET time trajectories from independent experiments were combined together to construct the FRET histograms as described.

Lifetime analysis of bound and unbound states

To confirm that the FRET signal indeed reports on Cas9–RNA binding, the lifetimes of the zero FRET (FRET<0.2) and the putative bound state (mid- and high-FRET states taken as a single state, FRET>0.2) were determined as a function of Cas9–RNA concentration (Cas9–RNA). On the basis of this cutoff of FRET=0.2, the survival probability of the zero FRET state versus time could fit well with a single exponential decay, and the decay rate increased linearly with [Cas9–RNA]. In contrast, the survival probability versus time for the bound state had to be fit with a double exponential decay and the decay rates did not depend on [Cas9–RNA] (Supplementary Fig. 2). Therefore, a bimolecular association/disassociation kinetics was used for the analysis of DNA binding by Cas9–RNA.

Lifetime of the bound state via thresholding

In order to perform an unbiased analysis of apparently three-state FRET fluctuations observed from binding-challenged DNA targets, we employed hidden Markov model analysis and generated idealized FRET time trajectories⁵³, assuming that there are three distinct FRET states (high-, mid- and zero-FRET states). To estimate the lifetime of the putative bound states, the survival probability of all the bound state events (mid- and high-FRET states taken as a single state, FRET>0.2) versus time was fit using a double exponential decay profile (A₁ exp(−t/τ₁)+A₂ exp(−t/τ₂)) (Supplementary Fig. 11). The final bound state lifetime (τ_{avg, observed}) is an amplitude-weighted average of two distinct lifetimes τ₁ and τ₂, that is, τ_{avg, observed}=A₁τ₁+A₂τ₂ (k_observed=1/τ_{avg, observed}).

Association rates

We determined the observed rates of binding k_binding using two independent methods. First, the Cas9–RNA binding events were captured in real time by flowing Cas9–RNA into the sample chamber with immobilized DNA target molecules (Supplementary Fig. 7a). Second, for the binding-challenged DNA targets that showed reversible association/disassociation, the smFRET time trajectories obtained under steady-state conditions were used to extract the unbound state duration between adjacent binding events (Supplementary Fig. 7b). These dwell times in the unbound state were then used to get the rate of association by fitting their survival probability distribution to a single exponential decay (Supplementary Fig. 8a).

Rates of transitions between different states

Generation of idealized FRET time trajectories using the hidden Markov model⁵³ yielded three different FRET states (zero, mid and high) along with the probabilities of transitions between the various FRET states. The log of transition probabilities between any two states was used to estimate the mean transition probability between the two given states, which was then used to estimate the rate as following:

k_A–B (s⁻¹)=T_{p (A−B)} × Sampling rate of image acquisition

where k_A–B is the rate of transition from state A to B and T_{p (A−B)} is the mean probability of transition from A to B.

If each frame is acquired over 0.1 s, then the sampling rate (1/0.1)=10 s⁻¹.

Correction factors

Because the high FRET state was very long-lived for certain DNA targets (that is, 8–20_mm, 9–20_mm, 9–12_mm, 1–2_mm), their dwell times were not accurately captured because of photobleaching-induced truncation of smFRET time trajectories. The same is true for the dwell time of the unbound state. We made the following correction to obtain the actual rate.

k_actual=k_observed−k_{photobleach (high/zero FRET state)} where k_observed is the rate calculated above and k_{photobleach (high/zero FRET state)} is the rate of photobleaching of the high- or zero-FRET state. Finally, we obtain τ_avg=1/k_actual.

Counts of DNA target sequences in human genome

The human genome assembly (GRCh38.p6) was analysed using custom MATLAB scripts to calculate the total occurrences of DNA target sequences used in this study, which is referred to as the actual count (Supplementary Table 2). The total number of occurrences expected for a sequence, assuming a random distribution of A, T, G and C nucleotides, is referred to as the probabilistic count and is calculated as follows: probabilistic count=(¼)ⁿ × total number of bp in human genome (3.2 billion) where ¼ is the probability of occurrence of any given nucleotide at a position in the sequence and n is the number of bp in the genome.

Data availability

Any additional data that support the findings of this study are available from the corresponding author upon request.

Additional information

How to cite this article: Singh, D. et al. Real-time observation of DNA recognition and rejection by the RNA-guided endonuclease Cas9. Nat. Commun. 7:12778 doi: 10.1038/ncomms12778 (2016).

References

Wright, A. V., Nunez, J. K. & Doudna, J. A. Biology and applications of CRISPR systems: harnessing nature's toolbox for genome engineering. Cell 164, 29–44 (2016).
Article CAS Google Scholar
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Article ADS CAS Google Scholar
Gasiunas, G., Barrangou, R., Horvath, P. & Siksnys, V. Cas9–crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl Acad. Sci. USA 109, E2579–E2586 (2012).
Article ADS CAS Google Scholar
Sternberg, S. H., Redding, S., Jinek, M., Greene, E. C. & Doudna, J. A. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature 507, 62–67 (2014).
Article ADS CAS Google Scholar
Doudna, J. A. & Charpentier, E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science 346, 1258096 (2014).
Article Google Scholar
Wu, X., Kriz, A. J. & Sharp, P. A. Target specificity of the CRISPR-Cas9 system. Quant. Biol. 2, 59–70 (2014).
Article CAS Google Scholar
O'Geen, H., Yu, A. S. & Segal, D. J. How specific is CRISPR/Cas9 really? Curr. Opin. Chem. Biol. 29, 72–78 (2015).
Article CAS Google Scholar
Sapranauskas, R. et al. The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli. Nucleic Acids Res. 39, 9275–9282 (2011).
Article CAS Google Scholar
Semenova, E. et al. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc. Natl Acad. Sci. USA 108, 10098–10103 (2011).
Article ADS CAS Google Scholar
Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–823 (2013).
Article ADS CAS Google Scholar
Fu, Y. et al. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat. Biotechnol. 31, 822–826 (2013).
Article CAS Google Scholar
Hsu, P. D. et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 31, 827–832 (2013).
Article CAS Google Scholar
Jiang, W., Bikard, D., Cox, D., Zhang, F. & Marraffini, L. A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 31, 233–239 (2013).
Article CAS Google Scholar
Jinek, M. et al. RNA-programmed genome editing in human cells. Elife 2, e00471 (2013).
Article Google Scholar
Mali, P. et al. CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering. Nat. Biotechnol. 31, 833–838 (2013).
Article CAS Google Scholar
Pattanayak, V. et al. High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity. Nat. Biotechnol. 31, 839–843 (2013).
Article CAS Google Scholar
Cho, S. W. et al. Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases. Genome Res. 24, 132–141 (2014).
Article CAS Google Scholar
Doench, J. G. et al. Rational design of highly active sgRNAs for CRISPR-Cas9-mediated gene inactivation. Nat. Biotechnol. 32, 1262–1267 (2014).
Article CAS Google Scholar
Jinek, M. et al. Structures of Cas9 endonucleases reveal RNA-mediated conformational activation. Science 343, 1247997 (2014).
Article Google Scholar
Gagnon, J. A. et al. Efficient mutagenesis by Cas9 protein-mediated oligonucleotide insertion and large-scale assessment of single-guide RNAs. PLoS ONE 9, e98186 (2014).
Article ADS Google Scholar
Smith, C. et al. Whole-genome sequencing analysis reveals high specificity of CRISPR/Cas9 and TALEN-based genome editing in human iPSCs. Cell Stem Cell 15, 12–13 (2014).
Article CAS Google Scholar
Shen, B. et al. Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects. Nat. Methods 11, 399–402 (2014).
Article CAS Google Scholar
Zhang, Y. et al. Comparison of non-canonical PAMs for CRISPR/Cas9-mediated DNA cleavage in human cells. Sci. Rep. 4, 5405 (2014).
Article Google Scholar
Frock, R. L. et al. Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases. Nat. Biotechnol. 33, 179–186 (2015).
Article CAS Google Scholar
Iyer, V. et al. Off-target mutations are rare in Cas9-modified mice. Nat. Methods 12, 479 (2015).
Article CAS Google Scholar
Kim, D. et al. Digenome-seq: genome-wide profiling of CRISPR-Cas9 off-target effects in human cells. Nat. Methods 12, 237–243 (2015).
Article CAS Google Scholar
Paulis, M. et al. A pre-screening FISH-based method to detect CRISPR/Cas9 off-targets in mouse embryonic stem cells. Sci. Rep. 5, 12327 (2015).
Article ADS CAS Google Scholar
Singh, R., Kuscu, C., Quinlan, A., Qi, Y. & Adli, M. Cas9-chromatin binding information enables more accurate CRISPR off-target prediction. Nucleic Acids Res. 43, e118 (2015).
Article Google Scholar
Tan, E.-P., Li, Y., Velasco-Herrera, M. D. C., Yusa, K. & Bradley, A. Off-target assessment of CRISPR-Cas9 guiding RNAs in human iPS and mouse ES cells. Genesis 53, 225–236 (2015).
Article CAS Google Scholar
Tsai, S. Q. et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 33, 187–197 (2015).
Article CAS Google Scholar
Wang, X. et al. Unbiased detection of off-target cleavage by CRISPR-Cas9 and TALENs using integrase-defective lentiviral vectors. Nat. Biotechnol. 33, 175–178 (2015).
Article CAS Google Scholar
Xu, H. et al. Sequence determinants of improved CRISPR sgRNA design. Genome Res. 25, 1147–1157 (2015).
Article CAS Google Scholar
Doench, J. G. et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9. Nat. Biotechnol. 34, 184–191 (2016).
Article CAS Google Scholar
Cencic, R. et al. Protospacer adjacent motif (PAM)-distal sequences engage CRISPR Cas9 DNA target cleavage. PLoS ONE 9, e109213 (2014).
Article ADS Google Scholar
Duan, J. et al. Genome-wide identification of CRISPR/Cas9 off-targets in human genome. Cell Res. 24, 1009–1012 (2014).
Article CAS Google Scholar
Kuscu, C., Arslan, S., Singh, R., Thorpe, J. & Adli, M. Genome-wide analysis reveals characteristics of off-target sites bound by the Cas9 endonuclease. Nat. Biotechnol. 32, 677–683 (2014).
Article CAS Google Scholar
Wu, X. et al. Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells. Nat. Biotechnol. 32, 670–676 (2014).
Article CAS Google Scholar
O'Geen, H., Henry, I. M., Bhakta, M. S., Meckler, J. F. & Segal, D. J. A genome-wide analysis of Cas9 binding specificity using ChIP-seq and targeted sequence capture. Nucleic Acids Res. 43, 3389–3404 (2015).
Article Google Scholar
Polstein, L. R. et al. Genome-wide specificity of DNA binding, gene regulation, and chromatin remodeling by TALE- and CRISPR/Cas9-based transcriptional activators. Genome Res. 25, 1158–1169 (2015).
Article CAS Google Scholar
Rutkauskas, M. et al. Directional R-loop formation by the CRISPR-Cas surveillance complex cascade provides efficient off-target site rejection. Cell Rep. S2211-1247, 00135–00137 (2015).
Google Scholar
Josephs, E. A. et al. Structure and specificity of the RNA-guided endonuclease Cas9 during DNA interrogation, target binding and cleavage. Nucleic Acids Res. 43, 8924–8941 (2015).
Article CAS Google Scholar
Blosser, T. R. et al. Two distinct DNA binding modes guide dual roles of a CRISPR-Cas protein complex. Mol. Cell 58, 60–70 (2015).
Article CAS Google Scholar
Knight, S. C. et al. Dynamics of CRISPR-Cas9 genome interrogation in living cells. Science 350, 823–826 (2015).
Article ADS CAS Google Scholar
Joo, C., Balci, H., Ishitsuka, Y., Buranachai, C. & Ha, T. Advances in single-molecule fluorescence methods for molecular biology. Annu. Rev. Biochem. 77, 51–76 (2008).
Article CAS Google Scholar
Ragunathan, K., Joo, C. & Ha, T. Real-time observation of strand exchange reaction with high spatiotemporal resolution. Structure 19, 1064–1073 (2011).
Article CAS Google Scholar
Lee, J. Y. et al. DNA recombination. Base triplet stepping by the Rad51/RecA family of recombinases. Science 349, 977–981 (2015).
Article ADS CAS Google Scholar
Ragunathan, K., Liu, C. & Ha, T. RecA filament sliding on DNA facilitates homology search. Elife 1, e00067 (2012).
Article Google Scholar
Szczelkun, M. D. et al. Direct observation of R-loop formation by single RNA-guided Cas9 and Cascade effector complexes. Proc. Natl Acad. Sci. USA 111, 9798–9803 (2014).
Article ADS CAS Google Scholar
Redding, S. et al. Surveillance and processing of foreign DNA by the Escherichia coli CRISPR-Cas system. Cell 163, 854–865 (2015).
Article CAS Google Scholar
Ha, T. et al. Probing the interaction between two single molecules: fluorescence resonance energy transfer between a single donor and a single acceptor. Proc. Natl Acad. Sci. USA 93, 6264–6268 (1996).
Article ADS CAS Google Scholar
Roy, R., Hohng, S. & Ha, T. A practical guide to single-molecule FRET. Nat. Methods 5, 507–516 (2008).
Article CAS Google Scholar
Anders, C., Niewoehner, O., Duerst, A. & Jinek, M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature 513, 569–573 (2014).
Article ADS CAS Google Scholar
McKinney, S. A., Joo, C. & Ha, T. Analysis of single-molecule FRET trajectories using hidden Markov modeling. Biophys. J. 91, 1941–1951 (2006).
Article ADS CAS Google Scholar
Slaymaker, I. M. et al. Rationally engineered Cas9 nucleases with improved specificity. Science 351, 84–88 (2016).
Article ADS CAS Google Scholar
Kleinstiver, B. P. et al. High-fidelity CRISPR-Cas9 nucleases with no detectable genome-wide off-target effects. Nature 529, 490–495 (2016).
Article ADS CAS Google Scholar
Nishimasu, H. et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell 156, 935–949 (2014).
Article CAS Google Scholar
Hsu, P. D. et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 31, 827–832 (2013).
Article CAS Google Scholar
Bae, S., Park, J. & Kim, J.-S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics 30, 1473–1475 (2014).
Article CAS Google Scholar
Heigwer, F., Kerr, G. & Boutros, M. E-CRISP: fast CRISPR target site identification. Nat. Methods 11, 122–123 (2014).
Article CAS Google Scholar
Ren, X. et al. Enhanced specificity and efficiency of the CRISPR/Cas9 system with optimized sgRNA parameters in Drosophila. Cell Rep. 9, 1151–1162 (2014).
Article CAS Google Scholar
Wang, T., Wei, J. J., Sabatini, D. M. & Lander, E. S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 343, 80–84 (2014).
Article ADS CAS Google Scholar
Chari, R., Mali, P., Moosburner, M. & Church, G. M. Unraveling CRISPR-Cas9 genome engineering parameters via a library-on-library approach. Nat. Methods 12, 823–826 (2015).
Article CAS Google Scholar
Farboud, B. & Meyer, B. J. Dramatic enhancement of genome editing by CRISPR/Cas9 through improved guide RNA design. Genetics 199, 959–971 (2015).
Article CAS Google Scholar
Moreno-Mateos, M. A. et al. CRISPRscan: designing highly efficient sgRNAs for CRISPR-Cas9 targeting in vivo. Nat. Methods 12, 982–988 (2015).
Article CAS Google Scholar
Stemmer, M., Thumberger, T., Del Sol Keyer, M., Wittbrodt, J. & Mateo, J. L. CCTop: an intuitive, flexible and reliable CRISPR/Cas9 target prediction tool. PLoS ONE 10, e0124633 (2015).
Article Google Scholar
Tsai, S. Q. et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 33, 187–197 (2015).
Article CAS Google Scholar
Wong, N., Liu, W. & Wang, X. WU-CRISPR: characteristics of functional guide RNAs for the CRISPR/Cas9 system. Genome Biol. 16, 218 (2015).
Article Google Scholar
Labun, K. et al. CHOPCHOP v2: a web tool for the next generation of CRISPR genome engineering. Nucleic Acids Res. 44, W272–W276 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

We thank current and past members of the Ha and Doudna group for various suggestions. The project was supported by grants from the National Science Foundation (PHY-1430124 to T.H. and 1244557 to J.A.D.) and National Institutes of Health (GM065367; GM112659 to T.H.); T.H. and J.A.D. are investigators with the Howard Hughes Medical Institute.

Author information

Digvijay Singh & Taekjip Ha
Present address: Present address: Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA,
Digvijay Singh & Taekjip Ha
Present address: Present address: Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21205, USA,
Digvijay Singh & Taekjip Ha
Present address: Present address: Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205, USA,
Jingyi Fei
Present address: Present address: Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, USA,

Authors and Affiliations

Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, 61801, Illinois, USA
Digvijay Singh & Taekjip Ha
Department of Chemistry, University of California, Berkeley, 94720, California, USA
Samuel H. Sternberg & Jennifer A. Doudna
Department of Physics and Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, 61801, Illinois, USA
Jingyi Fei & Taekjip Ha
Howard Hughes Medical Institute, Baltimore, 21205, Maryland, USA
Jingyi Fei & Taekjip Ha
Department of Molecular and Cell Biology, University of California, Berkeley, 94720, California, USA
Jennifer A. Doudna
Howard Hughes Medical Institute, Berkeley, 94720, California, USA
Jennifer A. Doudna
Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, 94720, California, USA
Jennifer A. Doudna
Innovative Genomics Initiative, University of California, Berkeley, 94720, California, USA
Jennifer A. Doudna

Authors

Digvijay Singh
View author publications
You can also search for this author in PubMed Google Scholar
Samuel H. Sternberg
View author publications
You can also search for this author in PubMed Google Scholar
Jingyi Fei
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer A. Doudna
View author publications
You can also search for this author in PubMed Google Scholar
Taekjip Ha
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.S., S.H.S., J.F., T.H. and J.A.D. designed the experiments. D.S. conducted all the single-molecule experiments and synthesized guide-RNA. S.H.S. prepared Cas9, dCas9 and guide RNAs, and conducted biochemical DNA cleavage assays. D.S. and J.F. performed the data analysis. All authors discussed the data; D.S., S.H.S., J.F. and T.H. wrote the manuscript.

Corresponding authors

Correspondence to Jennifer A. Doudna or Taekjip Ha.

Ethics declarations

Competing interests

S.H.S and J.A.D. are inventors on a related patent application. The other authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-13, Supplementary Tables 1-2 and Supplementary References (PDF 7284 kb)

Peer review file (PDF 319 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Singh, D., Sternberg, S., Fei, J. et al. Real-time observation of DNA recognition and rejection by the RNA-guided endonuclease Cas9. Nat Commun 7, 12778 (2016). https://doi.org/10.1038/ncomms12778

Download citation

Received: 11 April 2016
Accepted: 01 August 2016
Published: 14 September 2016
DOI: https://doi.org/10.1038/ncomms12778

This article is cited by

Sniper2L is a high-fidelity Cas9 variant with high activity
- Young-hoon Kim
- Nahye Kim
- Hyongbum Henry Kim
Nature Chemical Biology (2023)
Improving the sensitivity of in vivo CRISPR off-target detection with DISCOVER-Seq+
- Roger S. Zou
- Yang Liu
- Taekjip Ha
Nature Methods (2023)
A tweak and a peek: How Cas9 pries open double-stranded DNA to check its sequence
- Selma Sinan
- Rick Russell
Nature Structural & Molecular Biology (2022)
Salinity tolerance mechanisms and their breeding implications
- Mandeep Singh
- Usha Nara
- Sittal Thapa
Journal of Genetic Engineering and Biotechnology (2021)
Targeted insertion of large genetic payloads using cas directed LINE-1 reverse transcriptase
- Femila Manoj
- Laura W. Tai
- Thomas E. Kuhlman
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.