Abstract
Cpf1 is an RNA-guided endonuclease that can be programmed to cleave DNA targets. Specific features, such as containing a short crRNA, creating a staggered cleavage pattern and having a low off-target rate, render Cpf1 a promising gene-editing tool. Here, we present a new Cpf1 ortholog, EeCpf1, as a genome-editing tool; this ortholog is derived from the gut bacterial species Eubacterium eligens. EeCpf1 exhibits a higher cleavage activity with the Mn2+ metal cofactor and efficiently cuts the target DNA with an engineered, nucleotide extended crRNA at the 5′ target site. When mouse blastocysts were injected with multitargeting crRNAs against the IL2R-γ gene, an essential gene for immunodeficient mouse model production, EeCpf1 efficiently generated IL2R-γ knockout mice. For the first time, these results demonstrate that EeCpf1 can be used as an in vivo gene-editing tool for the production of knockout mice. The utilization of engineered crRNA with multiple target sites will help to explore the in vivo DNA cleavage activities of Cpf1 orthologs from other species that have not been demonstrated.
Similar content being viewed by others
Introduction
Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) systems eliminate invading genetic elements in prokaryotes1. CRISPR/Cas effector proteins are RNA-guided endonucleases that can be programmed to cleave DNA or/and RNA targets. Based on the cas gene content and the mechanism of action, CRISPR/Cas systems are classified into two classes and six types. The multiprotein complex of the class 1 systems detects foreign nucleic acids and then degrades DNA or/and RNA2,3. Class 2 systems of Cas9 (type II) and Cpf1 (type V) consist of single-component effector proteins that have been repurposed for genome editing in various organisms4,5. Cas9 is guided by a hybrid of CRISPR RNA (crRNA) and a trans-activating crRNA (tracrRNA) to cleave the target DNA with a protospacer adjacent motif (PAM). SpCas9, derived from Streptococcus pyogenes, was the first CRISPR/Cas protein to enable targeted mutagenesis and is still the most widely used genome-editing tool6,7. Recently, the type V of Cpf1 has also been rapidly characterized and developed into a range of genome editing and regulation tools8. Some features of the Cpf1 system differ from those of the Cas9 system, although their target recognition mechanism is similar. While the Cas9-RNA complex contains two RNA molecules, Cpf1 is guided by a single crRNA (~41 nt). The PAM site of Cas9 is located 3′ to the target DNA, while in Cpf1, it is located 5′ to the target DNA. Cas9 typically uses a guanine-rich PAM, such as NGG, while Cpf1 utilizes a thymidine-rich PAM, such as TTTN. Cas9 makes a blunt cut adjacent to the PAM, while Cpf1 generates a 5 base pair (bp) staggered cut 17 nt downstream of the PAM9. These specific features of Cpf1, together with the feature of lower off-target cleavage rates compared to those of Cas9, can broaden the spectrum of genome editing to various fields10. In search of Cpf1 orthologs capable of genome editing, a variety of Cpf1 proteins was explored by the PSI-BLAST program, and 52 nonredundant CRISPR-Cpf1 loci were previously identified8,11,12. Among them, seven Cpf1 orthologs (Francisella novicida U112 (FnCpf1), Acidaminococcus sp. BV36L (AsCpf1), Lachnospiraceae bacterium ND2006 (LbCpf1), Thiomicrospira sp. Xs5 (TsCpf1), Moraxella bovoculi AAX08_00205 (Mb2Cpf1), Moraxella bovoculi AAX11_00205 (Mb3Cpf1), and Butyrivibrio sp. NC3005 (BsCpf1)) were shown to have DNA cleavage functions in vivo, while the Cpf1 variants of FnCpf1, AsCpf1 and LbCpf1 have been studied most intensively and have been used as gene-editing tools13,14. Although EeCpf1 from Eubacterium eligens was first identified by its sequence homology, its in vivo DNA cleavage activity has never been reported. Recently, we found that the catalytic mutant of EeCpf1 functions as an efficient transcriptional regulator for gene expression in bacteria. Here, we show the in vivo DNA cleavage activity of EeCpf1 for the first time and present it as an efficient genome-editing tool.
Materials and Methods
Cloning, protein expression and purification
The gene encoding Cpf1 (WP_012739647.1) was amplified from the genomic DNA of E. eligens (ATCC 27750) by PCR and was ligated into a modified pET-22b(+) plasmid to produce the protein with a 6xHis-tag and a Cysteine Protease Domain (CPD) tag at the C-terminus. The resulting pET22b_EeCpf1-CPD plasmid was transformed into the E. coli strain BL21-Codon Plus (DE3)-RIL (Agilent Technologies). E. coli that harbored EeCpf1-CPD were cultured in LB medium that contained ampicillin to an OD600 of 0.6 and were induced by adding 1 mM IPTG at an incubation temperature of 18 °C for 16 hours. The cells were collected by centrifugation (6000 g, 30 min), resuspended in 300 mL of lysis buffer (30 mM Tris-HCl (pH 7.5), 150 mM NaCl, 5 mM β-mercaptoethanol, and 10% glycerol), and disrupted by sonication in an ice bath (VC-600 sonicator; Sonics & Materials). The supernatant was purified by centrifugation (10000 g, 30 min, 4 °C), and the protein was purified using the HisTrap HP, Heparin HP, and Superdex 200 pg columns (GE Healthcare) with an AKTA FPLC system (GE Healthcare) and elution buffer (30 mM Tris-HCl (pH 7.5), 150 mM NaCl, 5 mM β-mercaptoethanol, and 10% glycerol). The C-terminal 6xHis-tag and CPD tag were cleaved with a 200 μM phytic acid treatment15. The EedCpf1 mutant that contained the D880A substitution (pET22b-EedCpf1-CPD) was generated with a site-directed mutagenesis kit (Enzynomics) and was purified in the same way as the wild-type protein. The sequences of all bacterial expression plasmids can be found in Supplementary Table 1.
In vitro transcription of crRNAs
The targeting sequence consisted of 24 nucleotides, followed by an ‘TTTN’ sequence called the protospacer adjacent motif (PAM). For in vitro transcription, the template DNAs were amplified with the overlap PCR method. The amplified template DNA was purified with a commercial gel extraction kit (Bioneer). In vitro transcription was conducted with the purified DNA template using the MEGAshortscript T7 Transcription Kit (Invitrogen) according to the manufacturer’s instructions. The synthesized crRNAs were purified by ethanol precipitation.
In vitro nuclease activity assays
To determine the nuclease activity that targeted the pUC19 plasmid, purified EeCpf1 or EedCpf1 (160 nM) and crRNA (7.6 μM) were incubated at 37 °C for 5 min in reaction buffer (30 mM Tris-HCl (pH 7.5), 100 mM NaCl) with 1 mM MnCl2. The reaction was initiated by the addition of the pUC19 plasmid (200 ng) and was incubated at 37 °C for 20 min. The reaction was quenched by the addition of proteinase K (Enzynomics) and incubated at 37 °C for 10 min. All samples were analyzed on a 1% agarose gel. For the in vitro activity assay toward the IL2R-γ sequence, the region that contained four target sequences in the IL2R-γ gene was amplified by PCR. The amplified product was purified using a gel extraction kit and was used as a substrate for the IL2R-γ sequence targeting assay. The experiment was conducted following the same process as that of the nuclease activity assay with the pUC19 plasmid, except the amplified substrate was used instead of the plasmid.
Generation of mutant mice by injection of the EeCpf1/crRNA mixture
The care, use, and treatment of all mice in this study were in strict agreement with the Korean Ministry of Food and Drug Safety (MFDS) guidelines. Protocols were reviewed and approved by the Institutional Animal Care and Use Committee of the Korea Research Institute of Bioscience (KRIBB). Female C57BL/6J mice (6 weeks of age) were superovulated by intraperitoneal injection with 5 IU pregnant mare serum gonadotropin (PSMG, Sigma), followed 46 hours later by an injection of 5 IU human chorionic gonadotropin (hCG, Sigma). Immediately after the hCG injection, female mice were mated 1:1 with male mice (12 weeks of age) of the same strain with proven fertility. The animals were sacrificed 14 hours after hCG administration, and the oviducts were collected. The oocyte-cumulus complexes were released from the oviducts, and the embryos were transferred to microinjection dishes that contained M2 medium (Sigma) under mineral oil. The EeCpf1/crRNA reagent mixture was prepared by dilution of the components into distilled water to obtain the following concentrations: 0.6 µM EeCpf1 protein and 6.1 µM IL2R-γ crRNAs. The reagent mixture was introduced into the cytoplasm of the embryos by microinjection. The injected embryos were cultured in M16 medium (Sigma) under mineral oil. The surviving two-cell stage embryos were surgically implanted into the oviducts of pseudopregnant females.
Genomic sequence analysis
For PCR amplification, the embryos were lysed in 10 µl of blastocyst lysis buffer (100 mM Tris-HCl (pH 8.3), 100 mM KCl, 0.02% gelatin, 0.45% Tween 20, 10 mg/µl yeast tRNA and 20 mg/ml proteinase K). The samples were incubated at 56 °C for 10 min followed by 95 °C for 10 min and then stored at −4 °C. Four microliters of the crude samples was subjected to PCR amplification. The changes in the genomic DNA sequences of the blastocysts were analyzed by Sanger sequencing analysis (Bioneer, Korea) of a PCR fragment that was amplified from the IL2R-γ gene (primers used: FR, 5′-CAGCTCTTCAGGAACCCTACCAGTTTC-3′ and RP, 5′-CCCCCCCTTAACTGTTTAACCTCAGTC-3′).
Selection and analysis of off-target sites
Potential off-target sites were selected using Cas-OFFinder (http://www.rgenome.net/Cas-Offinder) with a criterion of less than two bulges and mismatches. On-target and potential off-target sites were amplified by nested PCR. Whether candidate off-target sites were mutated was determined using a T7EI digestion assay and Sanger sequencing.
Results
Characterization of the CRISPR/Cas system in Eubacterium eligens
The human gut-derived bacterium Eubacterium eligens has one CRISPR locus in the circular chromosome (2,144,190 bp) determined by the CRISPR database analysis. The CRISPR locus in E. eligens contains 36 bp of repeat sequences and 25–29 bp of spacers. When the ORFs near the CRISPR loci were analyzed, the type V system was located next to the cas1, cas2 and cpf1 (cas12a) genes (Fig. 1A). Interestingly, the cas1 gene in E. eligens, where the protein is expected to be involved in the adaptation stage of the CRISPR system, is significantly smaller than any other cas1 genes reported so far16. The repeat sequences are predicted to form a highly conserved crRNA scaffold in Cpf1 proteins, such as FnCpf1, AsCpf1, and LbCpf117 (Fig. 1B). The conservation of the stem-loop scaffolds indicates that the EeCpf1 protein may recognize the 5′ T-rich PAM sequence according to previous data11. Based on sequential alignment with three Cpf1 orthologs, the RuvC domain of EeCpf1 retains two essential catalytic residues (Asp880 and Glu965) that are conserved in the Cpf1 family (Fig. 1C). EeCpf1 showed an ~35% sequence homology with the reportedly editable mammalian gene Cpf1s.
In vitro DNA cleavage of EeCpf1
To characterize EeCpf1 for its nucleotide cleavage activity, we expressed and purified EeCpf1 proteins from E. coli and then reconstituted the Cpf1 ribonucleoproteins (RNPs) with in vitro-transcribed crRNAs. Previously, an in vitro PAM identification assay revealed that the PAM sequence is predominantly T-rich (5′-TTTN-3′) in EeCpf118. We used a double-stranded plasmid (pUC19) bearing the 5′-TTTN-3′ PAM as a DNA substrate and synthesized the crRNA that corresponded to a target in the plasmid (Fig. 2A). The in vitro DNA cleavage assay showed that EeCpf1 cleaved the target DNA of the plasmid in a crRNA-dependent manner to produce linear DNA (Fig. 2B). In the absence of crRNA, EeCpf1 produced a band (lane 7) that migrated with a pattern that corresponded to the pUC19 plasmid that was nicked by Nt.BspQI; this indicates that EeCpf1 can nick dsDNA in the absence of a crRNA. Since the nuclease activity of Cpf1 is known to be metal-dependent, we further determined the metal ion dependency of EeCpf1. The results showed that the metal ion Mn2+, as well as Mg2+, Ni2+ and Ca2+ but not Cu2+ or Zn2+, enabled EeCpf1 to cleave the target DNA substrate (Fig. 2C). We generated an active-site mutant of the RuvC domain that contained a D880A substitution and examined its effect on DNA cleavage activity. The EeCpf1 (D880A) mutant abolished both nick and double-stranded DNA cleavage activity (Fig. 2D). These data demonstrate that EeCpf1 shares the same crRNA-mediated DNA cleavage feature as those observed in other Type V systems.
EeCpf1 can edit the mammalian genomes of mouse cells
Next, we explored the capacity of the EeCpf1 protein to cleave endogenous genomic loci in mammalian cells. We expressed and purified the human codon-optimized EeCpf1 proteins from E. coli. Two nuclear localization signals (NLSs) were attached to each N- and C-terminus of EeCpf1 to ensure their nuclear compartmentalization in mammalian cells. The interleukin 2 receptor gamma (IL2R-γ), an essential enzyme in lymphocyte development and one of the candidate genes for the production of immunodeficient mice, was designated as a target. Using Cas-OFFinder and off-target analysis, four sites (two sites in exon 3 and one each in exons 4 and 5) with low sequence homologies to other sequences were selected within the IL2R-γ gene to avoid off-target mutagenesis (Supplementary Fig. 1), and the corresponding four crRNAs were designed19 (Fig. 3A). Previously, the extension of crRNA was reported to enhance the gene editing efficiency of AsCpf1 inside cells20. To empower the gene editing efficiency of EeCpf1, we designed each crRNA with the addition of a U-rich tail (U4AU6) to the 3′-end of the RNA. When the activity of EeCpf1 was measured in vitro, four target sites in the IL2R-γ gene that were generated by PCR were all specifically cleaved by the preassembled EeCpf1 RNPs (Fig. 3B). Subsequently, we microinjected the recombinant EeCpf1 protein and a mixture of four crRNAs into one-cell-stage embryos, and we cultured the mouse embryos in vitro and obtained blastocysts. Sanger sequencing results showed that five out of 35 (15%) blastocysts carried mutations in the IL2R-γ gene. In exon 3, a 20 bp sequence was deleted by overlapping the targets of crRNA1 and/or crRNA2 with a mutation efficiency of 6%. No mutation was found in exon 4 that was generated by crRNA3. The target site in exon 5 showed a 1 bp deletion and a 1 bp change with a 10% efficiency (Fig. 3C). The target specificity of EeCpf1 was evaluated for the five genome-wide off-target sites with mismatches ranging from 3- to 10-bp (Supplementary Fig. 2A). The results showed no detectable off-target effects in IL2R-γ-mutated blastocysts from the T7E1 assay (Supplementary Fig. 2B) or Sanger sequencing analyses (Supplementary Fig. 3A–C), which is in agreement with the low off-target effects of Cpf1 proteins in mice19. Together, the DNA sequencing charts exhibited five kinds of insertion and deletion mutations at the three target sites by EeCpf1 to yield the mutagenesis embryo of the IL2R-γ gene.
To produce IL2R-γ knockout mice, we microinjected a mixture of two crRNAs (target1/target2) and the Eecpf1 protein into 125 one-cell-stage embryos and obtained 76 two-cell-stage embryos (survival rate 60.8%). The 76 surviving embryos were transferred into pseudopregnant C57BL/6J female mice, and nine live animals were born. T7EI-based genotyping analyses identified one mutant (11%) out of nine F0 generation mice (Fig. 4A). Sanger sequencing analyses showed that the F0 heterozygote carried mutation sites with 4 bp deletions and 3 bp changes, which were consistently observed in different tissues (Fig. 4B), indicating no mosaicism among those three F0 biopsies (Fig. 4B,C). These results demonstrate that EeCpf1 could enable genome editing in mammalian cells.
Discussion
We presented the in vivo DNA cleavage activity of Cpf1 from E. eligens for the first time; we used this activity for gene editing to produce knockout mice. Among the four targeted sites in the IL2R-γ gene, which were all specifically cleaved by the EcCpf1 RNP complex in vitro, three sites were successfully mutated at the target loci in the mouse blastocysts, while one site was not mutated at the target loci. We assumed that there may be an epigenetic modification or chromatin structural change around the target region that impaired the accessibility to the target site by EeCpf121. Alternatively, the secondary structure of crRNA3 could have affected the formation of the EeCpf1 RNP complex22. Conclusively, the engineering of crRNA by the addition of a U-rich tail to the 3′-end of the RNA and multiple site targeting by crRNAs is an effective method to induce mutagenesis in genome nucleotides by EeCpf1. Recent reports have proposed that the 3′-overhang of the crRNA may have contributed to the effective binding of the RNA to the Cpf1 protein, yielding stable formation of the ribonucleoprotein complexes inside the cells20. In addition, the EeCpf1/engineered crRNA RNP complex did not show cytotoxicity or off-target effects in mouse embryos. In this respect, it will be interesting to know whether some Cpf1 orthologs, for which activity was only reported in vitro, will exhibit specific in vivo DNA cleavage activities if the same method from this study is applied. Notably, the weak activity of FnCpf1 was observed in mammalian cells, whereas FnCpf1 exhibits robust activity in plant cells23; this indicates that Cpf1 orthologs may have different activities depending on the organism. Therefore, the availability of additional Cpf1 orthologs with specific target cleavage activities that were presented in this study will further expand the genome editing options for a wide range of organisms. Furthermore, E. eligens is a common gut Firmicute bacterium and a major contributor to the gut microbiome24. Since perturbation of the microbiota and metabolome has been associated with various diseases and metabolic conditions, targeted manipulation of the microbiome by Cpf1 originating from E. eligens would be one of the potential therapeutic applications of the EcCpf1 protein.
References
Horvath, P. & Barrangou, R. CRISPR/Cas, the immune system of bacteria and archaea. Science 327, 167–170 (2010).
Jore, M. M. et al. Structural basis for CRISPR RNA-guided DNA recognition by Cascade. Nat Struct Mol Biol 18, 529–536 (2011).
Park, K. H. et al. RNA activation-independent DNA targeting of the Type III CRISPR-Cas system by a Csm complex. EMBO Rep 18, 826–840 (2017).
Shmakov, S. et al. Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems. Molecular cell 60, 385–397 (2015).
Makarova, K. S. et al. An updated evolutionary classification of CRISPR-Cas systems. Nat Rev Microbiol 13, 722–736 (2015).
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Hsu, P. D., Lander, E. S. & Zhang, F. Development and applications of CRISPR-Cas9 for genome engineering. Cell 157, 1262–1278 (2014).
Zetsche, B. et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell 163, 759–771 (2015).
Zetsche, B. et al. Multiplex gene editing by CRISPR-Cpf1 using a single crRNA array. Nat Biotechnol 35, 31–34 (2017).
Kim, D. et al. Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells. Nat Biotechnol 34, 863–868 (2016).
Teng, F. et al. Enhanced mammalian genome editing by new Cas12a orthologs with optimized crRNA scaffolds. Genome Biol 20, 15 (2019).
Toth, E. et al. Mb- and FnCpf1 nucleases are active in mammalian cells: activities and PAM preferences of four wild-type Cpf1 nucleases and of their altered PAM specificity variants. Nucleic acids research 46, 10272–10285 (2018).
Kleinstiver, B. P. et al. Genome-wide specificities of CRISPR-Cas Cpf1 nucleases in human cells. Nat Biotechnol 34, 869–874 (2016).
Jiang, Y. et al. CRISPR-Cpf1 assisted genome editing of Corynebacterium glutamicum. Nat Commun 8, 15179 (2017).
Shen, A. et al. Simplified, enhanced protein purification using an inducible, autoprocessing enzyme tag. PLoS One 4, e8119 (2009).
Mohanraju, P. et al. Diverse evolutionary roots and mechanistic variations of the CRISPR-Cas systems. Science 353, aad5147 (2016).
Li, B. et al. Engineering CRISPR-Cpf1 crRNAs and mRNAs to maximize genome editing efficiency. Nat Biomed Eng 1 (2017).
Kim, S. K. et al. Efficient Transcriptional Gene Repression by Type V-A CRISPR-Cpf1 from Eubacterium eligens. ACS Synth Biol 6, 1273–1282 (2017).
Cho, S. W. et al. Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases. Genome Res 24, 132–141 (2014).
Bin Moon, S. et al. Highly efficient genome editing by CRISPR-Cpf1 using CRISPR RNA with a uridinylate-rich 3′-overhang. Nat Commun 9, 3651 (2018).
Pulecio, J., Verma, N., Mejia-Ramirez, E., Huangfu, D. & Raya, A. CRISPR/Cas9-Based Engineering of the Epigenome. Cell Stem Cell 21, 431–447 (2017).
Park, H. M. et al. Extension of the crRNA enhances Cpf1 gene editing in vitro and in vivo. Nat Commun 9, 3313 (2018).
Endo, A., Masafumi, M., Kaya, H. & Toki, S. Efficient targeted mutagenesis of rice and tobacco genomes using Cpf1 from Francisella novicida. Sci Rep 6, 38169 (2016).
Chung, W. S. et al. Modulation of the human gut microbiota by dietary fibres occurs at the species level. BMC Biol 14, 3 (2016).
Acknowledgements
This research was partly supported by the Marine Biotechnology Program of the Korea Institute of Marine Science and Technology Promotion (KIMST), the Ministry of Oceans and Fisheries (MOF) (No. 20170488), the National Research Fund (NRF-2018R1A2A2A05021648) and the KRIBB Research Initiative.
Author information
Authors and Affiliations
Contributions
W.C.A., K.H.P. and E.J.W. conceived the idea. D.Y.Y., Y.S.K. and B.H.O. provided scientific suggestions. W.C.A., K.H.P., H.N.S., Y.A. and S.J.L. performed protein purification and In vitro experiments. I.S.B., M.J. and K.W.Y. performed In vivo experiments. The manuscript was written by K.H.P. and E.J.W.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ahn, WC., Park, KH., Bak, I.S. et al. In vivo genome editing using the Cpf1 ortholog derived from Eubacterium eligens. Sci Rep 9, 13911 (2019). https://doi.org/10.1038/s41598-019-50423-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-50423-6
This article is cited by
-
Biochemical characterization of the two novel mgCas12a proteins from the human gut metagenome
Scientific Reports (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.