A large number of genetic variants have been associated with human diseases. However, the lack of a genetic diversification approach has impeded our ability to interrogate functions of genetic variants in mammalian cells. Current screening methods can only be used to disrupt a gene or alter its expression. Here we report the fusion of activation-induced cytidine deaminase (AID) with nuclease-inactive clustered regularly interspaced short palindromic repeats (CRISPR)-associated protein 9 (dCas9) for efficient genetic diversification, which enabled high-throughput screening of functional variants. Guided by single guide (sg)RNAs, dCas9-AID-P182X (AIDx) directly changed cytidines or guanines to the other three bases independent of AID hotspot motifs, generating a large repertoire of variants at desired loci. Coupled with a uracil-DNA glycosylase inhibitor, dCas9-AIDx converted targeted cytidines specifically to thymines, creating specific point mutations. By targeting BCR-ABL with dCas9-AIDx, we efficiently identified known and new mutations conferring imatinib resistance in chronic myeloid leukemia cells. Thus, targeted AID-mediated mutagenesis (TAM) provides a forward genetic tool to screen for gain-of-function variants at base resolution.
This is a preview of subscription content
Subscribe to Journal
Get full journal access for 1 year
only $9.92 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Get time limited or full article access on ReadCube.
All prices are NET prices.
Sequence Read Archive
Bell, R.J. et al. Cancer. The transcription factor GABP selectively binds and activates the mutant TERT promoter in cancer. Science 348, 1036–1039 (2015).
Daub, H., Specht, K. & Ullrich, A. Strategies to overcome resistance to targeted protein kinase inhibitors. Nat. Rev. Drug Discov. 3, 1001–1010 (2004).
Nachman, M.W. & Crowell, S.L. Estimate of the mutation rate per nucleotide in humans. Genetics 156, 297–304 (2000).
Takahasi, K.R., Sakuraba, Y. & Gondo, Y. Mutational pattern and frequency of induced nucleotide changes in mouse ENU mutagenesis. BMC Mol. Biol. 8, 52 (2007).
Di Noia, J.M. & Neuberger, M.S. Molecular mechanisms of antibody somatic hypermutation. Annu. Rev. Biochem. 76, 1–22 (2007).
Peled, J.U. et al. The biochemistry of somatic hypermutation. Annu. Rev. Immunol. 26, 481–511 (2008).
Yoshikawa, K. et al. AID enzyme-induced hypermutation in an actively transcribed gene in fibroblasts. Science 296, 2033–2036 (2002).
Mojica, F.J., Díez-Villaseñor, C., García-Martínez, J. & Soria, E. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol. 60, 174–182 (2005).
Pourcel, C., Salvignol, G. & Vergnaud, G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. Microbiology 151, 653–663 (2005).
Sorek, R., Kunin, V. & Hugenholtz, P. CRISPR--a widespread system that provides acquired resistance against phages in bacteria and archaea. Nat. Rev. Microbiol. 6, 181–186 (2008).
Hsu, P.D., Lander, E.S. & Zhang, F. Development and applications of CRISPR-Cas9 for genome engineering. Cell 157, 1262–1278 (2014).
Doudna, J.A. & Charpentier, E. Genome editing. The new frontier of genome engineering with CRISPR-Cas9. Science 346, 1258096 (2014).
Mali, P., Esvelt, K.M. & Church, G.M. Cas9 as a versatile tool for engineering biology. Nat. Methods 10, 957–963 (2013).
Jinek, M. et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012).
Barreto, V., Reina-San-Martin, B., Ramiro, A.R., McBride, K.M. & Nussenzweig, M.C. C-terminal deletion of AID uncouples class switch recombination from somatic hypermutation and gene conversion. Mol. Cell 12, 501–508 (2003).
Shinkura, R. et al. Separate domains of AID are required for somatic hypermutation and class-switch recombination. Nat. Immunol. 5, 707–712 (2004).
Martin, A. et al. Activation-induced cytidine deaminase turns on somatic hypermutation in hybridomas. Nature 415, 802–806 (2002).
Qi, L.S. et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173–1183 (2013).
Zheng, N.Y., Wilson, K., Jared, M. & Wilson, P.C. Intricate targeting of immunoglobulin somatic hypermutation maximizes the efficiency of affinity maturation. J. Exp. Med. 201, 1467–1478 (2005).
Dörner, T., Foster, S.J., Farner, N.L. & Lipsky, P.E. Somatic hypermutation of human immunoglobulin heavy chain genes: targeting of RGYW motifs on both DNA strands. Eur. J. Immunol. 28, 3384–3396 (1998).
Di Noia, J. & Neuberger, M.S. Altering the pathway of immunoglobulin hypermutation by inhibiting uracil-DNA glycosylase. Nature 419, 43–48 (2002).
Mol, C.D. et al. Crystal structure of human uracil-DNA glycosylase in complex with a protein inhibitor: protein mimicry of DNA. Cell 82, 701–708 (1995).
Jore, M.M. et al. Structural basis for CRISPR RNA-guided DNA recognition by Cascade. Nat. Struct. Mol. Biol. 18, 529–536 (2011).
Druker, B.J. & Lydon, N.B. Lessons learned from the development of an abl tyrosine kinase inhibitor for chronic myelogenous leukemia. J. Clin. Invest. 105, 3–7 (2000).
Nardi, V., Azam, M. & Daley, G.Q. Mechanisms and implications of imatinib resistance mutations in BCR-ABL. Curr. Opin. Hematol. 11, 35–43 (2004).
Gambacorti-Passerini, C.B. et al. Molecular mechanisms of resistance to imatinib in Philadelphia-chromosome-positive leukaemias. Lancet Oncol. 4, 75–85 (2003).
Nagar, B. et al. Structural basis for the autoinhibition of c-Abl tyrosine kinase. Cell 112, 859–871 (2003).
Gorre, M.E. et al. Clinical resistance to STI-571 cancer therapy caused by BCR-ABL gene mutation or amplification. Science 293, 876–880 (2001).
Branford, S. et al. High frequency of point mutations clustered within the adenosine triphosphate-binding region of BCR/ABL in patients with chronic myeloid leukemia or Ph-positive acute lymphoblastic leukemia who develop imatinib (STI571) resistance. Blood 99, 3472–3475 (2002).
Fischlechner, M. et al. Evolution of enzyme catalysts caged in biomimetic gel-shell beads. Nat. Chem. 6, 791–796 (2014).
Tawfik, D.S. & Griffiths, A.D. Man-made cell-like compartments for molecular evolution. Nat. Biotechnol. 16, 652–656 (1998).
Yi, L. et al. Engineering of TEV protease variants by yeast ER sequestration screening (YESS) of combinatorial libraries. Proc. Natl. Acad. Sci. USA 110, 7229–7234 (2013).
Romanini, D.W., Peralta-Yahya, P., Mondol, V. & Cornish, V.W. A Heritable Recombination system for synthetic Darwinian evolution in yeast. ACS Synth. Biol. 1, 602–609 (2012).
Esvelt, K.M., Carlson, J.C. & Liu, D.R. A system for the continuous directed evolution of biomolecules. Nature 472, 499–503 (2011).
Yu, K., Roy, D., Bayramyan, M., Haworth, I.S. & Lieber, M.R. Fine-structure analysis of activation-induced deaminase accessibility to class switch region R-loops. Mol. Cell. Biol. 25, 1730–1736 (2005).
Schrader, C.E. et al. Mutations occur in the Ig Smu region but rarely in Sgamma regions prior to class switch recombination. EMBO J. 22, 5893–5903 (2003).
Yu, K., Chedin, F., Hsieh, C.L., Wilson, T.E. & Lieber, M.R. R-loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells. Nat. Immunol. 4, 442–451 (2003).
Komor, A.C., Kim, Y.B., Packer, M.S., Zuris, J.A. & Liu, D.R. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424 (2016).
Tewhey, R. et al. Direct identification of hundreds of expression-modulating variants using a multiplexed reporter assay. Cell 165, 1519–1529 (2016).
Findlay, G.M., Boyle, E.A., Hause, R.J., Klein, J.C. & Shendure, J. Saturation editing of genomic regions by multiplex homology-directed repair. Nature 513, 120–123 (2014).
Wang, T., Wei, J.J., Sabatini, D.M. & Lander, E.S. Genetic screens in human cells using the CRISPR-Cas9 system. Science 343, 80–84 (2014).
Chang, X. & Ma, Y.Q. Using targeted AID-mediated mutagenesis (TAM) to diversify a genomic locus in mammalian cells. Protoc. Exch. http://dx.doi.org/10.1038/protex.2016.062 (2016).
Pinello, L. et al. Analyzing CRISPR genome-editing experiments with CRISPResso. Nat. Biotechnol. 34, 695–697 (2016).
We thank B. Li and C. Jiang for suggestions on data analysis; H. Zhang and A. Chen for assistance with high-throughput sequencing. This work was supported by 31370858 from National Natural Science Foundation of China (NSFC), 2014CB943600 from Ministry of Science and Technology (China) (MOST), 13PJ1409300 from Shanghai Municipal Science and Technology Committee (SMSTC), and National Thousand Talents Program for Distinguished Young Scholars (China).
X.C. has filed intellectual protection of using dCas9-AIDx for genetic diversification and protein evolution (application no. CN201610423512.8).
Integrated supplementary information
Supplementary Figure 1 dCas9-AIDx required enzymatic activity of AID for correcting a premature stop codon in GFP.
a. GFP sequence with a premature stop codon. The premature stop codon is labeled in red with yellow highlight. sgRNAs (indicated by arrows) and PAM sequences (in blue) are specified.
b. dCas9-AIDx requires enzymatic activity of AID. sgRNAs targeting GFP were co-transfected with Cas9, dCas9, dCas9-AID-P182X (E58Q) or dCas9-AID-P182X. Seven days after transfection, the cells were analyzed for GFP expression by flow cytometry.
a. Design and sequences of sgRNAs targeting the AAVS1 locus (Hg19, chr19: 55626975- 55627279).
b-e. As in Fig. 2, dCas9-AIDx was transfected into 293T cells with three sgRNAs targeting the AAVS1 locus (e). Seven days after transfection, AAVS1 locus was PCR amplified and subjected to Miseq sequencing and mutations in the AAVS1 locus was analyzed. Substitution frequencies were calculated as reads with substitutions/ total reads covering the indicated bases (%). Parental 293T cells (b), cells transfected with AIDx(c), and cells transfected with dCas9-AIDx together with sgRNAs against GFP were used as control(d). sgRNA targeted regions were marked with gray boxes, and arrows indicate the location and direction of the sgRNAs-targeted DNA. Note, the y-axis is in log scale. Data are representative of three independent experiments.
Note, due to the lack of AID hotspot motifs in the AAVS1 locus, AIDx itself failed to induce mutations above the background.
The reporter cells were transfected with dCas9-AIDx together with either sgRNAs targeting GFP or sgRNAs targeting AAVS1. DNA sequences being 250-400bp downstream of the start codon of GFP (GFP) or within a 100 bp region of the AAVS1 locus (AAVS1) were considered as sgRNA-targeted regions respectively for the analysis.
a. The percentages of C /G nucleotides being mutated (substitution frequency >=0.1%) out of total C/ G were summarized from three independent experiments.
b. Average mutation rates (per base per cell cycle) of the C/G nucleotides in the sgRNA-targeted GFP or AAVS1 DNA were determined. Data are the summary of three independent experiments, and error bars show the standard deviation of the mean. **, p<0.01 in two-tailed Student’s t test.
c. The number of mutation combinations containing one or two mutant bases within the targeted region (GFP, 250 to 400; AAVS1, 201 to 300) was determined in 293T cells transfected with dCas9-AIDx and pooled sgRNAs targeting GFP or AAVS1. Data are the summary of two independent experiments, and error bars show the range. *, p<0.05 in two-tailed Student’s t test.
Supplementary Figure 4 Synergistic effects were observed by multiple sgRNAs targeting the same DNA window in inducing SNVs.
a. WT Cas9 but not dCas9-AIDx induces indels. The reporter cells were transfected with pooled sgRNA against GFP together with WT Cas9 or dCas9-AIDx. The indels in the GFP DNA were analyzed by CRISPResso. Data are summary of three independent experiments. Error bars stand for the standard deviation of the mean.
b,c. Synergistic effects of multiple sgRNAs targeting the same DNA. The reporter cells were transfected with dCas9-AIDx together with the pool of four sgRNAs(b) or one sgRNA(c). The substitution frequencies were calculated as reads with substitutions/total reads covering the base (%), and sgRNA targeted regions are shaded in gray. The x-axis indicates the relative position (bp) to the start codon of the GFP DNA. Note, the scale of y-axis in b (pooled sgRNAs) is 10-20 times greater than it in c (single sgRNA). Data are representative of three independent experiments.
d. dCas9-AIDx converts C or G into the other three bases. Proportions of each substitution out of total substitution for each nucleotide (% of total substitution) were summarized from three independent experiments. Error bars stand for the standard deviation of the mean. **, p<0.01; *, p<0.05 in two tailed Student’s t test.
As in Figure 2, the reporter cells were transfected with AIDx (a,c,e) or dCas9-AIDx together pooled-sgRNAs against GFP or AAVS1 (b,d,f), and sequence variants in the c-Myc (a,b Hg19 chr8: 128748974 -128749156), Bcl6 (c,d Hg19 chr3: 187462766 - 187463044) or PIM1 (e,f Hg19 chr6: 37138381- 37138554) were determined by sequencing. Note AIDx, but not dCas9-AIDx itself introduced a few mutations in these sites. Data are representative of three independent experiments.
g,h. The reporter cells were transfected with AIDx or dCas9-AIDx with pooled sgRNAs against AAVS1, and mutations in the GFP DNA were analyzed. Data are representative of three independent experiments.
As in Fig. 2, HEK293T cells were transfected with dCas9-AIDx and pooled sgRNAs against GFP (a) or against AAVS1 (b). The off-target sites of indicated sgRNAs were predicted as described52, and top three off-target sites of each sgRNAs were amplified and sequenced. The mutation frequencies of nucleosides in the protospacer were determined. The mismatched and mutated bases were labeled with blue or red respectively. Data shown are average of two independent experiments.
Supplementary Figure 7 dCas9-AIDx preferentially mutated cytidines in protospacers when paired with a single sgRNA.
HEK293T reporter cells were transfected with dCas9-AIDx and Ugi expression constructs, together with individual sgRNA against GFP (a) or individual sgRNA against AAVS1 (b). Seven days after transfection, amplified DNA from GFP or AAVS1 locus was sequenced and substitution frequency was calculated as reads with substitutions/total reads covering the base (%). sgRNA targeted sequences were highlighted with gray boxes and marked with arrows for direction. Four nucleotides with the highest substitution frequencies were indicated on the sequence with grey boxes. The gradient of gray indicates relative substitution frequencies. The most frequent substitution with each individual sgRNA is also indicated. Data are representative of two independent experiments.
a. dCas9-AIDx induces nucleotide substitutions predominantly within protospacer with highest activity at -12 and -16 bps upstream of the PAM sequence. 293T cells were transfected with dCas9-AIDx, Ugi and a single sgRNA, and substitution frequency was calculated as reads with C to T substitution/ total reads covering the base (%) within a -20bp to + 50 bp window relative to the PAM sequence. The x-axis indicates the relative location to PAM sequence (bp). The aggregated substitution frequency is denoted on the y-axis based on 12 individual sgRNAs (4 against GFP, 3 against AAVS1 and 5 against ABL kinase). Data are representative of two independent experiments.
b. dCas9-AIDx has progressive activity beyond protospacer when combined with multiple sgRNAs targeting the same region of DNA. The reporter cells were transfected with dCas9-AIDx, Ugi and pooled sgRNAs against GFP as in Fig. 2. Substitution frequency was calculated as reads with substitutions/total reads covering the base (%). Note, almost all the C/G nucleotides were mutated (substitution frequency >0.1%, dash line in the figure) from the 5’ of the first sgRNA to the 3’ end of the fourth sgRNA. Data are representative of two independent experiments.
c. as in b, the reporter cells were transfected with dCas9-AIDx, Ugi and pooled sgRNAs against AAVS1. The substitution frequency of the target DNA was determined as above. Data are representative of two independent experiments.
(a) Domain structure of ABL kinase; (b). sgRNAs were designed to target Exon 6 ±60 bps upstream and downstream. Exon sequence is capitalized and marked in gray. (c). cDNA of exon 6 in ABL gene was amplified and sequenced from DMSO treated K562 cells. The substitution frequency of each base was calculated as reads with substitutions/ total reads covering the base (%).
Supplementary Figure 10 Abl mutations identified in the TAM screening confer Imatinib resistance in K562 cells.
(a). Summary of the point mutations identified in two independent Imatinib-resistance screenings. The nucleotide and amino acid substitution of each mutation are presented together with their substitution frequencies. The mutations identified in both experiments were highlighted with yellow. Note C956 were mutated to either G or T in the first screening, which leads to T319S or T319I mutation respectively.
(b,c) The indicated ABL mutations were introduced into an MSCV-BCR/ABL-IRES-GFP vector, which was used to transduce K562 cells. The GFP+ cells were sorted and treated with either 4μM (b) or 10μM (c) Imatinib. The cells were counted everyday after the Imatinib treatment. Data are representative of two independent experiments.
Supplementary Figure 11 Cas9-nickase (nCas9)-AIDx enhanced the mutagenesis efficiency with single sgRNA but caused extensive indels with pooled sgRNAs.
a. nCas9 with AIDx enhanced nucleoside conversion efficiency. Reporter cells were transfected with a single sgRNA against GFP together with either dCas9-AIDx+Ugi (left) or nCas9-AIDx+Ugi (right). Substitution frequencies of each base were calculated.
b,c. nCas9-AIDx failed to turn on GFP. The reporter cells were transfected with nCas9-AIDx (right) together with AAVS1 sgRNA (Ctrl sgRNA) or pooled GFP sgRNAs. GFP expression was determined seven days after transfection. dCas9-AIDx (left) was shown as positive controls. b, Representative FACS plots; c, summary of three independent experiments.
d. nCas9-AIDx induced extensive Indels when combined with pooled sgRNAs against GFP. As in b, Indels in the GFP locus were determined by high-throughput sequencing. Summary of two independent experiments. Error bars stand for the standard deviation of the mean. **, p<0.01 in two tailed Student’s t test.
293T cells were transfected with dCas9-AIDx together with 10 sgRNAs (a), 7 sgRNAs (b), 5 sgRNAs (c) or 3 sgRNAs (d) targeting a 100bp window (201-300bp) within the exon 6 of ABL kinase. In the case of 10 sgRNAs, 3 sgRNAs targeting adjacent to this region are included, since only 7 sgRNAs can be designed within the 201-300 bp window. Substitution frequencies were calculated as reads with substitutions/ total reads covering the indicated bases (%). sgRNA targeted regions were marked with gray boxes, and arrows indicate the location and direction of the sgRNAs-targeted DNA. Data are representative of two independent experiments.
(e) As in Fig. S3a, percentages of mutated C/G nucleosides were calculated and compared among cells receiving indicated number of sgRNAs.
(f), substitution frequencies of mutant C/Gs were plotted and compared ompared among cells receiving indicated number of sgRNAs.
(g) As in Fig.S3c the number of mutation combinations within the targeted region (201 to 300) was determined in 293T cells transfected with dCas9-AIDx and indicated number of pooled sgRNAs targeting the ABL kinase.
Data are summary of two independent experiments. Error bars stand for the standard deviation of the mean. *, p<0.05 in two tailed Student’s t test.
About this article
Cite this article
Ma, Y., Zhang, J., Yin, W. et al. Targeted AID-mediated mutagenesis (TAM) enables efficient genomic diversification in mammalian cells. Nat Methods 13, 1029–1035 (2016). https://doi.org/10.1038/nmeth.4027
Development of an efficient and precise adenine base editor (ABE) with expanded target range in allotetraploid cotton (Gossypium hirsutum)
BMC Biology (2022)
Chinese Medicine (2022)
Nature Reviews Cancer (2022)
Science China Life Sciences (2022)
Current Genetics (2022)