Letter | Published:

Computational redesign of endonuclease DNA binding and cleavage specificity

Naturevolume 441pages656659 (2006) | Download Citation



The reprogramming of DNA-binding specificity is an important challenge for computational protein design that tests current understanding of protein–DNA recognition, and has considerable practical relevance for biotechnology and medicine1,2,3,4,5,6. Here we describe the computational redesign of the cleavage specificity of the intron-encoded homing endonuclease I-MsoI7 using a physically realistic atomic-level forcefield8,9. Using an in silico screen, we identified single base-pair substitutions predicted to disrupt binding by the wild-type enzyme, and then optimized the identities and conformations of clusters of amino acids around each of these unfavourable substitutions using Monte Carlo sampling10. A redesigned enzyme that was predicted to display altered target site specificity, while maintaining wild-type binding affinity, was experimentally characterized. The redesigned enzyme binds and cleaves the redesigned recognition site 10,000 times more effectively than does the wild-type enzyme, with a level of target discrimination comparable to the original endonuclease. Determination of the structure of the redesigned nuclease-recognition site complex by X-ray crystallography confirms the accuracy of the computationally predicted interface. These results suggest that computational protein design methods can have an important role in the creation of novel highly specific endonucleases for gene therapy and other applications.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.


  1. 1

    Uil, T. G., Haisma, H. J. & Rots, M. G. Therapeutic modulation of endogenous gene function by agents with designed DNA-sequence specificities. Nucleic Acids Res. 31, 6064–6078 (2003)

  2. 2

    Bibikova, M. et al. Stimulation of homologous recombination through targeted cleavage by chimeric nucleases. Mol. Cell. Biol. 21, 289–297 (2001)

  3. 3

    Porteus, M. H. & Baltimore, D. Chimeric nucleases stimulate gene targeting in human cells. Science 300, 763 (2003)

  4. 4

    Wickelgren, I. Molecular biology. Spinning junk into gold. Science 300, 1646–1649 (2003)

  5. 5

    Stoddard, B. L. Homing endonuclease structure and function. Q. Rev. Biophys. 38, 1–47 (2005)

  6. 6

    Urnov, F. D. et al. Highly efficient endogenous human gene correction using designed zinc-finger nucleases. Nature 435, 646–651 (2005)

  7. 7

    Lucas, P., Otis, C., Mercier, J. P., Turmel, M. & Lemieux, C. Rapid evolution of the DNA-binding site in LAGLIDADG homing endonucleases. Nucleic Acids Res. 29, 960–969 (2001)

  8. 8

    Rohl, C. A., Strauss, C. E., Misura, K. M. & Baker, D. Protein structure prediction using Rosetta. Methods Enzymol. 383, 66–93 (2004)

  9. 9

    Havranek, J. J., Duarte, C. M. & Baker, D. A simple physical model for the prediction and design of protein–DNA interactions. J. Mol. Biol. 344, 59–70 (2004)

  10. 10

    Voigt, C. A., Gordon, D. B. & Mayo, S. L. Trading accuracy for speed: A quantitative comparison of search algorithms in protein sequence design. J. Mol. Biol. 299, 789–803 (2000)

  11. 11

    Kono, H. & Sarai, A. Structure-based prediction of DNA target sites by regulatory proteins. Proteins 35, 114–131 (1999)

  12. 12

    Pabo, C. O. & Nekludova, L. Geometric analysis and comparison of protein–DNA interfaces: why is there no simple code for recognition? J. Mol. Biol. 301, 597–624 (2000)

  13. 13

    Luscombe, N. M., Laskowski, R. A. & Thornton, J. M. Amino acid-base interactions: a three-dimensional analysis of protein–DNA interactions at an atomic level. Nucleic Acids Res. 29, 2860–2874 (2001)

  14. 14

    Morozov, A. V., Havranek, J. J., Baker, D. & Siggia, E. D. Protein–DNA binding specificity predictions with structural models. Nucleic Acids Res. 33, 5781–5798 (2005)

  15. 15

    Seligman, L. M. et al. Mutations altering the cleavage specificity of a homing endonuclease. Nucleic Acids Res. 30, 3870–3879 (2002)

  16. 16

    Chevalier, B., Turmel, M., Lemieux, C., Monnat, R. J. Jr & Stoddard, B. L. Flexible DNA target site recognition by divergent homing endonuclease isoschizomers I-CreI and I-MsoI. J. Mol. Biol. 329, 253–269 (2003)

  17. 17

    Heath, P. J., Stephens, K. M., Monnat, R. J. Jr & Stoddard, B. L. The structure of I–Crel, a group I intron-encoded homing endonuclease. Nature Struct. Biol. 4, 468–476 (1997)

  18. 18

    Seeman, N. C., Rosenberg, J. M. & Rich, A. Sequence-specific recognition of double helical nucleic acids by proteins. Proc. Natl Acad. Sci. USA 73, 804–808 (1976)

  19. 19

    Sussman, D. et al. Isolation and characterization of new homing endonuclease specificities at individual target site positions. J. Mol. Biol. 342, 31–41 (2004)

  20. 20

    Doyon, J. B., Pattanayak, V., Meyer, C. B. & Liu, D. R. Directed evolution and substrate specificity profile of homing endonuclease I-SceI. J. Am. Chem. Soc. 128, 2477–2484 (2006)

  21. 21

    Gouble, A. et al. Efficient in toto targeted recombination in mouse liver by meganuclease-induced double-strand break. J. Gene Med. published online 13 February 2006 (doi:10.1002/jgm.879) (2006)

  22. 22

    Arnould, S. et al. Engineering of large numbers of highly specific homing endonucleases that induce recombination on novel DNA targets. J. Mol. Biol. 355, 443–458 (2006)

  23. 23

    Dunbrack, R. L. Jr & Cohen, F. E. Bayesian statistical analysis of protein side-chain rotamer preferences. Protein Sci. 6, 1661–1681 (1997)

  24. 24

    Onufriev, A., Bashford, S. D. & Case, D. A. Exploring protein native states and large-scale conformational changes with a modified generalized Born model. Proteins 55, 383–394 (2004)

  25. 25

    Press, W. H., Flannery, B. P., Teukolsky, S. A. & Vetterling, W. T. Numerical Recipes in C: The Art of Scientific Computing (Cambridge Univ. Press, New York, 1992)

  26. 26

    Brunger, A. T. et al. Crystallography and NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905–921 (1998)

Download references


We thank J. L. Eklund for assistance with binding assays, and B. W. Shen for assistance with data collection and refinement. This work was supported by fellowships from the Jane Coffin Childs Memorial Fund (J.J.H.), the National Science Foundation (C.M.D.), and grants from the National Institute of Health (R.J.M. and B.L.S.), the Howard Hughes Medical Institute (D.B.), and the Gates Foundation Grand Challenges Program (B.L.S., D.B., R.J.M.). Author Contributions J.J.H. and C.M.D. developed the original protein–DNA interface design methods and code. J.A. made further code and method developments, generated and assessed the computational predictions, and performed mutagenesis, biochemical characterization, and crystallization. D.S. collected and processed the crystallographic data, and aided in protein purification and structure refinement.

Author information


  1. Howard Hughes Medical Institute and Department of Biochemistry

    • Justin Ashworth
    • , James J. Havranek
    • , Carlos M. Duarte
    •  & David Baker
  2. Departments of Pathology and Genome Sciences, University of Washington, Seattle, Washington, 98195, USA

    • Raymond J. Monnat Jr
  3. Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue, Washington, 98109, Seattle, USA

    • Django Sussman
    •  & Barry L. Stoddard


  1. Search for Justin Ashworth in:

  2. Search for James J. Havranek in:

  3. Search for Carlos M. Duarte in:

  4. Search for Django Sussman in:

  5. Search for Raymond J. Monnat Jr in:

  6. Search for Barry L. Stoddard in:

  7. Search for David Baker in:

Competing interests

The atomic coordinates of the redesigned I-MsoI endonuclease bound to its cognate DNA have been deposited in the Protein Data Bank with the accession number 2FLD. Reprints and permissions information are available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.

Corresponding authors

Correspondence to Justin Ashworth or David Baker.

Supplementary information

  1. Supplementary Notes

    This file contains Supplementary Figures 1–6, Supplementary Tables, Supplementary Methods and additional references. (PDF 1471 kb)

About this article

Publication history



Issue Date



Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.