Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Protocol
  • Published:

Measuring the activity of protein variants on a large scale using deep mutational scanning

Abstract

Deep mutational scanning marries selection for protein function to high-throughput DNA sequencing in order to quantify the activity of variants of a protein on a massive scale. First, an appropriate selection system for the protein function of interest is identified and validated. Second, a library of variants is created, introduced into the selection system and subjected to selection. Third, library DNA is recovered throughout the selection and deep-sequenced. Finally, a functional score for each variant is calculated on the basis of the change in the frequency of the variant during the selection. This protocol describes the steps that must be carried out to generate a large-scale mutagenesis data set consisting of functional scores for up to hundreds of thousands of variants of a protein of interest. Establishing an assay, generating a library of variants and carrying out a selection and its accompanying sequencing takes on the order of 4–6 weeks; the initial data analysis can be completed in 1 week.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Deep mutational scanning workflow.
Figure 2: Variable library sequencing methods.
Figure 3: Using the Enrich software to analyze deep mutational scanning data.
Figure 4: Creating a barcoded library from Gibson-assembled oligonucleotides.

Similar content being viewed by others

References

  1. Cunningham, B.C. & Wells, J.A. High-resolution epitope mapping of hGH-receptor interactions by alanine-scanning mutagenesis. Science 244, 1081–1085 (1989).

    Article  CAS  Google Scholar 

  2. Pál, G., Kouadio, J.-L.K., Artis, D.R., Kossiakoff, A.A. & Sidhu, S.S. Comprehensive and quantitative mapping of energy landscapes for protein-protein interactions by rapid combinatorial scanning. J. Biol. Chem. 281, 22378–22385 (2006).

    Article  Google Scholar 

  3. Brocchieri, L. & Karlin, S. Protein length in eukaryotic and prokaryotic proteomes. Nucleic Acids Res. 33, 3390–3400 (2005).

    Article  CAS  Google Scholar 

  4. Fowler, D.M. et al. High-resolution mapping of protein sequence-function relationships. Nat. Methods 7, 741–746 (2010).

    Article  CAS  Google Scholar 

  5. Araya, C.L. & Fowler, D.M. Deep mutational scanning: assessing protein function on a massive scale. Trends Biotechnol. 29, 435–442 (2011).

    Article  CAS  Google Scholar 

  6. Fowler, D.M. & Fields, S. Deep mutational scanning: a new style of protein science. Nat. Methods 11, 801–807 (2014).

    Article  CAS  Google Scholar 

  7. Levin, A.M. & Weiss, G.A. Optimizing the affinity and specificity of proteins with molecular display. Mol. Biosyst. 2, 49–57 (2006).

    Article  CAS  Google Scholar 

  8. Ernst, A. et al. Coevolution of PDZ domain-ligand interactions analyzed by high-throughput phage display and deep sequencing. Mol. Biosyst. 6, 1782–1790 (2010).

    Article  CAS  Google Scholar 

  9. Gold, M.G. et al. Engineering A-kinase anchoring protein (AKAP)-selective regulatory subunits of protein kinase A (PKA) through structure-based phage selection. J. Biol. Chem. 288, 17111–17121 (2013).

    Article  CAS  Google Scholar 

  10. Reynolds, K.A., McLaughlin, R.N. & Ranganathan, R. Hot spots for allosteric regulation on protein surfaces. Cell 147, 1564–1575 (2011).

    Article  CAS  Google Scholar 

  11. Fujino, Y. et al. Robust in vitro affinity maturation strategy based on interface-focused high-throughput mutational scanning. Biochem. Biophys. Res. Commun. 428, 395–400 (2012).

    Article  CAS  Google Scholar 

  12. Procko, E. et al. Computational design of a protein-based enzyme inhibitor. J. Mol. Biol. 425, 3563–3575 (2013).

    Article  CAS  Google Scholar 

  13. Whitehead, T.A. et al. Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing. Nat. Biotechnol. 30, 543–548 (2012).

    Article  CAS  Google Scholar 

  14. Traxlmayr, M.W. et al. Construction of a stability landscape of the CH3 domain of human IgG1 by combining directed evolution with high throughput sequencing. J. Mol. Biol. 423, 397–412 (2012).

    Article  CAS  Google Scholar 

  15. Tinberg, C.E. et al. Computational design of ligand-binding proteins with high affinity and selectivity. Nature 501, 212–216 (2013).

    Article  CAS  Google Scholar 

  16. Kim, I., Miller, C.R., Young, D.L. & Fields, S. High-throughput analysis of in vivo protein stability. Mol. Cell Proteomics 12, 3370–3378 (2013).

    Article  CAS  Google Scholar 

  17. Adkar, B.V. et al. Protein model discrimination using mutational sensitivity derived from deep sequencing. Structure 20, 371–381 (2012).

    Article  CAS  Google Scholar 

  18. Hietpas, R.T., Jensen, J.D. & Bolon, D.N.A. Experimental illumination of a fitness landscape. Proc. Natl. Acad. Sci. USA. 108, 7896–7901 (2011).

    Article  CAS  Google Scholar 

  19. Wu, N.C. et al. Systematic identification of H274Y compensatory mutations in influenza A virus neuraminidase by high-throughput screening. J. Virol. 87, 1193–1199 (2013).

    Article  CAS  Google Scholar 

  20. Wagenaar, T.R. et al. Resistance to vemurafenib resulting from a novel mutation in the BRAFV600E kinase domain. Pigment Cell Melanoma Res. 27, 124–133 (2014).

    Article  CAS  Google Scholar 

  21. Ge, B. et al. Survey of allelic expression using EST mining. Genome Res. 15, 1584–1591 (2005).

    Article  CAS  Google Scholar 

  22. Hiatt, J.B., Patwardhan, R.P., Turner, E.H., Lee, C. & Shendure, J. Parallel, tag-directed assembly of locally derived short sequence reads. Nat. Methods 7, 119–122 (2010).

    Article  CAS  Google Scholar 

  23. Starita, L.M. et al. Activity-enhancing mutations in an E3 ubiquitin ligase identified by high-throughput mutagenesis. Proc. Natl. Acad. Sci. USA 110, E1263–E1272 (2013).

    Article  CAS  Google Scholar 

  24. Melamed, D., Young, D.L., Gamble, C.E., Miller, C.R. & Fields, S. Deep mutational scanning of an RRM domain of the Saccharomyces cerevisiae poly(A)-binding protein. RNA 19, 1537–1551 (2013).

    Article  CAS  Google Scholar 

  25. Firnberg, E. & Ostermeier, M. PFunkel: efficient, expansive, user-defined mutagenesis. PLoS ONE 7, e52031 (2012).

    Article  CAS  Google Scholar 

  26. Jain, P.C. & Varadarajan, R. A rapid, efficient, and economical inverse polymerase chain reaction-based method for generating a site saturation mutant library. Anal. Biochem. 449C, 90–98 (2013).

    Google Scholar 

  27. Gibson, D.G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).

    Article  CAS  Google Scholar 

  28. Knight, R. & Yarus, M. Analyzing partially randomized nucleic acid pools: straight dope on doping. Nucleic Acids Res. 31, e30 (2003).

    Article  Google Scholar 

  29. Sidhu, S.S. & Koide, S. Phage display for engineering and analyzing protein interaction interfaces. Curr. Opin. Struct. Biol. 17, 481–487 (2007).

    Article  CAS  Google Scholar 

  30. Meacham, F. et al. Identification and correction of systematic error in high-throughput sequence data. BMC Bioinformatics 12, 451 (2011).

    Article  Google Scholar 

  31. Loman, N.J. et al. Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 30, 434–439 (2012).

    Article  CAS  Google Scholar 

  32. Fowler, D.M., Araya, C.L., Gerard, W. & Fields, S. Enrich: software for analysis of protein function by enrichment and depletion of variants. Bioinformatics 27, 3430–3431 (2011).

    Article  CAS  Google Scholar 

  33. Araya, C.L. et al. A fundamental protein property, thermodynamic stability, revealed solely from large-scale measurements of protein function. Proc. Natl. Acad. Sci. USA 109, 16858–16863 (2012).

    Article  CAS  Google Scholar 

  34. Gai, S.A. & Wittrup, K.D. Yeast surface display for protein engineering and characterization. Curr. Opin. Struct. Biol. 17, 467–473 (2007).

    Article  CAS  Google Scholar 

  35. Dai, M. et al. Using T7 phage display to select GFP-based binders. Protein Eng. Des. Sel. 21, 413–424 (2008).

    Article  CAS  Google Scholar 

  36. Cox, M.P., Peterson, D.A. & Biggs, P.J. SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11, 485 (2010).

    Article  Google Scholar 

  37. Gietz, R.D. & Schiestl, R.H. High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method. Nat. Protoc. 2, 31–34 (2007).

    Article  CAS  Google Scholar 

  38. Roux, K.H. Optimization and troubleshooting in PCR. Cold Spring Harb. Protoc. 2009 10.1101/pdb.ip66 (2009).

  39. McLaughlin, R.N., Poelwijk, F.J., Raman, A., Gosal, W.S. & Ranganathan, R. The spatial architecture of protein function and adaptation. Nature 491, 138–142 (2012).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

This work was supported by grants P41 GM103533 (to S.F.) and F32 GM084699 (to D.M.F.) from the US National Institute of General Medical Sciences. S.F. is supported by the Howard Hughes Medical Institute.

Author information

Authors and Affiliations

Authors

Contributions

D.M.F. and J.J.S. developed and refined the protocols; and D.M.F., J.J.S. and S.F. wrote the paper.

Corresponding author

Correspondence to Douglas M Fowler.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fowler, D., Stephany, J. & Fields, S. Measuring the activity of protein variants on a large scale using deep mutational scanning. Nat Protoc 9, 2267–2284 (2014). https://doi.org/10.1038/nprot.2014.153

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nprot.2014.153

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing