Article | Published:

Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay

Nature Biotechnology volume 30, pages 271277 (2012) | Download Citation

Abstract

Learning to read and write the transcriptional regulatory code is of central importance to progress in genetic analysis and engineering. Here we describe a massively parallel reporter assay (MPRA) that facilitates the systematic dissection of transcriptional regulatory elements. In MPRA, microarray-synthesized DNA regulatory elements and unique sequence tags are cloned into plasmids to generate a library of reporter constructs. These constructs are transfected into cells and tag expression is assayed by high-throughput sequencing. We apply MPRA to compare >27,000 variants of two inducible enhancers in human cells: a synthetic cAMP-regulated enhancer and the virus-inducible interferon-β enhancer. We first show that the resulting data define accurate maps of functional transcription factor binding sites in both enhancers at single-nucleotide resolution. We then use the data to train quantitative sequence-activity models (QSAMs) of the two enhancers. We show that QSAMs from two cellular states can be combined to design enhancer variants that optimize potentially conflicting objectives, such as maximizing induced activity while minimizing basal activity.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

Accessions

Gene Expression Omnibus

References

  1. 1.

    et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007).

  2. 2.

    Initial impact of the sequencing of the human genome. Nature 470, 187–197 (2011).

  3. 3.

    & Targeting cancer by transcriptional control in cancer gene therapy and viral oncolysis. Adv. Drug Deliv. Rev. 61, 554–571 (2009).

  4. 4.

    & Bioluminescent assays for high-throughput screening. Assay Drug Dev. Technol. 5, 127–136 CrossRef (2007).

  5. 5.

    , , , & Improved Tet-responsive promoters with minimized background expression. BMC Biotechnol. 10, 81 (2010).

  6. 6.

    , & Transcriptional Regulation in Eukaryotes: Concepts, Strategies, and Techniques. Edn. 2 (Cold Spring Harbor Laboratory Press, 2009).

  7. 7.

    et al. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 38, 2522–2540 (2010).

  8. 8.

    et al. High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis. Nat. Biotechnol. 27, 1173–1175 (2009).

  9. 9.

    , , & Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence. Proc. Natl. Acad. Sci. USA 107, 9158–9163 (2010).

  10. 10.

    , & An atomic model of the interferon-beta enhanceosome. Cell 129, 1111–1123 (2007).

  11. 11.

    & Transcriptional enhancers: Intelligent enhanceosomes or flexible billboards? J. Cell. Biochem. 94, 890–898 (2005).

  12. 12.

    , , , & Quantitative sequence-activity models (QSAM)–tools for sequence design. Nucleic Acids Res. 21, 733–739 (1993).

  13. 13.

    , & Quantitative analysis of the relationship between nucleotide sequence and functional activity. Nucleic Acids Res. 14, 6661–6679 (1986).

  14. 14.

    & Transcriptional regulation by the phosphorylation-dependent factor CREB. Nat. Rev. Mol. Cell Biol. 2, 599–609 (2001).

  15. 15.

    & Different binding specificities and transactivation of variant CRE's by CREB complexes. Nucleic Acids Res. 22, 1463–1469 (1994).

  16. 16.

    et al. The CGTCA sequence motif is essential for biological activity of the vasoactive intestinal peptide gene cAMP-regulated enhancer. Proc. Natl. Acad. Sci. USA 85, 6662–6666 (1988).

  17. 17.

    , & Selection of optimal kappa B/Rel DNA-binding motifs: interaction of both subunits of NF-kappa B with DNA is required for transcriptional activation. Mol. Cell. Biol. 12, 4412–4421 (1992).

  18. 18.

    , , , & Assembly of a functional beta interferon enhanceosome is dependent on ATF-2-c-jun heterodimer orientation. Mol. Cell. Biol. 20, 4814–4825 (2000).

  19. 19.

    & Excess information at bacteriophage T7 genomic promoters detected by a random cloning technique. Nucleic Acids Res. 17, 659–674 (1989).

  20. 20.

    Pattern Recognition and Machine Learning (Springer, 2006).

  21. 21.

    , , , & Construction and model-based analysis of a promoter library for E. coli: an indispensable tool for metabolic engineering. BMC Biotechnol. 7, 34 (2007).

  22. 22.

    et al. Parallel on-chip gene synthesis and application to optimization of protein expression. Nat. Biotechnol. 29, 449–452 (2011).

  23. 23.

    et al. High-fidelity gene synthesis by retrieval of sequence-verified DNA identified using high-throughput pyrosequencing. Nat. Biotechnol. 28, 1291–1294 (2010).

  24. 24.

    et al. The NIH Roadmap Epigenomics Mapping Consortium. Nat. Biotechnol. 28, 1045–1048 (2010).

  25. 25.

    , , & Synthetic promoter elements obtained by nucleotide sequence variation and selection for activity. Proc. Natl. Acad. Sci. USA 97, 3038–3043 (2000).

  26. 26.

    , , & Synthetic design of strong promoters. Proc. Natl. Acad. Sci. USA 107, 2538–2543 (2010).

  27. 27.

    Adaptation in Natural and Artificial Systems: AN Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence Edn. 1 (MIT Press, 1992).

  28. 28.

    & Controlling the false discovery rate: a practical and powerful approach to multiple testing. J.R. Stat. Soc. B 57, 289–300 (1995).

  29. 29.

    & The upward bias in measures of information derived from limited samples. Neural Comput. 7, 399–407 (1995).

Download references

Acknowledgements

The authors would like to thank E.M. LeProust and S. Chen of Agilent for oligonucleotide library synthesis, R.P. Deering for assistance with Sendai virus infections and the staff of the Broad Institute and the Bauer Core facilities for assistance with data generation. This project was supported by funds from the Broad Institute, the Harvard Stem Cell Institute (T.S.M.), National Human Genome Research Institute grant R01HG004037 (M.K.), the Simons Center for Quantitative Biology at Cold Spring Harbor Laboratory (J.B.K.), National Science Foundation (NSF) grant PHY-0957573 (C.G.C., T.T.) and NSF grant PHY-1022140 (A. Mur.).

Author information

Author notes

    • Alexandre Melnikov
    • , Anand Murugan
    •  & Xiaolan Zhang

    These authors contributed equally to this work.

Affiliations

  1. Broad Institute, Cambridge, Massachusetts, USA.

    • Alexandre Melnikov
    • , Xiaolan Zhang
    • , Li Wang
    • , Peter Rogov
    • , Soheil Feizi
    • , Andreas Gnirke
    • , Manolis Kellis
    • , Eric S Lander
    •  & Tarjei S Mikkelsen
  2. Department of Physics, Princeton University, Princeton, New Jersey, USA.

    • Anand Murugan
    • , Tiberiu Tesileanu
    •  & Curtis G Callan Jr
  3. Simons Center for Systems Biology, Institute for Advanced Study, Princeton, New Jersey, USA.

    • Tiberiu Tesileanu
    •  & Curtis G Callan Jr
  4. MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts, USA.

    • Soheil Feizi
    •  & Manolis Kellis
  5. Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA.

    • Justin B Kinney
  6. Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.

    • Eric S Lander
  7. Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA.

    • Eric S Lander
  8. Harvard Stem Cell Institute and Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts, USA.

    • Tarjei S Mikkelsen

Authors

  1. Search for Alexandre Melnikov in:

  2. Search for Anand Murugan in:

  3. Search for Xiaolan Zhang in:

  4. Search for Tiberiu Tesileanu in:

  5. Search for Li Wang in:

  6. Search for Peter Rogov in:

  7. Search for Soheil Feizi in:

  8. Search for Andreas Gnirke in:

  9. Search for Curtis G Callan in:

  10. Search for Justin B Kinney in:

  11. Search for Manolis Kellis in:

  12. Search for Eric S Lander in:

  13. Search for Tarjei S Mikkelsen in:

Contributions

A. Mel., X.Z., P.R., A.G. and T.S.M. developed MPRA and performed the molecular biology experiments. L.W. cultured the cells, and performed the plasmid transfections and luciferase assays. A.Mur., T.T., S.F., C.G.C., J.B.K., M.K., E.S.L. and T.S.M. analyzed the data. T.S.M. wrote the main text with substantial input from all authors. C.G.C. and J.B.K. wrote the Supplementary Notes with substantial input from A. Mur. and T.S.M.

Competing interests

A patent application describing ideas presented in this article has been filed by the Broad Institute.

Corresponding author

Correspondence to Tarjei S Mikkelsen.

Supplementary information

PDF files

  1. 1.

    Supplementary Text and Figures

    Supplementary Tables 5,6, Supplementary Notes and Supplementary Figs. 1–10

Excel files

  1. 1.

    Supplementary Table 1

    CRE variants

  2. 2.

    Supplementary Table 2

    IFNB variants

  3. 3.

    Supplementary Table 3

    CRE mutagenesis/models

  4. 4.

    Supplementary Table 4

    IFNB mutagenesis/models

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/nbt.2137

Further reading