Protocol | Published:

Mapping genome-wide transcription-factor binding sites using DAP-seq

Nature Protocols volume 12, pages 16591672 (2017) | Download Citation

Abstract

To enable low-cost, high-throughput generation of cistrome and epicistrome maps for any organism, we developed DNA affinity purification sequencing (DAP-seq), a transcription factor (TF)-binding site (TFBS) discovery assay that couples affinity-purified TFs with next-generation sequencing of a genomic DNA library. The method is fast, inexpensive, and more easily scaled than chromatin immunoprecipitation sequencing (ChIP-seq). DNA libraries are constructed using native genomic DNA from any source of interest, preserving cell- and tissue-specific chemical modifications that are known to affect TF binding (such as DNA methylation) and providing increased specificity as compared with in silico predictions based on motifs from methods such as protein-binding microarrays (PBMs) and systematic evolution of ligands by exponential enrichment (SELEX). The resulting DNA library is incubated with an affinity-tagged in vitro-expressed TF, and TF–DNA complexes are purified using magnetic separation of the affinity tag. Bound genomic DNA is eluted from the TF and sequenced using next-generation sequencing. Sequence reads are mapped to a reference genome, identifying genome-wide binding locations for each TF assayed, from which sequence motifs can then be derived. A researcher with molecular biology experience should be able to follow this protocol, processing up to 400 samples per week.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

References

  1. 1.

    , & Lessons from domestication: targeting cis-regulatory elements for crop improvement. Trends Plant Sci. (2016).

  2. 2.

    , & The genetics of transcription factor DNA binding variation. Cell (2016).

  3. 3.

    , , , & Structure and evolution of transcriptional regulatory networks. Curr. Opin. Struct. Biol. 14, 283–291 (2004).

  4. 4.

    et al. Diverse transcription factor binding features revealed by genome-wide ChIP-seq in C. elegans. Genome Res. 21, 245–254 (2011).

  5. 5.

    et al. A cis-regulatory map of the Drosophila genome. Nature 471, 527–531 (2011).

  6. 6.

    et al. Architecture of the human regulatory network derived from ENCODE data. Nature 489, 91–100 (2012).

  7. 7.

    et al. Unlocking the secrets of the genome. Nature 18, 927–930 (2009).

  8. 8.

    et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 22, 1813–1831 (2012).

  9. 9.

    et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell 158, 1431–1443 (2014).

  10. 10.

    et al. DNA-binding specificities of human transcription factors. Cell 152, 327–339 (2013).

  11. 11.

    et al. DNA-dependent formation of transcription factor pairs alters their binding specificity. Nature 527, 384–388 (2015).

  12. 12.

    et al. Competition between DNA methylation and transcription factors determines binding of NRF1. Nature 528, 575–579 (2015).

  13. 13.

    et al. DNA methylation presents distinct binding sites for human transcription factors. Elife 2013 (2013).

  14. 14.

    et al. Integrative genomics identifies the corepressor SMRT as a gatekeeper of adipogenesis through the transcription factors C/EBPB and KAISO. Mol. Cell 46, 335–350 (2012).

  15. 15.

    et al. Cistrome and epicistrome features shape the regulatory DNA landscape. Cell (2016).

  16. 16.

    , , , & A widespread role of the motif environment in transcription factor binding across diverse protein families. Genome Res. (2015).

  17. 17.

    et al. HaloTag: a novel protein labeling technology for cell imaging and protein analysis. ACS Chem. Biol. (2008).

  18. 18.

    & Non-targeted transcription factors motifs are a systemic component of ChIP-seq datasets. Genome Biol. 15, 412 (2014).

  19. 19.

    et al. Human body epigenome maps reveal noncanonical DNA methylation variation. Nature 523, 212–216 (2015).

  20. 20.

    et al. Unique cell-type-specific patterns of DNA methylation in the root meristem. Nat. Plants 2, 16058 (2016).

  21. 21.

    & DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. Cold Spring Harb. Protoc. (2010).

  22. 22.

    , , , & Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).

  23. 23.

    & Epidermal Cells: Methods and Protocols (ed. Turksen, K.) 49–59 (Springer, 2014).

  24. 24.

    et al. Epigenomic diversity in a global collection of Arabidopsis thaliana accessions. Cell 166, 492–506 (2016).

  25. 25.

    , & Nucleic acid modifications in regulation of gene expression. Cell Chem. Biol. 23, 74–85 (2016).

  26. 26.

    Arabidopsis Interactome Mapping Consortium. Evidence for network evolution in an Arabidopsis interactome map. Science 333, 601–607 (2011).

  27. 27.

    et al. Mapping transcription factor interactome networks using HaloTag protein arrays 113, E4238–E4247.

  28. 28.

    & Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).

  29. 29.

    et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).

  30. 30.

    , & High resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints. PLoS Comput. Biol. 8, e1002638 (2012).

  31. 31.

    et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).

  32. 32.

    & MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics 27, 1696–1697 (2011).

  33. 33.

    , , , & Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data. Front. Genet. 5, 75 (2014).

  34. 34.

    & motifStack: plot stacked logos for single or multiple DNA, RNA and amino acid sequence. (2015).

  35. 35.

    & BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).

  36. 36.

    & in Protein Chromatography (Humana Press) 681, 259–280 (2011).

  37. 37.

    Structural Genomics Consortium. et al. Protein production and purification. Nat. Methods 5, 135–146 (2008).

  38. 38.

    , , , & MethylC-seq library preparation for base-resolution whole-genome bisulfite sequencing. Nat. Protoc. 10, 475–83 (2015).

  39. 39.

    & Immunoblot analysis. J. Vis. Exp. 2, 2008 (2008).

Download references

Acknowledgements

This work was supported by grants from the National Science Foundation (MCB1024999) and the Gordon and Betty Moore Foundation (GBMF3034) to J.R.E., as well as from the National Science Foundation to A.G. (IOS1114484 and IOS1546873). J.R.E. is a Howard Hughes Medical Institute Investigator.

Author information

Author notes

    • Ronan C O'Malley

    Present address: United States Department of Energy Joint Genome Institute, Walnut Creek, California, USA.

Affiliations

  1. Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, California, USA.

    • Anna Bartlett
    • , Ronan C O'Malley
    • , Mary Galli
    • , Joseph R Nery
    •  & Joseph R Ecker
  2. Plant Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, California, USA.

    • Ronan C O'Malley
    • , Shao-shan Carol Huang
    •  & Joseph R Ecker
  3. Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA.

    • Mary Galli
    •  & Andrea Gallavotti
  4. Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, California, USA.

    • Joseph R Ecker

Authors

  1. Search for Anna Bartlett in:

  2. Search for Ronan C O'Malley in:

  3. Search for Shao-shan Carol Huang in:

  4. Search for Mary Galli in:

  5. Search for Joseph R Nery in:

  6. Search for Andrea Gallavotti in:

  7. Search for Joseph R Ecker in:

Contributions

R.C.O. and J.R.E. designed the original protocol. R.C.O., A.B., S.-s.C.H., M.G., and A.G. modified and updated the protocol to its current state. J.R.N. performed all the sequencing. A.B., M.G., S.-s.C.H., and J.R.E. wrote the manuscript with contributions from all authors.

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Joseph R Ecker.

About this article

Publication history

Published

DOI

https://doi.org/10.1038/nprot.2017.055

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.