Identifying ChIP-seq enrichment using MACS

Feng, Jianxing; Liu, Tao; Qin, Bo; Zhang, Yong; Liu, Xiaole Shirley

doi:10.1038/nprot.2012.101

Protocol
Published: 30 August 2012

Identifying ChIP-seq enrichment using MACS

Jianxing Feng¹^na1,
Tao Liu²^na1,
Bo Qin¹,
Yong Zhang¹ &
…
Xiaole Shirley Liu²

Nature Protocols volume 7, pages 1728–1740 (2012)Cite this article

35k Accesses
959 Citations
44 Altmetric
Metrics details

Subjects

Abstract

Model-based analysis of ChIP-seq (MACS) is a computational algorithm that identifies genome-wide locations of transcription/chromatin factor binding or histone modification from ChIP-seq data. MACS consists of four steps: removing redundant reads, adjusting read position, calculating peak enrichment and estimating the empirical false discovery rate (FDR). In this protocol, we provide a detailed demonstration of how to install MACS and how to use it to analyze three common types of ChIP-seq data sets with different characteristics: the sequence-specific transcription factor FoxA1, the histone modification mark H3K4me3 with sharp enrichment and the H3K36me3 mark with broad enrichment. We also explain how to interpret and visualize the results of MACS analyses. The algorithm requires ∼3 GB of RAM and 1.5 h of computing time to analyze a ChIP-seq data set containing 30 million reads, an estimate that increases with sequence coverage. MACS is open source and is available from http://liulab.dfci.harvard.edu/MACS/.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 2: Peak model built by MACS using the FoxA1 data set.**

**Figure 3: IGV visualization of MACS results using the FoxA1 data set.**

**Figure 4: IGV visualization of MACS results using the University of Washington H3K4me3 data set.**

**Figure 5: IGV visualization of MACS results using the Broad Institute H3K36me3 data set.**

AutoRELACS: automated generation and analysis of ultra-parallel ChIP-seq

Article Open access 24 July 2020

Limits in the detection of m6A changes using MeRIP/m6A-seq

Article Open access 20 April 2020

Comparison of differential accessibility analysis strategies for ATAC-seq data

Article Open access 23 June 2020

References

Mardis, E.R. ChIP-seq: welcome to the new frontier. Nat. Methods 4, 613–614 (2007).
Article PubMed CAS Google Scholar
Park, P.J. ChIP-seq: advantages and challenges of a maturing technology. Nat. Rev. Genet. 10, 669–680 (2009).
Article PubMed PubMed Central CAS Google Scholar
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
Article PubMed CAS Google Scholar
Johnson, D.S., Mortazavi, A., Myers, R.M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).
Article PubMed CAS Google Scholar
Mikkelsen, T.S. et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).
Article PubMed PubMed Central CAS Google Scholar
Robertson, G. et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods 4, 651–657 (2007).
Article PubMed CAS Google Scholar
Dohm, J.C., Lottaz, C., Borodina, T. & Himmelbauer, H. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 36, e105 (2008).
Article PubMed PubMed Central CAS Google Scholar
Rozowsky, J. et al. PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. Nat. Biotech. 27, 66–75 (2009).
Article CAS Google Scholar
Vega, V.B., Cheung, E., Palanisamy, N. & Sung, W.-K. Inherent signals in sequencing-based chromatin-immunoprecipitation control libraries. PLoS ONE 4, e5241 (2009).
Article PubMed PubMed Central CAS Google Scholar
Liu, E.T., Pott, S. & Huss, M. Q&A: ChIP-seq technologies and the study of gene regulation. BMC Biol. 8, 56 (2010).
Article PubMed PubMed Central CAS Google Scholar
Teytelman, L. et al. Impact of chromatin structures on DNA processing for genomic analyses. PLoS ONE 4, e6700 (2009).
Article PubMed PubMed Central CAS Google Scholar
Nix, D.A., Courdy, S.J. & Boucher, K.M. Empirical methods for controlling false positives and estimating confidence in ChIP-seq peaks. BMC Bioinformatics 9, 523 (2008).
Article PubMed PubMed Central CAS Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-seq (MACS). Genome Biol. 9, R137–R137 (2008).
Article PubMed PubMed Central CAS Google Scholar
Tavares, L. et al. RYBP-PRC1 complexes mediate H2A ubiquitylation at polycomb target sites independently of PRC2 and H3K27me3. Cell 148, 664–678 (2012).
Article PubMed PubMed Central CAS Google Scholar
Ulitsky, I., Shkumatava, A., Jan, C.H., Sive, H. & Bartel, D.P. Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell 147, 1537–1550 (2011).
Article PubMed PubMed Central CAS Google Scholar
He, H.H. et al. Nucleosome dynamics define transcriptional enhancers. Nat. Genet. 42, 343–347 (2010).
Article PubMed PubMed Central CAS Google Scholar
Zheng, W., Zhao, H., Mancera, E., Steinmetz, L.M. & Snyder, M. Genetic analysis of variation in transcription factor binding in yeast. Nature 464, 1187–1191 (2010).
Article PubMed PubMed Central CAS Google Scholar
Noordermeer, D. et al. The dynamic architecture of Hox gene clusters. Science 334, 222–225 (2011).
Article PubMed CAS Google Scholar
Welboren, W.-J. et al. ChIP-seq of ERα and RNA polymerase II defines genes differentially responding to ligands. EMBO J. 28, 1418–1428 (2009).
Article PubMed PubMed Central CAS Google Scholar
Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007).
Article PubMed CAS Google Scholar
Liu, T. et al. Cistrome: an integrative platform for transcriptional regulation studies. Genome Biol. 12, R83 (2011).
Article PubMed PubMed Central CAS Google Scholar
Jothi, R., Cuddapah, S., Barski, A., Cui, K. & Zhao, K. Genome-wide identification of in vivo protein–DNA binding sites from ChIP-seq data. Nucleic Acids Res. 36, 5221–5231 (2008).
Article PubMed PubMed Central CAS Google Scholar
Ji, H. et al. An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat. Biotech. 26, 1293–1300 (2008).
Article CAS Google Scholar
Zang, C. et al. A clustering approach for identification of enriched domains from histone modification ChIP-seq data. Bioinformatics 25, 1952–1958 (2009).
Article PubMed PubMed Central CAS Google Scholar
Fejes, A.P. et al. FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology. Bioinformatics 24, 1729–1730 (2008).
Article PubMed PubMed Central CAS Google Scholar
Valouev, A. et al. Genome-wide analysis of transcription factor binding sites based on ChIP-seq data. Nat. Methods 5, 829–834 (2008).
Article PubMed PubMed Central CAS Google Scholar
Laajala, T.D. et al. A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments. BMC Genomics 10, 618 (2009).
Article PubMed PubMed Central CAS Google Scholar
Wilbanks, E.G. & Facciotti, M.T. Evaluation of algorithm performance in ChIP-seq peak detection. PLoS ONE 5, e11471 (2010).
Article PubMed PubMed Central CAS Google Scholar
Pepke, S., Wold, B. & Mortazavi, A. Computation for ChIP-seq and RNA-seq studies. Nat. Methods 6, S22–S32 (2009).
Article PubMed PubMed Central CAS Google Scholar
Barski, A. & Zhao, K. Genomic location analysis by ChIP-seq. J. Cell Biochem. 107, 11–18 (2009).
Article PubMed CAS Google Scholar
Malone, B.M., Tan, F., Bridges, S.M. & Peng, Z. Comparison of four ChIP-seq analytical algorithms using rice endosperm H3K27 trimethylation profiling data. PLoS ONE 6, e25260 (2011).
Article PubMed PubMed Central CAS Google Scholar
Chen, Y. et al. Systematic evaluation of factors influencing ChIP-seq fidelity. Nat. Methods 9, 609–614 (2012).
Article PubMed PubMed Central CAS Google Scholar
Stitzel, M.L. et al. Global epigenomic analysis of primary human pancreatic islets provides insights into type 2 diabetes susceptibility loci. Cell Metab. 12, 443–455 (2010).
Article PubMed PubMed Central CAS Google Scholar
Sati, S. et al. High resolution methylome map of rat indicates role of intragenic DNA methylation in identification of coding region. PLoS ONE 7, e31621 (2012).
Article PubMed PubMed Central CAS Google Scholar
Li, N. et al. Whole genome DNA methylation analysis based on high throughput sequencing technology. Methods 52, 203–212 (2010).
Article PubMed CAS Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article PubMed PubMed Central CAS Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Salmon-Divon, M., Dvinge, H., Tammoja, K. & Bertone, P. PeakAnalyzer: genome-wide annotation of chromatin binding and modification loci. BMC Bioinformatics 11, 415 (2010).
Article PubMed PubMed Central CAS Google Scholar
Robinson, J.T. et al. Integrative genomics viewer. Nat. Biotech. 29, 24–26 (2011).
Article CAS Google Scholar
Kent, W.J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
Article PubMed PubMed Central CAS Google Scholar
Nicol, J.W., Helt, G.A., Blanchard, S.G. Jr ., Raja, A. & Loraine, A.E. The integrated genome browser: free software for distribution and exploration of genome-scale datasets. Bioinformatics 25, 2730–2731 (2009).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This project was supported by the National Natural Science Foundation of China (31028011 and 31071114); the National Basic Research Program of China (973 Program: 2010CB944904 and 2011CB965104); US National Institutes of Health grant HG4069; and the Excellent Young Teachers Program of Tongji University (2010KJ041).

Author information

Jianxing Feng and Tao Liu: These authors contributed equally to this work.

Authors and Affiliations

Department of Bioinformatics, School of Life Sciences and Technology, Tongji University, Shanghai, China
Jianxing Feng, Bo Qin & Yong Zhang
Department of Biostatistics and Computational Biology, Harvard School of Public Health, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
Tao Liu & Xiaole Shirley Liu

Authors

Jianxing Feng
View author publications
You can also search for this author in PubMed Google Scholar
Tao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Qin
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaole Shirley Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Z., T.L. and X.S.L. developed the original MACS algorithm. T.L. developed the current version of the MACS program. J.F. and B.Q. performed the data analysis. J.F., T.L. and X.S.L. wrote the initial manuscript. All authors contributed to the discussion and writing of the final manuscript.

Corresponding authors

Correspondence to Yong Zhang or Xiaole Shirley Liu.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Feng, J., Liu, T., Qin, B. et al. Identifying ChIP-seq enrichment using MACS. Nat Protoc 7, 1728–1740 (2012). https://doi.org/10.1038/nprot.2012.101

Download citation

Published: 30 August 2012
Issue Date: September 2012
DOI: https://doi.org/10.1038/nprot.2012.101

This article is cited by

Targeted design of synthetic enhancers for selected tissues in the Drosophila embryo
- Bernardo P. de Almeida
- Christoph Schaub
- Alexander Stark
Nature (2024)
Single-cell multi-ome regression models identify functional and disease-associated enhancers and enable chromatin potential analysis
- Sneha Mitra
- Rohan Malik
- Christina S. Leslie
Nature Genetics (2024)
Cell-type-specific CAG repeat expansions and toxicity of mutant Huntingtin in human striatum and cerebellum
- Kert Mätlik
- Matthew Baffuto
- Nathaniel Heintz
Nature Genetics (2024)
Mitotic bookmarking redundancy by nuclear receptors in pluripotent cells
- Almira Chervova
- Amandine Molliex
- Pablo Navarro
Nature Structural & Molecular Biology (2024)
Tgfbr1 controls developmental plasticity between the hindlimb and external genitalia by remodeling their regulatory landscape
- Anastasiia Lozovska
- Artemis G. Korovesi
- Moisés Mallo
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.