Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Letter
  • Published:

Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells

Abstract

5-hydroxymethylcytosine (5hmC) is a modified base present at low levels in diverse cell types in mammals1,2,3,4,5. 5hmC is generated by the TET family of Fe(II) and 2-oxoglutarate-dependent enzymes through oxidation of 5-methylcytosine (5mC)1,2,4,5,6,7. 5hmC and TET proteins have been implicated in stem cell biology and cancer1,4,5,8,9, but information on the genome-wide distribution of 5hmC is limited. Here we describe two novel and specific approaches to profile the genomic localization of 5hmC. The first approach, termed GLIB (glucosylation, periodate oxidation, biotinylation) uses a combination of enzymatic and chemical steps to isolate DNA fragments containing as few as a single 5hmC. The second approach involves conversion of 5hmC to cytosine 5-methylenesulphonate (CMS) by treatment of genomic DNA with sodium bisulphite, followed by immunoprecipitation of CMS-containing DNA with a specific antiserum to CMS5. High-throughput sequencing of 5hmC-containing DNA from mouse embryonic stem (ES) cells showed strong enrichment within exons and near transcriptional start sites. 5hmC was especially enriched at the start sites of genes whose promoters bear dual histone 3 lysine 27 trimethylation (H3K27me3) and histone 3 lysine 4 trimethylation (H3K4me3) marks. Our results indicate that 5hmC has a probable role in transcriptional regulation, and suggest a model in which 5hmC contributes to the ‘poised’ chromatin signature found at developmentally-regulated genes in ES cells.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Comparison of 5hmC enrichment methods.
Figure 2: Genomic distribution of 5hmC or 5mC enriched regions of the genome.
Figure 3: Properties of HERGs at transcription start sites.

Similar content being viewed by others

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

Data have been deposited at GEO under accession number GSE28682.

References

  1. Tahiliani, M. et al. Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1. Science 324, 930–935 (2009)

    Article  ADS  CAS  Google Scholar 

  2. Kriaucionis, S. & Heintz, N. The nuclear DNA base 5-hydroxymethylcytosine is present in Purkinje neurons and the brain. Science 324, 929–930 (2009)

    Article  ADS  CAS  Google Scholar 

  3. Szwagierczak, A., Bultmann, S., Schmidt, C. S., Spada, F. & Leonhardt, H. Sensitive enzymatic quantification of 5-hydroxymethylcytosine in genomic DNA. Nucleic Acids Res. 38, e181 (2010)

    Article  Google Scholar 

  4. Ito, S. et al. Role of Tet proteins in 5mC to 5hmC conversion, ES-cell self-renewal and inner cell mass specification. Nature 466, 1129–1133 (2010)

    Article  ADS  CAS  Google Scholar 

  5. Ko, M. et al. Impaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET2 . Nature 468, 839–843 (2010)

    Article  ADS  CAS  Google Scholar 

  6. Iyer, L. M., Tahiliani, M., Rao, A. & Aravind, L. Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids. Cell Cycle 8, 1698–1710 (2009)

    Article  CAS  Google Scholar 

  7. Loenarz, C. & Schofield, C. J. Oxygenase catalyzed 5-methylcytosine hydroxylation. Chem. Biol. 16, 580–583 (2009)

    Article  CAS  Google Scholar 

  8. Koh, K. P. et al. Tet1 and Tet2 regulate 5-hydroxymethylcytosine production and cell lineage specification in mouse embryonic stem cells. Cell Stem Cell 8, 200–213 (2011)

    Article  CAS  Google Scholar 

  9. Delhommeau, F. et al. Mutation in TET2 in myeloid cancers. N. Engl. J. Med. 360, 2289–2301 (2009)

    Article  Google Scholar 

  10. Zhang, H., Li, X. J., Martin, D. B. & Aebersold, R. Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry. Nature Biotechnol. 21, 660–666 (2003)

    Article  CAS  Google Scholar 

  11. Song, C. X. et al. Selective chemical labeling reveals the genome-wide distribution of 5-hydroxymethylcytosine. Nature Biotechnol. 29, 68–72 (2011)

    Article  CAS  Google Scholar 

  12. Hayatsu, H. & Shiragami, M. Reaction of bisulfite with the 5-hydroxymethyl group in pyrimidines and in phage DNAs. Biochemistry 18, 632–637 (1979)

    Article  CAS  Google Scholar 

  13. Huang, Y. et al. The behaviour of 5-hydroxymethylcytosine in bisulfite sequencing. PLoS ONE 5, e8888 (2010)

    Article  ADS  Google Scholar 

  14. Harris, T. D. et al. Single-molecule DNA sequencing of a viral genome. Science 320, 106–109 (2008)

    Article  ADS  CAS  Google Scholar 

  15. Bowers, J. et al. Virtual terminator nucleotides for next-generation DNA sequencing. Nature Methods 6, 593–595 (2009)

    Article  CAS  Google Scholar 

  16. Lister, R. et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462, 315–322 (2009)

    Article  ADS  CAS  Google Scholar 

  17. Saxonov, S., Berg, P. & Brutlag, D. L. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc. Natl Acad. Sci. USA 103, 1412–1417 (2006)

    Article  ADS  CAS  Google Scholar 

  18. Feng, S. et al. Conservation and divergence of methylation patterning in plants and animals. Proc. Natl Acad. Sci. USA 107, 8689–8694 (2010)

    Article  ADS  CAS  Google Scholar 

  19. Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA 10.1073/pnas.1016071107 (24 November 2010)

  20. Boyer, L. A. et al. Polycomb complexes repress developmental regulators in murine embryonic stem cells. Nature 441, 349–353 (2006)

    Article  ADS  CAS  Google Scholar 

  21. Guttman, M. et al. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nature Biotechnol. 28, 503–510 (2010)

    Article  CAS  Google Scholar 

  22. Mikkelsen, T. S. et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007)

    Article  ADS  CAS  Google Scholar 

  23. Ku, M. et al. Genomewide analysis of PRC1 and PRC2 occupancy identifies two classes of bivalent domains. PLoS Genet. 4, e1000242 (2008)

    Article  Google Scholar 

  24. Meissner, A. et al. Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 454, 766–770 (2008)

    Article  ADS  CAS  Google Scholar 

  25. Zhang, H. et al. TET1 is a DNA-binding protein that modulates DNA methylation and gene transcription via hydroxylation of 5-methylcytosine. Cell Res. 20, 1390–1393 (2010)

    Article  ADS  Google Scholar 

  26. Lee, T. I. et al. Control of developmental regulators by Polycomb in human embryonic stem cells. Cell 125, 301–313 (2006)

    Article  CAS  Google Scholar 

  27. Bernstein, B. E. et al. A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125, 315–326 (2006)

    Article  CAS  Google Scholar 

  28. Fouse, S. D. et al. Promoter CpG methylation contributes to ES cell gene regulation in parallel with Oct4/Nanog, PcG complex, and histone H3 K4/K27 trimethylation. Cell Stem Cell 2, 160–169 (2008)

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank B. Ren for assistance in next generation sequencing using the Illumina platform. We thank M. Guttman for making his RNASeq data set available to us. W.A.P. is supported by a predoctoral graduate research fellowship from the National Science Foundation, and Y.H. by a postdoctoral fellowship from the Leukemia and Lymphoma Society. R.L. is supported by a California Institute for Regenerative Medicine Training Grant. This study was supported by the National Institute of Health grants RC1 DA028422, R01 AI44432 and 1 R01 HD065812-01A1 and a grant from the California Institute of Regenerative Medicine (to A.R.), a pilot grant from Harvard Catalyst, The Harvard Clinical and Translational Science Center (NIH Grant 1 UL1 RR 025758-02) and NIH K08 HL089150 (to S.A.), and a grant from the Mary. K. Chapman Foundation (to J.R.E.).

Author information

Authors and Affiliations

Authors

Contributions

W.A.P., Y.B. and S.A. devised the GLIB method. W.A.P., S.A., H.R.H. and E.M.M. optimized the GLIB method. Y.H. generated the anti-CMS antiserum, and Y.H. and W.A.P. optimized the anti-CMS pull-down. W.A.P. and Y.H. grew ES cells. W.A.P. prepared GLIB samples for sequencing, Y.H. prepared CMS samples, H.R.H. performed MeDIPs. Helicos sequencing and mapping was performed by P.K. and P.M.M., Illumina sequencing and mapping was performed by R.L. and J.R.E., and U.J.P. was responsible for bioinformatic analysis. M.K. performed the anti-5hmC dot blot. W.A.P. and M.T. performed anti-5hmC pull-downs. H.R.H. and S.M. performed and optimized in vitro tests of Tet substrate specificity. W.A.P., S.A. and A.R. wrote the manuscript. S.A. and A.R. coordinated research.

Corresponding authors

Correspondence to Suneet Agarwal or Anjana Rao.

Ethics declarations

Competing interests

P.K. and P.M.M. are employees of Helicos Biosciences.

Supplementary information

nature10102-s1.pdf Supplementary Information

This file contains Supplementary Figures 1-5 with legends, Supplementary Methods, additional references and Supplementary Tables 1-3 (PDF 1197 kb)

Supplementary Table 4. GLIB Peak Annotation.

This table shows the location of every GLIB HERG and the genomic features it is enclosed within (or in the case of exons, transcription start sites, or enhancers, touches). (XLS 22370 kb)

Supplementary Table 5. GLIB Peak Locations.

This table shows the location of each GLIB HERG in the Mus Musculus (mm9) genome. (TXT 5055 kb)

Supplementary Table 6. CMS Peak Annotation.

This table shows the location of every CMS HERG and the genomic features it is enclosed within (or in the case of exons, transcription start sites, or enhancers, touches). (XLS 23176 kb)

Supplementary Table 7. CMS Peak Locations.

This table shows the location of each CMS HERG in the Mus Musculus (mm9) genome. (TXT 4601 kb)

Supplementary Table 8. 5mC Peak Annotation.

This table shows the location of every MERG and the genomic features it is enclosed within (or in the case of exons, transcription start sites, or enhancers, touches). (XLS 21412 kb)

Supplementary Table 9. 5mC Peak Locations.

This table shows the location of each MERG in the Mus Musculus (mm9) genome. (TXT 2648 kb)

Supplementary Table 9. Visualization of hoxb locus

This file shows reads from the GLIB (reads.glib.hmc) and anti-CMS (reads.cms.hmc) precipitations. Also shown are the reads from the –BGT control (reads.glib.bg), the bisulphite treated input (reads.cms.bg), and the HERGs from each method (peaks.glib and peaks.cms). (TXT 3101 kb)

Supplementary Table 11. List of hydroxymethylated TSS genes (by GLIB).

This table lists every gene that overlaps with a HERG (as determined by GLIB) at or immediately prior to the TSS (-800bp to +200bp). Featured is the RefSeq output, the promoter CpG class and histone methylation state22, the expresson decile in ES cells21, the presence or absence of polycomb features at the promoter23, change in expression upon differentiation to embryoid bodies20, and upregulation or downregulation in response to Tet1 depletion8. (XLS 4771 kb)

Supplementary Table 12. List of hydroxymethylated TSS genes (by anti-CMS).

This table lists every gene that overlaps with a HERG (as determined by anti-CMS precipitation) at or immediately prior to the TSS (-800bp to +200bp). Featured is the RefSeq output, the promoter CpG class and histone methylation state22, the expresson decile in ES cells21, the presence or absence of polycomb features at the promoter23, change in expression upon differentiation to embryoid bodies20, and upregulation or downregulation in response to Tet1 depletion8. (XLS 4291 kb)

Supplementary Table 13. List of methylated TSS genes (by MeDIP).

This table lists every gene that overlaps with a MERG at or immediately prior to the TSS (-800bp to +200bp). Featured is the RefSeq output, the promoter CpG class and histone methylation state22, the expresson decile in ES cells21, the presence or absence of polycomb features at the promoter23, the change in expression upon differentiation to embryoid bodies20, and upregulation or downregulation in response to Tet1 depletion8. (XLS 644 kb)

PowerPoint slides

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pastor, W., Pape, U., Huang, Y. et al. Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells. Nature 473, 394–397 (2011). https://doi.org/10.1038/nature10102

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nature10102

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing