Nature Genetics | Article
-
Previous article
Nature Genetics | Article
Evolution of high-level ethambutol-resistant tuberculosis through interacting mutations in decaprenylphosphoryl-β-D-arabinose biosynthetic and utilization pathway genes -
Next article
Nature Genetics | Article
Mapping the fine structure of a eukaryotic promoter input-output function
Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues
- Gary C Hon1,
- Nisha Rajagopal2,
- Yin Shen1,
- David F McCleary1,
- Feng Yue1,
- My D Dang1,
- Bing Ren1, 2, 3, 4, 5,
- Journal name:
- Nature Genetics
- Volume:
- 45,
- Pages:
- 1198–1206
- Year published:
- DOI:
- doi:10.1038/ng.2746
- Received
- Accepted
- Published online
Abstract
Mammalian development requires cytosine methylation, a heritable epigenetic mark of cellular memory believed to maintain a cell's unique gene expression pattern. However, it remains unclear how dynamic DNA methylation relates to cell type–specific gene expression and animal development. Here, by mapping base-resolution methylomes in 17 adult mouse tissues at shallow coverage, we identify 302,864 tissue-specific differentially methylated regions (tsDMRs) and estimate that >6.7% of the mouse genome is variably methylated. Supporting a prominent role for DNA methylation in gene regulation, most tsDMRs occur at distal cis-regulatory elements. Unexpectedly, some tsDMRs mark enhancers that are dormant in adult tissues but active in embryonic development. These 'vestigial' enhancers are hypomethylated and lack active histone modifications in adult tissues but nevertheless exhibit activity during embryonic development. Our results provide new insights into the role of DNA methylation at tissue-specific enhancers and suggest that epigenetic memory of embryonic development may be retained in adult tissues.
Subject terms:
At a glance
Figures
-
Figure 1: Distinct tissue-specific methylomes. (a) UCSC Genome Browser snapshot of DNA methylomes derived from tissues of a single mouse near the Hoxa locus. Each track spans percent mCG (%mCG) values between 0% and 100%. Tracks are colored by tissue type, as determined by the clustering in c. (b) Box plots of the percent mCG distributions for each tissue, calculated from non-overlapping 10-kb bins spanning the mouse genome. Box-plot edges indicate the 25th and 75th percentiles, central bars indicate medians and whiskers indicate non-outlier extremes. Dashed lines indicate 80% and 90% mCG. (c) Dendrogram constructed from 1-kb regions exhibiting significant tissue-specific methylation (P = 0.001, χ2 test). Distance is measured as 1 – Pearson's correlation coefficient. (d) Global abundance of methylation in the non-CG context, expressed as the difference between the fraction of non-CG cytosines that are methylated and the bisulfite non-conversion rate. (e) Abundance of large domains bearing low (L), medium (M) or high (H) methylation, as determined by HMMs independently trained on each tissue.
-
Figure 2: Identification of tissue-specific methylated regions. (a) UCSC Genome Browser snapshot of tsDMRs identified by HMM segmentation of the χ2 statistic for methylation variation (top two tracks). The abundance of DNA methylation in each tissue is indicated below. The tissue specificity of highlighted tsDMRs is indicated above. (b) Quantification of DNA methylation abundance at tsDMRs from a. (c) An HMM was used to segment the mouse genome into regions of low (L, gray), medium (M, white) and high (H, red) tissue specificity for DNA methylation. Shown is the fraction of the genome spanned by these regions, with the high-specificity group denoting tsDMRs. (d) Size distribution of tsDMRs (H) and non-tsDMRs (L). Box plot edges indicate the 25th and 75th percentiles, and whiskers indicate non-outlier extremes. (e) Distributions of average values (left) and standard deviations (right) of percent mCG for tsDMRs (H) and non-tsDMRs (L). Dashed lines indicate the median, m.
-
Figure 3: Tissue-specific methylated regions are predominantly regulatory elements. (a) Enrichment of H3K4me1, H3K4me3, H3K27ac and DNA methylation in liver cells for liver tsDMRs and tsDMRs identified in other tissues. Bone marrow is shown immediately below liver in the H3K4me3 plot. (b) Average PhastCons conservation scores relative to tsDMRs. Higher values indicate greater conservation. (c) Left, heatmap representing the abundance of DNA methylation for tsDMRs in all tissues. Each row represents a tsDMR, and rows are grouped into horizontal blocks that represent the tsDMRs of a particular tissue, with the number of elements in the block shown to the right. A given tsDMR may appear in more than one block. Right, the percentage of tsDMRs in each block within 500 bp of promoters, distal regulatory elements (enhancers and CTCF-binding sites), genic regions and intergenic regions. The genome-wide average for each element is included as the bottom row.
-
Figure 4: Tissue-specific conservation of regulatory elements. (a,b) Average PhastCons conservation scores relative to tsDMRs that are proximal (within 2.5 kb) (a) or distal (beyond 2.5 kb) (b) to annotated transcription start sites (TSSs). Higher values indicate greater conservation.
-
Figure 5: Transcription factor binding motif enrichment near tsDMRs. (a) Heatmap representing the enrichment of transcription factor binding motifs for the tsDMRs identified in each tissue. Each row represents a motif, and the corresponding transcription factors for selected motifs are labeled on the right. (b) Predicted regulatory elements in heart (p300-binding sites (gray), chromatin-predicted enhancers (blue), tsDMRs (red)) were overlapped, and the density of known tissue-specific motifs (ETS, FOXO1, GATA4, JUN, MEF2A, NF1, SOX6, STAT3, TEAD4) relative to these aligned intersected sites is shown. (c) Cumulative density of graphs in b with respect to the absolute distance to the predicted regulatory element. The horizontal dashed line indicates 50% cumulative density, and vertical dotted lines indicate the resolution in base pairs of each set of predicted regulatory elements to achieve 50% cumulative density.
-
Figure 6: AD-I tsDMRs demonstrate features of dormant enhancers. (a) Heatmap representing the density of H3K4me1 and H3K27ac marks relative to all tsDMRs identified in heart, kidney and olfactory bulb. Also indicated are AD-A tsDMRs and AD-I tsDMRs. (b) AD-A and AD-I tsDMRs were identified for ten tissues where histone modification data are publicly available28 (spleen, thymus, bone marrow, intestine, liver, heart, kidney, cerebellum, cortex, olfactory bulb). Shown are the average profiles of H3K4me1, H3K27ac and DNA methylation. Each line represents the average over a tissue or replicate pair. (c) Heatmap representing the enrichment of transcription factor binding motifs for the AD-I and AD-A tsDMRs of each tissue. Each row represents a motif, and several rows are labeled on the right. The numbers of AD-A and AD-I tsDMRs are indicated in parentheses above. (d) Heatmap representing the enrichment of GO biological process terms obtained by running the GREAT tool35 on AD-I tsDMRs. (e) Average PhastCons conservation scores for AD-A tsDMRs and AD-I tsDMRs identified in cerebellum, kidney and heart. Higher values indicate greater conservation.
-
Figure 7: AD-I tsDMRs are active during development. (a) For AD-I tsDMRs identified in adult cerebellum, adult cortex and adult heart, shown is the average enrichment of H3K4me1 and H3K27ac marks in adult tissue and E14.5 tissue. Two biological replicates for each sample are shown. (b) AD-A and AD-I tsDMRs were associated with genes within 50 kb. Shown is the distribution of expression for genes associated with AD-A tsDMRs and AD-I tsDMRs in respective adult tissue. For AD-I tsDMRs for which RNA sequencing data exist at earlier developmental time points (cerebellum, cortex, olfactory bulb: brain E14.5; heart: heart E14.5; liver: liver E14.5), also shown is the distribution of expression of AD-I tsDMR–associated genes in developmental tissue. P values, Wilcoxon. (c) Heatmap of the overlap enrichment of AD-I tsDMRs with enhancers predicted in developing embryos (left) and adult cells (right). Enrichment is determined relative to sets of random AD-I tsDMRs. (d) Box plots of overlap enrichment from c of AD-I tsDMRs with enhancers predicted in developing or adult tissues. (e) Overlap of p300-binding sites34 identified in embryonic forebrain, midbrain and limb with AD-I tsDMRs (red) identified in ectodermal tissue (cerebellum, cortex, olfactory bulb) and all other tissues. As a comparison, overlap was also performed against random sets of AD-I tsDMRs (gray), shown as mean ± s.d. (f) In vivo reporter assays of enhancer activity for VISTA enhancers mm447 and mm414, as obtained from the VISTA enhancer browser39. (g) UCSC Genome Browser snapshot of AD-I tsDMRs identified in cerebellum (highlighted), which overlap with an in vivo–validated enhancer active in developing mouse midbrain. Also shown are active histone modifications (H3K4me1, H3K4me3, H3K27ac) and DNase I hypersensitivity sites (DHSs) in developing mouse brains (E14.5, E18.5) and adult cerebellum. For all box plots, edges indicate the 25th and 75th percentiles, and whiskers indicate non-outlier extremes.
-
Figure 8: Vestigial enhancers across development and strains. (a) Heatmaps representing the enrichment of chromatin (me1, H3K4me1; ac, H3K27ac) at predicted enhancers with active chromatin in E14.5 mouse brain that consistently retain or lose active chromatin marks in all adult brain tissues (cerebellum, cortex, olfactory bulb). The DNA methylation status in adult tissues is indicated on the right. (b) Percentage of E14.5 mouse brain enhancers belonging to various epigenetic states in adult brain tissue (cer, cerebellum; cor, cortex; olf, olfactory bulb). E14.5 enhancers were overlapped with adult enhancers (A), and the remainder were overlapped with adult tsDMRs (D). The remaining fraction is labeled as inactive (I). (c) Heatmap comparing the enrichment of transcription factor binding motifs for vestigial enhancers in brain tissue T1 with those in brain tissue T2. Each row represents a motif, and several rows are labeled on the right. (d) Top, heatmap of DNA methylation status for various tissues and mouse strains, centered on vestigial enhancers identified in C57BL/6 mouse cortex. Bottom, box plots comparing C57BL/6 cortex vestigial enhancer methylation with that of each tissue or strain. Distance is measured as the absolute difference in methylation of each vestigial enhancer between two tissues. Box plot edges indicate the 25th and 75th percentiles, and whiskers indicate non-outlier extremes. *P < 1 × 10−3, Wilcoxon. (e) Average profiles of DNA methylation for cortical tissue from C57BL/6, C/129 and 129/C mice centered at C57BL/6 cortex vestigial enhancers.
Accession codes
Primary accessions
Gene Expression Omnibus
References
- Feng, S., Jacobsen, S.E. & Reik, W. Epigenetic reprogramming in plant and animal development. Science 330, 622–627 (2010).
- Jones, P.A. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat. Rev. Genet. 13, 484–492 (2012).
- Bird, A. DNA methylation patterns and epigenetic memory. Genes Dev. 16, 6–21 (2002).
- Bonasio, R., Tu, S. & Reinberg, D. Molecular signals of epigenetic states. Science 330, 612–616 (2010).
- Okano, M. & Li, E. Genetic analyses of DNA methyltransferase genes in mouse model system. J. Nutr. 132, 2462S–2465S (2002).
- Meissner, A. Epigenetic modifications in pluripotent and differentiated cells. Nat. Biotechnol. 28, 1079–1088 (2010).
- Polo, J.M. et al. A molecular roadmap of reprogramming somatic cells into iPS cells. Cell 151, 1617–1632 (2012).
- Ball, M.P. et al. Targeted and genome-scale strategies reveal gene-body methylation signatures in human cells. Nat. Biotechnol. 27, 361–368 (2009).
- Lister, R. et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462, 315–322 (2009).
- Stadler, M.B. et al. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature 480, 490–495 (2011).
- Okano, M., Bell, D.W., Haber, D.A. & Li, E. DNA methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell 99, 247–257 (1999).
- Hemberger, M., Dean, W. & Reik, W. Epigenetic dynamics of stem cells and cell lineage commitment: digging Waddington's canal. Nat. Rev. Mol. Cell Biol. 10, 526–537 (2009).
- Irizarry, R.A. et al. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat. Genet. 41, 178–186 (2009).
- Hansen, K.D. et al. Increased methylation variation in epigenetic domains across cancer types. Nat. Genet. 43, 768–775 (2011).
- Ehrlich, M. et al. Amount and distribution of 5-methylcytosine in human DNA from different types of tissues of cells. Nucleic Acids Res. 10, 2709–2721 (1982).
- Novakovic, B. et al. DNA methylation–mediated down-regulation of DNA methyltransferase-1 (DNMT1) is coincident with, but not essential for, global hypomethylation in human placenta. J. Biol. Chem. 285, 9583–9593 (2010).
- Schroeder, D.I. et al. The human placenta methylome. Proc. Natl. Acad. Sci. USA 110, 6037–6042 (2013).
- Gilbert, S. Developmental Biology (Sinauer Associates, Sunderland, MA, 2000).
- Lister, R. et al. Hotspots of aberrant epigenomic reprogramming in human induced pluripotent stem cells. Nature 471, 68–73 (2011).
- Xie, W. et al. Base-resolution analyses of sequence and parent-of-origin dependent DNA methylation in the mouse genome. Cell 148, 816–831 (2012).
- Hon, G.C. et al. Global DNA hypomethylation coupled to repressive chromatin domain formation and gene silencing in breast cancer. Genome Res. 22, 246–258 (2012).
- Berman, B.P. et al. Regions of focal DNA hypermethylation and long-range hypomethylation in colorectal cancer coincide with nuclear lamina–associated domains. Nat. Genet. 44, 40–46 (2012).
- Peric-Hupkes, D. et al. Molecular maps of the reorganization of genome–nuclear lamina interactions during differentiation. Mol. Cell 38, 603–613 (2010).
- Hiratani, I. et al. Genome-wide dynamics of replication timing revealed by in vitro models of mouse embryogenesis. Genome Res. 20, 155–169 (2010).
- Heintzman, N.D. et al. Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature 459, 108–112 (2009).
- Heintzman, N.D. et al. Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome. Nat. Genet. 39, 311–318 (2007).
- Rada-Iglesias, A. et al. A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279–283 (2011).
- Shen, Y. et al. A map of the cis-regulatory sequences in the mouse genome. Nature 488, 116–120 (2012).
- Hawkins, R.D. et al. Dynamic chromatin states in human ES cells reveal potential regulatory sequences and genes involved in pluripotency. Cell Res. 21, 1393–1409 (2011).
- Creyghton, M.P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl. Acad. Sci. USA 107, 21931–21936 (2010).
- Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005).
- Brawand, D. et al. The evolution of gene expression levels in mammalian organs. Nature 478, 343–348 (2011).
- Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
- Visel, A. et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature 457, 854–858 (2009).
- McLean, C.Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
- Reichert, H. Evolutionary conservation of mechanisms for neural regionalization, proliferation and interconnection in brain development. Biol. Lett. 5, 112–116 (2009).
- Blow, M.J. et al. ChIP-Seq identification of weakly conserved heart enhancers. Nat. Genet. 42, 806–810 (2010).
- Taberlay, P.C. et al. Polycomb-repressed genes have permissive enhancers that initiate reprogramming. Cell 147, 1283–1294 (2011).
- Visel, A., Minovitsky, S., Dubchak, I. & Pennacchio, L.A. VISTA Enhancer Browser—a database of tissue-specific human enhancers. Nucleic Acids Res. 35, D88–D92 (2007).
- Hallonet, M., Hollemann, T., Pieler, T. & Gruss, P. Vax1, a novel homeobox-containing gene, directs development of the basal forebrain and visual system. Genes Dev. 13, 3106–3114 (1999).
- Muzio, L. et al. Emx2 and Pax6 control regionalization of the pre-neuronogenic cortical primordium. Cereb. Cortex 12, 129–139 (2002).
- ENCODE Project Consortium. A user's guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol. 9, e1001046 (2011).
- Thurman, R.E. et al. The accessible chromatin landscape of the human genome. Nature 489, 75–82 (2012).
- Wu, H. et al. Dnmt3a-dependent nonpromoter DNA methylation facilitates transcription of neurogenic genes. Science 329, 444–448 (2010).
- Brinkman, A.B. et al. Sequential ChIP–bisulfite sequencing enables direct genome-scale investigation of chromatin and DNA methylation cross-talk. Genome Res. 22, 1128–1138 (2012).
- Bocker, M.T. et al. Genome-wide promoter DNA methylation dynamics of human hematopoietic progenitor cells during differentiation and aging. Blood 117, e182–e189 (2011).
- Doi, A. et al. Differential methylation of tissue- and cancer-specific CpG island shores distinguishes human induced pluripotent stem cells, embryonic stem cells and fibroblasts. Nat. Genet. 41, 1350–1353 (2009).
- Feinberg, A.P. & Vogelstein, B. Hypomethylation distinguishes genes of some human cancers from their normal counterparts. Nature 301, 89–92 (1983).
- Monk, M., Boubelik, M. & Lehnert, S. Temporal and regional changes in DNA methylation in the embryonic, extraembryonic and germ cell lineages during mouse embryo development. Development 99, 371–382 (1987).
- Gama-Sosa, M.A. et al. The 5-methylcytosine content of DNA from human tumors. Nucleic Acids Res. 11, 6883–6894 (1983).
- Herman, J.G. et al. Silencing of the VHL tumor-suppressor gene by DNA methylation in renal carcinoma. Proc. Natl. Acad. Sci. USA 91, 9700–9704 (1994).
- ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
- Bestor, T., Laudano, A., Mattaliano, R. & Ingram, V. Cloning and sequencing of a cDNA encoding DNA methyltransferase of mouse cells. The carboxyl-terminal domain of the mammalian enzymes is related to bacterial restriction methyltransferases. J. Mol. Biol. 203, 971–983 (1988).
- Langmead, B., Trapnell, C., Pop, M. & Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
- Rajagopal, N. et al. RFECS: a random-forest based algorithm for enhancer identification from chromatin state. PLoS Comput. Biol. 9, e1002968 (2013).
- Maurano, M.T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
- Meyer, L.R. et al. The UCSC Genome Browser database: extensions and updates 2013. Nucleic Acids Res. 41, D64–D69 (2013).
- Dixon, J.R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
- Schug, J. et al. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 6, R33 (2005).
Author information
Affiliations
-
Ludwig Institute for Cancer Research, La Jolla, California, USA.
- Gary C Hon,
- Yin Shen,
- David F McCleary,
- Feng Yue,
- My D Dang &
- Bing Ren
-
Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California, USA.
- Nisha Rajagopal &
- Bing Ren
-
Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA.
- Bing Ren
-
Institute of Genomic Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA.
- Bing Ren
-
Moores Cancer Center, University of California, San Diego School of Medicine, La Jolla, California, USA.
- Bing Ren
Contributions
G.C.H., N.R. and F.Y. performed bioinformatics analysis. G.C.H., Y.S., D.F.M. and M.D.D. performed experiments. G.C.H. and B.R. prepared the manuscript.
Competing financial interests
The authors declare no competing financial interests.
Author details
Gary C Hon
Search for this author in:
Nisha Rajagopal
Search for this author in:
Yin Shen
Search for this author in:
David F McCleary
Search for this author in:
Feng Yue
Search for this author in:
My D Dang
Search for this author in:
Bing Ren
Search for this author in:
Supplementary information
PDF files
- Supplementary Text and Figures (1,785 KB)
Supplementary Figures 1–8 and Supplementary Note
Excel files
- Supplementary Table 1 (17,230 KB)
tsDMRs in mouse tissues
- Supplementary Table 2 (21 KB)
Motif analysis of tsDMRs not recovered by known regulatory elements
- Supplementary Table 3 (13,208 KB)
AD-A and AD-I tsDMRs in mouse tissues
- Supplementary Table 4 (11 KB)
Overlap of embryonic enhancers with adult AD-I tsDMRs
- Supplementary Table 5 (17,387 KB)
Predicted enhancers
- Supplementary Table 6 (16 KB)
Known motifs used in analysis