Abstract
The 11 zinc finger (ZF) protein CTCF regulates topologically associating domain formation and transcription through selective binding to thousands of genomic sites. Here, we replaced endogenous CTCF in mouse embryonic stem cells with green-fluorescent-protein-tagged wild-type or mutant proteins lacking individual ZFs to identify additional determinants of CTCF positioning and function. While ZF1 and ZF8–ZF11 are not essential for cell survival, ZF8 deletion strikingly increases the DNA binding off-rate of mutant CTCF, resulting in reduced CTCF chromatin residence time. Loss of ZF8 results in widespread weakening of topologically associating domains, aberrant gene expression and increased genome-wide DNA methylation. Thus, important chromatin-templated processes rely on accurate CTCF chromatin residence time, which we propose depends on local sequence and chromatin context as well as global CTCF protein concentration.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Data availability
All sequencing the datasets generated for this study are available in the GEO data repository under accession number GSE154009. Selected RNA-seq and Hi-C results can be found in Supplementary Tables 1 and 2. Previously published genomics data that were re-analysed here are available under the following accession codes: GSE123636 (RNA-seq of RBRi CTCF-expressing ESCs); GSE98671 (RNA-seq and Hi-C of auxin-inducible degron-mediated CTCF-depleted ESCs); GSE62380 (H3K27Ac, H3K4me3, H3K27me3, H3K36me3 and H3K79me2 ChIP-seq of ESCs); GSE96611 (H3K4Me2 ChIP-seq and ATAC-seq of ESCs); GSE44286 (Oct4 ChIP-seq of ESCs); GSE40910 (MNAse-seq of ESCs); GSE20485 (RNA polymerase II ChIP-seq of ESCs); and GSE136860 (PRO-seq of ESCs). Source data are provided with this paper. All other data that support the conclusions of this study are available from the authors on reasonable request.
Code availability
All codes used in this manuscript are publicly available except for Python scripts to analyse MeD-seq data, which are proprietary (Erasmus MC/Methylomics).
References
Bonev, B. & Cavalli, G. Organization and function of the 3D genome. Nat. Rev. Genet. 17, 661–678 (2016).
Rowley, M. J. & Corces, V. G. Organizational principles of 3D genome architecture. Nat. Rev. Genet. 19, 789–800 (2018).
Stadhouders, R., Filion, G. J. & Graf, T. Transcription factors and 3D genome conformation in cell-fate decisions. Nature 569, 345–354 (2019).
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385 (2012).
Sexton, T. et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148, 458–472 (2012).
Schoenfelder, S. & Fraser, P. Long-range enhancer–promoter contacts in gene expression control. Nat. Rev. Genet. 20, 437–455 (2019).
Groschel, S. et al. A single oncogenic enhancer rearrangement causes concomitant EVI1 and GATA2 deregulation in leukemia. Cell 157, 369–381 (2014).
Lupianez, D. G. et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene–enhancer interactions. Cell 161, 1012–1025 (2015).
Rhee, H. S. & Pugh, B. F. Comprehensive genome-wide protein–DNA interactions detected at single-nucleotide resolution. Cell 147, 1408–1419 (2011).
Schmidt, D. et al. Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell 148, 335–348 (2012).
Hashimoto, H. et al. Structural basis for the versatile and methylation-dependent binding of CTCF to DNA. Mol. Cell 66, 711–720.e3 (2017).
Nakahashi, H. et al. A genome-wide map of CTCF multivalency redefines the CTCF code. Cell Rep. 3, 1678–1689 (2013).
Holwerda, S. J. & de Laat, W. CTCF: the protein, the binding partners, the binding sites and their chromatin loops. Philos. Trans. R. Soc. Lond. B Biol. Sci. 368, 20120369 (2013).
Dekker, J. & Mirny, L. The 3D genome as moderator of chromosomal communication. Cell 164, 1110–1121 (2016).
Merkenschlager, M. & Nora, E. P. CTCF and cohesin in genome folding and transcriptional gene regulation. Annu. Rev. Genomics Hum. Genet 17, 17–43 (2016).
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. USA 112, E6456–E6465 (2015).
Fudenberg, G., Abdennur, N., Imakaev, M., Goloborodko, A. & Mirny, L. A. Emerging evidence of chromosome folding by loop extrusion. Cold Spring Harb. Symp. Quant. Biol. 82, 45–55 (2017).
Davidson, I. F. et al. DNA loop extrusion by human cohesin. Science 366, 1338–1345 (2019).
Gomez-Marin, C. et al. Evolutionary comparison reveals that diverging CTCF sites are signatures of ancestral topological associating domains borders. Proc. Natl Acad. Sci. USA 112, 7542–7547 (2015).
Vietri Rudan, M. et al. Comparative Hi-C reveals that CTCF underlies evolution of chromosomal domain architecture. Cell Rep. 10, 1297–1309 (2015).
Haarhuis, J. H. I. et al. The cohesin release factor WAPL restricts chromatin loop extension. Cell 169, 693–707.e14 (2017).
Rao, S. S. P. et al. Cohesin loss eliminates all loop domains. Cell 171, 305–320.e24 (2017).
Schwarzer, W. et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature 551, 51–56 (2017).
Wutz, G. et al. Topologically associating domains and chromatin loops depend on cohesin and are regulated by CTCF, WAPL, and PDS5 proteins. EMBO J. 36, 3573–3599 (2017).
Zuin, J. et al. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc. Natl Acad. Sci. USA 111, 996–1001 (2014).
Nora, E. P. et al. Targeted degradation of CTCF decouples local insulation of chromosome domains from genomic compartmentalization. Cell 169, 930–944.e22 (2017).
Hansen, A. S. et al. Distinct classes of chromatin loops revealed by deletion of an RNA-binding region in CTCF. Mol. Cell 76, 395–411.e13 (2019).
Saldana-Meyer, R. et al. RNA interactions are essential for CTCF-mediated genome organization. Mol. Cell 76, 412–422.e5 (2019).
Sleutels, F. et al. The male germ cell gene regulator CTCFL is functionally different from CTCF and binds CTCF-like consensus sites in a nucleosome composition-dependent manner. Epigenetics Chromatin 5, 8 (2012).
van de Nobelen, S. et al. CTCF regulates the local epigenetic state of ribosomal DNA repeats. Epigenetics Chromatin 3, 19 (2010).
Heath, H. et al. CTCF regulates cell cycle progression of αβ T cells in the thymus. EMBO J. 27, 2839–2850 (2008).
Stik, G. et al. CTCF is dispensable for immune cell transdifferentiation but facilitates an acute inflammatory response. Nat. Genet. 52, 655–661 (2020).
Vian, L. et al. The energetics and physiological impact of cohesin extrusion. Cell 173, 1165–1178.e20 (2018).
Burke, L. J. et al. CTCF binding and higher order chromatin structure of the H19 locus are maintained in mitotic chromatin. EMBO J. 24, 3291–3300 (2005).
Hansen, A. S., Pustova, I., Cattoglio, C., Tjian, R. & Darzacq, X. CTCF and cohesin regulate chromatin loop stability with distinct dynamics. eLife 6, e25776 (2017).
Geverts, B., van Royen, M. E. & Houtsmuller, A. B. Analysis of biomolecular dynamics by FRAP and computer simulation. Methods Mol. Biol. 1251, 109–133 (2015).
Caudron-Herger, M. et al. R-DeeP: proteome-wide and quantitative identification of RNA-dependent proteins by density gradient ultracentrifugation. Mol. Cell 75, 184–199.e10 (2019).
Hansen, A. S., Amitai, A., Cattoglio, C., Tjian, R. & Darzacq, X. Guided nuclear exploration increases CTCF target search efficiency. Nat. Chem. Biol. 16, 257–266 (2019).
Agarwal, H., Reisser, M., Wortmann, C. & Gebhardt, J. C. M. Direct observation of cell-cycle-dependent interactions between CTCF and chromatin. Biophys. J. 112, 2051–2055 (2017).
Cai, Y. et al. Experimental and computational framework for a dynamic protein atlas of human cell division. Nature 561, 411–415 (2018).
Stadhouders, R. et al. Transcription factors orchestrate dynamic interplay between genome topology and gene regulation during cell reprogramming. Nat. Genet. 50, 238–249 (2018).
Whyte, W. A. et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell 153, 307–319 (2013).
Grosveld, F., van Staalduinen, J. & Stadhouders, R. Transcriptional regulation by (super)enhancers: from discovery to mechanisms. Annu. Rev. Genomics Hum. Genet. https://doi.org/10.1146/annurev-genom-122220-093818 (2021).
Ing-Simmons, E. et al. Spatial enhancer clustering and regulation of enhancer–proximal genes by cohesin. Genome Res. 25, 504–513 (2015).
Beagrie, R. A. et al. Complex multi-enhancer contacts captured by genome architecture mapping. Nature 543, 519–524 (2017).
Crane, E. et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature 523, 240–244 (2015).
Krijger, P. H. et al. Cell-of-origin-specific 3D genome structure acquired during somatic cell reprogramming. Cell Stem Cell 18, 597–610 (2016).
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 919–922 (2016).
Beagan, J. A. & Phillips-Cremins, J. E. On the existence and functionality of topologically associating domains. Nat. Genet. 52, 8–16 (2020).
Giorgetti, L. et al. Structural organization of the inactive X chromosome in the mouse. Nature 535, 575–579 (2016).
Darrow, E. M. et al. Deletion of DXZ4 on the human inactive X chromosome alters higher-order genome architecture. Proc. Natl Acad. Sci. USA 113, E4504–E4512 (2016).
Deng, X. et al. Bipartite structure of the inactive mouse X chromosome. Genome Biol. 16, 152 (2015).
Kentepozidou, E. et al. Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains. Genome Biol. 21, 5 (2020).
Gates, L. A., Foulds, C. E. & O’Malley, B. W. Histone marks in the ‘driver’s seat’: functional roles in steering the transcription cycle. Trends Biochem. Sci. 42, 977–989 (2017).
Aranda, S., Mas, G. & Di Croce, L. Regulation of gene transcription by Polycomb proteins. Sci. Adv. 1, e1500737 (2015).
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Boers, R. et al. Genome-wide DNA methylation profiling using the methylation-dependent restriction enzyme LpnPI. Genome Res. 28, 88–99 (2018).
Bell, A. C. & Felsenfeld, G. Methylation of a CTCF-dependent boundary controls imprinted expression of the Igf2 gene. Nature 405, 482–485 (2000).
Hark, A. T. et al. CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/Igf2 locus. Nature 405, 486–489 (2000).
Wang, H. et al. Widespread plasticity in CTCF occupancy linked to DNA methylation. Genome Res. 22, 1680–1688 (2012).
Wiehle, L. et al. DNA (de)methylation in embryonic stem cells controls CTCF-dependent chromatin boundaries. Genome Res. 29, 750–761 (2019).
Oomen, M. E., Hansen, A. S., Liu, Y., Darzacq, X. & Dekker, J. CTCF sites display cell cycle-dependent dynamics in factor binding and nucleosome positioning. Genome Res. 29, 236–249 (2019).
Hansen, A. S., Cattoglio, C., Darzacq, X. & Tjian, R. Recent evidence that TADs and chromatin loops are dynamic structures. Nucleus 9, 20–32 (2018).
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Heinz, S. et al. Transcription elongation can affect genome 3D structure. Cell 174, 1522–1536.e22 (2018).
Busslinger, G. A. et al. Cohesin is positioned in mammalian genomes by transcription, CTCF and Wapl. Nature 544, 503–507 (2017).
Chen, X. et al. Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell 133, 1106–1117 (2008).
Bardou, P., Mariette, J., Escudie, F., Djemiel, C. & Klopp, C. jvenn: an interactive Venn diagram viewer. BMC Bioinformatics 15, 293 (2014).
Dragestein, K. A. et al. Dynamic behavior of GFP-CLIP-170 reveals fast protein turnover on microtubule plus ends. J. Cell Biol. 180, 729–737 (2008).
Leslie, K. & Galjart, N. Going solo: measuring the motions of microtubules with an in vitro assay for TIRF microscopy. Methods Cell Biol. 115, 109–124 (2013).
Magaraki, A. et al. A novel approach to differentiate rat embryonic stem cells in vitro reveals a role for RNF12 in activation of X chromosome inactivation. Sci. Rep. 9, 6068 (2019).
Schneider, C. A., Rasband, W. S. & Eliceiri, K. W. NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675 (2012).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Kim, T., Seo, H. D., Hennighausen, L., Lee, D. & Kang, K. Octopus-toolkit: a workflow to automate mining of public epigenomic and transcriptomic next-generation sequencing data. Nucleic Acids Res. 46, e53 (2018).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Brouwer, R. W., van den Hout, M. C., Grosveld, F. G. & van Ijcken, W. F. NARWHAL, a primary analysis pipeline for NGS data. Bioinformatics 28, 284–285 (2012).
Feng, X., Grossman, R. & Stein, L. PeakRanger: a cloud-enabled peak caller for ChIP-seq data. BMC Bioinformatics 12, 139 (2011).
Lawrence, M., Gentleman, R. & Carey, V. rtracklayer: an R package for interfacing with genome browsers. Bioinformatics 25, 1841–1842 (2009).
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Morgan, M. et al. ShortRead: a bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics 25, 2607–2608 (2009).
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Serra, F. et al. Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors. PLoS Comput. Biol. 13, e1005665 (2017).
Imakaev, M. et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods 9, 999–1003 (2012).
Yan, K. K., Yardimci, G. G., Yan, C., Noble, W. S. & Gerstein, M. HiC-spector: a matrix library for spectral and reproducibility analysis of Hi-C contact maps. Bioinformatics 33, 2199–2201 (2017).
Vidal, E. et al. OneD: increasing reproducibility of Hi-C samples with abnormal karyotypes. Nucleic Acids Res. 46, e49 (2018).
Ramirez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun. 9, 189 (2018).
Roayaei Ardakany, A., Gezer, H. T., Lonardi, S. & Ay, F. Mustache: multi-scale detection of chromatin loops from Hi-C and Micro-C maps using scale-space representation. Genome Biol. 21, 256 (2020).
Flyamer, I. M., Illingworth, R. S. & Bickmore, W. A. Coolpup.py: versatile pile-up analysis of Hi-C data. Bioinformatics 36, 2980–2985 (2020).
Acknowledgements
We would like to thank I. Zampeta for performing the Dnmt3a western blots, and R. van der Linden, R. Janssens and M. de Bruijn for performing flow cytometry. This work was supported by grants from the Netherlands Organisation for Scientific Research (ALW 822.02.018) and the Dutch Cancer Society (KWF EMCR 2008-4109). G.S. was supported by a Marie Skłodowska-Curie fellowship (H2020-MSCA-IF-2016, miRStem) and by the ‘Fundación Científica de la Asociación Española Contra el Cáncer’. R.S. is supported by the Netherlands Organization for Scientific Research (VENI 91617114) and an Erasmus MC Fellowship.
Author information
Authors and Affiliations
Contributions
W.S., F.S. and M.v.d.R. generated and characterized the GFP-tagged ESC lines. W.S., F.S., M.v.d.H., Z.O., W.F.J.v.IJ., S.B., S.C.H., R.S., N.G. and M.B. performed and analysed the RNA-seq and ChIP-seq experiments. S.C.H., S.B., B.G., W.A.v.C., A.H. and N.G. performed and analysed the imaging experiments. S.B., G.S., E.V. and R.S. performed and analysed the Hi-C experiments. S.B., R.B., J.B., J.G., N.G. and R.S. performed and analysed the DNA methylation experiments. S.B., S.M., J.G. and N.G. performed and analysed the FISH experiments. R.R., T.G., A.H., F.G., R.S. and N.G. conceived many of the experiments presented in this work and analysed results. All authors contributed to the writing of this manuscript.
Corresponding authors
Ethics declarations
Competing interests
R.B., J.B., W.F.V.v.IJ. and J.G. report being shareholders in Methylomics B.V., a commercial company that applies MeD-seq to develop methylation markers for cancer staging. The other authors declare no conflicts of interest or financial interests.
Additional information
Peer review information Nature Cell Biology thanks Bing Ren and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Characterization of ESC lines expressing GFP-CTCF or mutant CTCF proteins.
a) CTCF zinc finger position and sequence. ZFs are based on ref. 14. Cysteines (C) and histidines (H) important for zinc coordination are boxed. b) Designed amino acid deletions in the various CTCF ZF mutant proteins. c) Results of the ES cell rescue experiment. For each construct that was transfected, the number of colonies analyzed (left column) and of colonies with full Cre-mediated Ctcfdelneo allele deletion (middle column) are shown. Colonies were analyzed by various methods to verify the intended deletion. The final conclusion regarding functional CTCF substitution is depicted in the right hand column. Of note, we only obtained one ES cell line expressing del1. Sequencing revealed that it contained a deletion of one extra amino acid (K) at the N-terminus of the originally intended del1 stretch (which begins with TFQ), and that the D residue situated at the C-terminus of the intended deletion stretch (which ends with HTD) was not deleted but was instead mutated into an E. Both changes are indicated in Fig. 1a. d) Log2 fold change in CTCF ChIP-Seq signal in del8- versus GFP-CTCF-expressing ES cells. e) Motif analysis of DNA binding by CTCF ZF mutants compared to GFP-CTCF (top row). Sites with more than 2-fold increased binding were included (number of sites are between brackets). Core (C), upstream (U), and downstream (D) motifs are indicated. Arrows point to U motif nucleotides. These are shifted towards the core in the del8 mutant. Asterisks point to GC residues appearing in all mutant motifs.
Extended Data Fig. 2 Interphase localization and behaviour of GFP-tagged CTCF proteins.
a) Fluorescence intensities in the indicated ES cell lines, measured by FACS, which reveals mean fluorescence intensity (MFI) per cell. WT: wild type ES cells, GC: GFP-CTCF-expressing ES cells, 1: del1-, 8: del8-, 9: del9-, 10: del10-, 11: del11-expressing ES cells (n = 2 independent experiments for each line, 10,000 cells measured per experiment). b) Fluorescence intensity comparison of GFP-CTCF, del8, and EB3-GFP. To estimate the average number of expressed GFP-CTCF or del8 molecules in ESC nuclei (see right hand panel for still image of an ES cell colony expressing GFP-CTCF; scale bar: 12 micron), we determined the fluorescence intensity of various dilutions of a purified GFP-tagged protein (EB3-GFP) of known concentration, and generated a calibration curve (lower graph, regression line (red) and function are shown). We next measured GFP-CTCF (GC) and del8 (d8) fluorescence intensity (FI) per nuclear area in arbitrary units (a.u., left hand graph, GC: n = 44 nuclei from 6 independent ESC colonies, d8: n = 70 nuclei from 6 independent ESC colonies, median values with 75% percentile indicated). We then compared intensities of EB3-GFP to that of GFP-CTCF or del8. Based on this analysis we calculated that GFP-CTCF has an average nuclear concentration of 290 nM and del8 of 190 nM. c) Affinity of GFP-CTCF and del8 for chromatin. Taking into account the concentrations of diffusing GFP-CTCF and del 8 (95 nM and 264 nM, respectively, see Fig. 2f), we calculated experimental kon (= apparent kon/[Free CTCF]) and affinity (Kd = koff/ Experimental kon) of GFP-CTCF and del8 for chromatin.
Extended Data Fig. 3 Behaviour of GFP-CTCF- and del8-expressing cell lines.
a) Proliferation assay. GFP-CTCF-expressing ESCs (GC), and three ES cell lines expressing del8 (d8.1, d8.2, d8.3) were seeded at 100000 cells/dish at day 0. We counted cells after 1, 2, or 3 days (n = 3 experiments per cell line per time point, averages ± SD). b) Intracellular distribution of GFP-CTCF (GC) and of del8 in three independently isolated ES cell lines expressing (d8.1, d8.2, d8.3). Asterisk depicts a nuclear area of low fluorescence in the GC line, representing the nucleolus. This area is not visible in the del8 lines. Scale bar: 8 micron. c) Fluorescence intensity (FI) in arbitrary units (a.u.) in the indicated ES cell lines, measured by confocal microscopy in two independent experiments. GC, GFP-CTCF-expressing ES cells (n = 183 nuclei measured), d8: del8-expressing ES cells (d8.1: n = 322 nuclei measured, d8.2: n = 159 nuclei measured, d8.3: n = 167 nuclei measured). Data are presented as whisker plots with median and IQR. Mean values are also indicated (black stripes inside whisker plots, for d8.2 and d8.3 median and mean values overlap). d) Fluorescence recovery after photobleaching (FRAP) experiments in ES cells expressing GFP-CTCF (GC) or del8 (d8.1, d8.2, d8.3). Average values of two FRAP experiments are shown (GC: 22 cells measured, d8.1: 32 cells measured, d8.2: 24 cells measured, d8.3: 27 cells measured). For clarity we did not indicate SEMs. Note that d8.2 is the line used in many of the other experiments presented in this manuscript whereas the GC line is a second independently isolated line expressing GFP-CTCF. Recovery of the del8 mutant proteins is very similar in the three lines, in part because the nuclear concentrations (as examined in panel (c)) are similar. Recovery of GFP-CTCF in the second GC line is also very similar to that of GFP-CTCF in the first GC line shown in Fig. 2.
Extended Data Fig. 4 Analysis of 3D genome topology in ES cells expressing GFP-CTCF or the del8 mutant.
a) Principal component analysis (PCA) of the top 1000 genes with highest standard deviation in ES cells expressing del8 (d8) or GFP-CTCF (GC). We performed RNA-Seq on three biological replicate cultures of three independently isolated d8 lines (d8.1, d8.2, d8.3) and two GC lines (GC1, GC2). One RNA-Seq sample of clone d8.3 showed poor quality metrics and was therefore excluded. b) Reproducibility scores (based on spectral decomposition; on a 0 to 1 scale, see Methods for details) across in-situ Hi-C matrices between the indicated pairwise comparisons of conditions. Note high reproducibility scores (>0.7) amongst biological replicates, which significantly drop (<0.4) when correlating in-situ Hi-C datasets from GC and d8-expressing cells. c) Distance decay curves of chromosome-wide interactions for wildtype and del8 expressing ES cells. d) Absolute PC1 values of genome-wide A and B compartment regions (100 kb bins) that switch compartment when comparing GC and d8-expressing cells. Data shown are merged values from three independent biological replicates. Center lines denote median values; box limits indicate 25th-75th percentiles with whiskers extending 1.5 times the interquartile range. e) Change in mRNA expression levels as measured by RNA-Seq for genes located in genomic bins that switch compartment when comparing GC- and del8-expressing cells. Data shown are values from 2-3 independent biological replicates obtained from 2-3 independently generated ESC clones. Center lines denote median values; box limits indicate 25th-75th percentiles with whiskers extending 1.5 times the interquartile range. f) Absolute PC1 values of genome-wide A and B compartment regions (100 kb bins) for untreated and CTCF-depleted cells. Data were obtained from ref. 29. Data shown are merged values from two independent biological replicates. Center lines denote median values; box limits indicate 25th-75th percentiles with whiskers extending 1.5 times the interquartile range. Pie chart indicated genomic regions (100 kb bin size) that switch compartment (4.1% of the genome) due to CTCF depletion. ***P < 0.001; A: P = 0.00039, B: P = 0.00081 (two-sided Mann Whitney U test). g) 2D meta-plot quantifying inter-TAD interaction frequencies between superenhancers in untreated and CTCF-depleted cells (ref. 29).
Extended Data Fig. 5 Aspects of TAD organization in ES cells expressing GFP-CTCF or del8.
a) Percentage of conserved TAD borders between GC and d8 ESCs (1 bin = 50 kb). b) TAD size distribution in GC and d8 ESCs. Data shown are merged values from three independent biological replicates (presented as whisker plots with IQR and median). c) Log2 fold change (FC) in IS at TAD borders in GC ESCs. Borders were categorized based on >20% increase in IS (‘gain’), >20% decrease in IS (‘loss’) or <20% change in IS (‘stable’) in d8 compared to GC ESCs. d) Overlap between A/B compartment junctions and three TAD border categories. Outward or inward extension indicates positive or negative enrichment, respectively. Dashed circles indicate relative enrichment of 1. Table shows border and overlap statistics. e) IS TAD border correlation plot, comparing IS change in GC vs d8 to 48 h of acute CTCF depletion (data obtained from ref. 29). f) Differential IS at TAD borders in GC vs d8 (left) or after CTCF depletion (ref. 29) (right). Data shown are merged values from two independent biological replicates (presented as whisker plots with IQR and median). g) IS changes (GC-d8 differential values) for TAD borders bound or not bound by CTCF in GC cells. Data shown are merged values from three independent biological replicates (presented as whisker plots with IQR and median; ***P = 0.00094 (two-sided Mann Whitney U test)). h) Overlap between SMC1 loop anchors (top) or published ES cell loops (bottom) and three TAD border categories. Outward or inward extension indicates positive or negative enrichment, respectively. Dashed circles indicates a relative enrichment of 1. i) Meta-plot of Hi-C interaction signals for validated ES cell loops (ref. 50) in GC- and d8-expressing ES cells. j) Sex chromosome analysis of GC- and d8-expressing ESCs. Left upper panel: sex chromosome-specific PCR (m, f: DNA from male or female mouse, respectively). Right hand panels: representative fluorescent x-chromosome images (red). DAPI (green); scale bar: 5 micron. Bar plot (bottom left): x-chromosome number per ESC (n = 5 independent counting experiments, averages ± SD). k) Differential Hi-C interaction map of chromosome X. CTCF peak calls in GC- and d8-expressing cells are indicated on the left. l) Gene expression output (sum of all RPKM values from genes within a border) of three TAD border categories. n.s. = non significant; P > 0.05 (two-sided Mann Whitney U test). m) Precision nuclear run-on sequencing (PRO-Seq) at CTCF sites within ‘stable’ and ‘loss’ category TAD borders. PRO-Seq signals were divided into regions of 2 kb upstream or downstream of CTCF border sites; signal was also compared to a randomly shuffled set of all CTCF peaks (P < 0.0001 for all ‘loss’ versus ‘stable’ comparisons; two-sided Mann Whitney U Test). Data show values from two independent biological replicates (presented as whisker plots with IQR and median). n) CTCF binding site distribution with respect to distance to nearest TSS. Upper graph: all CTCF binding sites, lower graph: UC-containing sites. Inset: detailed distribution in a 5 kb window surrounding the TSS.
Extended Data Fig. 6 Gene expression and DNA methylation changes in ES cells expressing the del8 CTCF mutant.
a) Dnmt3a levels in GC or d8-expressing ESCs. Left: immunofluorescence staining with anti-Dnmt3a antibodies (red); DAPI (blue). We enhanced brightness and contrast to visualize the low Dnmt3a signal in GC cells. This leads to an overexposed image in the d8 panel. Right: quantification of Dnmt3a signal (data are presented as whisker plots with IQR and median, GC, n = 52 nuclei measured, d8, 60 nuclei measured; >5 ESC colonies measured per line, data are from one representative experiment; P = 4.89 E-12, two-sided t-test). b) Western blot analysis of cell lysates from two GC-expressing ESC lines (GC1, GC2) and three d8-expressing ESC lines (d8.1, d8.2, d8.3). Western blots were probed with Dnmt3a (upper) or tubulin (lower) antibodies. ‘L’ and ‘S’ depict long and short Dnmt3a isoforms, respectively. Quantification of Dnmt3a levels (relative to GC2 by measuring Dnmt3a intensity over tubulin. Data are shown below the lanes. The western blot was done three times. c) Dnmt3b levels in GC- and d8-expressing ESCs. Data were taken from the RNA-Seq in Supplementary Table 2 (GC, n = 6 RNA-Seq experiments, d8, n = 8 experiments). ***P < 0.0052 (two-sided Wald test corrected for multiple testing). d) 5MC levels in GC or d8-expressing ESCs. Left: immunofluorescence staining with anti-5mC antibodies (red). Right: quantification of fluorescent 5mC signal (data are presented as whisker plots with IQR and median, GC, n = 49 nuclei, d8, 61 nuclei; >5 ESC colonies measured per line, data are from one representative experiment; P = 1.61 E-12, two-sided t-test). e) Principal component analysis (PCA) of differentially methylated regions (DMRs) showing >2 fold change in methylation in d8- or GC-expressing ESCs. We performed MedSeq in duplicate for three independent d8 lines (d8.1, d8.2, d8.3) and two GC lines (GC1, GC2). f) Genomic localization of differentially methylated regions (DMRs) divided in hyper-methylated (top, n = 290) or hypo-methylated (bottom, n = 82) regions in del8-expressing cells as compared to wildtype ES cells. g) Correlation scatter plot comparing changes in DNA methylation (log2 d8-GC, y-axis) and changes in CTCF ChIP-Seq signal (log2 d8-GC, x-axis) for 87 DMRs (>3 fold change in DNA methylation) that contain a CTCF binding site.
Supplementary information
Supplementary Information
Supplementary Notes 1 and 2 and Supplementary Fig. 1
Supplementary Tables
Supplementary Table 1: Hi-C data statistics. Supplementary Table 2: Differential gene expression analysis using RNA-seq data from GFP–CTCF and del8-expressing ES cells. (A) RPKM. (B) DEG. Supplementary Table 3: Primers used for ZF constructs and sex chromosome analysis.
Supplementary Video 1
GFP–CTCF behaviour during mitosis.
Source data
Source Data Fig. 1
Unprocessed western blots.
Source Data Fig. 2
FACS gating strategy.
Source Data Fig. 2
Statistical source data.
Source Data Fig. 3
Statistical source data.
Source Data Fig. 4
Statistical source data.
Source Data Fig. 5
Statistical source data.
Source Data Fig. 6
Statistical source data.
Source Data Fig. 7
Unprocessed western blots.
Source Data Fig. 7
Statistical source data.
Source Data Extended Data Fig. 1
Statistical source data.
Source Data Extended Data Fig. 2
Statistical source data.
Source Data Extended Data Fig. 3
Statistical source data.
Source Data Extended Data Fig. 4
Statistical source data.
Source Data Extended Data Fig. 5
Unprocessed agarose gel PCR.
Source Data Extended Data Fig. 5
Statistical source data.
Source Data Extended Data Fig. 6
Unprocessed western blots.
Source Data Extended Data Fig. 6
Statistical source data.
Rights and permissions
About this article
Cite this article
Soochit, W., Sleutels, F., Stik, G. et al. CTCF chromatin residence time controls three-dimensional genome organization, gene expression and DNA methylation in pluripotent cells. Nat Cell Biol 23, 881–893 (2021). https://doi.org/10.1038/s41556-021-00722-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41556-021-00722-w
This article is cited by
-
A continuum of zinc finger transcription factor retention on native chromatin underlies dynamic genome organization
Molecular Systems Biology (2024)
-
The PTM profiling of CTCF reveals the regulation of 3D chromatin structure by O-GlcNAcylation
Nature Communications (2024)
-
The impact of DNA methylation on CTCF-mediated 3D genome organization
Nature Structural & Molecular Biology (2024)
-
Auxin-inducible degron 2 system deciphers functions of CTCF domains in transcriptional regulation
Genome Biology (2023)
-
CTCF is a DNA-tension-dependent barrier to cohesin-mediated loop extrusion
Nature (2023)