Super-resolution imaging reveals distinct chromatin folding for different epigenetic states

Journal name:
Nature
Volume:
529,
Pages:
418–422
Date published:
DOI:
doi:10.1038/nature16496
Received
Accepted
Published online

Metazoan genomes are spatially organized at multiple scales, from packaging of DNA around individual nucleosomes to segregation of whole chromosomes into distinct territories1, 2, 3, 4, 5. At the intermediate scale of kilobases to megabases, which encompasses the sizes of genes, gene clusters and regulatory domains, the three-dimensional (3D) organization of DNA is implicated in multiple gene regulatory mechanisms2, 3, 4, 6, 7, 8, but understanding this organization remains a challenge. At this scale, the genome is partitioned into domains of different epigenetic states that are essential for regulating gene expression9, 10, 11. Here we investigate the 3D organization of chromatin in different epigenetic states using super-resolution imaging. We classified genomic domains in Drosophila cells into transcriptionally active, inactive or Polycomb-repressed states, and observed distinct chromatin organizations for each state. All three types of chromatin domains exhibit power-law scaling between their physical sizes in 3D and their domain lengths, but each type has a distinct scaling exponent. Polycomb-repressed domains show the densest packing and most intriguing chromatin folding behaviour, in which chromatin packing density increases with domain length. Distinct from the self-similar organization displayed by transcriptionally active and inactive chromatin, the Polycomb-repressed domains are characterized by a high degree of chromatin intermixing within the domain. Moreover, compared to inactive domains, Polycomb-repressed domains spatially exclude neighbouring active chromatin to a much stronger degree. Computational modelling and knockdown experiments suggest that reversible chromatin interactions mediated by Polycomb-group proteins play an important role in these unique packaging properties of the repressed chromatin. Taken together, our super-resolution images reveal distinct chromatin packaging for different epigenetic states at the kilobase-to-megabase scale, a length scale that is directly relevant to genome regulation.

At a glance

Figures

  1. Chromatin in different epigenetic states exhibits distinct packaging and power-law scaling.
    Figure 1: Chromatin in different epigenetic states exhibits distinct packaging and power-law scaling.

    a, Enrichment profile of H3K4me2 (red), H3K27me3 (light blue) and unmodified H3 (black) in three genomic regions, each harbouring an example Active, Inactive or Repressed domain (indicated by brackets). Marker enrichment, as defined in Supplementary Methods, was determined from ChIP-seq data20. b, 3D-STORM images of the three distinct epigenetic domains in a, labelled by in situ hybridization with DNA probes conjugated to the photoswitchable dye Alexa-647, shown with their corresponding conventional images in the inset. Each epigenetic domain appears as a single region in nearly all cells due to homologous pairing in the tetraploid Kc167 cells. c, log–log plot of the median domain volume as a function of domain length for Active (red solid circles), Inactive (black solid circles) and Repressed (light blue solid circles) domains, as well as for Repressed domains in Ph-knockdown (Ph KD) cells (light blue hollow circles). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells). The lines indicate power-law fits, with the scaling exponent b shown in the legend. d, As in c but for the radius of gyration as a function of domain length with the scaling exponent c shown in the legend.

  2. Different types of epigenetic domains exhibit distinct subdomain scaling and intermixing.
    Figure 2: Different types of epigenetic domains exhibit distinct subdomain scaling and intermixing.

    a, Marker enrichment profile of three genomic regions with the example epigenetic domains marked by brackets and imaged subdomains marked by green and magenta lines. b, Linear plot of the radius of gyration as a function of the subdomain length (green symbols), compared to those for the whole domain data (red, black or light blue circles), for Active (left), Inactive (middle) and Repressed chromatin (right). Different green symbols (triangle and squares) represent subdomains of two different parent domains. Power-law fits of subdomains (green solid lines) and whole domains (red, black and light blue dashed lines) are shown with the scaling exponent c given in the legends. The green lines in the right panel are to guide the eye. c, Two-colour, 3D-STORM images of example pairs of subdomains within Active (left), Inactive (middle), and Repressed (right) domains. Portions of the two subdomains that overlap in 3D are shown in white. The two subdomains are labelled with Alexa-647- and Alexa-750-tagged DNA probes, respectively. d, Quantification of overlap fraction between the subdomains for Active (red), Inactive (black), or Repressed (light blue) chromatin (Supplementary Methods). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells).

  3. Neighbouring chromatin domains show different amount of intermixing for different types of epigenetic boundaries.
    Figure 3: Neighbouring chromatin domains show different amount of intermixing for different types of epigenetic boundaries.

    a, Marker enrichment profile of a genomic region harbouring three epigenetic domains marked by the magenta and green lines. b, Two-colour, 3D-STORM images for neighbouring epigenetic domains with Repressed–Active and Inactive–Active boundaries in wild-type cells, as well as for the Repressed–Active boundaries in Ph-knockdown (Ph KD) cells. c, Quantification of overlap fraction between the indicated neighbouring domains (Supplementary Methods). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells).

  4. Computational modelling of chromatin packaging for Inactive and Repressed domains.
    Figure 4: Computational modelling of chromatin packaging for Inactive and Repressed domains.

    a, Radius of gyration of polymer domains as a function of the domain length for simulated polymers confined in a small volume to emulate Inactive chromatin. The line indicates the power-law fit with the scaling exponent c = 0.33. b, Radius of gyration of subdomains of a parent Inactive domain (green triangles) compared to the whole-domain scaling data (black circles) with power-law fits and scaling exponents shown. c, Radius of gyration as a function of the domain length for the simulated sticky polymer domain, embedded in a non-stick, confined polymer, to emulate Repressed chromatin. The light blue line represent the power-law fit with the scaling exponent c = 0.22. The black dotted line indicates the point at which the closest possible packing of monomers is reached, causing c to deviate from 0.22 and approach 0.33 beyond this point. The exponent c also deviates from 0.22 at the small monomer number end because such a short polymer cannot sufficiently bend. d, Radius of gyration of the subdomains (green triangles) of the parent sticky domain in comparison with the whole-domain scaling data (light blue circles). e, Snapshots of simulations showing adjacent subdomains in the Inactive (non-sticky) chromatin (left), adjacent subdomains of the Repressed (sticky) chromatin (middle), and adjacent Repressed and Inactive chromatin domains (right). f, Quantification of the overlap fraction (Supplementary Methods) between the adjacent polymer regions illustrated in e. Error bars indicate 95% confidence intervals derived from resampling (n ≈ 20 simulations).

  5. Schematic of oligonucleotide probe design and synthesis.
    Extended Data Fig. 1: Schematic of oligonucleotide probe design and synthesis.

    A unique pair of index primers are used in a PCR reaction to selectively amplify the templates for the probe set of interest from a complex pool of custom, array-derived oligonucleotides. These templates are then amplified and converted to RNA in an in vitro transcription reaction. The RNA products are converted back to DNA in a reverse-transcription reaction using a primer labelled with an activator dye, Alexa-405, which incorporates the dye into the resulting single-stranded DNA probe. Finally, a 32-nucleotide oligonucleotide attached to Alexa-647 is hybridized to all of the probes. The photoswitchable dye, Alexa-647, is used for STORM imaging. The activator dye, Alexa-405, facilitates the 405-nm light induced reactivation of the Alexa-647 dye.

  6. Effect of cell fixation on chromatin sizes.
    Extended Data Fig. 2: Effect of cell fixation on chromatin sizes.

    a, Top, example images of DNA in a Kc167 cell, visualized with the viable DNA dye Hoechst 33342, both in the live cell before fixation and in the same cell after applying our fixation buffer (osmotically balanced methanol-free formaldehyde in PBS). Bottom, same as the top panels but for a Kc167 cell before and after fixation with methanol, a fixative that is known to cause a shrinkage effect. b, Quantifications of the distances between chromatin features in live and fixed cells. Corresponding chromatin features were identified in the live and corresponding fixed cell images through scale-invariant feature transform (SIFT) registration. We measured distances between pairs of identified SIFT features in each cell, and calculated the ratio between the median inter-feature distances before and after fixation for each cell. Plotted here are the histograms of ratios determined from many cells for fixation with our osmotically balanced methanol-free formaldehyde fixation buffer (magenta), for a ‘mock fixed’ condition in which the growth media was replaced with fresh media without any fixation reagent (cyan), and for fixation with methanol (red), n ≈ 80 cells in each cases. The average ratios are 1.009 ± 0.003 and 1.008 ± 0.003 for fixation with our fixation buffer and the mock fixation, respectively, indicating a lack of shrinkage effect. In contrast, the average ratio for the methanol-fixation case (0.868 ± 0.005) is appreciably less than one, indicating a chromosome shrinkage induced by methanol. c, STORM images of TRF1-mMaple3 labelled telomeres in live and fixed HEK293 cells. Cells are fixed with our osmotically balanced methanol-free formaldehyde in PBS. Two examples of telomere STORM images are shown for each condition. d, Quantifications of the radius of gyration of the telomeric domains in live and fixed cells. We determined the radius of gyration for each telomere structure and plotted here are the histograms of the radii of gyration across about 150 telomeres from around 30 cells for live (cyan) and fixed (magenta) cells. The average radius of gyration is 77 ± 3 nm for live cells and 78 ± 2 nm for fixed cells, again indicating that there is no significant chromatin shrinkage effect upon fixation. The telomere size measurement is not limited by our image resolution with mMaple3 (~30 nm).

  7. Volume, radius of gyration and other shape characteristics for chromatin domains of various domain lengths in three different epigenetic states.
    Extended Data Fig. 3: Volume, radius of gyration and other shape characteristics for chromatin domains of various domain lengths in three different epigenetic states.

    a, Scheme of Drosophila chromosomes (X, 2L, 2R, 3L, and 3R) with the position of the imaged epigenetic domains marked (red, Active domains A-01 to A-23; black, Inactive domains I-01 to I-14; blue, Repressed domains R-01 to R-11). b, log–log plot of the median domain volume as a function of domain contour length reproduced from Fig. 1c but with the domain ID labelled. c, As in Fig. 1c but plotted on a linear–linear scale. d, Linear plot of the median radius of gyration as a function of domain contour length. e, Coefficient of variation (CV) in density per voxel for all domains as a function of domain length. CV in density is defined as the ratio of the standard deviation of density to the average density within the domain-occupied volume, which characterizes how uniformly the chromatin is distributed in space within these domains (Supplementary Methods). f, Ratio of surface area to volume2/3 for all domains as a function of domain length. This surface-to-volume parameter characterizes the complexity of the physical shapes taken by the domains in 3D (Supplementary Methods). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells).

  8. Conventional images of chromatin domains and domain volume characterization based on conventional images.
    Extended Data Fig. 4: Conventional images of chromatin domains and domain volume characterization based on conventional images.

    a, Blow-up view of the conventional images of chromatin domains shown in Fig. 1b. The left column shows the raw conventional, wide-field images, with pixel size defined by our camera. The right column shows the corresponding anti-aliased and de-noised images. b, Quantification of the median domain volume determined from conventional images (foreground symbols), overlaid on the median volume determined from STORM data plotted in Fig. 1c (faint background symbols and lines). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells). Note that the conventional images may not only cause an artificial increase in domain size, especially severe for those domains whose physical sizes are smaller than the image resolution, but can also lead to an apparent decrease in domain size in some cases when the thin protrusions were too dim to detect by conventional imaging.

  9. Distributions of domain volume and radius of gyration of different epigenetic domains and subdomains over all imaged cells.
    Extended Data Fig. 5: Distributions of domain volume and radius of gyration of different epigenetic domains and subdomains over all imaged cells.

    a, Histograms of domain volume for all imaged cells for each of the domains shown in Extended Data Fig. 2a. Red, Active domains; black, Inactive domains; light blue, Repressed domains. The domain IDs are indicated in the upper right corner of each plot. The x-axis (volume) range has been adjusted for each domain to ensure the readability of the histogram. b, Histograms of the radius of gyration for each of the imaged domains in all cells. c, Histograms of the radius of gyration for subdomains of Active (red), Inactive (black) and Repressed (blue) chromatin, shown in Fig. 2b and Extended Data Fig. 6, for all imaged cells. The subdomain IDs are indicated in the upper right corner of each plot.

  10. Additional data on the scaling behaviour of subdomains of Repressed chromatin.
    Extended Data Fig. 6: Additional data on the scaling behaviour of subdomains of Repressed chromatin.

    a, Enrichment profile of H3K4me2 (red), H3K27me3 (light blue) and unmodified H3 (black) in a genomic region harbouring the Repressed domain R-11 (Antennapedia complex). b, The radius of gyration of subdomains (green triangles) of R-11 as a function of subdomain length compared to the scaling of whole Repressed domains (light blue circles). The data shown in Fig. 2b, right panel are for the R-10 domain (Bithorax complex). The light blue dashed line indicates the power-law fit for the whole domain data. Green lines are to guide the eye. Error bars represent 95% confidence intervals (n ≈ 50 cells).

  11. Effect of polyhomeotic (Ph) knockdown.
    Extended Data Fig. 7: Effect of polyhomeotic (Ph) knockdown.

    a, Quantification of relative change in gene expression by qPCR (mean ± s.e.m., n = 3 biological replicates) upon ph-p and ph-d double knockdown. Grey bars, expression fold change of ph-p and ph-d upon double knockdown. Light blue bars, expression fold change of five Polycomb target genes, Ubx, Abd-B, Dfd, Antp, en. Red bars, expression fold change of three control genes, Act5c, alphaTub84b and Gapdh1, that are not targeted by Polycomb. Expression fold change was determined as the ratio between the signal detected in ph-p and ph-d double knockdown cells and that detected in wild-type cells. b, Average expression fold change upon ph-p and ph-d double knockdown for all genes in all of the Active (red), Inactive (black) and Repressed (light blue) domains included in our study. The expression fold change is defined as the ratio of expression level measured in Ph-knockdown cells to that measured in the wild-type control cells determined by next generation RNA sequencing (mean ± s.e.m., n = 45, 89 and 532 genes for Repressed, Inactive and Active domains, respectively, 2 biological replicates). Expression level was measured in units of read fragments per kilobase per million reads (FPKM). Note, some genes (9 from Repressed regions, 11 from Inactive regions and 2 from Active regions) are excluded from the average expression fold change calculation because they received zero counts in the wild-type control cells. c, Example images of the R-10 domain (Bithorax complex) in wild-type (left) and Ph-knockdown (right) cells. d, Radius of gyration versus domain length for subdomains of R-10 in wild-type cells (solid green triangles) and Ph-knockdown cells (hollow green triangles). Error bars represent 95% confidence intervals (n ≈ 50 cells).

  12. Locus-to-locus variation observed for the three types of epigenetic domains after normalization based on the observed scaling law over domain length.
    Extended Data Fig. 8: Locus-to-locus variation observed for the three types of epigenetic domains after normalization based on the observed scaling law over domain length.

    a, Top, the normalized volume for domains of Active (left), Inactive (middle), and Repressed (right) chromatin. Normalized volume is defined as the ratio of median volume of the domain to the expected volume calculated from the power-law scaling fits shown in Fig. 1c. Error bars represent 78% confidence intervals, such that there is a less than 5% chance that domains with non-overlapping error bars are not distinct (n ≈ 50 cells). b, Volcano plots of the relative differences in volume between all pairs of Active domains (left), Inactive domains (middle), or Repressed domains (right), after the normalization shown in a. Each data point represents one pair of domains with their ratio of the normalized volumes plotted on the x-axis and the P value of their normalized volume difference plotted on the y-axis. The dashed line is at a P value of 0.05. All dots above this line represent pairs of domains in which the normalized volume of one domain is statistically distinguishable from that of the other domain. c, Standard deviation of the normalized volumes for each domain type. Error-bars represent 95% confidence intervals (n ≈ 50 cells).

  13. Additional factors correlating with the domain volume after normalizing the effect of domain length for Active domains.
    Extended Data Fig. 9: Additional factors correlating with the domain volume after normalizing the effect of domain length for Active domains.

    To normalize for the effect of domain length, we determined the percent deviation of the volumes of Active domains from the power-law scaling trend line shown in Fig. 1c, and hereafter refer to this value as percent deviation from trend line. a, Correlation of the percent deviation from trend line with the binding density of the insulator proteins BEAF32 (left) and CTCF (right). Binding density was determined from the density of peaks per kb in Dam-ID data20. Peaks were defined as local maxima at least 2 standards deviation above the mean. b, As in a but for correlation with transcription start site (TSS) density (left) and RNA-seq total read density (right). The TSS density is defined as the average number of TSSs per kb in the domain and the RNA-seq total read density is defined as the total number of reads mapping to the domain measured using RNA sequencing divided by the domain length in kb. Error bars represent 95% confidence intervals (n ≈ 50 cells). c, Pearson correlation coefficients and corresponding P values for the correlation of percent deviation from trend line with the indicated genomic factors. Average gene expression refers to the average expression value in FPKM (read fragments per kilobase per million reads) of all genes in the domain. Maximum gene expression refers to the FPKM of the most highly expressed gene in the domain. Su(Hw) is an insulator protein like BEAF32 and CTCF. We noticed a weak trend in which domains with higher binding densities of the insulators BEAF32 or Su(Hw) are slightly more compact. Although this trend is consistent with the hypothesis that insulator proteins may function as loop forming factors and that loops may lead to more compact domains12, the correlation detected here was not statistically significant. Further analysis with improved sensitivity in detection of BEAF32 or Su(Hw) binding sites might uncover a stronger effect, so our data do not rule out the insulator loop hypothesis. Similarly, we caution that the positive correlation observed with the density of CTCF binding sites might reflect the preference of CTCF to bind open chromatin regions (such as enhancers and promoters), and does not necessarily suggest that CTCF binding induces a more open chromatin state.

Tables

  1. Epigenetic domains and subdomains probed
    Extended Data Table 1: Epigenetic domains and subdomains probed

Accession codes

Primary accessions

Gene Expression Omnibus

References

  1. Kornberg, R. D. & Lorch, Y. Twenty-five years of the nucleosome, fundamental particle of the eukaryote chromosome. Cell 98, 285294 (1999)
  2. Bickmore, W. A. The spatial organization of the human genome. Annu. Rev. Genomics Hum. Genet. 14, 6784 (2013)
  3. Gibcus, J. H. & Dekker, J. The hierarchy of the 3D genome. Mol. Cell 49, 773782 (2013)
  4. Levine, M., Cattoglio, C. & Tjian, R. Looping back to leap forward: transcription enters a new era. Cell 157, 1325 (2014)
  5. Cremer, T. & Cremer, M. Chromosome territories. Cold Spring Harb. Perspect. Biol. 2, a003889 (2010)
  6. Gorkin, D. U., Leung, D. & Ren, B. The 3D genome in transcriptional regulation and pluripotency. Cell Stem Cell 14, 762775 (2014)
  7. Sexton, T. & Cavalli, G. The role of chromosome domains in shaping the functional genome. Cell 160, 10491059 (2015)
  8. Galupa, R. & Heard, E. X-chromosome inactivation: new insights into cis and trans regulation. Curr. Opin. Genet. Dev. 31, 5766 (2015)
  9. Bernstein, B. E., Meissner, A. & Lander, E. S. The mammalian epigenome. Cell 128, 669681 (2007)
  10. Bickmore, W. A. & van Steensel, B. Genome architecture: domain organization of interphase chromosomes. Cell 152, 12701284 (2013)
  11. Rivera, C. M. & Ren, B. Mapping human epigenomes. Cell 155, 3955 (2013)
  12. Gómez-Díaz, E. & Corces, V. G. Architectural proteins: regulators of 3D genome organization in cell fate. Trends Cell Biol. 24, 703711 (2014)
  13. Beliveau, B. J. et al. Versatile design and synthesis platform for visualizing genomes with Oligopaint FISH probes. Proc. Natl Acad. Sci. USA 109, 2130121306 (2012)
  14. Beliveau, B. J. et al. Single-molecule super-resolution imaging of chromosomes and in situ haplotype visualization using Oligopaint FISH probes. Nat. Commun. 6, 7147 (2015)
  15. Boyle, S., Rodesch, M. J., Halvensleben, H. A., Jeddeloh, J. A. & Bickmore, W. A. Fluorescence in situ hybridization with high-complexity repeat-free oligonucleotide probes generated by massively parallel synthesis. Chromosome Res. 19, 901909 (2011)
  16. Yamada, N. A. et al. Visualization of fine-scale genomic structure by oligonucleotide-based high-resolution FISH. Cytogenet. Genome Res. 132, 248254 (2011)
  17. Chen, K. H., Boettiger, A. N., Moffitt, J. R., Wang, S. & Zhuang, X. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348, 412 (2015)
  18. Rust, M. J., Bates, M. & Zhuang, X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM). Nature Methods 3, 793796 (2006)
  19. Huang, B., Wang, W., Bates, M. & Zhuang, X. Three-dimensional super-resolution imaging by stochastic optical reconstruction microscopy. Science 319, 810813 (2008)
  20. Filion, G. J. et al. Systematic protein location mapping reveals five principal chromatin types in Drosophila cells. Cell 143, 212224 (2010)
  21. Simon, J. A. & Kingston, R. E. Occupying chromatin: Polycomb mechanisms for getting to genomic targets, stopping transcriptional traffic, and staying put. Mol. Cell 49, 808824 (2013)
  22. Grossniklaus, U. & Paro, R. Transcriptional silencing by polycomb-group proteins. Cold Spring Harb. Perspect. Biol. 6, a019331 (2014)
  23. Cheutin, T. & Cavalli, G. Polycomb silencing: from linear chromatin domains to 3D chromosome folding. Curr. Opin. Genet. Dev. 25, 3037 (2014)
  24. Sexton, T. et al., Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148, 458472 (2012)
  25. Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289293 (2009)
  26. Mirny, L. A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 19, 3751 (2011)
  27. Halverson, J. D., Smrek, J., Kremer, K. & Grosberg, A. Y. From a melt of rings to chromosome territories: the role of topological constraints in genome folding. Rep. Prog. Phys. 77, 022601 (2014)
  28. Nicodemi, M. & Pombo, A. Models of chromosome structure. Curr. Opin. Cell Biol. 28, 9095 (2014)
  29. Isono, K. et al. SAM domain polymerization links subnuclear clustering of PRC1 to gene silencing. Dev. Cell 26, 565577 (2013)
  30. Spitz, F. & Furlong, E. E. M. Transcription factors: from enhancer binding to developmental control. Nature Rev. Genet. 13, 613626 (2012)

Download references

Author information

Affiliations

  1. Howard Hughes Medical Institute, Department of Chemistry and Chemical Biology, Department of Physics, Harvard University, Cambridge, Massachusetts 02138, USA

    • Alistair N. Boettiger,
    • Bogdan Bintu,
    • Jeffrey R. Moffitt,
    • Siyuan Wang &
    • Xiaowei Zhuang
  2. Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA

    • Brian J. Beliveau &
    • Chao-ting Wu
  3. Institute for Medical Engineering and Science, and Department of Physics, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, USA

    • Geoffrey Fudenberg,
    • Maxim Imakaev &
    • Leonid A. Mirny

Contributions

A.N.B. and X.Z. conceived the project. A.N.B., B.B., and X.Z. designed the experiments and simulations with input from C.-t.W. and L.A.M.; A.N.B. and B.B. performed the experiments, data analysis and simulations. J.R.M. conceived the high yield probe synthesis method, and J.R.M. and A.N.B developed this method. B.J.B. assisted with initial probe synthesis. S.W. assisted with live-cell STORM experiments. G.F. and M.I. assisted with methods for simulation. A.N.B., B.B., L.A.M., C.-t.W. and X.Z. interpreted the results. A.N.B., B.B., J.R.M. and X.Z. wrote the manuscript, with inputs from B.J.B., G.F., M.I., L.A.M. and C.-t.W.

Competing financial interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to:

RNA-Seq data have been deposited at Gene Expression Omnibus (GEO) under the accession code GSE75060.

Author details

Extended data figures and tables

Extended Data Figures

  1. Extended Data Figure 1: Schematic of oligonucleotide probe design and synthesis. (213 KB)

    A unique pair of index primers are used in a PCR reaction to selectively amplify the templates for the probe set of interest from a complex pool of custom, array-derived oligonucleotides. These templates are then amplified and converted to RNA in an in vitro transcription reaction. The RNA products are converted back to DNA in a reverse-transcription reaction using a primer labelled with an activator dye, Alexa-405, which incorporates the dye into the resulting single-stranded DNA probe. Finally, a 32-nucleotide oligonucleotide attached to Alexa-647 is hybridized to all of the probes. The photoswitchable dye, Alexa-647, is used for STORM imaging. The activator dye, Alexa-405, facilitates the 405-nm light induced reactivation of the Alexa-647 dye.

  2. Extended Data Figure 2: Effect of cell fixation on chromatin sizes. (254 KB)

    a, Top, example images of DNA in a Kc167 cell, visualized with the viable DNA dye Hoechst 33342, both in the live cell before fixation and in the same cell after applying our fixation buffer (osmotically balanced methanol-free formaldehyde in PBS). Bottom, same as the top panels but for a Kc167 cell before and after fixation with methanol, a fixative that is known to cause a shrinkage effect. b, Quantifications of the distances between chromatin features in live and fixed cells. Corresponding chromatin features were identified in the live and corresponding fixed cell images through scale-invariant feature transform (SIFT) registration. We measured distances between pairs of identified SIFT features in each cell, and calculated the ratio between the median inter-feature distances before and after fixation for each cell. Plotted here are the histograms of ratios determined from many cells for fixation with our osmotically balanced methanol-free formaldehyde fixation buffer (magenta), for a ‘mock fixed’ condition in which the growth media was replaced with fresh media without any fixation reagent (cyan), and for fixation with methanol (red), n ≈ 80 cells in each cases. The average ratios are 1.009 ± 0.003 and 1.008 ± 0.003 for fixation with our fixation buffer and the mock fixation, respectively, indicating a lack of shrinkage effect. In contrast, the average ratio for the methanol-fixation case (0.868 ± 0.005) is appreciably less than one, indicating a chromosome shrinkage induced by methanol. c, STORM images of TRF1-mMaple3 labelled telomeres in live and fixed HEK293 cells. Cells are fixed with our osmotically balanced methanol-free formaldehyde in PBS. Two examples of telomere STORM images are shown for each condition. d, Quantifications of the radius of gyration of the telomeric domains in live and fixed cells. We determined the radius of gyration for each telomere structure and plotted here are the histograms of the radii of gyration across about 150 telomeres from around 30 cells for live (cyan) and fixed (magenta) cells. The average radius of gyration is 77 ± 3 nm for live cells and 78 ± 2 nm for fixed cells, again indicating that there is no significant chromatin shrinkage effect upon fixation. The telomere size measurement is not limited by our image resolution with mMaple3 (~30 nm).

  3. Extended Data Figure 3: Volume, radius of gyration and other shape characteristics for chromatin domains of various domain lengths in three different epigenetic states. (384 KB)

    a, Scheme of Drosophila chromosomes (X, 2L, 2R, 3L, and 3R) with the position of the imaged epigenetic domains marked (red, Active domains A-01 to A-23; black, Inactive domains I-01 to I-14; blue, Repressed domains R-01 to R-11). b, log–log plot of the median domain volume as a function of domain contour length reproduced from Fig. 1c but with the domain ID labelled. c, As in Fig. 1c but plotted on a linear–linear scale. d, Linear plot of the median radius of gyration as a function of domain contour length. e, Coefficient of variation (CV) in density per voxel for all domains as a function of domain length. CV in density is defined as the ratio of the standard deviation of density to the average density within the domain-occupied volume, which characterizes how uniformly the chromatin is distributed in space within these domains (Supplementary Methods). f, Ratio of surface area to volume2/3 for all domains as a function of domain length. This surface-to-volume parameter characterizes the complexity of the physical shapes taken by the domains in 3D (Supplementary Methods). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells).

  4. Extended Data Figure 4: Conventional images of chromatin domains and domain volume characterization based on conventional images. (182 KB)

    a, Blow-up view of the conventional images of chromatin domains shown in Fig. 1b. The left column shows the raw conventional, wide-field images, with pixel size defined by our camera. The right column shows the corresponding anti-aliased and de-noised images. b, Quantification of the median domain volume determined from conventional images (foreground symbols), overlaid on the median volume determined from STORM data plotted in Fig. 1c (faint background symbols and lines). Error bars represent 95% confidence intervals derived from resampling (n ≈ 50 cells). Note that the conventional images may not only cause an artificial increase in domain size, especially severe for those domains whose physical sizes are smaller than the image resolution, but can also lead to an apparent decrease in domain size in some cases when the thin protrusions were too dim to detect by conventional imaging.

  5. Extended Data Figure 5: Distributions of domain volume and radius of gyration of different epigenetic domains and subdomains over all imaged cells. (398 KB)

    a, Histograms of domain volume for all imaged cells for each of the domains shown in Extended Data Fig. 2a. Red, Active domains; black, Inactive domains; light blue, Repressed domains. The domain IDs are indicated in the upper right corner of each plot. The x-axis (volume) range has been adjusted for each domain to ensure the readability of the histogram. b, Histograms of the radius of gyration for each of the imaged domains in all cells. c, Histograms of the radius of gyration for subdomains of Active (red), Inactive (black) and Repressed (blue) chromatin, shown in Fig. 2b and Extended Data Fig. 6, for all imaged cells. The subdomain IDs are indicated in the upper right corner of each plot.

  6. Extended Data Figure 6: Additional data on the scaling behaviour of subdomains of Repressed chromatin. (130 KB)

    a, Enrichment profile of H3K4me2 (red), H3K27me3 (light blue) and unmodified H3 (black) in a genomic region harbouring the Repressed domain R-11 (Antennapedia complex). b, The radius of gyration of subdomains (green triangles) of R-11 as a function of subdomain length compared to the scaling of whole Repressed domains (light blue circles). The data shown in Fig. 2b, right panel are for the R-10 domain (Bithorax complex). The light blue dashed line indicates the power-law fit for the whole domain data. Green lines are to guide the eye. Error bars represent 95% confidence intervals (n ≈ 50 cells).

  7. Extended Data Figure 7: Effect of polyhomeotic (Ph) knockdown. (182 KB)

    a, Quantification of relative change in gene expression by qPCR (mean ± s.e.m., n = 3 biological replicates) upon ph-p and ph-d double knockdown. Grey bars, expression fold change of ph-p and ph-d upon double knockdown. Light blue bars, expression fold change of five Polycomb target genes, Ubx, Abd-B, Dfd, Antp, en. Red bars, expression fold change of three control genes, Act5c, alphaTub84b and Gapdh1, that are not targeted by Polycomb. Expression fold change was determined as the ratio between the signal detected in ph-p and ph-d double knockdown cells and that detected in wild-type cells. b, Average expression fold change upon ph-p and ph-d double knockdown for all genes in all of the Active (red), Inactive (black) and Repressed (light blue) domains included in our study. The expression fold change is defined as the ratio of expression level measured in Ph-knockdown cells to that measured in the wild-type control cells determined by next generation RNA sequencing (mean ± s.e.m., n = 45, 89 and 532 genes for Repressed, Inactive and Active domains, respectively, 2 biological replicates). Expression level was measured in units of read fragments per kilobase per million reads (FPKM). Note, some genes (9 from Repressed regions, 11 from Inactive regions and 2 from Active regions) are excluded from the average expression fold change calculation because they received zero counts in the wild-type control cells. c, Example images of the R-10 domain (Bithorax complex) in wild-type (left) and Ph-knockdown (right) cells. d, Radius of gyration versus domain length for subdomains of R-10 in wild-type cells (solid green triangles) and Ph-knockdown cells (hollow green triangles). Error bars represent 95% confidence intervals (n ≈ 50 cells).

  8. Extended Data Figure 8: Locus-to-locus variation observed for the three types of epigenetic domains after normalization based on the observed scaling law over domain length. (224 KB)

    a, Top, the normalized volume for domains of Active (left), Inactive (middle), and Repressed (right) chromatin. Normalized volume is defined as the ratio of median volume of the domain to the expected volume calculated from the power-law scaling fits shown in Fig. 1c. Error bars represent 78% confidence intervals, such that there is a less than 5% chance that domains with non-overlapping error bars are not distinct (n ≈ 50 cells). b, Volcano plots of the relative differences in volume between all pairs of Active domains (left), Inactive domains (middle), or Repressed domains (right), after the normalization shown in a. Each data point represents one pair of domains with their ratio of the normalized volumes plotted on the x-axis and the P value of their normalized volume difference plotted on the y-axis. The dashed line is at a P value of 0.05. All dots above this line represent pairs of domains in which the normalized volume of one domain is statistically distinguishable from that of the other domain. c, Standard deviation of the normalized volumes for each domain type. Error-bars represent 95% confidence intervals (n ≈ 50 cells).

  9. Extended Data Figure 9: Additional factors correlating with the domain volume after normalizing the effect of domain length for Active domains. (176 KB)

    To normalize for the effect of domain length, we determined the percent deviation of the volumes of Active domains from the power-law scaling trend line shown in Fig. 1c, and hereafter refer to this value as percent deviation from trend line. a, Correlation of the percent deviation from trend line with the binding density of the insulator proteins BEAF32 (left) and CTCF (right). Binding density was determined from the density of peaks per kb in Dam-ID data20. Peaks were defined as local maxima at least 2 standards deviation above the mean. b, As in a but for correlation with transcription start site (TSS) density (left) and RNA-seq total read density (right). The TSS density is defined as the average number of TSSs per kb in the domain and the RNA-seq total read density is defined as the total number of reads mapping to the domain measured using RNA sequencing divided by the domain length in kb. Error bars represent 95% confidence intervals (n ≈ 50 cells). c, Pearson correlation coefficients and corresponding P values for the correlation of percent deviation from trend line with the indicated genomic factors. Average gene expression refers to the average expression value in FPKM (read fragments per kilobase per million reads) of all genes in the domain. Maximum gene expression refers to the FPKM of the most highly expressed gene in the domain. Su(Hw) is an insulator protein like BEAF32 and CTCF. We noticed a weak trend in which domains with higher binding densities of the insulators BEAF32 or Su(Hw) are slightly more compact. Although this trend is consistent with the hypothesis that insulator proteins may function as loop forming factors and that loops may lead to more compact domains12, the correlation detected here was not statistically significant. Further analysis with improved sensitivity in detection of BEAF32 or Su(Hw) binding sites might uncover a stronger effect, so our data do not rule out the insulator loop hypothesis. Similarly, we caution that the positive correlation observed with the density of CTCF binding sites might reflect the preference of CTCF to bind open chromatin regions (such as enhancers and promoters), and does not necessarily suggest that CTCF binding induces a more open chromatin state.

Extended Data Tables

  1. Extended Data Table 1: Epigenetic domains and subdomains probed (786 KB)

Supplementary information

PDF files

  1. Supplementary Information (701 KB)

    This file contains Supplementary Methods and Supplementary References.

Excel files

  1. Supplementary Table 1 (6.1 MB)

    This file lists of all the sequences for the oligonucleotide probe sets used to label the epigenetic and subdomains shown in the Extended Data Table.

Additional data