Complex small-world regulatory networks emerge from the 3D organisation of the human genome

Brackley, C. A.; Gilbert, N.; Michieletto, D.; Papantonis, A.; Pereira, M. C. F.; Cook, P. R.; Marenduzzo, D.

doi:10.1038/s41467-021-25875-y

Download PDF

Article
Open access
Published: 01 October 2021

Complex small-world regulatory networks emerge from the 3D organisation of the human genome

Nature Communications volume 12, Article number: 5756 (2021) Cite this article

5044 Accesses
13 Citations
40 Altmetric
Metrics details

Subjects

Abstract

The discovery that overexpressing one or a few critical transcription factors can switch cell state suggests that gene regulatory networks are relatively simple. In contrast, genome-wide association studies (GWAS) point to complex phenotypes being determined by hundreds of loci that rarely encode transcription factors and which individually have small effects. Here, we use computer simulations and a simple fitting-free polymer model of chromosomes to show that spatial correlations arising from 3D genome organisation naturally lead to stochastic and bursty transcription as well as complex small-world regulatory networks (where the transcriptional activity of each genomic region subtly affects almost all others). These effects require factors to be present at sub-saturating levels; increasing levels dramatically simplifies networks as more transcription units are pressed into use. Consequently, results from GWAS can be reconciled with those involving overexpression. We apply this pan-genomic model to predict patterns of transcriptional activity in whole human chromosomes, and, as an example, the effects of the deletion causing the diGeorge syndrome.

Deciphering multi-way interactions in the human genome

Article Open access 20 September 2022

Gabrielle A. Dotson, Can Chen, … Indika Rajapakse

Nonlinear control of transcription through enhancer–promoter interactions

Article Open access 13 April 2022

Jessica Zuin, Gregory Roth, … Luca Giorgetti

Defining genome architecture at base-pair resolution

Article 09 June 2021

Peng Hua, Mohsin Badat, … James O. J. Davies

Introduction

Transcription—the copying of DNA into RNA—is tightly regulated. Early insights into regulatory mechanisms came from work on binary on/off genetic switches controlled by one or just a few transcription factors such as the lambda and lac repressor in Escherichia coli¹. Similar regulatory mechanisms are present in eukaryotes, albeit with additional complexity. For instance, a fibroblast cell can be reprogrammed into a muscle cell by a single master regulator (MYOD)^2,3 or into pluripotent stem cells by four Yamanaka factors (Oct4, Sox2, c-Myc, Klf4)⁴.

Genome-wide association studies (GWAS) lead to quite a different view: gene regulation is widely distributed and involves interactions between hundreds (perhaps thousands) of loci scattered around the genome^5,6. GWAS allow quantitative trait loci (QTLs) affecting any measurable genetic trait to be ranked in an unbiased way. With complex traits like human height, and diseases such as schizophrenia and type II diabetes, the top ten QTLs in the rank order combine to yield only modest effects, while the top one-hundred still account for less than half of the total genetic effect. Hundred more QTLs are expected to be identified as sample sizes and data resolution improve^5,6,7. Expression QTLs (eQTLs) are QTLs affecting transcription of other DNA regions. Perhaps surprisingly, these are rarely found in genes encoding transcription factors or other proteins; instead, they usually involve single-nucleotide changes in non-coding elements that bind transcription factors such as active enhancers and promoters^8,9,10.

Results from GWAS lead to the view that most gene-regulatory networks are incredibly complex, with the activity of a given gene being affected by a panoply of eQTLs, each having a tiny effect. This is captured by the “omnigenic” model, which is based on a set of gene-interaction equations^5,6 such that the activity of almost any gene affects that of almost every other one. This model provides a useful and appealing framework to view GWAS results. However, it is difficult to compare its outputs with experimental data because it contains many parameters that are currently unknown and require fitting to training datasets.

In general, existing models for gene regulation traditionally assume post-transcriptional and biochemically mediated interactions between different genes^11,12, and disregard the role of three-dimensional (3D) chromatin structure. Here we propose an alternative but complementary framework that links transcriptional regulation directly to 3D genome structure, deliberately neglecting downstream biochemical regulation to enable unambiguous interpretation of our results. This framework is motivated by experiments showing that chromatin folding can lead to contacts between enhancers and promoters affecting transcription, and that 3D structure changes in disease^13,14. Additionally, because our modelling is essentially fitting-free, its output can be directly compared to experiments. When the agreement is good, our model is validated; when poor, it points to some missing ingredient (such as biochemical feedback) that could be included in future models.

We use stochastic computer simulations of a polymer model for chromosome organization, in which a chain of beads represents a chromatin fibre, and a set of spheres complexes of transcription factors and RNA polymerases—which we will call “TFs” for short. Some chromatin beads are identified as transcription units (TUs), and we call them TU beads. They contain binding sites for TFs, and can be sites of transcriptional initiation (we do not discriminate between genic and non-genic promoters). As a simple starting point we only consider one type of TF that binds specifically and multivalently to TU beads, and non-specifically (i.e., with weak affinity) to every other bead. We perform 3D Brownian dynamics simulations that evolve the diffusive dynamics of the chain and associated factors. We previously showed that similar polymer models yield structures resembling those seen using chromosome–conformation–capture (3C)^{15,16,17,18,19} and microscopy²⁰. Here, we link 3D structure to expression and transcriptional dynamics by measuring how often a TU bead is transcribed—which we do by computing the fraction of time it binds a TF. To establish the methodology, we model a 3 Mbp chromatin fragment, before going on to simulate whole human chromosomes.

Our simulations capture many features of eukaryotic regulation. For example, transcription is stochastic and bursty (in agreement with single-cell transcriptomics data), and the predicted pattern of transcriptional activity in human chromosomes correlates significantly with that observed experimentally. We also find that small-world (percolating) networks that encapsulate much of the rich complexity observed in GWAS emerge through spatial effects alone. In other words, the activity of most (probably all) TUs in our model is affected by the activity of most (probably all) other segments in the genome. We find such pan-genomic regulation critically requires non-saturating concentrations of TFs—as normally found in vivo—and that increasing concentrations dramatically simplifies the networks. This enables us to reconcile the GWAS-based view that regulatory networks are complicated with the observation that overexpressing one or a few TFs can decisively alter cell state.

Results

We first consider a simple system where a 3 Mbp chromatin fragment is represented by a chain of 1000 beads (each 30 nm in diameter, and corresponding to 3 kbp). We select at random N = 39 beads and identify them as TUs (Fig. 1a; see “Methods” and Supplementary Note 1 for more details). The linear density of TUs in the fragment is similar to that in human chromosome 22. Additionally, n spheres (also 30 nm in diameter) represent TFs (recall these are complexes of transcription factors and RNA polymerase II). TFs bind reversibly to TUs via a strong attractive interaction, and to all other beads weakly and non-specifically. An important feature is that TFs switch between active (binding) and inactive (non-binding) state at rate α. Many factors switch like this in vivo (e.g., due to phosphorylation and de-phosphorylation), and switching is required to account for the rapid exchange of factors and polymerases between bound and free states seen in live-cell photobleaching experiments²¹. As ~7 out of 8 polymerases attempting to initiate at promoters dissociate with a half-life of ~2.4 s²², our complexes generally behave like those in vivo.

**Fig. 1: Patterns of transcriptional activity.**

While our results refer to a single patterning of TUs along the fibre, they are representative of any arbitrary random positioning of TUs: in other words the qualitative trends we present below are robust and do not depend on the particular choice of the 1D pattern of TUs along the fibre in any way.

We say a TU bead is transcribed whenever a TF lies close to it (see “Methods”), and the transcriptional activity of a TU is then the fraction of time it is transcribed during a simulation. To reflect the situation in mammalian cells (Supplementary Note 3 and ref. ²³), we typically assume there are fewer TFs than TU beads (i.e., n = 10 TFs in the active binding state at any time, compared to 39 TUs).

By interrogating TF-chromatin interactions at regular time intervals over hundreds of simulations, we build up a population picture of transcription. A typical configuration of the 3 Mbp fragment is shown in Fig. 1B. Strikingly, bound TFs spontaneously cluster, despite there being no attractive interactions between TUs or between TFs. Such clustering is driven by the “bridging-induced attraction”^16,24,25 that arises due to a positive feedback: when a TF forms a molecular bridge between two chromatin regions and forms a loop, the local chromatin concentration increases, making further TF binding more likely. Clusters then grow until limited by entropic costs of crowding (Fig. S1A). Most of the non-trivial phenomena described below result from such clustering. Clustering requires TF multivalency, as monovalent factors do not cluster²⁴. However, the assumption of multivalency, which is common in the polymer physics literature¹⁵, is well-founded. Several TFs are known to be bivalent or multivalent²⁶, and, more importantly, our spheres represent complexes of TFs and polymerases, so they will behave as multivalent binders even when the individual TFs in the complex are monovalent. Although clustering does not require any interactions between TFs, adding a weak attraction between them, as might arise for instance due to macromolecular crowding or electrostatic interactions between intrinsically disordered regions, should not qualitatively change any of the results discussed here (at least as long as TFs still microphase separate into clusters rather than undergoing macroscopic phase separation).

The clusters we observe, and which emerge through the bridging-induced attraction, are qualitatively similar to those seen in vivo, which are variously described as transcriptional compartments, hubs, super-enhancer (SE) clusters, phase-separated droplets/condensates, and factories^{7,10,27,28,29}. They are also similar to the contact domains seen in microC³⁰, which are formed by accessible DNA sites clustering together in 3D space. Clustering arising through the bridging-induced attraction has recently been found in vitro for systems of DNA and cohesin (which binds multivalently to DNA)³¹.

Transcriptional activity varies along the chromatin fibre and is highly stochastic

As TFs have the same affinity for all TUs, one might expect each TU to be bound with equal likelihood; however, transcriptional activity (the fraction of time a TU is transcribed) varies from ~10–90% (Fig. 1C). What causes this variation? As TF copy number is limiting, and as bound TFs cluster, most transcription occurs in clusters—as is the case in vivo^7,32,33,34. Since TUs are positioned irregularly along the fragment, some have closer neighbours in 1D sequence space than others, and these are inevitably the ones most likely to cluster and be transcribed. Instead, those far from their neighbours are less likely to cluster and are less active. Accordingly, the transcriptional activity of a TU anticorrelates with distance to the nearest TU along the fibre (Fig. S1B; the Spearman correlation is r ≃ −0.94, p value p < 10⁻¹²).

While Fig. 1C pertains to population averages of 1000 simulations, it is informative to consider each simulation independently (as in single-cell transcriptomics). Such analysis shows that transcriptional activity is stochastic, varying substantially from simulation to simulation: a TU active in some simulations may be silent in others (Fig. 1D).

Transcriptional bursting

During a simulation, chromatin conformation can change dramatically (Fig. 2A). Such changes often yield transcriptional “bursts”—periods of continued activity followed by silent periods (Fig. 2B)—as TUs with intermediate levels of activity repeatedly join a cluster to give a burst and then dissociate. Notably, TUs lying close to each other in sequence space often start and stop bursts coordinately due to the intrinsic positive feedback in the system (Fig. S1A).

These results are consistent with experimental observations: single cell Hi-C³⁵ and transcriptomics³⁶ show that the structure and function of each individual cell is unique, and bursting is well documented^37,38,39,40 with nearby promoters often firing together³⁸.

Local chromatin architecture creates small-world percolating transcription networks

To investigate correlations between transcriptional activities of different TUs, we compute the Pearson correlation matrix between the activities of all possible TU pairs, and identify an emergent regulatory network in which TUs form nodes (Figs. 3A and S2). Specifically, we draw an edge between two TUs whenever there is a statistically significant positive or negative correlation between their transcriptional dynamics (Fig. 3A). This network arises only due to spatial interactions, as we assume no underlying biochemical regulation.

**Fig. 3: Regulatory networks formed by TU beads are percolating at low TF concentrations.**

The network shows a striking property. With n = 10 active TFs, most nodes are connected (Fig. 3Aii), and the fraction of TUs participating in the largest connected component is close to 1 (Fig. 3B). Such a network is said to be “percolating”, which means that any two nodes are connected by a path along edges. Our percolating networks are also “small-world”, which means that most nodes can be reached from every other node by a small number of steps⁴¹—we provide quantitative measurements of the small world-ness of our networks in the SI (Supplementary Note 4). The small-world phenomenology is consistent with the multitude of small-effect eQTLs detected by GWAS^5,6. Notably, the regulation we observe acts at the transcriptional level, and not post-transcriptionally as envisaged by the omnigenic model^5,6.

How might our simple model give rise to complex regulatory networks? By analysing simulation trajectories, we noted that TUs lying near each other in 1D sequence space often joined the same cluster in 3D. As a result, the activity of these clustered beads is highly positively correlated. At the same time, cluster formation sequesters TFs and so reduces the likelihood that another cluster forms elsewhere. As a result, most long-range correlations are negative (Fig. 3A).

Crucially, these network properties depend on there being a low TF copy-number (as in vivo²³) so TU beads do not become saturated. We therefore reasoned that increasing copy number should suppress correlations as more rarely transcribed TUs are pressed into use. Indeed, increasing n reduces long-range negative correlations (Fig. 3Aiii,iv), and the fraction of nodes in the largest-connected component falls (Fig. 3B). Another way to think about this result is: if resources are plentiful, there is no need for sharing or competition, and all TUs can bind a TF independently of each other. If TFs do not switch and are permanently in the binding state (and n = 10), the network becomes even more highly connected (Fig. 3Ai).

Modelling effect of mutations and SNPs in regulatory elements

GWAS reveals that single-nucleotide polymorphisms (SNPs) in regulatory elements and TUs can lead to many small changes in transcriptional activity across the genome. To model this, we abrogate TF binding to one TU in the chain. Bead 930 is chosen first because it is usually highly active (Fig. 1C). This single “knock-out” affects in a statistically significant way the activity of almost half of the other TUs, both near and far away in sequence space (Fig. 4Aii). The immediately adjacent TU (i.e., bead 931) is down-regulated the most, while more distant ones are up-regulated (due to loss of a strong competitor). This knock-out also rewires the whole network, even though it still retains its small-world character (Fig. 4Aiii). Both positive and negative interactions are affected along the whole chain, as shown by a heat map of the change in Pearson correlation between TU transcriptional activities (Fig. 4Aiv).

**Fig. 4: Modelling SNPs and eQTL action.**

We next systematically knock out each TU in turn. To quantify global effects, we define a “transcriptional difference” between the wild type and each knock-out based on a standard Euclidian-distance metric (SI, Supplementary Note 2); the larger this quantity, the more different the two states are. This difference varies >10-fold between different mutations (Fig. 4Bi).

Together, these observations are reminiscent of the behaviour of SNPs and eQTLs. Thus, each TU mutant can be seen as a SNP underlying an eQTL; then, those with low and high transcriptional differences (Fig. 4Bi,ii) are low- and high-effect eQTLs (low-effect mutants are often isolated in sequence space), and those with wide effects (e.g., bead 930 in Fig. 4A) may be viewed as omnigenic.

Modelling loops, heterochromatin and euchromatin

In mammalian genomes, promoter-enhancer pairs are often contained in loops stabilized by cohesin and the CCCTC-binding factor (CTCF)^42,43,44. To investigate how such loops might affect transcription, we incorporated eight permanent and non-overlapping loops at different positions in the chain (Fig. 5A, loops a–h). In reality, such loops may arise from extrusion by cohesin halted at convergent CTCF loops⁴². Our assumption of stable, permanent loops is quantitatively accurate in the limit in which the interaction between cohesin and CTCF is strong and long-lived. However, we expect the trends to be qualitatively similar for more transient loops consistent with the loop extrusion model as in refs. ^19,43.

**Fig. 5: Looping subtly affects transcriptional activity.**

The inclusion of stable loops has subtle effects. For example, loop h encompasses three TUs (beads 905, 907, 930), and expression of one is slightly boosted compared to the unlooped case (Fig. 5B, C). This is consistent with the idea that looping switches on some genes during development⁴⁵, and can increase enhancer–promoter interactions^46,47. However, up-regulation requires appropriate positioning of a TU within the loop. For instance, loop d encompasses two TUs (beads 396 and 404), and has no effect on their activity. Broadly speaking, looping up-regulates activity, but not invariably so, and—perhaps surprisingly—two of the three most up-regulated TUs (beads 33 and 886) are not contained in loops (Fig. 5C). Looping also extensively rewires the regulatory network (Fig. 5D, E). Globally, the increase in activity is modest, as incorporating all beads into closely packed loops only increases total activity by ~10%, with—once again—some TUs being down- as well as up-regulated (Fig. S3). This is consistent with experiments showing that the interplay between looping and expression is complex⁴⁸ but slight (e.g., knocking down human cohesin leaves expression of 87% genes unaffected, with global levels changing <30%⁴⁹).

In simulations thus far, TFs bind strongly to TU beads, and weakly to all others to model binding to open euchromatin^19,50. To investigate the effects of heterochromatin—which binds few TFs, carries few histone marks⁵¹, and is gene poor and traditionally viewed as transcriptionally inert—we perform simulations where four of the most-active TUs (905, 907, 930, and 931) are embedded in a non-binding segment (running from bead 901–940). This has a dramatic effect (Fig. 6A–C): the activity of the TU beads now embedded in the non-binding island are at least halved, some nearby neighbors are down-regulated, and more distant ones up-regulated (again due to a reduction in competition; Fig. 6B, C). The regulatory network is also rewired (Fig. 5D, E).

**Fig. 6: Neighboring heterochromatin affects transcriptional activity.**

Just as embedment in a non-binding segment down-regulates a TU bead, embedment in a weak-binding (euchromatic) one up-regulates it (Fig. S4). This shows our model effectively captures position effects where the local chromatin context strongly influences activity⁵².

Modelling a whole human chromosome

We next model a whole mid-sized human chromosome (HSA 14, length 107 Mbp; Fig. 7A) in a well-characterized and differentiated diploid cell (HUVEC, human umbilical vein endothelial cell). Now, multivalent and switchable TFs (20% active at any moment) at a non-saturating concentration bind to a string with 35784 beads. As chromosome territories are often ellipsoidal, simulations are performed in an ellipsoid of appropriate size^7,53; consequently, chromatin density is now higher than in simulations detailed above, with volume fractions comparable to those in vivo (~14%).

**Fig. 7: Comparison of transcriptional activities of TUs on HSA14 in HUVECs determined using simulations and GRO-seq.**

Chromatin beads are classified using DNase-hypersensitity data and ChIP-seq data for H3K27ac. DNase-hypersensitive sites (DHS) are excellent markers to locate promoters and enhancers (and so TF-binding sites^19,54), whereas H3K27ac modifications strongly correlate with open chromatin¹⁹. Therefore, if the 3 kbp region corresponding to a chromatin bead has a DHS, then that bead is a TU; if it has H3K27ac, it is a euchromatin bead, and all other beads are non-binding (heterochromatic). We call this the “DHS” model. As properties of different chromatin segments have been catalogued using “hidden-Markov models” (HMMs) applied to many data sets⁵¹, we alternatively classify beads according to HMM state; we call this the “HMM model” (Fig. S5). For more details, see Supplementary Note 3.

Simulations using the DHS model again yield clusters enriched in TUs and TFs (Fig. 7B). As before, aggregating data from many simulations allow determination of transcriptional activities of every bead, which we compare with those of corresponding regions determined experimentally⁵⁵ by GRO-seq (global run-on sequencing⁵⁶); activities of all 3 kbp regions are ranked from high to low, binned into quintiles, and compared. In Fig. 7C, squares near the diagonal from bottom-left to top-right have high ranks (shown as red and yellow) compared to those off-diagonal (blue and purple) indicating good concordance between simulations and data. A specific sub-set of beads corresponding to SEs—which are highly active in vivo⁵⁷—are also highly active in simulations (shown as white dots concentrated at top right). Plots showing the rank of transcriptional activities in simulations and experiments in selected genomic regions are shown in Fig. S6. Simulations yield patterns qualitatively closer to those obtained with GRO-seq than those given by poly(A)⁺ RNA-seq, as the latter only include genic transcription. Concordance between results from simulations and GRO-seq is confirmed by the Spearman rank correlation (~0.38 for all beads; p < 10⁻¹²; this measure is used because it is less sensitive to outliers; Fig. 7D). Restricting analysis just to TUs provides a more stringent comparison (as all TUs bind TFs with equal affinity); it still yields a significant correlation (r ≃ 0.32, p < 10⁻¹²; Fig. 7D). As neighbouring high-affinity regions tend to have roughly similar transcriptional rates in both simulations and data, we also average rates found in active “patches” (contiguous sets of beads which are either all TUs or all labelled as euchromatin), but found this has no significant effect (Fig. 7D). Concordance was confirmed using our HMM model (Fig. 7D, right, and Fig. S5). Adding cohesin-mediated looping to simulations involving the DHS model did not significantly change agreement with experimental data (e.g., for TUs only, r ≃ 0.33, p < 10⁻¹²). Similar agreement with GRO-seq data was obtained from simulations applied to the H1 human embryonic stem-cell line (for TUs using the DHS model, r ≃ 0.29, p < 10⁻¹²), and to the GM12878 cell line (DHS model, r ≃ 0.33, p < 10⁻¹²).

As in the chromosome fragment simulations (Fig. S1B), the transcriptional activity of a TU in our model anticorrelates with the distance to the nearest TU. In our HSA14 simulations, the presence of heterochromatin slightly reduces the absolute value of the correlation, which however remains highly significant (Spearman correlation r ≃ −0.83, p < 10⁻¹²). Interestingly, the experimental GRO-seq signal of a DHS also anticorrelates with the distance to the nearest DHS in a significant way, although more weakly than in simulations (Fig. S7; over the whole genome the Spearman correlation is r ~ −0.23, p < 10⁻¹²).

Networks inferred from simulations are qualitatively similar to experimental ones

Regulatory networks emerging from our whole-chromosome simulations are again small-world and highly connected (Fig. S8 and Supplementary Note 4). To facilitate comparison with previous results, we select four segments of HSA14 that have the same length as the one considered in Fig. 3 (i.e., 3 Mbp), and roughly the same density of TUs; all four segments again have highly connected components (compare Fig. S8 and Fig. 3). However, patterns in real chromosomes and artificial fragments are quite different. In HSA14 networks, there are more positive interactions between sets of adjacent TUs and other sets that are >10 beads distant in sequence space (black lines across the middle of circles in Fig. S8).

Whole-chromosome networks also have the following statistical properties. First, their node-degree distribution decays exponentially (Fig. S9A)—as found in gene networks⁵⁸ but not in transcription factor interaction networks, which are often scale-free⁵⁹. Second, they are modular (as clusters arising due to the bridging-induced attraction are the basic co-regulated building blocks)—again as found in gene⁵⁸ and eQTL⁶⁰ networks. [Modularity is apparent from the blocks visible in the correlation matrices, such as in Fig. S2.] Third, node degree broadly correlates with transcriptional activity (Spearman correlation 0.59, p value < 10⁻¹²)—as in gene coregulation networks⁵⁸.

Contact maps found by simulations are qualitatively similar to Hi-C

We previously showed¹⁶ that simulations involving two different TFs (binding to active and inactive regions, respectively) yield contact maps much like those found with Hi-C⁴². Therefore, we expected the present simulations to reflect Hi-C data poorly as they involve only one TF binding to the minor (i.e., active) fraction of the genome, so contacts made by this structured minority would be obscured by those due to the unstructured majority. Even so, simulations yield contact maps broadly similar to those obtained by Hi-C (Fig. 7E). To measure the agreement, we use a comparison based on contact maps restricted to TUs as anchors—which may be considered as equivalent to interactions obtained by promoter-capture HiC⁶¹. These yield good concordance (Fig. 7E; Pearson coefficient r = 0.82; r = 0.47 when monitoring only long-range contacts between TUs at least 300 kbp away, p < 10⁻⁶ in both cases). The exponent with which contact probability decays with 1D distance is ~−1.1 in experiments, and ~−0.8 in simulations (fitted for 1D distances between ~30 kbp and 1.5 Mbp), both broadly consistent with the −1 value expected for a fractal globule⁶². The small discrepancy may point to our simulations slightly overestimating the weight of long-range contacts, perhaps because we do not include loop extrusion.

Overall the results obtained in our HSA14 simulations show that a simple model based on 3D chromatin organisation captures much of the complexity in 3D structure and transcription of a whole human chromosome.

Modelling chromosome 22 carrying the diGeorge deletion

Our approach can, in principle, be applied to study any chromosome providing appropriate genomic data are available (e.g., on DNase hypersensitivity and histone acetylation). As a proof of principle, we studied the effect of deleting ~2.55 Mbp from HSA22—an alteration which is associated with the diGeorge syndrome (Fig. 8A) (https://dosage.clinicalgenome.org/clingen_region.cgi?id=ISCA-37446). This syndrome affects ~1 in 4000 people, and the variable symptoms include congenital heart problems, frequent infections, developmental delays, and learning problems.

**Fig. 8: Modelling effects of the DiGeorge deletion in HSA22.**

We predict a multitude of small effects in TU activity, both near and far away from the deletion (see the Manhattan plot in Fig. 8Bi). In particular, most TUs are slightly up-regulated, as fewer TUs compete for the same number of factors, and the TUs which change the most have intermediate transcriptional activities in the wild type (Fig. S10). The p values associated with the change in transcriptional activities vary widely, and comparison of the observed distribution with the null hypothesis (indicating that changes in measured transcription are due to random variation) shows the observed is highly enriched in small p values (Fig. 8Bii), as is generally the case with results from GWAS^5,6. The regulatory network is also re-wired (Fig. 8C). Results are consistent with measurements of differential gene expressions in patients, which showed both a large number of up-regulated and down-regulated genes⁶³. A more quantitative comparison between experiments and simulations would benefit from having GRO-seq data that include non-genic transcription.

Clearly, this approach opens up a rich field of study. For instance, while there may be processes which occur in vivo which are not represented in our model, it could still give an indication of the genes most likely to be affected by any chromosome rearrangement.

Discussion

We have described a parsimonious 3D stochastic model for transcriptional dynamics based on multivalent binding of factors and polymerases (TFs) to genic and non-genic transcriptional units (TUs) in a chain representing a chromatin fibre. A distinctive feature of our framework is that it is fitting-free, which means the model is truly predictive and can provide a mechanistic understanding of the phenomena we observe. On the other hand, the absence of fitting renders it challenging to obtain a fully quantitative agreemeent between modelling and experiment.

In our simulations two types of fibres were considered: a 3 Mbp fragment with randomly-positioned TUs, which is useful to exemplify emerging trends, and human chromosomes 14 and 22 where TUs were appropriately positioned according to bioinformatic data. Despite deliberately excluding any explicit underlying network of biochemical regulation, our model nevertheless yields some notable results. These depend on having a low TF copy-number—a feature compatible with observations in vivo²³. First, since TFs bind with the same affinity to all TUs, one might expect the latter to all be transcribed similarly, but they are not (Fig. 1). This is largely due to inter-TU spacing; TUs lying close together in 1D sequence space tend to be the most active (Fig. 1C) with positively correlated dynamics reminiscent of transcriptional bursting (Fig. 2B). This is because they often cluster into structures which are analogous to the phase-separated transcription hubs/factories seen experimentally^7,10, or to contact domains formed by accessible DNA sites found by high-resolution mapping of chromatin interactions by microC³⁰. Second, switching off binding at any TU significantly affects the activity of many others, both near and far away in sequence space (Fig. 4). Third, introducing stable loops has subtle effects (Fig. 5), consistent with the result that cohesin knock-outs and degrons lead to small global changes in expression⁴⁹, although they can be important for inducible gene response in selected cases⁴⁶. Fourth, transcriptional activity of a TU is strongly affected by the local environment in ways that are reminiscent of the silencing of a gene by incorporation into heterochromatin⁵² (Fig. 6), or activation by embedment in euchromatin (Fig. S4). Fifth, the stochasticity seen in individual simulations reflects that detected by single-cell transcriptomics and single-cell Hi-C. Nevertheless, this variability does not prevent emergence of robust phenotypes in a cell population. Sixth, our simple fitting-free model predicts patterns of transcriptional activity in human chromosomes that promisingly and significantly correlate with experimental GRO-seq data (Fig. 7). This suggests that chromatin structure significantly constrains transcriptional activity. We hypothesise that additional downstream biochemical regulation, not included in our model, may provide a tool to adjust this underlying “structural” pattern of activity in a way which may be required for appropriate biological function.

Finally, our results enable us to reconcile two conflicting sets of data, namely that regulatory networks are both complex (as GWAS shows that thousands of loci around the genome control complex phenotypes^5,6) and simple (as over-expressing just four Yamanaka factors switches cell fate⁴). Thus, our simulations reveal complex small-world networks of mutual up- and down-regulation (Figs. 3 and S8), consistent with GWAS results. However, increasing TF copy-number dramatically simplifies network structure (Fig. 3). We suggest such a simplification occurs when a fibroblast is reprogrammed into a pluripotent stem cell by over-expressing the Yamanaka factors; the high factor concentration simplifies the network so that the factors can combine to switch the phenotype (Fig. S11).

Taken together, these results suggest the activity—or inactivity—of every genomic region affects that of every other region to some extent. We describe our framework as “pan-genomic” (Fig. S11). This is reminiscent of the omnigenic model^5,6 in the sense that many loci are involved, all having small effects. However, it differs as it provides an underlying mechanism for pangenomic effects, by positing a direct and immediate effect of structure on regulation at the transcriptional level, which contrasts with the non-trivial post-transcriptional pathways envisioned by the omnigenic model. Additionally, our pangenomic model yields a natural framework to qualitatively understand mutually exclusive gene expression, when switching on one gene in a family turns off all others (as in developing olfactory neurons⁶⁴). The current model to explain this phenomenon postulates a coupling between cis-acting up-regulation and trans-acting down-regulation. The pangenomic networks we find provide exactly this type of regulatory interactions (Fig. 3). Our results are also consistent with recent experiments and mathematical models showing that subtle changes in 3D structure can lead to large changes in transcription^65,66. On the other hand, it is challenging within our current model to account for local negative feedback mechanisms leading to noise reduction or oscillations¹¹, as these are more likely to arise biochemically (an example is the p53–Mdm2 system which achieves stabilisation of the cellular concentration of p53 via a negative feedback loop⁶⁷).

In conclusion, we have developed a framework that can be applied to predict the transcriptional activity of any genomic fragment in health or disease (Figs. 7 and 8) providing appropriate experimental data are available. Predictive power can be enhanced by incorporating additional TFs, and more suitable datasets of histone marks. Other features that can improve correlations between experiments and simulations are a more accurate modelling of cohesin loop formation by loop extrusion, and of the heteromorphic nature of chromatin¹⁹. We hope to report on work incorporating the latter two features in the future.

Methods

Polymer modelling

We model chromatin fibres and chromosomes as bead-and-spring polymers. A fibre has M monomers, each of size σ (corresponding to 3 kbp, or 30 nm²⁴), and r_i denotes the position of the ith monomer in 3D space. Multivalent transcription factors (either active or inactive) are modelled as spheres, again with size σ for simplicity. There are n multivalent factors in a simulation (where n is varied systematically, see text and “Results” section for details), and N high-affinity binding sites, which we refer to as TU (or TU beads).

Any two monomers (i and j) in the chromatin fibre interact purely repulsively, via a Weeks–Chandler–Anderson potential, given by

$${U}_{\,{{\mbox{WCA}}}\,}^{ij}=4{k}_{\mathrm B}T\left[{\left(\frac{\sigma }{{r}_{ij}}\right)}^{12}-{\left(\frac{\sigma }{{r}_{ij}}\right)}^{6}+\frac{1}{4}\right]$$

(1)

if r_ij < 2^1/6σ and 0 otherwise, where r_ij is the separation of beads i and j. There is also a finite extensible non-linear elastic (FENE) spring acting between consecutive beads in the chain to enforce chain connectivity. This is given by

$${U}_{\,{{\mbox{FENE}}}\,}^{ij}=-\frac{{K}_{\mathrm f}{R}_{0}^{2}}{2}{{{{{{\mathrm{ln}}}}}}}\,\left[1-{\left(\frac{{r}_{ij}}{{R}_{0}}\right)}^{2}\right]$$

(2)

where i and j are neighbouring beads, R₀ = 1.6σ is the maximum separation between the beads, and K_f = 30k_BT/σ² is the spring constant. With simulations including permanent cohesin loops (Fig. 7 in the main text, and Supplementary Fig. S4), neighbouring monomers and monomers forming loops interact via harmonic, rather than FENE springs,

$${U}_{\,{{\mbox{harmonic}}}\,}^{ij}={K}_{\mathrm h}{\left({r}_{ij}-\bar{R}\right)}^{2}$$

(3)

where i and j are neighbouring beads, K_h = 100k_BT/σ² is the harmonic spring constant, and $\bar{R}$ is the equilibrium spring distance. For these simulations, we use $\bar{R}=1.1\sigma$ for bonds joining neighbouring monomers along the chain, and $\bar{R}=1.8\sigma$ for bonds joining loop-forming monomers. The harmonic potential is used instead of the FENE one to enhance numerical stability.

Finally, a triplet of neighbouring beads interact via a Kartky–Porod term to model the stiffness of the chromatin fibre. This term explicitly reads as follows:

$${U}_{\,{{\mbox{KP}}}\,}^{ij}=\frac{{k}_{\mathrm B}T{\ell }_{\mathrm p}}{\sigma }\left[1-\frac{{\overrightarrow{t}}_{i}\cdot {\overrightarrow{t}}_{j}}{| {\overrightarrow{t}}_{i}| | {\overrightarrow{t}}_{j}| }\right]$$

(4)

where i and j are neighbouring beads, ${\overrightarrow{t}}_{i}$ is the tangent vector connecting beads i to i + 1, and ℓ_p is related to the persistent length of the chain: this parameter is set to 3σ in our simulation, which corresponds to a relatively flexible fibre—the resulting persistence length is within the range of values estimated for chromatin from experiments and computer simulations⁶⁸.

The interaction between a chromatin bead, a, and a multivalent TF, b, is modeled through a truncated and shifted Lennard–Jones potential, given by

$${U}_{{{{{{{{\rm{LJ}}}}}}}}}^{ab}=4{\epsilon }_{ab}\left[{\left(\frac{\sigma }{{d}_{ab}}\right)}^{12}-{\left(\frac{\sigma }{{d}_{ab}}\right)}^{6}-{\left(\frac{\sigma }{{r}_{c}}\right)}^{12}+{\left(\frac{\sigma }{{r}_{c}}\right)}^{6}\right],$$

(5)

for d_ab (the distance between the centres of chromatin bead and protein) smaller than r_c, and 0 otherwise. The parameter r_c is the interaction cut-off; it is set to r_c = 2^1/6σ for inactive proteins or for active proteins and non-binding chromatin beads (this cutoff results in a Weeks–Chandler–Anderson potential and purely repulsive interactions), or to r_c = 1.8σ for an active protein and a binding chromatin bead (this results in an attractive interaction). In all cases, the potential is shifted to zero at the cut-off in order to have a smooth potential. Purely repulsive interactions are modeled by setting ϵ_ab = k_BT, while attractive interactions are modeled using ϵ_ab = 3k_BT for active TF and low-affinity beads, and to ϵ_ab = 8k_BT for active TF and high-affinity (TU) beads.

A TU bead (or more generally any chromatin bead in Fig. 8D in the main text) is said to be transcribed if it is bound to a factor—i.e., if there is at least a TF whose centre lies within a range r_c = 1.8σ away from the bead centre.

The time evolution of each bead in the simulation (whether TF or chromatin bead) is governed by the following Langevin equation:

$${m}_{i}\frac{{ d}^{2}{\overrightarrow{r}}_{i}}{ d{t}^{2}}=-\nabla {U}_{i}-{\gamma }_{i}\frac{ d{\overrightarrow{r}}_{i}}{d}t+\sqrt{2{k}_{B}T{\gamma }_{i}}{\overrightarrow{\eta }}_{i}(t),$$

(6)

where U_i is the total potential experienced by bead i, m_i ≡ m and γ_i ≡ γ are its mass and friction coefficient (equal for all beads in our simulations), and ${\overrightarrow{\eta }}_{i}$ is a stochastic noise vector with the following mean and variance:

$$\langle \overrightarrow{\eta }(t)\rangle =0;\quad \langle {\eta }_{i,\alpha }(t){\eta }_{j,\beta }(t^{\prime} )\rangle ={\delta }_{ij}{\delta }_{\alpha \beta }\delta (t-t^{\prime} ),$$

(7)

where the Latin and Greek indices run over particles and Cartesian components, respectively, and δ denotes here the Kronecker delta.

As is customary⁶⁹, we set m/ξ = τ_LJ = τ_B, with the LJ time ${\tau }_{{{{{{{{\rm{LJ}}}}}}}}}=\sigma \sqrt{m/\epsilon }$ and the Brownian time τ_B = σ²/D, where ϵ is the simulation energy unit, equal to k_BT, and D = k_BT/γ is the diffusion coefficient of a bead of size σ. From the Stokes friction coefficient for spherical beads of diameter σ we have that ξ = 3πη_solσ where η_sol is the solution viscosity. One can map this to physical units by setting T = 300 K and σ = 30 nm, as above, and by setting the viscosity to the effective viscosity of the nucleoplasm, which is scale-dependent and ranges between 10 and100 cP for objects of the size of our chromatin bead⁷⁰. This leads to τ_LJ = τ_B = 3πη_solσ³/ϵ ≃ 0.6–6 ms. The Brownian time τ_B is our unit of time in simulations. The numerical integration of Eq. (6) is performed using a standard velocity-Verlet algorithm with time step Δt = 0.01τ_B and is implemented in the LAMMPS engine⁷¹. Protein switching is including by stochastically changing the type of TF beads every 10,000 timesteps (equivalently, every 100 Brownian times), with probabilities such that the switching off rate is of α = 10⁻⁵${\tau }_{\mathrm B}^{-1}$, or 0.017–0.17 s⁻¹. In simulations of the toy model (Figs. 1–7 in the main text and Suppl. Figs. S1–S4), the switching on rate is equal to α; in chromosome 14/22 simulations (Fig. 8 in the main text and Suppl. Fig. S5), it is equal to α/4. Consequently, in steady state the average number of active and inactive proteins is equal in simulations of the toy model, whereas the average number of inactive proteins is fourfold larger than that of active proteins in chromosome 14/22 simulations.

For more details on simulations, see Supplementary Notes 1 and 3.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets generated during and/or analysed during the current study have been deposited in Edinburgh DataShare [https://doi.org/10.7488/ds/3110]. To compare the predicted transcriptional activity of chromosome 14 outputted by our simulations with experiments, we use GRO-seq data. For HUVECs, we use the datasets GEO: GSM2486801, GSM2486802, GSM2486803. For hESCs, we use GEO: GSM1579367, GSM1579368. Super-enhancer regions considered here are those identified in ref. ⁵⁷, and available in the dbSUPER database [http://asntech.org/dbsuper/].

Code availability

The code used for the simulation is LAMMPS, which is publicly available at https://lammps.sandia.gov/. Custom codes written to analyse data are available from the corresponding author upon request, or they can be downloaded from https://git.ecdf.ed.ac.uk/dmarendu/omnigenomic-model (access can be requested from the corresponding author).

References

Alberts, B., Johnson, A., Lewis, J., Morgan, D. & Raff, M. Molecular Biology of the Cell (Taylor & Francis, 2014).
Davis, R. L., Weintraub, H. & Lassar, A. B. Expression of a single transfected cDNA converts fibroblasts to myoblasts. Cell 51, 987–1000 (1987).
Article CAS PubMed Google Scholar
Dall’Agnese, A. et al. Transcription factor-directed re-wiring of chromatin architecture for somatic cell nuclear reprogramming toward trans-differentiation. Mol. Cell 76, 453–472 (2019).
Article PubMed PubMed Central CAS Google Scholar
Takahashi, K. et al. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131, 861–872 (2007).
Article CAS PubMed Google Scholar
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liu, X., Li, Y. I. & Pritchard, J. K. Trans effects on gene expression can drive omnigenic inheritance. Cell 177, 1022–1034 (2019).
Article CAS PubMed PubMed Central Google Scholar
Cook, P. R. & Marenduzzo, D. Transcription-driven genome organization: a model for chromosome structure and the regulation of gene expression tested through simulations. Nucleic Acids Res. 46, 9895–9906 (2018).
Article CAS PubMed PubMed Central Google Scholar
Andersson, R. et al. An atlas of active enhancers across human cell types and tissues. Nature 507, 455–461 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Javierre, B. M. et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell 167, 1369–1384 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cramer, P. Organization and regulation of gene transcription. Nature 573, 45–54 (2019).
Article ADS CAS PubMed Google Scholar
Sneppen, K., Krishna, S. & Semsey, S. Simplified models of biological networks. Annu. Rev. Biophys. 39, 43–59 (2010).
Article CAS PubMed Google Scholar
Smolen, P., Baxter, D. A. & Byrne, J. H. Modeling transcriptional control in gene networks—methods, recent results, and future directions. Bull. Math. Biol. 62, 247–292 (2000).
Article CAS PubMed MATH Google Scholar
Pombo, A. & Dillon, N. Three-dimensional genome architecture: players and mechanisms. Nat. Rev. Mol. Cell Biol. 16, 245–257 (2015).
Article CAS PubMed Google Scholar
Spielmann, M., Lupiáñez, D. G. & Mundlos, S. Structural variation in the 3D genome. Nat. Rev. Genet. 19, 453–467 (2018).
Article CAS PubMed Google Scholar
Barbieri, M. et al. Complexity of chromatin folding is captured by the strings and binders switch model. Proc. Natl Acad. Sci. USA 109, 16173–16178 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Brackley, C. A., Johnson, J., Kelly, S., Cook, P. R. & Marenduzzo, D. Simulated binding of transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and topological domains. Nucleic Acids Res. 44, 3503–3512 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, N. & Marenduzzo, D. Genome organization: experiments and modeling. Chromosome Res. 25, 1 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pereira, M. C. F. et al. Complementary chromosome folding by transcription factors and cohesin. Preprint at bioRxiv https://doi.org/10.1101/305359 (2018).
Buckle, A., Brackley, C. A., Boyle, S., Marenduzzo, D. & Gilbert, N. Polymer simulations of heteromorphic chromatin predict the 3D folding of complex genomic loci. Mol. Cell 72, 786–797 (2018).
Article CAS PubMed PubMed Central Google Scholar
Finn, E. H. et al. Extensive heterogeneity and intrinsic variation in spatial genome organization. Cell 176, 1502–1515 (2019).
Article CAS PubMed PubMed Central Google Scholar
Brackley, C. A. et al. Ephemeral protein binding to DNA shapes stable nuclear bodies and chromatin domains. Biophys. J. 28, 1085–1093 (2017).
Article CAS Google Scholar
Steurer, B. et al. Live-cell analysis of endogenous GFP-RPB1 uncovers rapid turnover of initiating and promoter-paused RNA polymerase II. Proc. Natl Acad. Sci. USA 115, E4368–E4376 (2018).
Article CAS PubMed PubMed Central Google Scholar
Brewster, R. C. et al. The transcription factor titration effect dictates level of gene expression. Cell 156, 1312–1323 (2014).
Article CAS PubMed PubMed Central Google Scholar
Brackley, C. A., Taylor, S., Papantonis, A., Cook, P. R. & Marenduzzo, D. Nonspecific bridging-induced attraction drives clustering of DNA-binding proteins and genome organization. Proc. Natl Acad. Sci. USA 110, E3605–E3611 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Brackley, C. Polymer compaction and bridging-induced clustering of protein-inspired patchy particles. J. Phys. Condens. Matter 32, 314002 (2020).
Article CAS PubMed Google Scholar
Kilic, S., Bachmann, A. L., Bryan, L. C. & Fierz, B. Multivalency governs HP1α association dynamics with the silent chromatin state. Nat. Commun. 6, 7313 (2015).
Article ADS CAS PubMed Google Scholar
Cook, P. R. The organization of replication and transcription. Science 284, 1790–1795 (1999).
Article CAS PubMed Google Scholar
Papantonis, A. et al. TNFα signals through specialized factories where responsive coding and miRNA genes are transcribed. EMBO J. 31, 4404–4414 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shrinivas, K. et al. Enhancer features that drive formation of transcriptional condensates. Mol. Cell 75, 549–561 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hsieh, T.-H. S. et al. Resolving the 3D landscape of transcription-linked mammalian chromatin folding. Mol. Cell 78, 539–553 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ryu, J.-K. et al. Bridging-induced phase separation induced by cohesin SMC protein complexes. Sci. Adv. 7, eabe5905 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Pombo, A. et al. Regional specialization in human nuclei: visualization of discrete sites of transcription by RNA polymerase III. EMBO J. 18, 2241–2253 (1999).
Article CAS PubMed PubMed Central Google Scholar
Faro-Trindade, I. & Cook, P. R. A conserved organization of transcription during embryonic stem cell differentiation and in cells with high C value. Mol. Biol. Cell 17, 2910–2920 (2006).
Article CAS PubMed PubMed Central Google Scholar
Beagrie, R. A. et al. Complex multi-enhancer contacts captured by genome architecture mapping. Nature 543, 519–524 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Nagano, T. et al. Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 502, 59 (2013).
Article ADS CAS PubMed Google Scholar
Macaulay, I. C. & Voet, T. Single cell genomics: advances and future perspectives. PLoS Genet. 10, e1004126 (2014).
Article PubMed PubMed Central CAS Google Scholar
Muerdter, F. & Stark, A. Gene regulation: activation through space. Curr. Biol. 26, R895–R898 (2016).
Article CAS PubMed Google Scholar
Fukaya, T., Lim, B. & Levine, M. Enhancer control of transcriptional bursting. Cell 166, 358–368 (2016).
Article CAS PubMed PubMed Central Google Scholar
Bartman, C. R., Hsu, S. C., Hsiung, C. C.-S., Raj, A. & Blobel, G. A. Enhancer regulation of transcriptional bursting parameters revealed by forced chromatin looping. Mol. Cell 62, 237–247 (2016).
Article CAS PubMed PubMed Central Google Scholar
Suter, D. M. et al. Mammalian genes are transcribed with widely different bursting kinetics. Science 332, 472–474 (2011).
Article ADS CAS PubMed Google Scholar
Humphries, M. D. & Gurney, K. Network ‘small-world-ness’: a quantitative method for determining canonical network equivalence. PLoS ONE 3, e0002051 (2008).
Article ADS PubMed CAS Google Scholar
Rao, S. S. et al. A 3d map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665 – 1680 (2014).
Article PubMed Central CAS Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS PubMed PubMed Central Google Scholar
Brackley, C. A. et al. Non-equilibrium chromosome looping via molecular slip-links. Phys. Rev. Lett. 119, 138101 (2017).
Article ADS CAS PubMed Google Scholar
Oti, M., Falck, J., Huynen, M. A. & Zhou, H. Ctcf-mediated chromatin loops enclose inducible gene regulatory domains. BMC Genomics 17, 252 (2016).
Article PubMed PubMed Central CAS Google Scholar
Cuartero, S. et al. Control of inducible gene expression links cohesin to hematopoietic progenitor self-renewal and differentiation. Nat. Immunol. 19, 932–941 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sasca, D. et al. Cohesin-dependent regulation of gene expression during differentiation is lost in cohesin-mutated myeloid malignancies. Blood 134, 2195–2208 (2019).
Article PubMed PubMed Central Google Scholar
Robson, M. I., Ringel, A. R. & Mundlos, S. Regulatory landscaping: how enhancer-promoter communication is sculpted in 3d. Mol. Cell 74, 1110–1122 (2019).
Article CAS PubMed Google Scholar
Rao, S. S. et al. Cohesin loss eliminates all loop domains. Cell 171, 305–320 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, N. et al. Chromatin architecture of the human genome: gene-rich domains are enriched in open chromatin fibers. Cell 118, 555–566 (2004).
Article CAS PubMed Google Scholar
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Timms, R. T., Tchasovnikarova, I. A. & Lehner, P. J. Position-effect variegation revisited: hushing up heterochromatin in human cells. BioEssays 38, 333–343 (2016).
Article CAS PubMed Google Scholar
Wang, Y., Nagarajan, M., Uhler, C. & Shivashankar, G. Orientation and repositioning of chromosomes correlate with cell geometry-dependent gene expression. Mol. Biol. Cell 28, 1997–2009 (2017).
Article CAS PubMed PubMed Central Google Scholar
Consortium, E. P. An integrated encyclopedia of dna elements in the human genome. Nature 489, 57–74 (2012).
Article ADS CAS Google Scholar
Niskanen, H. et al. Endothelial cell differentiation is encompassed by changes in long range interactions between inactive chromatin regions. Nucleic Acids Res. 46, 1724–1740 (2017).
Article PubMed Central CAS Google Scholar
Jordán-Pla, A., Pérez-Martínez, M. E. & Pérez-Ortín, J. E. Measuring RNA polymerase activity genome-wide with high-resolution run-on-based methods. Methods 159, 177–182 (2019).
Article PubMed CAS Google Scholar
Khan, A. & Zhang, X. dbsuper: a database of super-enhancers in mouse and human genome. Nucleic Acids Res. 44, D164–D171 (2015).
Article PubMed PubMed Central CAS Google Scholar
Belcastro, V. et al. Transcriptional gene network inference from a massive dataset elucidates transcriptome organization and gene function. Nucleic Acids Res. 39, 8677–8688 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ouma, W. Z., Pogacar, K. & Grotewold, E. Topological and statistical analyses of gene regulatory networks reveal unifying yet quantitatively different emergent properties. PLoS Comput. Biol. 14, e1006098 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Fagny, M. et al. Exploring regulation in tissues with eQTL networks. Proc. Natl Acad. Sci. USA 114, E7841–E7850 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mifsud, B. et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat. Genet. 47, 598 (2015).
Article CAS PubMed Google Scholar
Mirny, L. A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 19, 37–51 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jalbrzikowski, M. et al. Transcriptome profiling of peripheral blood in 22q11. 2 deletion syndrome reveals functional pathways related to psychosis and autism spectrum disorder. PLoS ONE 10, e0132542 (2015).
Article PubMed PubMed Central CAS Google Scholar
Alsing, A. K. & Sneppen, K. Differentiation of developing olfactory neurons analysed in terms of coupled epigenetic landscapes. Nucleic Acids Res. 41, 4755–4764 (2013).
Article CAS PubMed PubMed Central Google Scholar
Xiao, J. Y., Hafner, A. & Boettiger, A. N. How subtle changes in 3D structure can create large changes in transcription. eLife 10, e64320 (2021).
Article PubMed PubMed Central Google Scholar
Zuin, J. et al. Nonlinear control of transcription through enhancer-promoter interactions. Preprint at bioRxiv https://doi.org/10.1101/2021.04.22.440891 (2021).
Harris, S. L. & Levine, A. J. The p53 pathway: positive and negative feedback loops. Oncogene 24, 2899–2908 (2005).
Article CAS PubMed Google Scholar
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Article ADS CAS PubMed Google Scholar
Kremer, K. & Grest, G. S. Dynamics of entangled linear polymer melts: a molecular-dynamics simulation. J. Chem. Phys. 92, 5057–5086 (1990).
Article ADS CAS Google Scholar
Michieletto, D., Orlandini, E. & Marenduzzo, D. Polymer model with epigenetic recoloring reveals a pathway for the de novo establishment and 3D organization of chromatin domains. Phys. Rev. X 6, 041047 (2016).
Google Scholar
Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comp. Phys. 117, 1–19 (1995).
Article ADS CAS MATH Google Scholar

Download references

Acknowledgements

We thank the European Research Council (ERC CoG 648050 THREEDCELLPHYSICS) for support.

Author information

Authors and Affiliations

SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh, EH9 3FD, UK
C. A. Brackley, D. Michieletto, M. C. F. Pereira & D. Marenduzzo
MRC Human Genetics Unit, MRC Institute of Genetics & Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, EH4 2XU, UK
N. Gilbert & D. Michieletto
Institute of Pathology, University Medical Center, Georg-August University of Göttingen, 37075, Göttingen, Germany
A. Papantonis
Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford, OX1 3RE, UK
P. R. Cook

Authors

C. A. Brackley
View author publications
You can also search for this author in PubMed Google Scholar
N. Gilbert
View author publications
You can also search for this author in PubMed Google Scholar
D. Michieletto
View author publications
You can also search for this author in PubMed Google Scholar
A. Papantonis
View author publications
You can also search for this author in PubMed Google Scholar
M. C. F. Pereira
View author publications
You can also search for this author in PubMed Google Scholar
P. R. Cook
View author publications
You can also search for this author in PubMed Google Scholar
D. Marenduzzo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.A.B., N.G., D. Michieletto, A.P., P.R.C., M.C.F.P., and D. Marenduzzo designed research; C.A.B., M.C.F.P., and D. Marenduzzo performed research; C.A.B., N.G., D. Michieletto, A.P., M.C.F.P., P.R.C., and D. Marenduzzo analysed the data and wrote the manuscript.

Corresponding author

Correspondence to D. Marenduzzo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brackley, C.A., Gilbert, N., Michieletto, D. et al. Complex small-world regulatory networks emerge from the 3D organisation of the human genome. Nat Commun 12, 5756 (2021). https://doi.org/10.1038/s41467-021-25875-y

Download citation

Received: 27 May 2021
Accepted: 30 August 2021
Published: 01 October 2021
DOI: https://doi.org/10.1038/s41467-021-25875-y

This article is cited by

Transcription modulates chromatin dynamics and locus configuration sampling
- Giada Forte
- Adam Buckle
- Chris A. Brackley
Nature Structural & Molecular Biology (2023)
Deciphering multi-way interactions in the human genome
- Gabrielle A. Dotson
- Can Chen
- Indika Rajapakse
Nature Communications (2022)
The 3D architecture of the pepper genome and its relationship to function and evolution
- Yi Liao
- Juntao Wang
- Changming Chen
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.