Multiscale modelling of chromatin 4D organization in SARS-CoV-2 infected cells

Chiariello, Andrea M.; Abraham, Alex; Bianco, Simona; Esposito, Andrea; Fontana, Andrea; Vercellone, Francesca; Conte, Mattia; Nicodemi, Mario

doi:10.1038/s41467-024-48370-6

Download PDF

Article
Open access
Published: 13 May 2024

Multiscale modelling of chromatin 4D organization in SARS-CoV-2 infected cells

Nature Communications volume 15, Article number: 4014 (2024) Cite this article

896 Accesses
43 Altmetric
Metrics details

Subjects

Abstract

SARS-CoV-2 can re-structure chromatin organization and alter the epigenomic landscape of the host genome, but the mechanisms that produce such changes remain unclear. Here, we use polymer physics to investigate how the chromatin of the host genome is re-organized upon infection with SARS-CoV-2. We show that re-structuring of A/B compartments can be explained by a re-modulation of intra-compartment homo-typic affinities, which leads to the weakening of A-A interactions and the enhancement of A-B mixing. At the TAD level, re-arrangements are physically described by a reduction in the loop extrusion activity coupled with an alteration of chromatin phase-separation properties, resulting in more intermingling between different TADs and a spread in space of the TADs themselves. In addition, the architecture of loci relevant to the antiviral interferon response, such as DDX58 or IFIT, becomes more variable within the 3D single-molecule population of the infected model, suggesting that viral infection leads to a loss of chromatin structural specificity. Analysing the time trajectories of pairwise gene-enhancer and higher-order contacts reveals that this variability derives from increased fluctuations in the chromatin dynamics of infected cells. This suggests that SARS-CoV-2 alters gene regulation by impacting the stability of the contact network in time.

SARS-CoV-2 restructures host chromatin architecture

Article 23 March 2023

Viral remodeling of the 4D nucleome

Article Open access 25 April 2024

Statistics of chromatin organization during cell differentiation revealed by heterogeneous cross-linked polymers

Article Open access 14 June 2019

Introduction

The SARS-CoV-2 outbreak had an important impact on society and science. Several efforts have been made to understand the effects of the virus on host cells from different points of view, ranging from studying the immunological response to the virus¹ to investigating the effects of infection on epigenetic regulation² or researching therapeutic molecular targets³. SARS-CoV-2 is able to impact the chromatin architecture^4,5 of the host cell, which in general is an important control layer for gene regulation^6,7. Indeed, virus infection has been shown, for instance, to alter genome organization of olfactory receptors in humans and hamsters, providing a potential mechanism to explain anosmia⁵, one of the typical symptoms of Covid-19. More recently, it has been shown that SARS-CoV-2 deeply impacts genome organization at multiple length scales, ranging from some kilobases up to A/B compartment level, and influences the activity of gene categories⁴ fundamental to the immunological response¹, such as genes involved in the interferon (IFN) response and pro-inflammatory genes. Although those studies shed light on the effects of SARS-CoV-2 on genome organization, the physical mechanisms regulating how the virus changes the host cell 3D chromatin structure are not clearly understood.

Here, we employ models from polymer physics^8,9 and Molecular Dynamics (MD) simulations to quantitatively study multiscale chromatin re-arrangements resulting from SARS-CoV-2 infection of the host cell. In general, polymer models have shown to be a valuable tool to investigate genome organization in the cell nucleus, as they are able to describe the physical mechanisms shaping chromosome folding⁹ and to explain several features of genome architecture, e.g., the heterogeneity of chromatin structure in single cells¹⁰ or the structural re-arrangements caused by genomic mutations and their impact on gene expression¹¹. At very large genomic length scales (several Mbs), we show that a simple polymer made of consecutive compartments (i.e., block-copolymer model^12,13), in which homo- and hetero-typic interactions are defined within and between compartments, is able to explain the weakening of A compartment and enhancement of A-B mixing⁴ experimentally observed in SARS-CoV-2 infected genome, by basically reducing the intra-compartment homo-typic A-A affinity. At TAD level (from hundreds Kbs to some Mbs), we show that a model combining loop-extrusion^14,15 and phase-separation^13,16 effectively describes the experimentally observed intra-TAD weakening in SARS-CoV-2 infected cells⁴, which results from a reduction of extruders density coupled with an alteration of phase-separation properties of chromatin filament. Importantly, we find that this alteration is not observed in a polymer model describing chromatin organization in human coronavirus HCoV-OC43 infected cells (causing common cold), suggesting that alteration of phase-separation is a peculiar feature of SARS-CoV-2 infection. Furthermore, using the same model informed with HiC data^10,11, we investigate the architecture of genomic loci containing DDX58 and IFIT genes, which are of relevant immunological interest since linked to the antiviral interferon (IFN) response¹⁷ of the host cell. Specifically, analysis of polymer structures reveals that in SARS-CoV-2 model the population of single-molecule 3D configurations results more variable and less coherent with respect to non-infected condition, suggesting that the alteration of activity observed for IFN genes^1,4 can be due to a general loss of structural specificity caused by alteration of physical mechanisms driving 3D chromatin organization. By leveraging on our Molecular Dynamics simulations, we show that the model of SARS-CoV-2 exhibits a more scattered time dynamics, leading to a reduction of contact stability between pairs or hubs of multiple regulatory elements.

Overall, our polymer-physics based study provides insights into how viral infection affects chromatin organization and suggests that this occurs through the combined alteration of the loop-extrusion and phase-separation properties of chromatin, indicating a potential mechanistic link between the observed genome re-structuring and mis-regulation of, e.g., key genes involved in the immunological response within the host cell.

Results

We study chromatin re-organization of host cell genome infected by SARS-CoV-2. To this aim, we consider recently published HiC data⁴ in control condition, i.e., not infected human A549 cells expressing ACE2 (referred to as Mock) and in human A549 cells expressing ACE2 at 24-h post SARS-CoV-2 infection, in which HiC data highlighted re-arrangements at multiple length scales, involving A/B compartment, TADs and regulatory contacts within specific loci⁴.

Modeling of chromatin re-structuring in A/B compartments

One of the main structural re-arrangements on chromatin architecture resulting from SARS-CoV-2 infection of the host genome occurs at A/B compartment level. Specifically, it has been observed that viral infection results in a general weakening of A-compartment concomitantly with an enhanced A/B compartment mixing⁴, as schematically depicted in Fig. 1a. To quantitatively investigate such effect, we first focused on a simple model of chromatin at A/B compartment level. We employed the Strings and Binders Switch polymer model^13,18, where chromatin folding is driven by a phase-separation mechanism (Methods), similar to other models proposed for chromatin compartmentalization^12,19,20,21. Briefly, we consider a simple block copolymer where A and B compartments are modeled as two different types of binding sites (represented as different colors) which can homo-typically interact with cognate molecules (named binders) with an affinity E_A-A and E_B-B, driving A-A or B-B interactions within the same compartment (Fig. 1b). On the other hand, binders can also mediate A-B or B-A hetero-typic interactions, with a general affinity E_A-B (Methods). To ensure micro-phase separation of A and B blocks, we always consider E_A-A > E_A-B and E_B-B > E_A-B¹⁹. We first considered models with balanced interactions E_A-A = E_B-B and varied the homo-typic affinity (here, hetero-typic affinity E_A-B is kept constant, Methods). In general, low homo-typic interactions result in a reduced compartmentalization and increased A/B mixing, as shown by the model contact maps (Fig. 1c and Supplementary Fig. 1a), the first eigenvector E1 from Principal Component Analysis (PCA) and the saddle-plots of the sorted eigenvector components^22,23 (Supplementary Fig. 1b, Methods). Analogous effects are observed by increasing hetero-typic affinity E_A-B, keeping constant homo-typic E_A-A = E_B-B (Supplementary Fig. 2, Methods). In addition, models with unbalanced interactions with E_B-B > E_A-A result in both A/B mixing and, importantly, in weakening of A-compartment shown in the contact maps (Fig. 1c, Supplementary Fig. 3a) and in asymmetric saddle plots (Supplementary Fig. 3b). Therefore, we reasoned that a combination of models with balanced and unbalanced interactions can fit A/B compartment alteration in SARS-CoV-2 infected genomes. We then fitted the best combination of interactions to reproduce the average compartment profile (using saddle-plot maps) obtained from HiC data in Mock and SARS-CoV-2-infected cells (Methods). Interestingly, Mock HiC data are mainly described (almost 90%) by a model with balanced homo-typic interactions (i.e., E_A-A = E_B-B Fig. 1d, bottom left panel), indicating a similarity in the A and B average compartmentalization level and consistent with existing models of A/B compartmentalization²⁰. Conversely, data in SARS-CoV-2 infected cells are best described by a combination of unbalanced homo-typic interactions where E_B-B > E_A-A ( > 60%) consistently with the general weakening of A-compartment and above-mentioned enhanced A/B mixing, with balanced interactions only marginally involved (about 20%, Fig. 1d, bottom right). Importantly, albeit very simple, this model exhibits a high level of agreement with experimental data, as shown by the comparison between Log2 FC (SARS-CoV-2/Mock) of saddle-plot matrices (Pearson r = 0.77, Fig. 1e). Analogous results are found by fitting a combination of different hetero-typic affinities, keeping fixed balanced interactions E_A-A = E_B-B. Indeed, we find that SARS-CoV-2 data are better described by a combination with higher hetero-typic affinities with respect to the Mock case (Supplementary Fig. 4a), although the saddle-plot changes are captured with less accuracy (Pearson r = 0.6, Supplementary Fig. 4b), indicating an important role for the model with unbalanced affinities.

To test the robustness of our results on a real genomic region, we repeated the above discussed analysis using as case of study chromosome 11, with A and B blocks defined using the 1^st eigenvector from PCA (Supplementary Fig. 5a, Methods). MD simulations of this model return contact maps accurately describing A/B compartment profile contained in the HiC data (Supplementary Fig. 5b, c, Methods). Specifically, we find that Mock data are best described by a combination with ∼70% balanced interactions (Supplementary Fig. 5d), in line with the previously discussed result but also highlighting a not negligible role for unbalanced interactions even in the not-infected case, likely due to the distinct machineries behind A and B compartments formation²⁴. Conversely, SARS-CoV-2 data are best described by a combination with higher level of unbalanced affinities (∼80%, Supplementary Fig. 5d, e) in agreement with the weakening of A-compartment. Overall, these results show that chromatin re-arrangements observed in infected host genome can be explained by a re-modulation of affinities which in turn affects the tendency of compartments to microphase separate, as also shown by the 3D rendering of polymer structures representing A and B compartments in Mock (Fig. 1f, Supplementary Figs. 4d, 5f, left panel) and SARS-CoV-2 (Fig. 1f, Supplementary Figs. 4d, 5f, right panel) infected conditions.

Viral infection impacts loop-extrusion and phase-separation features at TAD level

Next, we investigated how SARS-CoV-2 infection impacts genome organization at TAD level, i.e., genomic scales ranging from tens of kbs to some Mbs. Indeed, it has been shown that viral infection produces a general weakening of intra-TAD contacts along with a slightly increase of inter-TADs interactions⁴ (Fig. 2a) and concomitantly with a general reduction of Cohesin level⁴, suggesting a reduction of loop-extrusion activity. To test this hypothesis and give a mechanistic insight to this result, we used a polymer physics model combining both loop-extrusion^14,15 (LE) and phase-separation^13,18 (PS) mechanisms (Fig. 2b, Methods), which recently has been shown to successfully describe chromatin organization at single cell level¹⁰. In this scenario, LE and PS simultaneously act and the pattern of chromatin contacts observed in HiC data results from an interplay between both processes (Fig. 2c). By varying the main system parameters, i.e., interaction affinity and average distance between extruders (or equivalently their number, Methods) (Supplementary Fig. 6a), we generated several different polymer populations with their simulated contact maps (Supplementary Fig. 6b) and contact probability profiles (Supplementary Fig. 6c). In this way, we were able to identify the polymer model best fitting the contact probability obtained from HiC data (Methods), in the genomic distance ranging from the sub-TAD level (approx. 10 kb) to inter-TADs contacts (some Mbs, Methods). The model is able to explain with accuracy experimental data, as shown by the fit of the average contact probability (as shown by χ² values in Supplementary Fig. 6d) in Mock (Fig. 2d, left bottom panel) and in SARS-CoV-2 infected (Fig. 2d, right bottom panel) conditions. Importantly, the best model describing Mock data revealed an average distance between extruders of approximately 100−150 kb (Supplementary Fig. 6d, left panel), consistent with previous estimates obtained from other HiC datasets¹⁵. Conversely, the best model fitting the SARS-CoV-2 infected HiC data was best described by a consistently decreased number of extruders (approximately halved, Supplementary Fig. 6d, right panel), in full agreement with experimental observations where viral infection produces a genome-wide decrease of Cohesin levels⁴. Interestingly, this analysis revealed that, in order to fit HiC data in infected cells, the reduction of extruders is coupled with a reduction of interaction affinity (around 15−20%) between binders and chromatin (Supplementary Fig. 6d), which affects chromatin spatial localization and contributes to the general weakening of intra-TADs contacts (Fig. 2d, upper panels) observed in infected genomes. This is quantitatively shown in the Log2 FC (SARS-CoV-2/Mock) of contact maps (Fig. 2e, upper panel) and contact probabilities (Fig. 2e, bottom panel), which exhibits a very good agreement with experimental data (Pearson r = 0.82, Methods). To check whether such changes in chromatin architecture are peculiar of SARS-CoV-2 or if they are observed in other coronaviruses, we repeated the above-described analysis using HiC data in cells infected by the human coronavirus (HCoV) OC43⁴, which causes common cold. Again, we were able to fit the average contact probability with high accuracy (chi-square test p-val=1). Intriguingly, we find that the best model describing HCoV-OC43 infection is analogous to the Mock case, with same affinity and a light increase of average distance between extruders (i.e., as observed in SARS-CoV-2, but to a lesser extent), as shown by the best fitting parameters (Supplementary Fig. 6e) and in full agreement with the experimental reports⁴. Therefore, this suggests that the above discussed re-arrangements, caused by alteration of phase-separation properties, are specifically induced by SARS-CoV-2 and are not observed in other viruses. Taking advantage of MD simulations, we produced an example of 3D structure representing the average TAD in Mock (Fig. 2f, left panel and Supplementary Movie 1) and SARS-CoV-2 infected (Fig. 2f, right panel and Supplementary Movie 2) conditions, providing an effective and realistic summary of the architectural re-arrangements occurring within and between TADs after the infection. Microscopy experiments could be a possible strategy to observe this structural effect.

Next, to investigate the impact of combined extruders and affinity variation on chromatin compartmentalization, we generalized the above discussed model of TADs by including also A and B compartments (Supplementary Fig. 7a, Methods). When the number of extruders is lowered (we considered ∼4-fold reduction), compartment affinities kept fixed, TADs are weakened (p = 10⁻⁴⁶, one-sided Mann-Whitney U test) and compartmentalization is strengthened (Supplementary Fig. 7b−d Methods), consistent with experimental observation in which depletion of Cohesin increases compartment strength^25,26. Interestingly, if the same decrease of extruders is coupled with a decrease of the homo-typic affinities, either intra-TAD contacts (p = 10⁻⁹⁷, one-sided Mann−Whitney U test) and compartmentalization strength are reduced (Supplementary Fig. 7b−d), in agreement with HiC data from SARS-CoV-2 infected cells. Overall, those simulations suggest that SARS-CoV-2 viral infection specifically affects genome organization by altering fundamental physical mechanisms, including loop-extrusion and phase-separation, that shape chromatin structure.

Structural re-arrangements of interferon response genes (IFN) loci

Next, to understand how the above discussed structural re-arrangement within TADs may affect gene regulation, we modeled real genomic regions relevant in case a viral infection occurs. Specifically, we considered genomic loci containing interferon (IFN) response genes, i.e., genes typically upregulated upon interferon stimulus and that are commonly expressed as response to a viral infection¹⁷. Importantly, it has been shown that in severe Covid syndromes such genes are not properly expressed^1,27 with consequent alteration in the immunological response of host cell. We considered as first case of study the genomic region spanning 400 kb around the DDX58 gene (chr9: 32300000-32700000 bp, hg19 assembly, Fig. 3). The DDX58 locus exhibits the typical re-arrangements caused by SARS-CoV-2 infection, as in Mock case the DDX58 gene is contained in a well-defined domain limited by convergent CTCF sites (Fig. 3a), whereas in the infected case a general weakening of intra-TAD interactions is observed, although CTCF peaks are mainly unchanged (Fig. 3b). Analogous observations hold for another IFN locus, containing the cluster of IFIT genes (chr10: 90900000-91290000 bp) (Supplementary Fig. 8a, b). To quantitatively investigate such re-arrangements, we employed the above-described polymer model combining loop-extrusion and chromatin-protein interactions¹⁶, using experimental CTCF ChIP-seq data⁴ to set the probabilities and the positions of the anchor points for extruders¹⁵ and HiC data to optimize the types and the positions of the binding sites¹¹ (Supplementary Fig. 8a). To this aim, we employed the PRISMR algorithm¹¹, which infers from the input HiC contact map the number of types of binding sites and their best arrangement along the polymer to fit the input data (Methods). In the DDX58 locus, the algorithm returned 4 types of binding sites (Fig. 3c, d), while in the IFIT locus 5 types have been found (Supplementary Fig. 8c,d). Taking advantage of the results obtained for the polymer model calibrated to simulate the average chromatin behavior at TAD level, we were able to generate, by MD simulations, ensembles of 3D structures accurately capturing the differences in the DDX58 locus between Mock and SARS-CoV-2 conditions, as shown by the simulated contact maps (Fig. 3c, d) highly correlated with experimental data (Pearson r > 0.9, distance corrected r’=0.67, Methods). In addition, the model correctly captures the different contact probability decay (Supplementary Fig. 9b), as shown by the Log2 FC curve (Supplementary Fig. 9c, Pearson r = 0.81). Analogous results were found for the polymer model of the IFIT locus, which returns highly correlated contact maps (Supplementary Fig. 8c, d) and similar contact probability decays (Supplementary Fig. 9d, e). Finally, examples of 3D structures taken from MD simulations (Supplementary Data 1) visually highlight the above-discussed architectural differences, with the DDX58 and IFIT loci organized in distinct, well-defined regions in Mock (Fig. 3e and Supplementary Fig. 8e) while they tend to be less localized and more intermingled in SARS-CoV-2 (Fig. 3f and Supplementary Fig. 8f).

**Fig. 3: Structural re-arrangements of IFN DDX58 locus.**

Single cell 3D structures result highly variable in SARS-CoV-2 infected condition

The different 3D structures observed in Mock and SARS-CoV-2 prompted us to investigate in more detail the above-discussed architectural differences at the single cell level. To this aim, polymer models offer a powerful tool as they allow to build ensembles of independent 3D structures that mimic single-cell variability¹⁶, experimentally observed e.g., by MERFISH microscopy method²⁸. Therefore, leveraging on such feature, we analyzed the population of 3D structures in Mock (Fig. 4a, upper panel) and SARS-CoV-2 (Fig. 4a, bottom panel) models. First, we focused on the DDX58 promoter and its validated enhancer⁴ (Fig. 4a). By visual inspection of these 3D structures in both conditions, it emerges that DDX58 promoter and the enhancer tend to be closer in space in Mock with respect to the infected condition, in agreement with HiC data. The distributions of 3D distances between the DDX58 promoter and its enhancer (Fig. 4b) confirmed this observation, as in Mock it exhibits a lower mean than the infected case (one-sided t test p = 10⁻²⁵⁹). Interestingly, the distribution results also more variable in infected cells (st. dev. in SARS-CoV-2 ∼30% higher than in Mock), suggesting that the mis-regulation of this gene upon infection is also due to a loss of contact specificity and supporting the scenario by which the viral action changes the binding pattern through alteration of Cohesin and other factors, which in turn causes a general loss of structural coherence in the population of 3D structures. Next, we focused on the architecture of the entire locus and considered the polymer size and shape descriptors²⁹ (Methods). Again, we find that the estimated volume distribution (Methods) is more variable in SARS-CoV-2 (Fig. 4c upper panel, st. dev. ∼30% higher). Conversely, the average anisotropy distribution (Fig. 4c, bottom panel), which measures how asymmetrically the polymer is distributed in space, results lower in SARS-CoV-2 population. Analogous results are found for a-sphericity, another shape descriptor (Methods) measuring the deviation from a spherical geometry. Those results are consistent with the results of the previous section, where we observed increased inter-TADs contacts and less localization observed which make the polymer more homogeneous and spherical in SARS-CoV-2 model.

**Fig. 4: Single cell 3D structures result more variable in SARS-CoV-2 infected condition.**

Next, we investigated whether the infected model may exhibit differences on higher-order contacts. To this aim, we focused on the cluster of IFIT genes, where we considered the probability of three-way contacts^30,31 using as point of view IFIT3 gene, located in the center of the IFIT TAD (Fig. 4d, Methods). We find that in SARS-CoV-2 model three-way contacts result consistently reduced (Fig. 4d) although weak, long-range events appear. By fixing the enhancer 1 (E1) as other point of view we generated a virtual three-way profile involving E1 and IFIT3 (Fig. 4e), which clearly highlights specific three-way contacts, as the triplet involving E1-IFIT3-E3 (arrow in Fig. 4e) whose frequency in Mock case results statistically higher than in control triplets (p = 3*10^-9, one-sided Mann−Whitney U test, Methods). In addition, the frequency of this triplet is reduced in SARS-CoV-2 model (p = 7*10^-5, one-sided Mann-Whitney U test, Methods). This suggests that the mis-regulation may also be due to an alteration of contact network within the regulatory hub, consistent with other recent observations whereby the olfactory hubs are disrupted/perturbed after SARS-CoV-2 infection⁵. Finally, examples of 3D structures of the IFIT locus in Mock (Fig. 4f, left panel) and SARS-CoV-2 (Fig. 4f, right panel) conditions provide a visual summary of the discussed results.

Time dynamics of 3D contacts is highly variable in SARS-CoV-2 infected condition

Next, we investigated the mechanism leading to the different structural variability observed in Mock and SARS-CoV-2 models. To this aim, we considered the population of independent time trajectories (Methods) of the polymer and analyzed the dynamics in both conditions (Fig. 5a) at equilibrium (Methods). We focused again on the DDX58-enhancer distance (Fig. 5b) and the above-discussed polymer shape descriptors, i.e., anisotropy (Supplementary Fig. 10a, left panel) and a-sphericity (Supplementary Fig. 10a, right panel). As expected, the distance trajectories in the Mock model appear fluctuating around average values lower than the SARS-CoV-2 model, as also confirmed by the distributions of the average distance over different time trajectories (Fig. 5c, upper panel, t test p = 10^-15, Methods). Same analysis for anisotropy (Supplementary Fig. 10b, left panel) and a-sphericity (Supplementary Fig. 10b, right panel) reveals instead a specular behavior, in agreement with the observations of the previous section. Interestingly, it emerges also that the time trajectories in SARS-CoV-2 model are more fluctuating, as shown by the distribution of the standard deviations of the distance in time (Fig. 5c, lower panel). For the shape descriptors Mock and SARS-CoV-2 models exhibit similar deviations from the average value during time (Supplementary Fig. 10c). Analogously, multiple co-localization events (named co-occurrences, Methods) in IFIT locus, involving IFIT3 and two enhancers, tend to be less frequent in time in SARS-CoV-2 model dynamics (Supplementary Fig. 10d). These results suggest that SARS-CoV-2 could affect the stability of contacts between regulatory elements. To support this conclusion, we analyzed in more detail the DDX58-enhancer distance time dynamics by considering shorter time scales at higher time resolution (Fig. 5d, Methods). We generated time trajectories to follow a smooth evolution of gene-enhancer distance, in Mock (Supplementary Movie 3) and SARS-CoV-2 (Supplementary Movie 4) models. In this way, we were able to estimate a contact time τ, i.e., how long the gene and the enhancer spend in contact (Fig. 5e, Methods). Importantly, we find that the distribution of contact times tends to be significantly lower in SARS-CoV-2 model (Fig. 5f, t test p = 2*10^-4), with an approximately 1.5-fold reduction of the average contact time. Analogously, we considered the distribution of time intervals between contacts (Supplementary Fig. 10e) and found that its average exhibits an approximately 1.6-fold increase in SARS-CoV-2 condition (one-sided t test p = 3*10^-4), suggesting that alteration of either contact times and frequencies similarly contribute to the changes in the HiC map observed in infected condition.

**Fig. 5: Time dynamics of 3D contacts is more variable in SARS-CoV-2.**

Taken together, those results point toward a scenario where the mis-regulation of IFN genes observed in SARS-CoV-2 could be imputed to a decreased contact stability between genes and their regulatory elements. It is worth to stress that, although in-silico generated, the distance dynamics obtained by these polymer models represent a good proxy of real trajectories, as shown in recent studies³² and are therefore suitable quantities for experimental testing through e.g., live cell imaging³³.

Chromatin re-arrangements in SARS-CoV-2 infection correlate with a combination of changes of CTCF and histone marks

Next, to understand the link between the architectural re-arrangements encoded in HiC data and molecular factors, we investigated the relationship between binding sites and epigenetics marks, such as CTCF and histone modifications. In this way, we could assign a biological identity to the binding sites found from HiC data^11,34 and mechanistically interpret the changes in such associations occurring upon viral infection. To this aim, we made a cross-correlation analysis (Methods) between the binding site profiles of the model and different available epigenetic marks at DDX58 (Fig. 6) and IFIT (Supplementary Fig. 11) loci, in Mock and SARS-CoV-2 conditions. In Mock, we find (Fig. 6a, right panel) a clear, strong correlation between CTCF and RAD21 with binding site type #1, likely highlighting an important role for LE mechanism in shaping the central domain containing the DDX58 gene, but we also observe a significative correlation with RNAPolII (RPB1) and H3K4me3, in agreement with the view of a combinatorial action of different factors in shaping chromatin organization³⁴. In addition, it emerges a clear association between the flanking binding sites (#3 and #4) to H3K27me3 and H3K9me3 respectively (Fig. 6b, left panel). In SARS-CoV-2 model the distribution of binding sites exhibits, in general, a similar profile (Fig. 6a, left panel) but a richer pattern of (less strong) correlations is found (Fig. 6b, right panel). In particular, we could identify the most significant changes in such correlations by using a control set of randomly permuted polymers (Methods) and found that they involve CTCF (p = 0.047) and RAD21 (p = 0.065, generally reduced), which become associated with multiple types (#1 and #3) as well as H3K27ac (p = 0.033), which exhibits a general reduction too⁴. Analogous considerations hold for IFIT locus where changes in correlations involve CTCF, RPB1 and H3K4me3 (Supplementary Fig. 11), although they result much less significant (p > 0.1). In general, those results support the proposed mechanism⁴ by which an alteration of LE activity upon infection coupled with changes in the epigenetic signatures of activity produces an altered expression of IFN genes with a consequent poor response to the infection.

**Fig. 6: Chromatin re-arrangements correlates with a combination of changes of CTCF and histone marks at DDX58 locus.**

Discussion

In this work, we investigated how SARS-CoV-2 infection alters the 3D organization of chromatin in the host cell at multiple length scales, ranging from few kilobases to several Mbs and involving different structural entities, as A/B compartments, TADs and gene-enhancer loops. To this aim, we employed models from polymer physics and MD simulations widely used to study chromatin organization^13,18,35. We showed that a simple block copolymer including just homo-typic and hetero-typic interactions is overall able to describe the A-compartment weakening and A-B mixing detected from HiC data in SARS-CoV-2 infected cells, by remodulating A-A affinities in an unbalanced A/B compartment model. Of course, more complicated descriptions of compartmentalization are possible and could include other mechanisms known to play a role for chromatin structure, such as interaction with nuclear envelope^19,36. At TAD level, we find that a combined reduction of loop extrusion activity (modeled as a reduction of extruders) together with an alteration of phase-separation properties (modeled as a reduction of chromatin-protein affinities) potentially explain the weakening of intra-TAD interactions observed in HiC data⁴. Interestingly, a model calibrated from HiC data in host cells infected with virus⁴ HCoV-OC43, another human coronavirus causing common cold, has a slightly reduced loop-extrusion activity with respect the Mock case but keeps unchanged protein affinities with chromatin, suggesting that the capacity of altering this phase-separation properties is a peculiar feature of SARS-CoV-2 model and it is not triggered by defense mechanisms of the host cell. In addition, a model including TADs and A/B compartments confirmed this scenario, as simultaneous alteration of loop-extrusion and phase-separation can lead to intra-TAD weakening and a general decrease of compartmentalization strength, as observed in SARS-CoV-2 infected condition. We then investigated the link between chromatin re-arrangement and the regulation of genes involved in the antiviral response (IFN genes) which are mis-regulated upon SARS-CoV-2 infection¹. Polymer models of genomic loci containing DDX58 and IFIT genes highlighted a higher degree of variability in the ensemble of single-molecule conformations of SARS-CoV-2 models. This variability is in turn related to a noisier and less stable time of contact dynamics, suggesting that SARS-CoV-2 infection reduces specificity and structural stability of regulatory contacts. Analysis of epigenetic association with the polymer models reveals changes with factors not only limited to Cohesin and CTCF, consistent with the above depicted scenario where SARS-CoV-2 infection alters multiple physical mechanisms shaping chromatin organization of the host cell.

In order to understand the molecular causes leading to the above-discussed re-arrangements, by means of direct or undirect mechanisms, it would be interesting to integrate in the polymer model the existence of specific molecular factors encoded by the virus known to perturb the host cell, as highlighted by recent experiments showing that viral proteins can alter the cell epigenome² (ORF8) or interact with other proteins of the infected cell³⁷. In this regard, it is worth to mention that other viruses are capable of re-structuring genome organization through the transcription of their proteins, such as NS1 from influenza A virus (IAV)³⁸. The above outlined strategy, based on polymer models combined with experimental data, could be relevant to test the effects of specific proteins on the physical mechanisms shaping chromatin architecture (e.g., phase-separation) and therefore be helpful in the identification of molecular targets for therapeutics purposes.

In general, exploring the link between viral infection and chromatin architecture can be extremely insightful to understand virus action on host cell at the level of gene regulation. To this aim, polymer models turn out to be valuable tool as they offer an unbiased, predictive approach to connect different aspects relevant for genome organization and function³⁹, including single-cell variability, dynamics between regulatory elements and research of therapeutic targets.

Methods

We use polymer physics models to study chromatin re-organization of host cell genome infected by SARS-CoV-2. We consider recently published HiC data⁴ in control condition, i.e., not infected human A549 cells expressing ACE2 (referred to as Mock) and in human A549 cells expressing ACE2 at 24-hour post SARS-CoV-2 infection.

Polymer model of A/B compartment

To simulate A and B compartment we employed the Strings and Binders Switch¹⁸ (SBS) model, in which a chromatin filament is modeled by a string made of \(N\) beads that can interact with different, specific binding factors populating the surrounding environment. We used a polymer made of \(N=\)1000 beads, divided in equally sized blocks of 75 beads, schematically colored in green and red (Fig. 1) and representing, without loss of generality, A and B compartment respectively. Assuming, e.g., a genomic content of 100 kb per bead, the polymer represents a region of 100 Mb divided in 12 compartments large 7.5 Mb each, in line with average size of A/B compartments⁴⁰. Homo-typic affinities E_A-A and E_B-B between binding sites and cognate binders, which mediate intra-compartment interactions (i.e., between A-A and B-B), are taken in the range 3.2−3.4\({K}_{B}T\). In Fig. 1 and Supplementary Figs. 1 and 3 we show affinities normalized with respect the background hetero-typic interaction (A-B and B-A), which is taken E_A-B = 3.1\({K}_{B}T\) and kept constant in all the simulations for sake of simplicity. To test the generality of our results, we considered also models with constant homo-typic affinities (E_A-A = E_B-B, for sake of simplicity) and variable hetero-typic affinity E_A-B taken in the range 3.0−3.2\({K}_{B}T\). As before, in Supplementary Figs. 2 and 4 we show affinities normalized with respect E_A-B. Finally, binder concentration is taken above coil-globule transition threshold¹³, so to ensure phase-separation of compartments.

Polymer model of real chromosomes

To simulate chromosome 11 (Supplementary Fig. 5), we used the 1st eigenvector from the Principal Component Analysis (PCA) applied to HiC data in Mock condition at 100 kb resolution, using the function eigs_cis from cooltools package²³. GC content was used to identify A and B compartments. We used a polymer made of \(N=\)1353 beads, having 100 kb of genomic content. Beads of type A or type B were assigned based on the compartment profile, as shown in Supplementary Fig. 5a. When the eigenvector was not defined, A/B beads were assigned by simple interpolation around those sites (Supplementary Fig. 5a). As before, homo-typic affinities E_A-A and E_B-B are taken in the range 3.2-3.4\({K}_{B}T\) with hetero-typic E_A-B = 3.1\({K}_{B}T\) kept fixed.

Polymer model of TADs

To simulate polymer models of TADs we considered again the above mentioned SBS model combined with loop extrusion^14,15 (LE), following a previously described implementation¹⁰. Specifically, we use a simple homopolymer made of \(N=\)1000 beads with one type of binder, as shown in Fig. 2b. Bead-binder interaction affinity is taken in the range 3.1−3.8\({K}_{B}T\), binder concentration is accordingly taken high enough to ensure coil-globule phase-transition and, as before, it is kept constant for sake of simplicity. Anchor points for the loop extruding factors (LEfs) are all bi-directional (i.e., forward and reverse) and are regularly placed along the polymer every 120 beads, occurring with a probability equal to 0.5, as shown in Fig. 2c. We assume a 5 kb genomic content per bead, so to obtain an average TAD size ∼600 kb, i.e., similar to the average TAD size measured in Mock HiC data⁴. Average distances among extruders (we refer to as LEf separation in Supplementary Fig. 6d, e), proportional to the inverse of their total number, is taken in the range 60−500 kb, consistent with previous reports¹⁵. Extruders lifetime, which is in turn related to the processivity (i.e., the average length of an extruded loop, see Molecular Dynamics simulation details), is taken high enough to allow the formation of TADs and loops (500 kb) in the contact maps and kept constant for sake of simplicity. Results were not significantly affected by changes of this parameter. Analogously, polymer model including TADs and A/B compartments (Supplementary Fig. 7) is made of \(N=\)1000 beads with four equally sized compartments in the sequence A-B-A-B (250 beads each), with 5 TADs in each compartment (50 beads each). By assuming 10 kb of genomic content for each bead, we simulate approximately 10 Mb. Again, homo-typic affinities E_A-A and E_B-B are taken in the range 3.2−3.4\({K}_{B}T\), hetero-typic affinity E_A-B = 3.1\({K}_{B}T\) kept fixed. In this case, we considered only balanced affinities (E_A-A = E_B-B) for sake of simplicity. Loop extrusion parameters are similar to the above described model of TADs, with average distance among extruders taken in the range 100−1000 kb (Supplementary Fig. 7a) and similar processivity.

Polymer model of interferon response genes DDX58 and IFIT

To simulate DDX58 (chr9:32300000-32700000, hg19) and IFIT (chr10:90900000-91290000, hg19) loci, we used the previously described hybrid model (SBS + LE), informed with experimental data to find the binding sites along the polymer, as schematically shown in Supplementary Fig. 9a. Binding sites have been obtained with the PRISMR algorithm¹¹, using as input HiC data in Mock and SARS-CoV-2 conditions at 5 kb resolution. In general, starting from a contact map of a generic genomic locus, this algorithm returns the minimum number of different types of binding sites (represented by different colors) and their position along the chain in order to best explain the input data. This occurs through an iterative Simulated Annealing Monte Carlo optimization procedure that minimizes a cost function made of two terms: the first one takes into account the difference between HiC and model predicted contact matrices, while the second is proportional to the total number of model binding sites so to penalize the presence of too many of them¹¹. In this way, the model optimizes the similarity with the input HiC data, while avoiding overfitting. Here, 4 types of binding sites for DDX58 (Fig. 3) and 5 types for IFIT (Supplementary Fig. 8) locus have been found, in line with similar polymer models of real loci¹⁰, and an inert type not shown in the diagrams for sake of simplicity. As the single bead contains 500 bp, we used polymers of \(N=\) 900 beads for DDX58 locus and \(N=\) 880 beads for IFIT locus (we added inert tails of 50 beads on both sides to control boundary effects). Homo-typic bead-binder attractive interactions were taken in the range 2.3−2.9\({K}_{B}T\), SARS-CoV-2 model simulated with lower affinity with respect the Mock model, consistently with the results obtained in the previously discussed model of TADs. In addition, a general, constant hetero-typic interaction is also used¹⁶. Anchor points have been defined using CTCF peaks from ChIP-seq data⁴ binned at 500 bp resolution (i.e., the size of a single polymer bead). The presence of an anchor point occurs with a probability proportional to the height of the signal. Orientations of anchor points (Fig. 3 and Supplementary Fig. 8) have been assigned using the FIMO tool in the MEME suite (https://meme-suite.org/meme/) fed with CTCF binding motif (JASPAR database)^10,11. If multiple matches occurred within the 500 bp window, the most likely was taken (i.e., the FIMO hit with lowest p value). Processivity was taken in the range 150−400 kb to ensure the formation of the loops in the maps (Fig. 3 and Supplementary Fig. 8). Separation among extruders is taken in the range 50−100 kb, with SARS-CoV-2 case simulated with halved density with respect to the Mock case, again consistently with the results found from the previously described model of TADs.

Molecular dynamics simulations details

All previously described polymer models have been explored using classical Molecular Dynamics simulations⁴¹. In general, chromatin fiber is a standard bead on a string chain and binders are simple spherical particles, both with same diameter \({\sigma }=\)1 and mass \(m=\)1, expressed in dimensionless units. Excluded volume effects of beads and binders are taken into account using a truncated, purely repulsive Lennard-Jones (LJ) potential⁴¹, with energy unit \({K}_{B}T\), \(T\) temperature and \({K}_{B}\) Boltzmann constant. Consecutive beads are linked by FENE bonds⁴¹, with maximum length \({R}_{0}=\)1.6\({\sigma }\) and spring constant \({K}_{{FENE}}=\)30\({K}_{B}T/{{\sigma }}^{2}\). Bead-binder attractive interactions are modeled using a short-range, truncated LJ potential: \({V}_{{LJ}}(r)=4{\varepsilon }[{(\frac{{\sigma }}{r})}^{12}-{(\frac{{\sigma }}{r})}^{6}-{(\frac{{\sigma }}{{r}_{{{{{\mathrm{int}}}}}}})}^{12}+{(\frac{{\sigma }}{{r}_{{{{{\mathrm{int}}}}}}})}^{6}]\) for r < r_int and 0 otherwise, where r is the distance between particle centers and ε, sampled in the range (8−12)\({K}_{B}T\), regulates the interaction intensity. Specific parameters are: \({r}_{{{{{\mathrm{int}}}}}}=\)1.3\({\sigma }\) for compartment and TAD models (Figs. 1 and 2), \({r}_{{{{{\mathrm{int}}}}}}=\)2.5\({\sigma }\) for specific interaction in DDX58 and IFIT loci models (Fig. 3 and Supplementary Fig. 8). Unless differently stated, we always show binding affinities corresponding to the minimum of \({V}_{{LJ}}\). Length scales are mapped in physical units (Figs. 4 and 5) through the relation¹⁸ \({{\sigma }=\left(\frac{g}{G}\right)}^{1/3}D\) where \(D\) is nuclear diameter (7 μm), \(G\) is the nuclear genomic content (6.6 Gbp) and \(g\) is genomic content of a single bead (500 bp), which returns \({\sigma }{\sim }\)30 nm for the models of DDX58 and IFIT loci, in line with previous estimates from analogous polymer models³⁰.

The system evolves according the Langevin equation⁴² with standard parameters⁴¹, i.e., friction coefficient \({\zeta }=\)0.5 and temperature \(T=\)1, in dimensionless units. We used an integration timestep \({dt}=\)0.01. Integration has been performed with a Velocity Verlet algorithm using the publicly available HOOMD software⁴³. Simulations are performed in a cubic box (linear size \(L=\)50\({\sigma }\) in real loci models) with boundary periodic conditions to take into account finite size effects. For each parameter setting we performed up to 30 independent simulations. Polymer configurations are initialized as standard Self-Avoiding-Walk (SAW) states, binders are randomly located in the simulation box and then equilibrated up to 5*10⁷ timesteps. Configurations have been sampled up to the equilibrium sampling frequency every 5*10⁴ timesteps, except for the simulations shown in Fig. 5d, e, f, where frames were sampled every 10³ timesteps. Quantities obtained from the entire population of single-molecule 3D configurations (Fig. 4b, c) are shown as histograms. Analogously, histograms of averages and standard deviations of time trajectories, shown in Fig. 5c and Supplementary Fig. 10b−d, are computed over 30 independent trajectories. Timescales shown in Fig. 5 are estimated by using the relation¹³ τ = η(6πσ³/ε), where ε = 1K_BT is the energy scale and \(\eta\) the viscosity; assuming \(T=\)300 K and \(\eta=\)0.2 cP, we obtain an estimated timescale \(\tau {\sim }\)0.5 ms, again in line with previous studies³⁰.

Loop extrusion process is implemented largely following previous descriptions¹⁵ and is integrated in the above-described MD simulations, in a model combining both phase-separation and loop extrusion mechanisms. Basically, loop extruding factors are modeled as harmonic springs with elastic constant \({K}_{{spring}}=\)10\({K}_{B}T/{{\sigma }}^{2}\) and equilibrium distance \({r}_{{eq}}=\)1.1\({\sigma }\). Extruding factors slide along the polymer every 500 MD timesteps by moving the spring from the bead pair (\(i\), \(j\)) to (\(i\) − 1, \(j\) + 1). Extruders can stochastically detach from the polymer with a rate \({k}_{{off}}\), which is related to the processivity through the relation⁴⁴ \({proc}{=2g/k}_{{off}}\), \(g\) the above defined genomic content per bead. When an extruder detaches, a new one is replaced along the polymer in a random position, so to keep a constant number of extruders. An extruder halts its motion when it meets oppositely directed anchor points or when it meets another extruder during the sliding¹⁵, since they cannot pass through each other.

The codes of the above-described models, i.e., performing MD simulations of A/B compartments, TADs and real genomic loci, where SBS and LE are combined, have been adapted from the software¹⁰ at GitHub link (https://github.com/ehsanirani/PhaseSeparation-LoopExtrusion-MD) and are available as Supplementary Software 1.

Analysis of contact maps, saddle-plots and contact probability

Contact maps were computed from the ensemble of 3D polymer structures by setting a distance threshold \(A\) and defining a contact if d_i,j < Aσ, where d_i,j is the Euclidean distance between beads \(i\) and \(j\) and \(A\) taken is taken the range 2−3.5. For each polymer conformation, we calculate the associated contact map and then aggregate all the maps of the ensemble of structures for a fixed choice of parameters. All the simulated maps correspond to the entire polymer, except Fig. 1c, Supplementary Figs. 1a, 2 and 3a, where an average among three consecutive sub-matrices is shown for presentation purposes. Analogously, triplet matrices (Fig. 4d) are computed from a simple generalization of the pairwise calculation³⁰. We first fix a specific point of view (i.e., the gene IFIT3, Fig. 4d) and identify it on the polymer (e.g., bead \(i\)). Then, from each polymer conformation we call a triple contact of bead \(i\) with other beads (e.g., \(j\) and \(k\)) if their mutual Euclidean distance is lower than the threshold or, more formally, if d_i,j& d_j,k & d_k,I < 5σ. Then, we iterate over all possible \(j\) and \(k\) indexes to obtain a triplet matrix of single polymer conformation. Those matrices are then aggregated to generate a triplet frequency matrix. Statistical significance of the triplet frequency involving E1-IFIT3-E3 (Fig. 4e) is estimated by comparing the distribution of triple contacts from the population of single conformations with the distribution of control triplets located 100 kb downstream the IFIT3 promoter and preserving the relative genomic distance. Saddle-plots (Fig. 1e, Supplementary Figs. 1b, 3b, 4a, 5e) have been computed using the cooltools package²³ of the cooler tool to analyze HiC data²². Briefly, we first converted the simulated maps in cool format using the create_cooler function, then we called A/B compartments and then used the saddle function with default number of bins (i.e., 50). Analogously, we performed the same analysis on HiC data⁴ (80 kb resolution) in Mock and SARS-CoV-2 infected conditions to generate the Log2-FC matrix in Fig. 1e. Best polymer model for A/B compartments in Fig. 1d was found by considering linear combinations of simulated saddle-plots and minimizing the sum of the entry-by-entry square difference between the model and experimental saddle-plots. As we considered combinations of four matrices, 2 with unbalanced (i.e., E_A-A < E_B-B) and 2 with balanced (i.e., E_A-A = E_B-B) homo-typic affinities, the procedure finds the best four coefficients (Fig. 2d, bottom panels), their sum constrained to 1. We verified that by minimizing other quantities (e.g., the χ²) analogous results were found. Similarly, best model in Supplementary Fig. 4a was found by considering combinations of two matrices with different hetero-typic affinities E_A-B and balanced homo-typic affinities E_A-A = E_B-B (specifically, we combined models with E_A-A/E_A-B = 1.06 and 1.08). On the same line, best model for chromosome 11 (Supplementary Fig. 5b, c, d, e) is a combination of two matrices with different homo-typic affinities (one balanced E_A-A = E_B-B and one unbalanced E_A-A < E_B-B, hetero-typic E_A-B kept fixed). In this case, we could fit simulated and experimental 1st eigenvector profile (Supplementary Fig. 5b, c), which returns the same best coefficient (Supplementary Fig. 5d, e) obtained by performing the above-described fit using saddle-plots. Compartment strength (Supplementary Fig. 7c) has been computed using the saddle_strength function from the cooltools package²³. Contact probabilities shown in Fig. 2d and Supplementary Fig. 6c were computed from the previously defined contact maps by taking the average value of each diagonal. Curves are then multiplied by a coefficient (equal in Mock and SARS-CoV-2 models) to map the simulated values into the experimental range. Best model for TADs was found by considering the best linear combination of contact probabilities for different model parameters (i.e., average LEf separation and interaction affinity) and then minimizing the \({\chi }^{2}\) with experimental curve in the range 15 kbp−2.5 Mb (Mock or SARS-CoV-2 conditions), so to take into account the wide range of variability of the contact probability. To test the best description in terms of affinities, we consider combinations of four curves, two with affinities 3.1 and 3.8\({K}_{B}T\) (LEf separation fixed) and two with same affinities but with no LEfs (Supplementary Fig. 6d and 6e). Therefore, the fit returns four coefficients, their sum constrained to 1, representing the amount of each curve in the best model.

Gene-enhancer distance, shape descriptors and structural variability

3D distances between DDX58 gene and its enhancer E has been simply obtained by calculating the Euclidean 3D distance from a 3D structure. Smoothing of 3D distance trajectories shown in Fig. 5d and e has been done with a standard 1^st order polynomial computed by use of signal.savgol_filter function from the Python package scipy. Shape descriptors were computed using standard formula used in polymer physics field. We first computed the gyration tensor \({{{{{\bf{G}}}}}}\)²⁹, whose entries are \({{{{{{\rm{G}}}}}}}_{{{{{{\rm{\alpha }}}}}},{{{{{\rm{\beta }}}}}}}=\frac{1}{N}({\sum }_{i}^{N}({x}_{\alpha,i}-{x}_{\alpha,{CM}})({x}_{\beta,i}-{x}_{\beta,{CM}}))\), where \({{{{{\rm{\alpha }}}}}},{{{{{\rm{\beta }}}}}}\in \{0,1,2\}\) are component indexes, \({{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{i}}}}}}}\) is the vector position of bead \(i\), \({{{{{{\boldsymbol{x}}}}}}}_{{{{{{\boldsymbol{CM}}}}}}}\) is the vector position of the polymer center of mass and \(N\) number of polymer beads. Then, by diagonalizing this tensor, we obtained the three eigenvalues λ₁, λ₂ and λ₃, sorted in ascending order. Anisotropy (Fig. 4c and Supplementary Fig. 10a−c) is defined as²⁹ \(1-3({\lambda }_{1}{\lambda }_{2}+{\lambda }_{2}{\lambda }_{3}+{\lambda }_{3}{\lambda }_{1})/{({\lambda }_{1}+{\lambda }_{2}+{\lambda }_{3})}^{2}\) and reflects the symmetry of a polymer conformation. Analogously, asphericity (Supplementary Fig. 10a−c) is defined as²⁹ \(({\lambda }_{1}-({\lambda }_{2}+{\lambda }_{3})/{2})\) and measures the deviation from a spherical symmetry. Volume of a polymer conformation is estimated by first numerically computing a convex hull from the 3D coordinates of the polymer by use of the spatial.ConvexHull function from the Python package scipy and then converting this value in physical volume units through the previously estimated length scale.

Epigenetics signature of binding sites

To investigate the biological nature of the model binding sites, we compared their genomic locations with available epigenetic marks. To this aim, cross correlation analysis (Fig. 6 and Supplementary Fig. 11) has been performed as previously described^11,30. Epigenetics data (Fig. 6a and Supplementary Fig. 11a, left and right bottom panels) are taken from ref. ⁴ and have been first binned at 5 kb resolution in order to match the HiC resolution used to infer the binding site profiles. Then we computed the Pearson correlation between each specific binding site and epigenetic profiles, i.e., between the number of binding sites of a specific type (represented by a color in the left and right upper panels of Fig. 6a and Supplementary Fig. 11a) and the epigenetic signal in the corresponding 5 kb bins. Significance of these correlations has been estimated with a random control model generated by bootstrapping 10,000 times the original binding sites position along the locus and re-calculating the correlations. We symmetrically set the bottom 15th percentile and top 85th percentile as significance thresholds, although different thresholds led to similar results. The results are collected in a matrix (Fig. 6b and Supplementary Fig. 11b) where each element is the significant correlation between a specific type and epigenetic mark pair, or zero if the correlation is not significant. Typically, each type correlates with a combination of epigenetic marks, rather than with a specific one. Analogously, p-values of the changes in correlation with epigenetic tracks between Mock and SARS-CoV-2 has been estimated by comparing the differences with a control distribution of changes obtained by randomly bootstrapping the original binding sites (one-sided computation on a population of 1000 permutations). The top 4 most significant changes (i.e., the lowest p-values) were highlighted in Fig. 6.

Statistics and reproducibility

No statistical method was used to predetermine sample size. No data were excluded from the analyses. The experiments were not randomized. For all boxplots, the centre lines represent medians; box limits indicate the 25th and 75th percentiles; and whiskers extend 1.5 times the interquartile range (IQR) from the 25th and 75th percentiles. Mann−Whitney U test and t-test were commonly used to compare distributions; p < 0.05 was considered significant (*p < 0.05; **p < 0.01; ***p < 0.001).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Published HiC data⁴ and ChIP-seq data⁴ used in this work are available at the Gene Expression Omnibus (GEO) database with accession number GSE179184 CTCF binding motif is available from the JASPAR database (matrix profile MA0139.1). Polymer configurations of DDX58 locus are provided as Supplementary Data 1. The polymer structures generated for the IFIT locus and Chr11 will be available from the authors upon request. Please contact Andrea M. Chiariello at andreamaria.chiariello@infn.it. Requests of these data will be answered within approximately two weeks. Source data are provided with this paper.

Code availability

Codes used to perform simulations presented in this paper are available in the Supplementary Software 1 folder. The software used for Molecular Dynamics simulations is HOOMD, version 2.9.6. Full information and additional documentation are available at the github link: https://github.com/ehsanirani/PhaseSeparation-LoopExtrusion-MD¹⁰.

References

Carvalho, T., Krammer, F. & Iwasaki, A. The first 12 months of COVID-19: a timeline of immunological insights. Nat. Rev. Immunol. 21, 245–256 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kee, J. et al. SARS-CoV-2 disrupts host epigenetic regulation via histone mimicry. Nature 610, 381–388 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ho, J. S. Y. et al. TOP1 inhibition therapy protects against SARS-CoV-2-induced lethal inflammation. Cell 184, 2618–2632.e17 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, R. et al. SARS-CoV-2 restructures host chromatin architecture. Nat. Microbiol. 8, 679–694 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zazhytska, M. et al. Non-cell-autonomous disruption of nuclear architecture as a potential cause of COVID-19-induced anosmia. Cell 185, 1052–1064.e12 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kempfer, R. & Pombo, A. Methods for mapping 3D chromosome architecture. Nat. Rev. Genet. 21, 207–226 (2020).
Article CAS PubMed Google Scholar
Misteli, T. The self-organizing genome: principles of genome architecture and function. Cell 183, 28–45 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bianco, S. et al. Computational approaches from polymer physics to investigate chromatin folding. Curr. Opin. Cell Biol. 64, 10–17 (2020).
Article CAS PubMed Google Scholar
Brackey, C. A., Marenduzzo, D. & Gilbert, N. Mechanistic modeling of chromatin folding to understand function. Nat. Methods 178, 767–775 (2020).
Article Google Scholar
Conte, M. et al. Loop-extrusion and polymer phase-separation can co-exist at the single-molecule level to shape chromatin folding. Nat. Commun. 131, 1–13 (2022).
Google Scholar
Bianco, S. et al. Polymer physics predicts the effects of structural variants on chromatin architecture. Nat. Genet. 50, 662–667 (2018).
Article CAS PubMed Google Scholar
Jost, D., Carrivain, P., Cavalli, G. & Vaillant, C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 42, 9553–9561 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chiariello, A. M., Annunziatella, C., Bianco, S., Esposito, A. & Nicodemi, M. Polymer physics of chromosome large-scale 3D organisation. Sci. Rep. 6, 29775 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Sanborn, A. L. et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl Acad. Sci. USA 112, E6456–E6465 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fudenberg, G. et al. Formation of chromosomal domains by loop extrusion. Cell Rep. 15, 2038–2049 (2016).
Article CAS PubMed PubMed Central Google Scholar
Conte, M. et al. Polymer physics indicates chromatin folding variability across single-cells results from state degeneracy in phase separation. Nat. Commun. 11, 3289 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Schneider, W. M., Chevillotte, M. D., Rice, C. M. & Interferon-Stimulated Genes: a complex web of host defenses. Annu. Rev. Immunol. 32, 513–545 (2014).
Article CAS PubMed PubMed Central Google Scholar
Barbieri, M. et al. Complexity of chromatin folding is captured by the strings and binders switch model. Proc. Natl Acad. Sci. 109, 16173–16178 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Falk, M. et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature 570, 395–399 (2019).
Shi, G., Liu, L., Hyeon, C. & Thirumalai, D. Interphase human chromosome exhibits out of equilibrium glassy dynamics. Nat. Commun. 91, 1–13 (2018).
Google Scholar
Shin, S., Shi, G. & Thirumalai, D. From effective interactions extracted using Hi-C data to chromosome structures in conventional and inverted nuclei. PRX Life 1, 013010 (2023).
Article Google Scholar
Abdennur, N. & Mirny, L. A. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics 36, 311–316 (2020).
Article CAS PubMed Google Scholar
Open2C, Abdennur, N., Abraham, S., Fudenberg, G., Flyamer, I. M., Galitsyna, A. A. et al. Cooltools: Enabling high-resolution Hi-C analysis in Python. PLoS Comput. Biol. 20, e1012067 (2024).
Hildebrand, E. M. & Dekker, J. Mechanisms and functions of chromosome compartmentalization. Trends Biochem. Sci. 45, 385–396 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schwarzer, W. et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature 551, 51–56 (2017).
Article ADS PubMed PubMed Central Google Scholar
Rao, S. S. P. et al. Cohesin loss eliminates all loop domains. Cell 171, 305–320.e24 (2017).
Article CAS PubMed PubMed Central Google Scholar
Blanco-Melo, D. et al. Imbalanced host response to SARS-CoV-2 drives development of COVID-19. Cell 181, 1036–1045.e9 (2020).
Article CAS PubMed PubMed Central Google Scholar
Bintu, B. et al. Super-resolution chromatin tracing reveals domains and cooperative interactions in single cells. Science 362, eaau1783 (2018).
Article ADS PubMed PubMed Central Google Scholar
Arkin, H. & Janke, W. Gyration tensor based analysis of the shapes of polymer chains in an attractive spherical cage. J. Chem. Phys. 138, 054904 (2013).
Article ADS PubMed Google Scholar
Chiariello, A. M. et al. A dynamic folded hairpin conformation is associated with α-Globin activation in erythroid cells. Cell Rep. 30, 2125–2135.e5 (2020).
Article CAS PubMed Google Scholar
Oudelaar, A. M. et al. Single-allele chromatin interactions identify regulatory hubs in dynamic compartmentalized domains. Nat. Genet. 50, 1744–1751 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chiariello, A. M., Corberi, F. & Salerno, M. The interplay between phase separation and gene-enhancer communication: a theoretical study. Biophys. J. 119, 873–883 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Gabriele, M. et al. Dynamics of CTCF- and cohesin-mediated chromatin looping revealed by live-cell imaging. Science 376, 476–501 (2022).
Article ADS Google Scholar
Esposito, A. et al. Polymer physics reveals a combinatorial code linking 3D chromatin architecture to 1D chromatin states. Cell Rep. 38, 110601 (2022).
Article CAS PubMed Google Scholar
Brackley, C. A., Taylor, S., Papantonis, A., Cook, P. R. & Marenduzzo, D. Nonspecific bridging-induced attraction drives clustering of DNA-binding proteins and genome organization. Proc. Natl Acad. Sci. USA 110, E3605–E3611 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ringel, A. R. et al. Repression and 3D-restructuring resolves regulatory conflicts in evolutionarily rearranged genomes. Cell 185, 3689–3704.e21 (2022).
Article CAS PubMed PubMed Central Google Scholar
Gordon, D. E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020).
Heinz, S. et al. Transcription elongation can affect genome 3D structure. Cell 174, 1522–1536.e22 (2018).
Article CAS PubMed PubMed Central Google Scholar
Dekker, J. et al. The 4D nucleome project. Nature 549, 219–226 (2017).
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Kremer, K. & Grest, G. S. Dynamics of entangled linear polymer melts: a molecular-dynamics simulation. J. Chem. Phys. 92, 5057–5086 (1990).
Article ADS CAS Google Scholar
Allen, M. P. & Tildesley, D. J. Computer Simulation of Liquids (Oxford Science Publications) SE - Oxford science publications. Oxford Univ. Press (1989).
Anderson, J. A., Glaser, J. & Glotzer, S. C. HOOMD-blue: a Python package for high-performance molecular dynamics and hard particle Monte Carlo simulations. Comput. Mater. Sci. 173, 109363 (2020).
Article CAS Google Scholar
Buckle, A., Brackley, C. A., Boyle, S., Marenduzzo, D. & Gilbert, N. Polymer Simulations of Heteromorphic Chromatin Predict the 3D Folding of Complex Genomic Loci. Mol. Cell 72, 786–797.e11 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

A.M.C. acknowledges “Programma per il Finanziamento della Ricerca di Ateneo Linea B” (FRA) 2020, University of Naples Federico II, CINECA ISCRA Grant ID PhaSSep - HP10C8JWU7. M.N. acknowledges support from the National Institutes of Health Common Fund 4D Nucleome Program grant 5 1UM1HG011585-03, EU H2020 Marie Curie ITN n.813282, PNRR MUR M4C2 CN00000041 “National Center for Gene Therapy and Drugs based on RNA Technology” NextGenerationEU CUP E63C22000940007, MUR PRIN 2022 2022R8YXMR, and computer resources from INFN, CINECA, ENEA CRESCO/ ENEAGRID88 and Ibisco at the University of Naples.

Author information

These authors contributed equally: Andrea M. Chiariello, Alex Abraham.

Authors and Affiliations

Dipartimento di Fisica, Università degli Studi di Napoli Federico II, and INFN Napoli, Complesso Universitario di Monte Sant’Angelo, 80126, Naples, Italy
Andrea M. Chiariello, Alex Abraham, Simona Bianco, Andrea Esposito, Andrea Fontana, Mattia Conte & Mario Nicodemi
Dipartimento di Ingegneria Elettrica e delle Tecnologie dell’Informazione - DIETI, Università degli Studi di Napoli Federico II, and INFN Napoli, Via Claudio 21, 80125, Naples, Italy
Francesca Vercellone
Berlin Institute for Medical Systems Biology at the Max Delbruck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
Mario Nicodemi

Authors

Andrea M. Chiariello
View author publications
You can also search for this author in PubMed Google Scholar
Alex Abraham
View author publications
You can also search for this author in PubMed Google Scholar
Simona Bianco
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Esposito
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Fontana
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Vercellone
View author publications
You can also search for this author in PubMed Google Scholar
Mattia Conte
View author publications
You can also search for this author in PubMed Google Scholar
Mario Nicodemi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M.C and M.N. designed the project. A.M.C. and A.A. developed the modeling part; A.M.C., A.A., S.B., A.E., and A.F. ran the computer simulations; S.B and A.E. are other equal contributing authors; A.M.C., A.A., S.B., A.E., F.V., A.F. performed data analyses. A.M.C. wrote the manuscript with input from the other authors.

Corresponding authors

Correspondence to Andrea M. Chiariello or Mario Nicodemi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Software 1

Supplementary Movie 1

Supplementary Movie 2

Supplementary Movie 3

Supplementary Movie 4

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chiariello, A.M., Abraham, A., Bianco, S. et al. Multiscale modelling of chromatin 4D organization in SARS-CoV-2 infected cells. Nat Commun 15, 4014 (2024). https://doi.org/10.1038/s41467-024-48370-6

Download citation

Received: 27 July 2023
Accepted: 29 April 2024
Published: 13 May 2024
DOI: https://doi.org/10.1038/s41467-024-48370-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.