Introduction

Robustness, the ability to maintain biological function in the face of perturbation, is considered a fundamental property of evolvable complex systems1. Micro-organisms achieve robustness to environmental perturbations (for example changes in nutrient levels, pH or temperature) through a large-scale re-arrangement of gene regulation known as the stress response. In broad terms, this response consists of shifting resources away from growth and proliferation and towards protective mechanisms2,3. These transient changes are accompanied by longer term effects: exposure to stress results in higher tolerance against future perturbation for both the same and different types of stress4. While this cross-protection effect between environmental stresses is well documented, less is known about the relationship between the stress response and robustness to genetic perturbations (‘mutational robustness’).

There is some evidence to suggest environmental and mutational robustness are mediated via similar mechanisms: a gene’s contribution to mutational robustness correlates with its effect on environmental robustness5. Cross-protection could thus extend from environmental perturbations to genetic perturbations. Indeed, exposure to stress has been shown to decrease the phenotypic effects of some mutations in Caenorhabditis elegans6. In Saccharomyces cerevisiae, a systematic study of single-gene deletions found that the average effect of gene deletion on growth rate was smaller in a number of stress-inducing environments than under standard conditions, suggesting that the stress response may indeed protect against the effects of mutation7. Similar results have also been reported in Escherichia coli8. Furthermore, as exposure to stress is associated with an increased rate of mutation9, an increase in mutational robustness could be beneficial to stressed cells. Additionally, there is evidence to suggest that stress acts as a driver of evolutionary innovation2. Although it seems somewhat counter intuitive, increased mutational robustness can, under some circumstances, promote greater evolvability10. It is therefore interesting to ask whether exposure to stress also increases mutational robustness.

Network models are well established tools for studying robustness in complex systems11. A network is a representation of some entities of interest (nodes) and the relationships between them (edges). Network robustness is typically studied by removing a proportion of a network’s nodes (or edges) and measuring the change in some network property or properties relating to overall network connectivity. The tacit assumption is that these properties capture how well the network is able to perform its function.

In physical protein interaction networks (where edges indicate protein binding), there is evidence to suggest that node removal captures the consequences of loss of function mutations: high degree nodes have been shown to (a) considerably disrupt measures of network connectivity when removed from the network and (b) correspond to genes likely to be essential12. To some extent, this centrality-lethality relationship may be accentuated by biases in interactome mapping: high-throughput protein interaction detection techniques have been found to favour highly expressed and highly conserved proteins13, both of which are also likely features of essential proteins. Indeed, the centrality-lethality correlation has been found to be missing in high quality physical interaction networks mapped using techniques less likely to exhibit these types of biases14. Nevertheless, the effect is unlikely to be entirely caused by bias because it has been documented in a wide range of networks15, including co-expression networks (see below)16, which are not susceptible to the same mapping biases as physical interaction networks.

Much of the emphasis in the literature, particularly for work on real-world networks, has been on comparing random node failure to targeted removal of central nodes. In general terms, complex real-world networks have been found to be resilient against random failure, but vulnerable in the face of targeted attack17,18,19. Work on random node failure has focused almost entirely on average behaviour: to our knowledge, only a single study has discussed the range of the damage level that can be caused by random node removal. Trajanovski et al. used repeated realisations of random node removal to generate a probability density function for the damage in a number of theoretical network models and real-world networks20. The authors reported that two networks with a similar average robustness may differ considerably in the range of the expected network damage. Trajanovski et al. did not focus in detail on the shape of the probability density function. Given the stochasticity of biological processes, we speculated that the shape of this distribution may have substantial consequences in terms of the cell’s ability to function and survive.

In this work, we use fission yeast (Schizosaccharomyces pombe) co-expression networks previously derived from expression data from different yeast strains before and after exposure to oxidative stress21, to study the effect of stress on mutational robustness. A co-expression network represents similarities in genes’ patterns of expression. As genes involved in the same function are likely to be co-regulated and therefore co-expressed, these networks capture functional associations between proteins22. Because they are not affected by the same biases in interactome mapping as physical interaction networks, co-expression networks provide a valuable complementary approach. This is particularly pertinent when studying the effects of different cellular environments or conditions: there is little data mapping physical interactions under different conditions whereas condition-specific co-expression data is available. Co-expression networks therefore provide a unique opportunity to study condition dependent changes in protein network topology.

Results

Damage distribution shape is different in SF and ER networks

First we compared the damage to scale-free (SF) Barabasi-Albert graphs23 and Erdős-Rényi (ER) random graphs under random node removal. We examined damage to the network, measured in terms of decrease in efficiency (a network property capturing the global connectivity of the network - see Methods), after removal of an increasing proportion of the nodes in random order. To produce an estimate of the damage probability density function (or ‘damage distribution’), we repeated this procedure 500 times (500 ‘realisations’) (Fig. 1). As reported previously, SF networks are, on average, less damaged by node removal. Importantly, however, the shape of the damage distribution is different, with greater variance and skewness in the SF network. (Skewness is a measure of the asymmetry of a distribution where a positive value indicates the data is spread to the left of the mean and is given by s = E(x − μ)33 where μ and σ are the mean and standard deviation of x and E(t) represents the expected value of t).

Figure 1
figure 1

Damage to the network (measured as decrease in efficiency) in response to removal of an increasing proportion of the nodes in a SF and ER network.

Each dotted line represents one realisation of random node removal, with a total of 500 realisation for each network. The solid lines represent average behaviour across realisations. The difference in average damage (ER minus SF) is shown in green, corresponding to the y-axis on the right.

For example, at 10% node removal, the mean damage is 0.23 vs 0.24, the variance is 4.3 × 10−4 vs 6.5 × 10−5 and the skewness is 0.99 vs 0.10 (SF and ER, respectively). This pattern is preserved up to almost complete breakdown of the network (see Supplementary Table 1 for details). In order to verify that this behaviour is not a property of these particular network realisations, we repeated the 500 removal realisations in 100 networks of each type (Fig. 2).

Figure 2
figure 2

Distribution of the damage to the network (measured in terms of decrease in efficiency) after removal of 10% of the nodes for 500 realisations of random node removal for SF and ER networks.

Each dotted line corresponds to the distribution of damage for a single network. The vertical lines indicate average damage across the 100 networks and 500 realizations.

Removal of essential genes causes greater loss of efficiency than removal of non-essential genes

Using node removal to model loss of function mutation assumes that the change in network properties (in our case network efficiency) reflects the phenotypic consequences of the mutation. To verify this assumption for the co-expression networks used in this study, we compared the loss of efficiency caused by removal of essential and non-essential genes from the pre-stress network, using a set of essential (n = 1092) and non-essential (n = 3428) genes downloaded from PomBase24. Removal of essential genes did indeed cause greater average damage than removal of non-essential genes (4.89 × 10−4 vs 4.31 × 10−4, p = 0.0047, Wilcoxon rank-sum), confirming node removal can indeed be used as a model for loss of function mutation.

Damage distribution shape is different in yeast co-expression networks after exposure to stress

To study the effect of stress on the mutational robustness of fission yeast, we examined the damage caused by random node removal in the co-expression networks before and after exposure to oxidative stress (Fig. 3). Across 500 realisations, the average damage was smaller after exposure to stress ( as estimated by bootstrapping; see Methods; 95% CI [0, 3.7 × 10−5]), suggesting stress increases mutational robustness.

Figure 3
figure 3

Damage in response to removal of an increasing proportion of the nodes in fission yeast co-expression networks before and after exposure to stress.

The top panel (a) shows results for the full simulation (up to removal of all nodes), while the bottom panel (b) focuses on the removal of a smaller number of nodes (up to 2%). Damage is measured as the decrease in network efficiency. Each dotted line represents one realisation of random node removal, with a total of 500 realisation for each network. The solid lines represent average behaviour across realisations. The difference in average damage (pre-stress minus post-stress) is shown in green, corresponding to the y-axis on the right. The co-expression networks were derived from expression data from different yeast strains before and after exposure to oxidative stress21.

The effect of stress on the mean damage, although significant, is small in numerical terms (see Supplementary Tables 2 and 3). However, as with the ER and SF network models, there is a clear difference in the shape of the damage distribution: as illustrated in Fig. 4, both the variance and skewness (from removal of 1% of the nodes to removal of 50% of the nodes) of the distribution are greater after exposure to stress. Thus, after exposure to stress, while most of the time random loss of function mutations have a smaller effect on the network than prior to the stress, they occasionally lead to catastrophic damage.

Figure 4
figure 4

Variance (left) and skewness (right) of the distribution of damage to the network (i.e. decrease in efficiency) in response to random node removal.

This distribution arises from 500 realizations depicted in Fig. 3. The top panels (a,b) show results for the full simulation (up to removal of all nodes), while the bottom panels (c,d) focus on the removal of a smaller number of nodes (up to 2%). These changes in the damage distribution indicate that while on average, loss of function mutations have a smaller effect after exposure to stress, they will occasionally lead to catastrophic failure in the post-stress network.

For the removal of a small number of nodes (2 and 6, corresponding to 0.03% and 0.1% of the nodes in the network), the skewness of the distribution was greater prior to the stress. While this could reflect a genuine difference in the shape of the damage distribution when removing a smaller number of nodes, this behaviour could also result from uncertainty related to the relatively small number of realisations (n = 500): if nodes associated with catastrophic damage when removed are relatively rare, 500 realisations may not be enough to fully explore the damage distribution for removal of a small number of nodes. In order to address this, we examined the effect of removing a single node, for all nodes in the network. This replicated our earlier results: the average damage is smaller after exposure to stress (4.1 × 10−4 vs 3.9 × 10−4 before and after stress, respectively), while the variance and skewness both increase (3.1 × 10−7 vs 4.8 × 10−7 for variance and 2.0 vs 3.5 for skewness).

Stress-induced changes are not explained by a change in degree distribution

The different robustness of the ER and SF networks are often explained in terms of their degree distributions17. However, the pre- and post-stress networks’ degree distributions appear quite similar (see Supplementary Figure 1). We therefore sought to establish whether the changes in the degree distribution were enough to explain the change to the damage distribution.

We generated control networks with identical expected degree distributions to the co-expression networks (see Methods), but otherwise differing network structure and examined the shape of the damage distribution for 500 realisations of removing 10% of the nodes in these control networks (see Supplementary Figure 2). Averaging across 20 control networks, the mean damage was 0.215 and 0.211 (pre- and post-stress, respectively), the variance 1.17 × 10−4 and 1.68 × 10−4 and skewness 0.03 and −0.01. Thus, based on the change in degree distribution alone, we would expect a smaller increase in variance and a decrease in skewness. We conclude that the changes observed in the shape of the co-expression network damage distribution are not explained by the change in degree distribution.

Damage distribution shape has functional consequences

To illustrate the potential consequences of the differences in the shape of the damage distribution for the cell, we assume a cell’s survival probability depends on the permissiveness of its environment. In unchallenging environments, the cell has a high damage tolerance: the network can sustain a large amount of damage before the survival of the cell is impaired. In difficult environments, on the other hand, damage tolerance is lower: a smaller amount of network damage will impair the cell’s survival. As proof of concept, we created a simple model in which the cell only survives if the network damage is below threshold t (Fig. 5). The overall probability of survival r thus depends on the damage distribution p(d):

Figure 5
figure 5

Illustration of how different network structures may be beneficial in different environments.

The left hand panel (a) shows the damage distribution p(d) for 10% node removal for the pre- and post-stress networks. The damage tolerance t, capturing the level of network damage the cell can sustain before dying is also illustrated. The right hand panel (b) shows the relationship between overall survival probability and the damage tolerance for both networks. For high damage tolerance (i.e. unchallenging environments where the cell is able to survive despite a high level of network damage), the pre-stress network has a higher overall survival probability than the post-stress network because of the occasional catastrophic failures of the post-stress network. However, if the damage tolerance is low, the pre-stress network performs better than the post-stress network. Thus, the shape of the damage distribution can have important consequences for the cell.

As shown in Fig. 5, the optimal damage distribution depends on the damage tolerance: at high damage tolerance (unchallenging environments), the pre-stress network has a greater overall probability of survival because in the post-stress cells, the occasional catastrophic failures fall beyond the tolerated damage. At low damage tolerance (difficult environments) however, the post-stress network performs better than the pre-stress network because very low damage is more common in the post-stress network. This illustrates how the shape of the damage distribution can have meaningful consequences for the cell’s ability to function and survive.

Discussion

Our network model of mutational robustness suggests that exposure to stress makes cells more resistant to loss of function mutations. Indeed, it is known that various forms of stress increase resistance to U.V. radiation and other DNA-damaging agents in both prokaryotic and eukaryotic single-celled organisms25. This effect is generally attributed to changes in the cell’s DNA repair mechanism. Our results, however, raise the possibility that stress-induced protection from DNA-damaging agents could instead arise from increased tolerance to DNA damage conferred by wider changes to the cell’s functional organisation. This idea is supported by the observation that in budding yeast (Saccharomyces cerevisiae), few, if any, of the genes necessary for DNA repair are induced by exposure to DNA-damaging agents (including oxidative stress)26. This suggests the increased resistance to further insults produced by exposure to these stressors is mediated by mechanisms other than improved DNA repair.

The difference in average damage from node removal, though statistically significant, is relatively small. There is, however, a clear change in the shape of the damage distribution: the post-stress distribution is skewed towards low damage and has a longer tail at higher damage values. In other words, most of the time the post-stress network experiences a relatively low level of damage while occasionally succumbing to very high damage levels.

We find that changes in the degree distribution of the co-expression networks after exposure to stress do not fully explain the change in damage distributions. A significant body of previous work suggests that other properties, such as clustering coefficient, degree-degree correlation and local structure (short loops and graphlets), provide further insight into the biological function of the network. For example, degree-degree correlation has been shown to play a role in network tolerance to node removal in both network models27 and protein networks28,29. It is therefore possible that stress-induced changes in these other properties could predict the change in damage distribution.

We have previously documented changes in the modular structure of these co-expression networks in response to stress: sets of genes become more tightly co-regulated but co-regulation between these modules is decreased21. Whether this increase in network modularity accounts for the increase in robustness to node removal is unclear: while there is evidence pointing to a negative association between modularity and robustness in dynamic network models30,31, the relationship between modularity and static network models remains less well studied. Our previous work also suggested a stress-induced decrease in centrality of genes involved in various cellular processes such as ion transport and metabolism - the biological interpretation of the change in damage distribution upon node removal may therefore relate to changes in the importance of these cellular processes.

As discussed previously, exposure to stress is associated with a higher rate of mutation9. Increased mutational robustness would therefore allow cells to tolerate the higher frequency of deleterious mutations but also enable stressed cells to explore a greater range of new, potentially adaptive, phenotypes. Furthermore, exposure to stress is associated with higher levels of heterogeneity within a population of cells, making it more likely for at least part of the population to survive a change in conditions (‘bet hedging’)2. It is therefore possible that the increased variance of the damage distribution relates to the increase in heterogeneity under stressed conditions.

We have also demonstrated that the change in shape of the damage distribution can have functional consequences for the cell: which network has the better overall chance of survival depends on the level of network damage tolerated before the survival of the cell is impaired. This suggests another possible interpretation for the change in network behaviour: it is tempting to speculate that in stress-inducing environments survival may indeed be impaired at a lower level of network damage because of the additional burden imposed by the stress. In other words, it is plausible that cells in stressed environments have lower damage tolerance. The change in the damage distribution may therefore represent a strategy optimising overall survival probability of the cell in face of random mutations. Whether or not this interpretation is correct, we have demonstrated the importance of considering the shape of the distribution of damage caused by random node removal instead of simply focusing on worst-case or average behaviour.

Methods

Damage measure

Damage to the network was measured in terms of decrease in network efficiency e, given by:

with ρ indicating the length of the shortest path between nodes i and j. The change in network efficiency in response to node removal was measured by the normalized change in efficiency , where ef is the efficiency after removal of fraction f of the nodes. Adopting the approach taken by Trajanovski et al.20, removal of a node from the network was modelled by deletion of all the node’s edges instead of removal of the node itself. Thus, when computing ef, n remains equal to the number nodes in the original network. Using this approach, the damage measure (df) is constrained between 0 (no damage) and 1 (fully disconnected network).

Theoretical network models

Resilience to node removal was compared in scale-free (SF) Barabasi-Albert graphs23 and Erdős-Rényi (ER) random graphs. Network generation, node removal and path calculations were all implemented using the NetworkX package for Python.

SF networks were generated according to the preferential attachment model: the network is initialized with m nodes and grown one node at the time, until a network with n nodes is reached. Each new node attaches to m of the existing nodes. The probability of attaching to existing node ii) is proportional to the degree of i (, where ki is the degree of i).

ER networks were generated by initializing a network with n nodes and connecting each pair of nodes with probability p.

In the work presented here, we used values n = 1000 and m = 2, giving a SF network with 1000 nodes and 1996 edges. This corresponds to a p of 0.004 for ER network generation. Because ER network generation is a probabilistic process, there was slight variation in the number of edges in the ER network.

Co-expression networks

Gene co-expression networks were constructed using gene expression data from different genetic strains in fission yeast, before and after exposure to oxidative stress (0.5 mM hydrogen peroxide, H2O2). Spearman correlation coefficients were computed across the genetic variants for each gene pair, before and after exposure to stress. To generate the networks, 4 × 104 gene pairs with the highest significant (p < 0.05) correlation coefficients were considered connected, yielding an unweighted network. All nodes, even those with no edges were included in the network when modelling node removal. More details on network generation can be found in Lehtinen et al.21.

Control networks

In order to study the effect of a network’s degree distribution on the shape of the damage distribution, we generated control networks with identical expected degree distribution but otherwise differing network structure. For a network with degree sequence W = (w1, w2, ..., wn), control networks were built by assigning an edge between nodes i and j with probability when i ≠ j and pij = 0 when i = j. The control networks were generated using the python module NetworkX.

Statistical testing

Bootstrapping was used to estimate a p-value for the difference in damage to the two networks. Bootstrapping was chosen because the non-normality and difference in shape of the damage distributions violate the assumptions of tests such as the t-test or the Wilcoxon rank-sum test.

As we were interested in the damage for all fractions of nodes removed, we used the difference in total damage across the sampled percentages as the measure of interest. The pre-stress and post-stress realisations were randomly re-shuffled 105 times and the measure of interest computed for each permutation, giving a null distribution from which the p-value was estimated. The confidence interval for the p-value was calculated using the binomial distribution. For comparison, p-values from the Wilcoxon rank-sum test are included in Supplementary Table 2 and 3.

Modelling the effects of the damage distribution

The overall probability of survival r was modelled as a function of the damage distribution p(d) (as estimated from the 500 node removal realisations, for 10% node removal) and a damage tolerance threshold t:

Additional Information

How to cite this article: Lehtinen, S. et al. Co-Expression Network Models Suggest that Stress Increases Tolerance to Mutations. Sci. Rep. 5, 16726; doi: 10.1038/srep16726 (2015).