Dynamic patterns of information flow in complex networks

Harush, Uzi; Barzel, Baruch

doi:10.1038/s41467-017-01916-3

Download PDF

Article
Open access
Published: 19 December 2017

Dynamic patterns of information flow in complex networks

Uzi Harush¹ &
Baruch Barzel¹

Nature Communications volume 8, Article number: 2181 (2017) Cite this article

21k Accesses
91 Citations
57 Altmetric
Metrics details

Subjects

Abstract

Although networks are extensively used to visualize information flow in biological, social and technological systems, translating topology into dynamic flow continues to challenge us, as similar networks exhibit fundamentally different flow patterns, driven by different interaction mechanisms. To uncover a network’s actual flow patterns, here we use a perturbative formalism, analytically tracking the contribution of all nodes/paths to the flow of information, exposing the rules that link structure and dynamic information flow for a broad range of nonlinear systems. We find that the diversity of flow patterns can be mapped into a single universal function, characterizing the interplay between the system’s topology and its dynamics, ultimately allowing us to identify the network’s main arteries of information flow. Counter-intuitively, our formalism predicts a family of frequently encountered dynamics where the flow of information avoids the hubs, favoring the network’s peripheral pathways, a striking disparity between structure and dynamics.

Unraveling the effects of multiscale network entanglement on empirical systems

Article Open access 10 June 2021

Arsham Ghavasieh, Massimo Stella, … Manlio De Domenico

Diversity of information pathways drives sparsity in real-world networks

Article 17 January 2024

Arsham Ghavasieh & Manlio De Domenico

Taming out-of-equilibrium dynamics on interconnected networks

Article Open access 22 November 2019

Javier M. Buldú, Federico Pablo-Martí & Jacobo Aguirre

Introduction

The recent years have witnessed major advances in our ability to map the structure of many natural and man-made complex systems^1,2,3,4,5, from social networks⁶ and infrastructure systems^{7, 8} to sub-cellular interaction mapping has uncovered several universal characteristics, observed across networks of vastly different domains, such as the small-world phenomenon¹¹ or the commonly observed fat-tailed degree^{12, 13} and weight^{14, 15} distributions. Our ultimate goal, however, is to translate these structural characteristics into functional predictions pertaining to the system’s dynamic behavior^16,17,18. For instance, we wish to use the topology of the gene regulatory network to gain insight into the functional pathways along which genetic information is transmitted^{19, 20}, or to translate the social network topology into predictions on the propagation of influence through social ties^{21, 22}. The problem is that information flow is not determined solely by the static network topology, but also by the nonlinear dynamics characterizing the interactions between the nodes^{18, 23}. Hence, the same network may exhibit fundamentally different patterns of information flow under different dynamics: epidemic spread, ecological interactions, or genetic regulation.

To observe these patterns we employ here a perturbative approach, a fundamental tool to uncover information propagation²⁴, specifically applicable in the context of network dynamics^{17, 18, 25, 26}. We then analytically track the propagation of signals between nodes, identifying the main pathways through which these signals penetrate the network. Our results show that despite the diversity of potential interaction mechanisms, the patterns of information flow are governed by universal laws that can be directly linked to the system’s microscopic dynamics.

Results

Quantifying information flow

We consider a system of N components (nodes) linked via a weighted and directed network A _ij. Each node is characterized by a time dependent activity x _i(t), i = 1, …, N, whose meaning depends on the specific application: for instance, the concentration of a protein in a cellular network, the abundance of a species in an ecological networks or the probability of infection of an individual in a social network. The system’s dynamics is driven by^{18, 27}

$$\frac{{{\mathrm{d}}x_i}}{{{\mathrm{d}}t}} = M_0\left( {x_i} \right) + \mathop {\sum}\limits_{j = 1}^N A_{ij}M_1\left( {x_i} \right)M_2\left( {x_j} \right),$$

(1)

where the first term on the r.h.s. accounts for i’s self-dynamics, and the second term captures the impact of i’s interacting partners. By appropriately selecting the nonlinear functions M = (M ₀(x), M ₁(x), M ₂ 1) provides a rather general description of complex system dynamics, including frequently used models to describe the behavior of social^{21, 28,29,30}, biological^{25, 31,32,33} and technological^{34, 35} systems (Table 1). Note that in (1) the weighted link A _ij represents the rate of incoming influence from x _j to x _i, hence A _ij = A _i←j, a directed link outgoing from j, incoming to i.

Table 1 Network dynamics

Full size table

We can track the propagation of a signal through the system (1) by following how a local perturbation in the steady-state activity of a source node n impacts the activities of all remaining nodes in the system, giving rise to the linear response matrix^{17, 18}

$$G_{mn} = \left| {\frac{{{\mathrm{d}}x_m{\mathrm{/}}x_m}}{{{\mathrm{d}}x_n{\mathrm{/}}x_n}}} \right| = \left| {\frac{{{\mathrm{d}}{\mathrm{log}}\,x_m}}{{{\mathrm{d}}{\mathrm{log}}\,x_n}}} \right|.$$

(2)

The terms of G _mn capture the level of information spread form the source n to a specific target node m. Summing over all targets, we obtain the total capacity of information distributed from n throughout the network as

$$Z_n = \mathop {\sum}\limits_{m = 1}^N G_{mn},$$

(3)

capturing the cumulative response of the system to the signal dx _n.

Consider the contribution of an intermediate node i to Z _n: first the signal dx _n reaches i, then i responds by shifting its own activity by dx _i, in effect creating a new signal that helps propagate dx _n to the rest of the network. If we now artificially set dx _i = 0, we freeze i’s activity, forcing it to remain unperturbed, and hence preventing it from propagating the signal x _n onward. The result is $Z_n^{\left\{ i \right\}}$, capturing the level of information spread from n under the freezing of x _i, effectively blocking all flow of information n → m via pathways that pass though i (Fig. 1a). More generally, we can freeze the flow through an entire network path, Π = {i, A _ij, j, A _jk, k, …}, in which case we block the flow of information through a sequence of nodes and links, providing $Z_n^{\Pi}$. This allows us to quantitatively evaluate the contribution of Π to the flow from the source n as

$${\cal F}_n^{\Pi} = \frac{{Z_n - Z_n^{\Pi}}}{{Z_n}},$$

(4)

capturing the fraction of Z _n that was mediated through the Π pathway. Averaging over all n we obtain Π’s overall flow

$${\cal F}_{\Pi} = \frac{1}{N}\mathop {\sum}\limits_{n = 1}^N {\cal F}_n^{\Pi},$$

(5)

quantifying, systematically, the contribution of each pathway to the spread of signals (G _mn) throughout the network. In case ${\cal F}_{\Pi} \ll 1$, Π’s contribution to the flow of information in the system (1) is marginal; if, however, ${\cal F}_{\Pi} \to 1$, then almost all information flows through Π.

To place our proposed measure of flow (5) in context we emphasize the distinction between influence and flow. Most often, network components—nodes, links, pathways—are ranked according to their dynamic impact on the network, e.g., seeking the most influential nodes³⁶. In the context of our current formalism, such impact is captured by the magnitude of Z _n (3), namely the response of the system to n’s perturbation. However, most of the time a network component is not the source of information, but rather the mediator of the information that constantly travels between arbitrary locations on the network. For example, when a single gene n out of N ~ 10⁴ is perturbed, that gene is the only source of information, whereas the role of all remaining genes is to propagate n’s signal, supporting flow as mediators, not as sources. Hence ${\cal F}_{\Pi}$, designed to capture the efficiency of a pathway as a “pipe” rather that a source of information flow, provides a crucial, overlooked, metric of the ongoing dynamic role continuously played by all network components.

Observing the patterns of flow

To observe the diverse patterns of flow exhibited by (1) we constructed a set of model and empirical networks, capturing systems from a broad range of scientific domains, including weighted scale-free networks with scale-free weights (SF1—undirected, SF2—directed); protein interactions from human and yeast cells (Human PPI³⁷ and Yeast PPI⁹); two online social networks (UCIonline³⁸ and Epoch³⁹) and a bipartite ecological network, capturing plant-pollinator relationships (ECO1, ECO2⁴⁰). We then implemented six different types of frequently used dynamic models M, capturing diverse forms of interaction mechanisms: the susceptible-infected-susceptible model^{21, 28,29,30} for epidemic spreading $\left( {\Bbb E} \right)$, biochemical interactions via mass-action-kinetics^{31, 41} $\left( {\Bbb B} \right)$, mutualistic dynamics in ecology⁴² $\left( {\Bbb M} \right)$, population dynamics^{32, 35} $\left( {\Bbb P} \right)$, and genetic regulation as captured by the Michelis–Menten model^{33, 43} (${\Bbb R}_1$ and ${\Bbb R}_2$), all summarized in Table 1.

For each system we measured the flow through all nodes and edges, ${\cal F}_i$ and ${\cal F}_{ij}$, respectively. For ${\cal F}_i$ we selected Π = {i} in (5), a path including a single node, and for ${\cal F}_{ij}$ we repeated the calculation with Π = {A _ij}, freezing sequentially all edges. Hence, we obtain the contribution of all individual nodes $\left( {{\cal F}_i} \right)$ and edges $\left( {{\cal F}_{ij}} \right)$ to the flow of information in the system. We find that the patterns of flow exhibit an extremely high level of diversity across the different systems, as expressed by the distinct size distribution of nodes (or width of edges) across the twenty-four layouts presented in Fig. 2. For instance, in Fig. 2a–f we show the flow patterns obtained by applying different dynamics (M) to the same network (SF1). It shows that despite the fact that A _ij remains the same, the dynamic patterns of flow are highly distinctive. For ${\Bbb P}$ and ${\Bbb R}_1$ information flow is dominated by a few selected central nodes. In contrast, under ${\Bbb E}$ and ${\Bbb B}$ the same network exhibits a distributed flow, with almost all nodes equally contributing to the spread of information. Finally, ${\Bbb M}$ and ${\Bbb R}_2$ show yet another pattern of information flow, with a seemingly random scatter of flow hubs spread throughout the network. Such diversity is also observed for SF2 (Fig. 2g–l), or for the empirical networks, where the same topology exhibits profoundly different flow patterns, depending on the system’s dynamics (Fig. 2m–x). Hence, the patterns of flow are a consequence not just of the topology, but of the intricate interplay between this topology and the system’s interaction dynamics (Fig. 1b, c). Taken together, the twenty-four networks of Fig. 2, demonstrate a highly diverse set of flow patterns, illustrating the extreme challenge in predicting the dynamics of information flow in complex systems.

Predicting the system’s flow patterns

To understand the origins of the observed flow patterns we derive ${\cal F}_i$’s dependence on the network’s degree distribution, by linking it with the in and out weighed degrees of all nodes, $S_{i,{\mathrm{in}}} = \mathop {\sum}\nolimits_{j = 1}^N A_{ij}$ and $S_{i,{\mathrm{out}}} = \mathop {\sum}\nolimits_{j = 1}^N A_{ij}^{\top}$. We show in Supplementary Note 1 that, on average, information flow scales with a node’s in/out-degree as

$${\cal F}_i \sim S_{i,{\mathrm{out}}}S_{i,{\mathrm{in}}}^{\omega - 1},$$

(6)

where the scaling exponent ω is fully determined by the system’s dynamics M. To understand the contribution of M = (M ₀(x), M ₁(x), M ₂(x)) we link ω in Supplementary Note 1 to the Hahn series expansion

$$M_2\left( {W^{ - 1}(x)} \right) = \mathop {\sum}\limits_{\Gamma (n)} C_nx^{\Gamma (n)},$$

(7)

where W(x) = −M ₁(x)/M ₀(x) and W ⁻¹(x) denotes its inverse function. The Hahn⁴⁴ expansion (7) is a generalization of the Taylor expansion to allow for both negative and real powers; the powers Γ(n) represent a well-ordered set in ascending order with n, namely Γ(0) represents the leading power in the expansion of M ₂(W ⁻¹(x)), Γ(1) > Γ(0) is the next power and so on. The exponent ω in (6) can be linked directly to the system’s dynamics via (7) as

$$\omega = \left\{ {\begin{array}{*{20}{c}} {1 - \Gamma (0)} & {\Gamma (0) \ne 0} \\ {1 - \Gamma (1)} & {\Gamma (0) = 0} \end{array}} \right.,$$

(8)

hence ω is determined by the leading non-vanishing exponent in (7). While the specific value of ω depends on the dynamic model M (${\Bbb P},{\Bbb R}_1$, etc.) the formula (8) to extract it from a given model is universal, providing a step-by-step method for constructing the flow in (6). An explicit example is shown in Methods.

Equations (6–8), represent our first analytical result, exposing the rules that govern the flow of information in a complex network. The scaling exponent ω helps us link between the system’s structure (S _i,in, S _i,out) and its dynamic patterns of information flow $\left( {{\cal F}_i} \right)$, providing the connection we seek between the system’s topology and its actual observed flow patterns (Fig. 1d). In other words, Eq. (6) helps us translate topological characteristics, such as the weighed in/out degrees, into dynamic insights pertaining to the flow of information, thus addressing a fundamental challenge of network science^{17, 45}.

Most importantly, our formalism predicts that the diversity observed in Fig. 2 is, in fact, rooted in a deep universality, expressed by the mapping of structure to dynamics that appears in (6). To test this, we revisit the “zoo” of twenty-four diverse flow patterns presented in Fig. 2 and confront the observed flow through all nodes ${\cal F}_i$ with our universal prediction ${\cal F}_i^{{\mathrm{Th}}} = S_{i,{\mathrm{out}}}S_{i,{\mathrm{in}}}^{\omega - 1}$, taking for each system the relevant A _ij and the appropriate value of ω, as predicted by (8). Strikingly, we find in Fig. 3a that despite their diverse and unpredictable behavior, all layouts of Fig. 2 collapse onto the universal linear plot (solid line) predicted by (6). This collapse demonstrates the predictive power of our formalism, taking a set of fundamentally different systems, from gene regulation to online social networks, cast on extremely diverse networks, and showing that they are all driven by similar rules of information flow, encapsulated within the universal relationship (6).

To gain a better grip on the mapping of (6) we focus specifically of the flow patterns of SF2 (original layouts appearing in Fig. 2g–l and presented again for convenience in Fig. 3c–h). In Fig. 3i–n we present the collapse plot of ${\cal F}_i$ vs. ${\cal F}_i^{{\mathrm{Th}}}$, this time only for SF2, showing each dynamics separately. Once again we observe the derived universality, in which all data collapses along the theoretically predicted solid lines. However, the important point here is that now we can observe how the role of all nodes changes across the different dynamics, as expressed through their location in each of the six plots. For instance, in ${\Bbb P}$, where ω = 5/3, Eq. (6) predicts that nodes with high S _i,in contribute more to the flow, hence occupying the top right quadrant of Fig. 3i, as noted by the direction of the red arrow. In contrast, for ${\Bbb R}_1$ (ω = 2/3, Fig. 3j) the flow negatively scales with S _i,in, concentrating the high in-degree nodes toward the bottom left quadrant, thus capturing the qualitative difference in flow patterns across the different models, as predicted by our theory. The effect becomes more dramatic as ω is decreased in ${\Bbb E},{\Bbb B},{\Bbb M}$ and ${\Bbb R}_2$, pushing the in-hubs further towards the limit of small ${\cal F}_i$ (bottom left), as symbolized by the length of the red arrows (Fig. 3k–n).

Next, we seek a similar universality to the one observed for ${\cal F}_i$ that can capture edge flow ${\cal F}_{ij}$. To observe this we show that, on average, the contribution of the A _ij link to the propagation of information follows (Supplementary Note 1)

$${\cal F}_{ij} \sim A_{ij}S_{i,{\mathrm{out}}}S_{i,{\mathrm{in}}}^{\xi - 1}S_{j,{\mathrm{in}}}^\xi ,$$

(9)

where ξ = ω − 1 and ω is taken from (8). Hence ${\cal F}_{ij}$, associated with the link from j to i (A _i←j) depends linearly on the link weight and on its target’s outgoing weighted degree S _i,out, a rather expected interpretation of topology into information flow. The role of i and j’s in-degrees, however, is more complex, affected also by the system’s dynamics through ξ. To test this prediction we measured the i,j-flow, ${\cal F}_{ij}$, through all links in the networks of Fig. 2. Once again, in Fig. 3b, we observe that the seemingly random behavior observed in Fig. 2 hides a deep universality, in which all systems, despite their diverse topology/dynamics, condense around the predicted linear plot ${\cal F}_{ij} \sim {\cal F}_{ij}^{{\mathrm{Th}}}$, where ${\cal F}_{ij}^{{\mathrm{Th}}}$ is taken from our analytically predicted (9). The specific results obtained from SF2 are expanded in Fig. 3o–t, also indicating the roles of large S _i,in (red arrows) and large S _j,in (blue arrows) in each system.

The results of Fig. 3a, b and their expansion in Fig. 3i–t, expose an extremely robust universality, sustained across multiple orders of magnitude and diverse networks and dynamics, together—exposing an intricate balance: on the one hand, different systems exhibit highly distinct flow patterns, e.g., the different roles of in/out hubs in Fig. 3i–t. Yet, at the same time, all this richness, enabled by the topology/dynamics interplay, indeed the “zoo” of flow patterns observed in Fig. 2, is shown to originate from two universal analytically predictable sources, Eqs. (6) and (9).

Universality classes of flow

To obtain a deeper understanding of the implications of the derived universality we now focus on undirected networks, namely networks where all links are bi-directional (A _ij ≠ 0 ⇔ A _ji ≠ 0), but not necessarily weight-symmetric, hence potentially A _ij ≠ A _ji. For randomly distributed weights, such networks have, on average, $S_{i,{\mathrm{in}}} \sim S_{i,{\mathrm{out}}} \equiv S_i$, which in (6) and (9) provide (Supplementary Note 1)

$${\cal F}_i \sim S_i^\omega$$

(10)

$${\cal F}_{ij} \sim A_{ij}\left( {S_iS_j} \right)^\xi .$$

(11)

These scaling relationships predict three highly distinctive dynamic universality classes:

Degree driven flow (ω > 0, Fig. 4a–f, red). In case ω > 0 the flow ${\cal F}_i$ in (10) increases with the weighted degree S _i, indicating that the flow of information is dominated by the high degree nodes. The greater is ω, the more pronounced is the effect and hence the more dominant is the role of the hubs. Equation (8) predicts that ${\Bbb P}$ and ${\Bbb R}_1$ belong to this class with ω = 5/3 and ω = 2/3, respectively. This analytical prediction is perfectly confirmed in Fig. 4a, d on the weighted scale-free network SF1 (circles).

Homogeneous flow (ω = 0, Fig. 4g–l, green). In case ω = 0 we have ${\cal F}_i$ independent of S _i, hence the contribution of the hubs to the flow of information is, on average, identical to that of the peripheral nodes. This represents homogeneous flow, where all nodes have almost similar contribution to the flow of information, independent of the network’s often heterogeneous degree distribution. Using Eq. (8) we predict that ${\Bbb E}$ and ${\Bbb B}$ belong to this class. Indeed, Fig. 4g, j indicates that despite the three orders of magnitude diversity in the weighted degrees S _i, their contribution to the flow is largely homogeneous.

Degree-averting flow (ω < 0, Fig. 4m–r, blue). For ${\Bbb M}$ and ${\Bbb R}_2$, Eq. (8) predicts ω = −1 < 0, indicating that ${\cal F}_i$ decreases with S _i. Hence, strikingly, for this class of dynamics information flow tends to avoid the hubs, being dominated mainly by the majority of low degree nodes. Such counter-intuitive flow patterns, which favor the peripheral nodes, represent a highly unexpected outcome of prediction (8), and yet they are fully supported by the results presented in Fig. 4m, p, where ${\cal F}_i$ is in fact inversely proportional to S _i. These results, which defy the natural interpretation of topology to dynamics, highlight the importance of our formalism as well as its predictive strength, allowing us to expose such unique patterns of information flow.

Our formalism further predicts that ω and ξ, and consequently the three universality classes, are fully determined by the dynamics M through (8), independent of the network topology A _ij. Hence we implemented all six dynamic models (Table 1) on the relevant networks from Fig. 2. We also included several additional canonical model networks, such as an Erdös-Rényi network, and scale-free networks with binary (SF3) and normally distributed (SF4) weights, (in addition to SF1 that features scale-free distributed weights). We find that despite the diversity of the examined networks, the behavior of ${\cal F}_i$ and ${\cal F}_{ij}$ consistently exhibits the universal scaling predicted by (10) and (11), across all examined networks (Fig. 4).

Centralized vs. peripheral information flow. The analysis above helps us uncover the main arteries of information flow in a complex network, quantifying the contribution of each node/link, and hence of all pathways to the flow of information, as emerges from the interplay between the system’s topology (A _ij, S _i) and its interaction dynamics (M, ω, ξ). To visualize this we used the scale-free SF1, presented in Fig. 2a–f, this time using a hub-central layout, in which the hubs (large S _i) are located at the center, and the low degree nodes (low S _i) tend to the periphery. For the degree-driven ${\Bbb P}$ and ${\Bbb R}_1$ we observe a centralized information flow, in which the cross-talk between all nodes is primarily mediated by the hubs located at the core of the network (Fig. 4c, f, red). As predicted, the effect is more pronounced for the ${\Bbb P}$ dynamics, where ω is larger. Using the same network with the same layout, the homogeneous ${\Bbb E}$ and ${\Bbb B}$ exhibit a non-centralized flow pattern, in which all nodes/pathways participate equally in spreading information (Fig. 4i, l, green). Finally, the degree-averting ${\Bbb M}$ and ${\Bbb R}_2$ show peripheral flow, in which information favors the longer, decentralized pathways that traverse through the exterior low degree nodes (Fig. 4o, r, blue).

Taken together, these distinct flow patterns, all obtained from the same network (SF1), illustrate the potential disparity between the static network topology and the actual dynamic pathways of information flow. Indeed, flow sometimes condenses around the hubs (red), distributes evenly across nodes (green), favors the network periphery (blue), or follows any other pattern within (6) and (9), as dictated by ω and ξ. Therefore to truly utilize networks as the tool they are designed to be—for visualizing the flow of information—one must use our analytically derived (6)–(11) to translate the network topology into actual pathways of information flow.

Additional flow patterns

At the heart of our analytical formalism lies Eq. (1), whose universal structure covers a broad range of steady-state dynamics, as captured in Table 1 and demonstrated in Figs. 2–4. To expand the applicability of our formalism, we now turn to two systems that extend beyond the boundaries of (1), and use numerical analysis to observe their flow patterns.

Epidemic spreading. The concept of dynamic flow can help us understand, and hence mitigate, the spread of epidemics, a most pertinent threat to our global health⁴⁶. Indeed, to design efficient immunization strategies, we must identify the nodes with the highest contribution to the flow of information (or viruses). To observe this, we implement the susceptible-infected-recovered model, in which each node can be in one of three states, S, I, or R, representing a generalization of (1) to account for multidimensional activities x _i(t). Freezing each node, we find that ${\cal F}_i \sim S_i$, representing a degree driven flow (Fig. 5a, red). This suggests that the optimal mitigation strategy is to immunize the hubs—a rather expected result. However, measuring the flow at later times, we find that the role of the hubs diminishes, indicated by the receding flow curve for large S _i in Fig. 5b (green), up to a point where ${\cal F}_i$ sharply decreases with S _i, entering a rather extreme state of degree-averting flow (Fig. 5c, blue). Hence disease spreading exhibits an evolving flow pattern, being degree driven at the early stages of the contagion and degree averting as the system approaches the pandemic state. The reason for this transition is that the well-connected hubs become infected, and hence non-susceptible, at the early stages of the spread, at which point they cease to contribute to the viral flow (Fig. 5d).

To test these evolving flow patterns in an empirical setting, we used air-traffic data⁴⁶, capturing the international mobility of 7 × 10⁶ individuals per day over the course of 3 years between N = 1,292 major international airports. Indeed, we find that flow evolves over time, condensing around different nodes at different stages of the contagion (Fig. 5e–j). These findings are crucial for developing efficient mitigation strategies based on air-traffic interventions, such as immunizing or quarantining passengers at strategic air routes. Such interventions help reduce the spread of disease at the price of negatively impacting the mobility of people and goods, a burden, which may significantly impact the global economy. To minimize the damage, we seek optimal mitigation strategies, which employ minimal intervention. Our analysis suggests that hub-immunization, the commonly assumed strategy, is only effective at the early stages of the spread. As the spread unfolds the dynamic flow diffuses towards the peripheral pathways.

In a broader perspective, such time dependent flow patterns expose the limited predictive power offered by the static topology, which remains unchanged in time. Our formalism, on the other hand, was able to uncover the time evolving flow patterns, providing crucial insights on the dynamic nature of disease propagation, as well as practical implications on its mitigation. For the detailed analysis of this system see Supplementary Note 5.

Metabolism. As our final example we analyze information flow in Glycolysis (Fig. 6a ⁴⁷), a well-mapped metabolic pathway that consumes glucose (triangle) to form the energy-rich ATP molecule (pentagon). This biochemical sequence can be accurately modeled via mass-action-kinetics (Fig. 6b), giving rise to a rather rich module structure, including third and fourth order reactions, that help us extend our analysis beyond pairwise dynamics. Instead of i,j links, we now have modules that represents chemical reactions, grouping together interacting substrates and catalysts (large gray circles), whose reactions generate flux (arrows), that link each module to its product molecules.

In this system, information flows from the input glucose to the output ATP, hence, by perturbing the glucose levels (a signal), we can measure the contribution of all reactants (nodes) or reactions (modules) to the flow, by sequentially freezing each node/module, and tracking the consequent changes in ATP production (response). The resulting flow patterns, shown in Fig. 6c–f expose a balance of positive (blue) and negative (red) flows, indicating that although some nodes/modules enhance the spread, others mitigate it, by negatively contributing to the flow. This balanced picture illustrates the role of metabolism as a regulatory process, intended to sustain the desired output levels (ATP) in the face of environmental perturbations (glucose signal), achieved by restricting the efficiency of information flow. Interestingly, our flow analysis naturally distinguishes between substrates and catalysts, the latter showing extremely low ${\cal F}_i$ (Fig. 6c, e). This finding is supported by empirical observations, that biochemical outputs are highly insensitive to changes in enzyme concentration⁴⁸. For the detailed analysis of this system see Supplementary Note 6.

Discussion

From neuronal signals to gene regulation, complex networks function by enabling the flow of information between nodes. Understanding the rules that govern this flow is a crucial step toward establishing a theory of network dynamics. Our approach here is to separate the contribution of the topology (A _ij) from the dynamics (M, ω, ξ), allowing us to efficiently translate topological characteristics (S _i,in/out, A _ij) into dynamic predictions (${\cal F}_i$, ${\cal F}_{ij}$). This will potentially enable us to leverage the vast amounts of data collected in recent years on the topology of real networks, into an understanding of their actual flow patterns. For instance, here we have shown that degree heterogeneity, a ubiquitous characteristic observed by almost all real networks¹, translates into one of three classes of flow: hubs may either dominate information flow (red), have no impact on the flow (green) or have a marginal role, effectively being the “shock-absorbers” of the network’s signal propagation (blue).

Our derivations are exact for a random A _ij with arbitrary degree/weight distributions, and under the assumption of small perturbations. We further establish their robustness when these assumptions are violated in Supplementary Note 4, confronting our predictions against large perturbations or non-random characteristics of A _ij, such as clustering C and degree-correlations Q ⁴⁹. We find that extreme levels of C or Q may result in a systematic decrease in ω, representing a reduction in the role of the hubs. This occurs due to the prevalence of loops in these limits, providing alternative pathways for the signals to bypass the well-connected nodes, a purely topological effect, observed independently of the dynamics. Still, even with these minor deviations in the precise values of ω or ξ, our macro-scale qualitative classification of flow patterns (degree-driven, homogeneous, degree-averting) remains unaffected, representing an intrinsic characteristic of the system’s internal mechanisms M, which is highly insensitive to microscopic discrepancies.

In a broader perspective, our predicted universality indicates that the macroscopic flow patterns of complex systems are controlled by only a few relevant parameters of the system’s microscopic dynamics, in this case the leading powers of the expansion (7). Such disparity between the unlimited microscopic degrees of freedom, and the restricted set of macroscopic behaviors lays the basis for a statistical mechanics theory of network dynamics, allowing us to systematically translate a complex system’s microscopic description, in terms of A _ij and M, to its anticipated large-scale dynamic behavior, e.g., centralized vs. peripheral flow.

Methods

Example: flow in regulatory dynamics

Our formalism provides a step-by-step procedure to translate the topology A _ij into dynamic flow ${\cal F}$, through the exponents ω and ξ. As an example we consider gene regulatory dynamics ${\Bbb R}$, where (Supplementary Note 2)

$$\frac{{{\mathrm{d}}x_i}}{{{\mathrm{d}}t}} = - x_i^a + \mathop {\sum}\limits_{j = 1}^N A_{ij}\frac{{x_j^h}}{{1 + x_j^h}},$$

(12)

with a, h > 0. The contribution of all paths to the flow is governed by the exponents ω and ξ, which we now exemplify how to analytically extract in three steps: First, we break the dynamics into the three components of M, providing

$$M_0(x) = - x^a,\quad M_1(x) = 1,\quad M_2(x) = \frac{{x^h}}{{1 + x^h}}.$$

(13)

We then construct the power series (7): first writing W(x) = −M ₁(x)/M ₀(x) = x ^−a; then inverting it to obtain W ⁻¹(x) = x ^−1/a; finally, using (7) we construct the power series as

$$M_2\left( {W^{ - 1}(x)} \right) = \frac{{x^{ - \frac{h}{a}}}}{{1 + x^{ - \frac{h}{a}}}} = 1 - x^{\frac{h}{a}} + O\left( {x^{2\frac{h}{a}}} \right).$$

(14)

From (14) we extract the leading powers of M ₂(W ⁻¹(x)) as Γ(0) = 0 and Γ(1) = h/a. We next use Γ(n) in (8) to predict ω and ξ = ω − 1: here, since Γ(0) = 0, Eq. (8) predicts

$$\omega = 1 - \Gamma (1) = 1 - \frac{h}{a},$$

(15)

providing, for ${\Bbb R}_1$ (a = 1, h = 1/3), ω = 2/3, a degree-driven dynamics, and for ${\Bbb R}_2$ (a = 1, h = 2) ω = −1 a degree-averting dynamics. Hence, on the same network, a slight change in the dynamics (value of h) leads to fundamentally different flow patterns. A detailed analysis of all dynamics in Table 1 appears in Supplementary Note 2.

Data availability

We make our code DynamicFlow.m (Matlab) available with this submission. The code accepts a user defined network and dynamics and provides the dynamic flow patterns as output, together with the scaling relationships reported throughout the paper. Specifically, the code allows users to reproduce all results presented in the paper. All empirical networks used in this work are publicly available online.

References

Caldarelli, G. Scale-free networks: complex webs in nature and technology. (Oxfrod University Press, New York, 2007).
Book MATH Google Scholar
Drogovtsev, S. N. & Mendez, J. F. F. Evolution of networks: from biological nets to the Internet and WWW (Oxford University Press, Oxford, 2003).
Book Google Scholar
Strogatz, S. H. Exploring complex networks. Nature. 410, 268–276 (2001).
Article ADS CAS PubMed MATH Google Scholar
Helbing, D., Jost, J. & Kantz, H. (eds) Networks and complexity. Networks and Heterogeneous Media (NHM) 3 185–411 (AIMS, Springfield, MO, 2008).
Newman, M. E. J. Networks—an introduction (Oxford University Press, New York, 2010).
Book MATH Google Scholar
Palla, G., Derényi, I., Farkas, I. & Vicsek, T. Uncovering the overlapping community structure of complex networks in nature and society. Nature. 435, 814–818 (2005).
Article ADS CAS PubMed Google Scholar
Pastor-Satorras, R. & Vespignani, A. Evolution and structure of the Internet: A statistical physics approach (Cambridge University Press, Cambridge, 2004).
Book MATH Google Scholar
Cohen, R. & Havlin, S. Complex networks: Structure, robustness and function (Cambridge University Press, New York, NY, 2010).
Book MATH Google Scholar
Yu, H. et al. High-quality binary protein interaction map of the yeast interactome network. Science 322, 104–110 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Milo, R. et al. Network motifs: Simple building blocks of complex networks. Science 298, 824–827 (2002).
Article ADS CAS PubMed Google Scholar
Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘small-world’ networks. Nature. 393, 440–442 (1998).
Article ADS CAS PubMed MATH Google Scholar
Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. Science 286, 509–512 (1999).
Article ADS MathSciNet PubMed MATH Google Scholar
Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47 (2002).
Article ADS MathSciNet MATH Google Scholar
Alamaas, E., Kovács, B., Vicsek, T., Oltvai, Z. N. & Barabási, A.-L. Global organization of metabolic fluxes in the bacterium Escherichia coli. Nature. 427, 839–843 (2004).
Article ADS Google Scholar
Barrat, A., Barthélemy, M., Pastor-Satorras, R. & Vespignani, A. The architecture of complex weighted networks. Proc Natl Acad Sci USA 101, 37473752 (2004).
Article Google Scholar
Barrat, A., Barthélemy, M. & Vespignani, A. Dynamical Processes on Complex Networks (Cambridge University Press, Cambridge, 2008).
Book MATH Google Scholar
Barzel, B. & Biham, O. Quantifying the connectivity of a network: the network correlation function method. Phys. Rev. E 80, 046104–046115 (2009).
Article ADS Google Scholar
Barzel, B. & Barabási, A.-L. Universality in network dynamics. Nature Physics 9, 673–681 (2013).
Article ADS CAS PubMed Central Google Scholar
Holter, N. S., Maritan, A., Cieplak, M., Fedoroff, N. V. & Banavar, J. R. Dynamic modeling of gene expression data. Proc Natl Acad Sci USA 98, 1693–1698 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Bornholdt, S. Boolean network models of cellular regulation: prospects and limitations. J. R. Soc. Interface. 5, S85–S94 (2008).
Article CAS PubMed PubMed Central Google Scholar
Barthélémy, M., Barrat, A., Pastor-Satorras, R. & Vespignani, A. Velocity and hierarchical spread of epidemic outbreaks in scale-free networks. Phys. Rev. Lett. 92, 178701 (2004).
Article ADS PubMed Google Scholar
Leskovec, J., Mcglohon, M., Faloutsos, C., Glance, N. & Hurst, M. Patterns of cascading behavior in large blog graphs. In Proceedings of the Seventh SIAM International Conference on Data Mining 551–556 (2007). April 26–28, 2007, Minneapolis, Minnesota, USA (2007) see: http://www.citeulike.org/user/applewu/article/8569539
Gao, J., Barzel, B. & Barabási, A.-L. Universal resilience patterns in complex networks. Nature. 530, 307312 (2016).
Article Google Scholar
Kubo, R., Toda, M. & Hashitsume, N. Statistical Physics II. Nonequilibrium Statistical Mechanics (Springer-Verlag, Heidelberg, 1991).
MATH Google Scholar
Maslov, S. & Ispolatov, I. Propagation of large concentration changes in reversible protein-binding networks. Proc Natl Acad Sci. USA 104, 13655–13660 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Barzel, B. & Barabási, A.-L. Network link prediction by global silencing of indirect correlations. Nat. Biotechnol. 31, 720–725 (2013).
Article CAS PubMed PubMed Central Google Scholar
Barzel, B., Liu, Y.-Y. & Barabási, A.-L. Constructing minimal models for complex system dynamics. Nature Communications 6, 7186 (2015).
Article PubMed Google Scholar
Barthélémy, M., Barrat, A., Pastor-Satorras, R. & Vespignani, A. Dynamical patterns of epidemic outbreaks in complex heterogeneous networks. J. Theor. Bio. 235, 275–288 (2005).
Article MathSciNet Google Scholar
Hufnagel, L., Brockmann, D. & Geisel, T. Forecast and control of epidemics in a globalized world. Proc Natl Acad Sci. USA 101, 15124–15129 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Dodds, P. S. & Watts, D. J. A generalized model of social and biological contagion. J. Theor. Biol. 232, 587–604 (2005).
Article MathSciNet CAS PubMed Google Scholar
Voit, E. O. Computational Analysis of Biochemical Systems (Cambridge University Press, New York, NY, 2000).
Google Scholar
Novozhilov, A. S., Karev, G. P. & Koonin, E. V. Biological applications of the theory of birth-and-death processes. Brief. Bioinform. 7, 70–85 (2006).
Article PubMed Google Scholar
Karlebach, G. & Shamir, R. Modelling and analysis of gene regulatory networks. Nature Reviews 9, 770–780 (2008).
Article CAS PubMed Google Scholar
Gardiner, C. W. Handbook of Stochastic Methods (Springer-Verlag, Berlin, 2004).
Book MATH Google Scholar
Hayes, J. F. & Ganesh Babu, T. V. J. Modeling and Analysis of Telecommunications Networks (John Wiley & Sons, Inc, Hoboken, NJ, 2004).
Book Google Scholar
Kitsak, M. et al. Identification of influential spreaders in complex networks. Nature Physics 6, 888–893 (2010).
Article ADS CAS Google Scholar
Rual, J. F. et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature. 437, 1173–1178 (2005).
Article ADS CAS PubMed Google Scholar
Opsahl, T. & Panzarasa, P. Clustering in weighted networks. Soc. Networks. 31, 155–163 (2009).
Article Google Scholar
Eckmann, J.-P., Moses, E. & Sergi, D. Entropy of dialogs creates coherent structures in e-mail traffic. Proc Natl Acad Sci. USA 101, 14333–14337 (2004).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Robertson, C. Flowers and insects lists of visitors of four hundred and fifty three flowers (Carlinville, Carlinville, Il., 1929).
Google Scholar
Barzel, B. & Biham, O. Binomial moment equations for stochastic reaction systems. Phys. Rev. Lett. 106, 150602–150605 (2011).
Article ADS PubMed Google Scholar
Holling, C. S. Some characteristics of simple types of predation and parasitism. The Canadian Entomologist 91, 385–398 (1970).
Article Google Scholar
Alon, U. An Introduction toSystems Biology: Design Principles of Biological Circuits (Chapman & Hall, London, 2006).
MATH Google Scholar
Schmetterer, L. & Sigmund, K. (eds) Hans Hahn Gesammelte Abhandlungen Band 1/Hans Hahn Collected Works Volume 1 (Springer, Vienna, 1995).
Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47-97 (2002).
Brockmann, D. & Helbing, D. The hidden geometry of complex, network-driven contagion phenomena. Science 342, 1337–1342 (2013).
Article ADS CAS PubMed Google Scholar
Nelson, D. L., Lehninger, A. L. & Cox, M. M. Lehninger principles of biochemistry (Macmillan, New York, NY, 2008).
Google Scholar
Schaaff, I., Jürgen, H. & Zimmermann, K. F. Overproduction of glycolytic enzymes in yeast. Yeast. 5, 285–290 (1989).
Article CAS PubMed Google Scholar
Newman, M. E. J. Assortative mixing in networks. Phys. Rev. Lett. 89, 208701–208704 (2002).
Article ADS CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Bar-Ilan University, Ramat-Gan, 52900, Israel
Uzi Harush & Baruch Barzel

Authors

Uzi Harush
View author publications
You can also search for this author in PubMed Google Scholar
Baruch Barzel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors designed and carried out the research. U.H. conducted the derivations, data analysis, and numerical simulations. B.B. is the lead writer of the paper.

Corresponding author

Correspondence to Baruch Barzel.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Software 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Harush, U., Barzel, B. Dynamic patterns of information flow in complex networks. Nat Commun 8, 2181 (2017). https://doi.org/10.1038/s41467-017-01916-3

Download citation

Received: 05 March 2017
Accepted: 25 October 2017
Published: 19 December 2017
DOI: https://doi.org/10.1038/s41467-017-01916-3

This article is cited by

Diversity of information pathways drives sparsity in real-world networks
- Arsham Ghavasieh
- Manlio De Domenico
Nature Physics (2024)
Higher-order Granger reservoir computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction
- Xin Li
- Qunxi Zhu
- Wei Lin
Nature Communications (2024)
More is different in real-world multilayer networks
- Manlio De Domenico
Nature Physics (2023)
Multi pathways temporal distance unravels the hidden geometry of network-driven processes
- Sebastiano Bontorin
- Manlio De Domenico
Communications Physics (2023)
Emergent stability in complex network dynamics
- Chandrakala Meena
- Chittaranjan Hens
- Baruch Barzel
Nature Physics (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.