Two universal physical principles shape the power-law statistics of real-world networks

Lorimer, Tom; Gomez, Florian; Stoop, Ruedi

doi:10.1038/srep12353

Download PDF

Article
Open access
Published: 23 July 2015

Two universal physical principles shape the power-law statistics of real-world networks

Tom Lorimer¹,
Florian Gomez¹ &
Ruedi Stoop¹

Scientific Reports volume 5, Article number: 12353 (2015) Cite this article

4907 Accesses
18 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The study of complex networks has pursued an understanding of macroscopic behaviour by focusing on power-laws in microscopic observables. Here, we uncover two universal fundamental physical principles that are at the basis of complex network generation. These principles together predict the generic emergence of deviations from ideal power laws, which were previously discussed away by reference to the thermodynamic limit. Our approach proposes a paradigm shift in the physics of complex networks, toward the use of power-law deviations to infer meso-scale structure from macroscopic observations.

Emergent stability in complex network dynamics

Article 20 April 2023

Deciphering the generating rules and functionalities of complex networks

Article Open access 25 November 2021

Uncovering the hidden structure of small-world networks

Article Open access 19 March 2024

Introduction

A recent seminal discovery elucidated that in nature a simple physical principle often rules the growth of ‘random networks’. The so called preferential attachment (‘the rich get richer’) rule leads to complex networks that have properties contrasting those predicted from classical random network theory^1,2,3,4. A fundamental universality principle of physics must be held responsible for this change of paradigm. The preferential attachment principle expresses in our interpretation that for the formation of ensembles, attractive forces that are generally valid over decades of spatial extensions are required (that in physics may involve, e.g., mass, charge). It is this principle that generates the celebrated power laws observed in the distribution of mesoscopic network indicators, such as network degree, connectivity weight^5,6,7,8, or neuronal avalanche size^9,10,11. A second fundamental universality principle of physics that is active at the same time, has, however, passed unnoticed so far. It is the fact that real-world connectivity requires space and that this space is limited. The question that we address in our work is what the traces of this principle will be, during network formation and regarding the final network. This question has not been answered so far.

Generic network building algorithm

To study this question, we consider a novel generic network building algorithm (our ‘primary model’) that implements both principles at the most basic level as follows. We start from a connected network of N₀ nodes. With probability p, an ‘outside’ node, from a finite set of available nodes, is added; alternatively, with probability 1 − p, an attempt is made to construct an ‘inside’ edge (see below). If an outside node is added, the new node connects to the network by m edges, where the target nodes are sampled according to their degree k (i.e. ∝ k), following preferential attachment. For an inside edge, two nodes are independently chosen using preferential attachment. If the two chosen nodes are not identical and not already connected, an edge is established, which expresses the second fundamental principle in terms of an ‘edge saturation’ (at a level defined by p and m, implemented right from the start of the network’s growth). The process stops if the set of available nodes is depleted. The algorithm generates undirected topological networks of arbitrary size, void of loops and multiple-edges; examples will be discussed later. Fig. 1 shows the stereotypical degree distribution obtained in this way, exhibiting an extended power-law part of the distribution terminated by a hump (that, upon the network’s growth, moves towards larger degrees, until the process is stopped by node depletion. The details of how this happens in time are outlined in our ‘Statistical modelling’ section).

Network properties

While we observe a wide-spread activity to find power-law distributions in all areas of physics, we emphasize that based on the fundamental ingredients necessary in the network building process, only in rare cases will neat power laws be found. Examples of experimental data with the deviations that our key principles predict are shown in Fig. 2. While our real-world examples are often related to biology (mostly because of the great availability of the underlying data and because of the greater simplicity of the examples), all of our arguments are immediately transferable to physical situations where previous analysis has generally stopped at the preferential attachment level. Our analysis now provides guidelines for inferring from macroscopic measurements the microscopic properties that dominate network growth (cf. Fig. 3, where the ‘humpiness’ of the distribution P(k) was evaluated as the deviation from the power law p(k) excluding the hump, as (P(k) − p(k))/p(k)). This provides an important input for the modelling of real world systems (see, e.g., the Drosophila network example discussed below). By superposition of prototypes with different p and m parameters, more general hump structures can be generated (Fig. 2). This mechanism provides an as yet unexplored link between the macro- and meso-scales that can be invaluable for both the modelling and the further analysis of real-world systems.

In contrast to preferential attachment networks (cf.¹²), a network generated according to the two fundamental physical principles embodied in our primary model, will not necessarily be sparse (this would imply a power-law exponent >2, cf. Fig. 1). Moreover, Dorogovtsev and Mendes’ modified preferential attachment algorithm with its double regimes of power-law behaviour⁷ also deviates from the fundamental principles that we have worked out. Their model uses a second internal linking process that is always successful in making new connections. In our case it is exactly the edge connection failures (by edge saturation) that define the network structure. Whereas the rate of internal linking in their algorithm accelerates with the network size, our approach does not share this property. The network structures that we obtain depend primarily on parameter p; the obtained distributions are generally unaffected by the network’s initial condition (in contrast to Refs 13, 14, 15), as long as the initial network size N₀ is sufficiently smaller than the final network size. Previous authors have also studied network shaping by edge depletion¹⁶. Their algorithm can also produce scale free networks with exponent <2, but excludes saturation and thus does not show the characteristic hump termination discussed here.

The modelling of biological networks containing a small number of nodes only, is a particular challenge. The example of Drosophila’s courtship network, a network that is built on observable irreducible acts of body language^17,18 (cf. Figs 4 and 5) illustrates that our approach also successfully masters this challenge (a further discussion of this example is given towards the end of the paper).

Statistical modelling

To better understand how the statistical properties and in particular, saturation, emerge from the model, we focus on a semi-analytical growth description, in which the natural time step t is the addition of one node to the network. The degree distribution from a network growth algorithm is usually determined from a differential equation that describes the rate of addition of new edges to a given node, as a function of the time s at which the node has joined the network¹⁹, i.e. For our algorithm, the topological constraint on the addition of inside edges implies that can not be determined analytically from the single node information f(k, s, t), but requires the full pairwise connection information of the network encoded in the adjacency matrix at time t, A_t, i.e. To work around this complication, we make the following ansatz. We suppose that the probability of failure while trying to add an inside edge (i, j) to an already chosen node i, can be expressed by a mean field ‘saturation’ function F(k, t) in terms of the degree k of node i. Furthermore, suppose that the total number of edges present in the network at time t can be approximated by K(t). F(k, t) is then defined as the average probability of a node with degree k, to be already connected to a second node j chosen with P ∝ k_j. Thus,

where F_i(t) is the probability that node i with degree k_i, is already connected to node j. F_i(t) has then the form

where k_i(t) accounts for the case where node i would be chosen twice and the second term is the degree-weighted sum over the nodes to which node i is already connected (E(t) denotes the network’s set of edges).

Using this approximation, we can express our algorithm by the rate of addition of new edges to a node of degree k(s, t) as

In this case, the network grows out from a connected network of N₀ nodes, with k(s, s) ≈ m as the initial condition. The first term on the right hand side of Eq. (3) describes the increase in k due to connection to outside nodes and the second term describes the addition of inside edges. The whole equation has been rescaled by (cancelling the p in the first term’s numerator) such that t corresponds to the number of nodes in the network. As can be easily seen from Eq. (3), our growth algorithm provides two well-known limiting cases. For p = 1 we retrieve the preferential attachment growth process⁴. For p = 0, the network will not add nodes and must asymptotically become a clique of size N₀. In between, for p << 1, the second term dominates, which renders the network more dense and produces the large deviation from power-law structure in the distribution tail.

To demonstrate the validity of our mean-field approximation, we compare the node degree evolution obtained from a 4^th order Runge-Kutta integration of Eq. (3) using our approximation for F(k, t) (see below), against the averaged result from 10³ realizations of the primary model. As the result, an approximate power law scaling clearly emerges at early evolution stage and an upper bound to the envelope of node degrees emerges for longer evolution time t necessary to attain larger network sizes (cf. Fig. 6, where the results of the semi-analytical description are based on exponents and prefactors from an approximation of the results of Fig. 7a via Eq. (4)). F(k, t) has a very regular behavior in both variables (k, t) (Fig. 7a) and is accompanied by a node degree distribution P(k) as found for our primary model (Fig. 7b). Over a large range, we can approximate F(k, t) by a power law for small k and by a second power law at large k:

where k_c ~ t^λ and the fractional term for k > k_c simply makes F(k, t) continuous at k_c. The exponents α, β, γ, λ will vary according to the choice of algorithm parameter p, where 0 < λ < 1: i.e. 1 < k_c < t. In accordance with Fig. 7a, the following observations can be made: First, γ < β (the exponent of the power law fit decreases as k crosses k_c). Second, F(t − 1, t) = 1, since t − 1 is the maximum possible node degree at time t (achieved in Fig. 7a for t = 25 only). Similarly, as p → 0, F(k, t) → 1 (the network will tend toward a clique, where all possible connections already exist). When p = 1, F(k, t) ceases to be relevant. Finally, for any p ∈ (0,1), as t → ∞, F(k, t) → 0, since the number of inside edges added at each time-step approximates a constant value, so the network becomes increasingly sparse.

We can use F(k, t) to infer the generated unnormalized degree probability distribution, N(k, t) as follows. Starting from the continuity equation, we may write

where is given by Eq. (3) and the Kronecker delta function has been included to account for the addition of outside nodes. By differentiating Eq. (3), we notice that Eq. (5) contains the product of k and the derivative of the saturation function F:

where , . The form of F(k, t) implies that a sharp change should occur in the solutions of Eq. (6) around k_c. Indeed, a comparison between P(k, t) and F(k, t) (Fig. 7) supports this suggestion. Thus, we hold the properties of the saturation function F(k, t) responsible for the form of the deviation of P(k, t) from the ideal power law.

Discussion

Examples of edge saturation network growth emerge from the fundamental situation where the state of a physical system is described by a symbol and where time acting on the states leads to a description in terms of a language (symbolic dynamics and formal languages^{20,21,22,23,24,25,26,27}, natural languages). Starting with a finite number of N₀ states, observations of the system in time yield sequences of states, that define links on a graph between nodes (states), which implies that more important or more versatile nodes will have more links. During the refinement of this description, two processes may occur: 1) adjacencies are established between previously unconnected nodes (preferentially between more versatile ones); 2) a new node is added and connected preferentially to already highly connected nodes. Evidently, in many networks there will, however, be a limitation on the number of edges that can be hosted by a given node.

The Drosophila courtship body language of 37 fundamental behavioural states^17,18 and its network is an example of such a process. The states are fundamental in the sense that each act could, from the view of the physics of body motion, be followed by any other act. Some transitions, however, are generally not taken, leading to edges missing. Well-defined connected sub-networks characterize a chosen courtship partner’s class, according to which protagonists can be distinguished (male, female (virgin, mature, mated), fruitless). Within these bounds, courtship exploits the available expression space, corroborating the view that it might advertise individual properties of the sender into the eyes of a courtship partner^18,28. To compare our network growth algorithm with the data from male-female interaction, we grow the network until the number of nodes (symbols) is depleted, with p chosen so that on average the number of edges matches that of the courtship network. A comparison -without further fitting- exhibits that the two degree distributions match extremely well and that the proposed generating algorithm is very specific (Fig. 5).

Our paradigm may also appear in the guise of an equilibrium condition in the following sense. Complex networks in physics or in biology are often constrained to maintain some ‘average’ conditions. As soon as (possibly: self-enhancing) node interaction sets in, this needs to be balanced by homeostasis, i.e. a competitive, counter-balancing mechanism that weakens other connections of the same node to the network⁸. In the neural networks domain, a closely related principle is known as ‘Hebbian learning’²⁹. Self-organized Hebbian-learning³⁰ in the super-paramagnetic³¹ phase of ensembles has been proven a reliable and efficient way of clustering that does away with convexity requirements of cluster borders³². A very similar approach has also been used as a synchronization model for coupled oscillators, where the oscillators’ struggle to synchronize is expressed by competing connection strengths w_ij that evolve according to the dynamical update rule ⁸, where s_ij measures the pairwise oscillator synchrony. The resulting distribution of w_ij has been shown to tend for intermediate coupling strengths towards a hump-terminated power-law (cf. Fig. 2a). This dynamical law expresses the limited resources available for the local wiring around each node, which in our model is encoded in the probability p ruling the edge saturation. We envisage that also avalanche distributions of the typical form of Fig. 2a could be understood similarly¹¹.

Many interesting real-world phenomena dwell on the mesoscale. In social networks, the largest scale is relevant, e.g., for the study of disease and rumour spreading, but more subtle social dynamics happens within the community structures^33,34. Our results suggest that a large class of systems can be formulated as growing along simple principles, similar and in addition to preferential attachment. The sets of m, p parameters needed to recover an experimental distribution, i.e. the violation of the ideal power law on the macroscopic scale, provides us with an insight about the local mesoscale structures present in the network. In this way, starting from non-ideal power law distributions of complex networks, an avenue opens towards the identification and understanding of interesting mesoscale real-world phenomena in physics.

Additional Information

How to cite this article: Lorimer, T. et al. Two universal physical principles shape the power-law statistics of real-world networks. Sci. Rep. 5, 12353; doi: 10.1038/srep12353 (2015).

References

Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D.-U. Complex networks: structure and dynamics. Phys. Rep . 424, 175 (2006).
Article MathSciNet ADS Google Scholar
Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47 (2002).
Article MathSciNet ADS Google Scholar
Cohen, R. & Havlin, S. Complex networks: structure, robustness and function . (Cambridge University Press, Cambridge, England, 2010).
Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. Science 286, 509 (1999).
Article MathSciNet ADS Google Scholar
Amaral, L. A. N., Scala, A., Barthélémy, M. & Stanley, H. E. Classes of small-world networks. Proc. Natl. Acad. Sci. USA . 97, 11149 (2000).
Article CAS ADS Google Scholar
Mossa, S., Barthélémy, M., Stanley, H. E. & Amaral, L. A. N. Truncation of power law behavior in “scale-free” network models due to information filtering. Phys. Rev. Lett. 88, 138701 (2002).
Article ADS Google Scholar
Dorogovtsev, S. N. & Mendes, J. F. F. Language as an evolving word web. Proc. R. Soc. Lond. B 268, 2603 (2001).
Article CAS Google Scholar
Assenza, S., Gutiérrez, R., Gómez-Gardeñes, J., Latora, V. & Boccaletti, S. Emergence of structural patterns out of synchronization in networks with competitive interactions. Sci. Rep . 1, 99 (2011).
Article CAS ADS Google Scholar
Eurich, C. W., Herrmann, J. M. & Ernst, U. A. Finite-size effects of avalanche dynamics. Phys. Rev. E 66, 066137 (2002).
Article ADS Google Scholar
Levina, A., Herrmann, J. M. & Geisel, T. Dynamical synapses causing self-organized criticality in neural networks. Nat. Phys . 3, 857 (2007).
Article CAS Google Scholar
de Arcangelis, L., Lombardi, F. & Herrmann, H. J. Criticality in the brain. J. Stat. Mech . 3, P03026 (2014).
Article MathSciNet Google Scholar
Del Genio, C. I., Gross, T. & Bassler, K. E. All scale-free networks are sparse, Phys. Rev. Lett. 107, 178701 (2011).
Article ADS Google Scholar
Dorogovtsev, S. N., Mendes, J. F. F. & Samukhin, A. N. Size-dependent degree distribution of a scale-free growing network. Phys. Rev. E 63, 062101 (2001).
Article CAS ADS Google Scholar
Guimaraes, P. R., de Aguiar, M. A. M., Bascompte, J., Jordano, P. & dos Reis S. F. Random initial condition in small Barabasi-Albert networks and deviations from the scale-free behavior. Phys. Rev. E 71, 037101 (2005).
Article ADS Google Scholar
Waclaw, B. & Sokolov, I. M. Finite-size effects in Barabási-Albert growing networks. Phys. Rev. E 75, 056114 (2007).
Article MathSciNet CAS ADS Google Scholar
Schneider, C. M., de Arcangelis, L. & Herrmann, H. Scale-free networks by preferential depletion. Euro. Phys. Lett . 95, 16005 (2011)
Article ADS Google Scholar
Stoop, R. & Arthur, B. Periodic orbit analysis demonstrates genetic constraints, variability and switching in Drosophila courtship behavior. Chaos 18, 023123 (2008).
Article ADS Google Scholar
Stoop, R. & Joller, J. Mesocopic Comparison of Complex Networks Based on Periodic Orbits. Chaos 21, 016112 (2011).
Article CAS ADS Google Scholar
Dorogovtsev, S. N. & Mendes, J. F. F. Evolution of Networks . (Oxford University Press, Oxford, 2003).
Grassberger, P. & Kantz, H. Generating partitions for the dissipative Hénon map. Phys. Lett. A 113, 235 (1985).
Article MathSciNet ADS Google Scholar
Cvitanović, P., Gunaratne, G. H. & Procaccia, I. Topological and metric properties of Hénon-type strange attractors. Phys. Rev. A 38, 1503 (1988).
Article MathSciNet ADS Google Scholar
Bai-Lin, H. Elementary Symbolic Dynamics and Chaos in Dissipative Systems (World Scientific, Singapore, 1989).
Stoop, R. Bivariate thermodynamic formalism and anomalous diffusion. Phys. Rev. E 49, 4913 (1994).
Article CAS ADS Google Scholar
Stoop, R. & Parisi, J. Evaluation of probabilistic and dynamical invariants from finite symbolic substrings-comparison between two approaches. Physica D 58, 325 (1992).
Article MathSciNet ADS Google Scholar
Stoop, R. Phase transitions in the approximated and asymptotic generalized entropy spectrum of a nonhyperbolic system. Phys. Rev. A 46, 7450 (1992).
Article CAS ADS Google Scholar
Lai, Y.-C., Bollt, E. & Grebogi, C. Communicating with chaos using two-dimensional symbolic dynamics. Phys. Lett. A 255, 75 (1999).
Article CAS ADS Google Scholar
Klages, R. Microscopic chaos, fractals and transport in non-equilibrium statistical mechanics (World Scientific, Singapore, 2007).
Stoop, R., Nüesch, P., Stoop, R. L. & Bunimovich, L. A. At grammatical faculty of language, flies outsmart men. PLoS ONE 8, e70284 (2013).
Article CAS ADS Google Scholar
Hebb, D. The Organization of Behavior (Wiley & Sons, New York, 1949).
Landis, F., Ott, T. & Stoop, R. Hebbian self-organizing integrate-and-fire networks for data clustering. Neur. Comp . 22, 273 (2010).
Article MathSciNet Google Scholar
Ott, T. et al. Sequential superparamagnetic clustering for unbiased classification of high-dimensional chemical data. J. Chem. Inf. Comput. Sci. 44, 1358 (2004).
Article CAS Google Scholar
Gomez, F., Stoop, R. L. & Stoop, R. Universal dynamical properties preclude standard clustering in a large class of biochemical data. Bioinformatics 30, 2486 (2014).
Article CAS Google Scholar
Girvan, M. & Newman, M. E. J. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA . 99, 7821 (2002).
Article MathSciNet CAS ADS Google Scholar
Palla, G., Derényi, I., Farkas, I. & Vicsek, T. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 814 (2005).
Article CAS ADS Google Scholar
Yanai, I., Camacho, C. J. & DeLisi, C. Predictions of gene family distributions in microbial genomes: evolution by gene duplication and modification. Phys. Rev. Lett. 85, 2641 (2000).
Article CAS ADS Google Scholar

Download references

Acknowledgements

This work was supported by the Swiss National Science Foundation (Grant 200021-153542/1 to R.S.).

Author information

Authors and Affiliations

Institute of Neuroinformatics and Institute of Computational Science, University of Zurich and ETH Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
Tom Lorimer, Florian Gomez & Ruedi Stoop

Authors

Tom Lorimer
View author publications
You can also search for this author in PubMed Google Scholar
Florian Gomez
View author publications
You can also search for this author in PubMed Google Scholar
Ruedi Stoop
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.S. and T.L. designed the research, T.L. and F.G. carried out the analysis, R.S. wrote the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Lorimer, T., Gomez, F. & Stoop, R. Two universal physical principles shape the power-law statistics of real-world networks. Sci Rep 5, 12353 (2015). https://doi.org/10.1038/srep12353

Download citation

Received: 23 April 2015
Accepted: 26 June 2015
Published: 23 July 2015
DOI: https://doi.org/10.1038/srep12353

This article is cited by

Fingerprints of a second order critical line in developing neural networks
- Karlis Kanders
- Hyungsub Lee
- Ruedi Stoop
Communications Physics (2020)
Understanding cancer complexome using networks, spectral graph theory and multilayer framework
- Aparna Rai
- Priodyuti Pradhan
- Sarika Jalan
Scientific Reports (2017)
Assortativity and leadership emerge from anti-preferential attachment in heterogeneous networks
- I. Sendiña-Nadal
- M. M. Danziger
- S. Boccaletti
Scientific Reports (2016)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.