Abstract
The Erdős-Rényi (ER) random graph G(n, p) analytically characterizes the behaviors in complex networks. However, attempts to fit real-world observations need more sophisticated structures (e.g., multilayer networks), rules (e.g., Achlioptas processes), and projections onto geometric, social, or geographic spaces. The p-adic number system offers a natural representation of hierarchical organization of complex networks. The p-adic random graph interprets n as the cardinality of a set of p-adic numbers. Constructing a vast space of hierarchical structures is equivalent for combining number sequences. Although the giant component is vital in dynamic evolution of networks, the structure of multiple big components is also essential. Fitting the sizes of the few largest components to empirical data was rarely demonstrated. The p-adic ultrametric enables the ER model to simulate multiple big components from the observations of genetic interaction networks, social networks, and epidemics. Community structures lead to multimodal distributions of the big component sizes in networks, which have important implications in intervention of spreading processes.
Similar content being viewed by others
Introduction
Number theory has a reputation of “unreasonable effectiveness.” Perhaps the most famous example is the Fibonacci numbers, which are closely related to golden ratio, plant growth (Phyllotaxis), and DNA patterns. Over the past years, a few developments have begun to unfold the potentials of number theory in understanding complex networks1,2. This work employs the p-adic numbers for modeling complex networks based on the Erdős-Rényi (ER) random graph. The p-adic number system gives an extension of the ordinary arithmetic of rational numbers (\({\mathbb {Q}}\)) in a way distinct from the common extension of \({\mathbb {Q}}\) to real numbers (\({\mathbb {R}}\)) and complex numbers (\({\mathbb {C}}\)). The p-adic number system can be applied in various scientific fields. One groundbreaking application in physics is the p-adic AdS/CFT3. Khrennikov et al.4 developed p-adic wavelet for modeling reaction-diffusion dynamics. Applications in biology include the models for hierarchical structures of protein5 and genetic code6.
Networks, behaviors, and health
There is growing interest on a unified theory of complex networks from various fields7,8,9. The formalism commenced with graph theory in mathematics. The emergence of giant component is essential for the evaluation of random graphs10,11,12. Network analysis revealed hidden structures in social and economic systems13,14. Exponential random graph15 and stochastic blockmodels16,17 were developed. Many statistical mechanics in physics, e.g., percolation18 and time series19, are modeled with complex networks. Network models are closely associated with epidemiology20,21 and public health22. Chains of affection23 is a classical work revealing the network structure as a critical factor in public health. The spatiotemporal dynamics20 is essential for understanding the epidemic processes.
The complex networks also play an essential role in the microscopic scale. Numerous researches in biology rely on the curation and archival storage of protein, genetic, and chemical interactions for all major model organism species and humans (e.g. BioGRID and STRING database). The disease network24,25, protein-protein interaction network, and gene network26 contributed to the network approaches to life science27. In the near future, the network advances in biology, such as drug targeting28,29 and network medicine30, might be critical for improving our health.
Erdős–Rényi model and extensions
The classical ER model has inspired the continuous development of a rich spectrum of sophisticated models. There are three major approaches:
-
Creating more complex structures such as multilayer networks31,32 and multiplex networks33,34. Complex networks often exhibit community or module structures7.
-
Manipulating the rules for constructing edges. The Achlioptas process picks two candidate edges each time18 for competitive graph-evolution. ER process, Bohman and Frieze process, and product rule process were compared with one another for analyzing explosive percolation35.
-
Projecting graphs onto social, geometric, or geographic structures. Major developments in that aspect include the hyperbolic networks36, spatial preferential attachment37, inhomogeneous random graph38,39, and spatial networks30.
Findings
Common attempts to modeling the hierarchical structures7 are reflected by various notions including subpopulation, subgraph, mixing pattern, community, and module. We postulate that hierarchical structures are naturally encrypted in a standard graph. Consequently, imposing additional structures onto a graph to enrich its behavior is not always necessary. The key is the p-adic absolute value40. According to Ostrowski’s theorem, every non-trivial absolute value on \({\mathbb {Q}}\) is either the usual real absolute value or the p-adic absolute value. Various hierarchical structures can be represented by p-adic integers as nodal indices. The network topology and relative strengths between connections are unified as p-adic distances between numbers.
The p-adic random graph (PARG), probably the simplest model of inhomogeneous networks, offers a flexible method to simulate various observations in complex networks, especially the phenomenon of multiple big components. Degree distribution is a key property for distinguishing random, free-scale, and small world networks. However, PARG indicates that two random graphs with identical degree distribution may produce significantly different component sizes.
We fit PARG to the component size distribution of the genetic interaction networks, and also to the joint distributions of big components in COVID-19 outbreaks. The experiments imply that the community structures are responsible for the multimodal distributions of the sizes of big components. The largest or the second largest component could be more stable at (multiple) specific sizes. Therefore, maintaining a local peak could be valuable for intervening the spreading processes.
p-adic random graph
In contrast to the celebrated ER network, another early prototype of random graph, the Rado graph, has been rarely revisited. The Rado graph employs the binary number system to encode the graph edges, using Ackermann coding of hereditarily finite sets. PARG explores the fundamental nature of integers to encode the probability of connecting a pair of nodes. The p-adic number system extends the ordinary arithmetic of rational numbers3,40. Our PARG model will focus on the p-adic metric on nonnegative integers. An integer’s r-adic (picking a prime number r) absolute value is the reciprocal of the largest power of r that divides it. For example, \(|40|_2=1/8\) (let r=2, then the 2-adic absolute value of 40 equals 1/8), \(|40|_3=1\), \(|40|_5=1/5\). Such absolute value is the most significant example of ultrametrics40. Each node is naturally associated with an integer, i.e., its index in the ordered set of nodes, or an arbitrary (unique) integer can be assigned to each node. The probability of connecting any pair of nodes i, j is proportional to the p-adic closeness between the two nodal indices \(v_i, v_j\) :
for \(i,j\in [1,n],\; i<j\) (assuming \(i<j \Leftrightarrow v_i<v_j\)). \(p^*\) is a constant as the probability in the ER sense. When comparing ER and PARG, we normalize the PARG probability so that \(\sum _{ij} p_{ij} =n(n-1) p^*/2\). As a result, the number of edges in ER equals that in PARG. The p-adic distances encode a hierarchical structure, as shown in the circular tree-map in Fig. 1a. The distances between any pair of numbers from the same small circle and the same big circle are 1/9 and 1/3, respectively. If the two numbers are from different big circles, their distance is 1.
The digit format of a p-adic number is intuitive, for example, \(201_3=2\cdot 3^0 +0\cdot 3^1+1\cdot 3^2=11\). One can construct a set of integers (as the nodal indices) in their digit formats by
where r is the chosen prime number. We call the model a full PARG if \(u_0=u_1=\cdots = u_m=r\) (i.e., \(n= r^{m+1}\)), otherwise, we call it a regular PARG (i.e., \(n= \prod _i u_i\)). The nodal indices can also be arbitrary digits under a given prime r, which leads to a general PARG. The expression (2) facilities the enumeration of hierarchical structures. For example, \(3.2.3_3\) fully describes a hierarchical structure as shown in Fig. 1c. More examples are in illustrated in Fig. 1. A notation such as \(G(3.2.3_3, p^*)\) fully specifies a PARG.
PARG implements a Bernoulli process on all pairs from a set of p-adic numbers. Let \(p_k\) represent the probability of a randomly chosen node with degree k. In the ER model, \(p_k\) follows a Binomial distribution. It becomes a Poisson distribution in the limit of large n. By contrast, n in PARG equals the number of individuals in observations. Because of the symmetric connectivity in regular PARG, \(p_k\) can be obtained from the degree distribution of one node:
where \(\sum _{i=0}^m d_i =n-1\) holds. \(d_i\) denotes the number of links (from the chosen node) with probability \(r^ip^*\) according to (1-2). The numbers \(\alpha _0,\alpha _1,\cdots ,\alpha _m\) denote all combinations satisfying \(k=\sum _{i=0}^m \alpha _i\). Numeric computing of (3) indicates that \(p_k\) in PARG and that in ER are almost identical. The two models can have the same degree distribution and the same number of edges. However, there could be significant differences in the size distributions of big components in PARG and ER (Fig. 2, Supplementary Note 1, Table S1 and S2). The largest components in PARG may exhibit multimodal distribution due to the hierarchical structure. This implies that certain sizes of (multiple) big components are more statistically stable than other sizes.
Sizes of big components
The relative sizes of multiple big components in complex networks were much less studied compared to the studies carried out on the giant component12,41. Analytical methods11,42 and generating functions10 have been widely employed for analyzing the component sizes. Rather than let \(n\rightarrow \infty\), n in PARG is equal to the number of relevant individuals in observations. As a result, the sizes of simulated components are similar to that in ground truth. When n is finite, numeric random realizations are suited to evaluate the various probabilities about component sizes (see “Methods” section).
When n is fixed, fitting ER model to empirical data only involves the single parameter p, while PARGs involve two kinds of parameters: the probability p and the hierarchical structures represented by (2). The configuration space of (2) is very vast, even when \(n<1000\), so we opted for ad hoc heuristics to choose the hierarchical structures that fit relatively well to observations. The heuristics includes: (1) scaling the distances. For example, \(7.5.6_r\) with \(r=7, 11, 13,\cdots\) represent the same hierarchical structure, though the distances between the levels are scaled. (2) Flat vs. deep hierarchy. For instance, both \(16.16_{17}\) and \(2.2.2.2.2.2.2.2_2\) refer to 256 nodes. The former is made of 16 groups (each contains 16 nodes), while the latter’s structure looks like a high tree.
Based on the hierarchical structures in PARG, the following experiments analyze multiple big components, especially \(5|C_2|>|C_1|\), as static structures observed in networks. The topics range from microscopic networks, such as biological networks26,28, to macroscopic networks, such as epidemics21,43.
Genetic interaction networks
The essential role of genetic interaction networks plays in biology has been lately revealed25. The essential genetic interaction network of yeast genes (theCellmap.org) contains 1,261 mutant strains. Their interactions have been characterized by Pearson correlation coefficient (PCC). Genes with highly correlated genetic interaction profiles (PCC>0.4) form clusters of specific pathways or protein complexes26. We set three PCC thresholds above 0.4 to obtain three graphs with distinct big components, as shown in Fig. 3g–i. We count the components sizes falling into the predefined intervals of Fibonacci numbers (\(b_{i+1}=b_i+b_{i-1}, b_0=3\)). Let \(\theta _i\) be the number of simulated components whose sizes fall between \(b_i\) and \(b_{i+1}\), and \(\theta _i^*\) be that in ground truth. The error of a random realization is given by
The averaged error from many random realizations yields a relatively accurate evaluation. ER and PARG with distinct values of np lead to different errors (Fig. 3a–c). Equipped with (configurable) hierarchical structures, PARGs fit better to observations than ER. The component size distributions of the best fits are shown in Fig. 3d−f.
Protein-protein interaction
Exploring the protein interaction networks of proteins poses a major challenge in biomedicine. Protein-protein interaction (PPI) is crucial to understanding cellular pathways and human diseases44. The following experiment creates a graph from a set of 408 S. cerevisiae protein complexes as45. The graph nodes represent individual proteins from these complexes. An edge is constructed between two nodes (proteins) if they belong to the same complex. The graph includes 1,628 nodes and 11,249 edges.
We define the similar metric \(S_{duo}\) and \(S_{tri}\) (see “Methods” section) to compare the simulated component sizes with the ground truth. PARG fits better (\(S_{tri}\)=0.307) than the ER model (\(S_{tri}\)=0.112) to the PPI network, as shown in Fig. 4. The simulation data can be found in supplementary Table S3 and S4. It means that the chosen PARG has a higher probability that the sizes of its big components resemble those in the PPI network.
Instrumental resource street network
There has been growing attention to the impact of social networks on health46. For example, homeless youth is an active research field, including analysis and interventions. Social networks of homeless youth47,48 are vital for understanding and intervening the observed phenomena.
This experiment involves a social network about employment services utilization among homeless youth. The original research49 queried 136 homeless youth in Los Angeles in 2008. Four distinct networks were constructed from the same population, according to instrumental, emotional, employment services use, and sociometric relationship respectively. Only the instrumental network (\(|C_1^*|\)=30, \(|C_2^*|\)=13) satisfies \(5|C_2^*|>|C_1^*|\), i.e., the second largest component is large enough. Regarding the similarity metric \(S_{duo}\), the ER model attains the maximum similarity (\(S_{duo}\)=0.155) at \(p=1.08/n\); while \(G(10.14_{353}, 1.75/n )\) has a much higher similarity (\(S_{duo}\) =0.316). The community structure in the PARG corresponds to the social or geographical networks of the homeless youth; although the map between the two is still elusive.
We also fit the hyperbolic networks36 and the Achlioptas process35 to the observed component sizes (\(|C_1^*|\)=30, \(|C_2^*|\)=13). Details can be found in “Methods” section. The random hyperbolic graph50,51 reaches the maximum similarity (\(S_{duo}\)=0.266) when \(C=-9.3\), \(\alpha =10\), \(D=0.0684R\). The Achlioptas process with product rule (PR) has the maximum similarity (\(S_{duo}\)=0.221) when the number of edges is equal to 100. So PARG outperforms the other two models in this case.
Spreading of coronavirus
Coronavirus52 has spread among many Chinese cities since the end of January 2020. The incubation period53 and possible mild symptoms54 made the prevention more complicated. Social networking sites (or local officials) reported traces of infected people. Relationships between the infected (and those who had close contacts with them) were also investigated. From a point of view of networks, the cities exhibited three distinct patterns: 1) No big components. Shenzen reported 416 confirmed cases by February 20, 2020. The largest cluster has only 9 people. 2) A giant component. Xinyu reported 110 confirmed cases by February 10, 2020. The giant component consists of 52 cases, nearly half of the infected population. The second largest component has only six cases. 3) Multiple big components. Tianjin reported 136 confirmed cases by February 3, 2020. The first, second, and third largest clusters contain 44, 17 and 11 cases, respectively, which are related to a huge department store, the railway, and a residential area, respectively.
We focus on Tianjin’s infection network (Supplementary Note 1, Table 5), which consists of multiple big components. A graph is created to visualize the relationship between the infected when the outbreak was around its peak, as shown in Fig. 5. The simulation data can be found in Supplementary Table S6 and S7. The similarity metric \(S_{tri}\) could be biased when \(|C_3^*|\) is quite smaller than \(|C_1^*|\), so the similarity metric \(T_{tri}\) (see “Methods” section) is employed in this case to fit the models to the observed clusters, as shown in Fig. 6. The PARG \(G(8.17_{59}, 1.49/n)\) has the highest similarity \(T_{tri}\)=0.00916.
We also compared the results of the random hyperbolic graph and the Achlioptas process with that of PARG. The random hyperbolic graph reaches the maximum similarity (\(T_{tri}\)=0.0150) when \(C=-9.65\), \(\alpha =1\), \(D=0.04294R\). The Achlioptas process with Bohman Frieze (BF) rule has the maximum similarity (\(T_{tri}\)=0.00825) when the number of edges is equal to 93. So the random hyperbolic graph fits best to this case.
A later investigation indicates that the 11 cases (in yellow, Fig. 5) are probably related to the department store as well. In this new perspective, the two big clusters form the largest component of 55 nodes. We employ the metric \(T_{duo}\) to fit modes to this new observation (\(|C_1^*|\)=55, \(|C_2^*|\)=17), as shown in Fig 7. The simulation data can be found in Supplementary Table S8 and S9. The great variety of hierarchical structures enable PARG to fit relatively well to observations from distinct perspectives.
Discussion
The size distributions of big components in complex networks are attributed to the structure of physical world; the behaviors of agents (nodes) and the information transmitting between them; and the observer (how to look at the events). The ER model offered prominent findings of component sizes in networks, however, it rarely fits the joint distribution P(\(|C_1|\approx |C_1^*|\), \(|C_2|\approx |C_2^*|\), \(\cdots\)) in real observations. A successful strategy is introducing inhomogeneous structures or selective rules (for constructing edges) to the random graph to increase its versatility. PARG probably provides the simplest way to fully describe a hierarchical structure in an ER-like model.
PARG interprets the n in ER model as the cardinality of a set of natural numbers (nodal indices). Consequently, the probability p can be weighted by distances between the nodal indices. In number theory (Ostrowski’s theorem), any non-trivial definition of absolute value on \({\mathbb {Q}}\) is either the conventional one or the p-adic absolute value. So, the p-adic ultrametric reveals the natural hierarchical structures hidden in any graph with indexed nodes. PARG blurs the boundaries between the topology approaches (e.g., multiplex networks) and the geometric approaches (e.g., hyperbolic networks).
In our PARG approach, n denotes the number of observed individuals, whereas in previous ER-like models \(n\rightarrow \infty\). The limit of n facilities analytical approaches10,11, while a relatively small finite n is convenient for numerical random realizations. Random graph theories such as explosive percolation35 and synchronization55, deeply revealed the dynamics of the emerged components. By contrast, this work explores the sizes of resultant (static) components from observations or simulations. The results imply that the proportions between the big component sizes are closely associated with the hierarchical structures of complex networks. This implication is in contrast to previous emphasis10,27,56 on degree distribution as the fingerprint of network structures.
We fit PARGs and other random graphs to observations from various types of networks. The PARG outperforms the ER model and the Achlioptas process. The random hyperbolic model fits better than PARG to certain cases but worse than PARG to other cases. The simulations of PARG show that the size of big components, e.g., \(P(|C_1|=x)\) and \(P(|C_2|=x)\), can exhibit multimodal distribution (e.g., Figs. 2b and 7(b)) due to their modular structures. In the case of multimodal distribution, the first peak of \(P(|C_2|=x)\) could be very close to the last peak of \(P(|C_1|=x)\). Thus, one may distinguish the major mode from the minor modes when investigating the giant component. One present challenge in network epidemic modeling57 is designing network-based interventions. Current strategies include targeting high-degree nodes or central nodes. The multimodal distribution of \(P(|C_i|=x)\) has implications in controlling the spreading processes in networks. Since \(P(|C_i|=x)\) has more than one local peak, it might be possible to predict and maintain the big components’ growth around a local peak.
Methods
Random graph implementation
All random graphs are created through random experiments implemented in the Java programming language (Java 1.8 with Eclipse IDE). Given a connecting probability p (in the ER context), an edge is included in the graph if
where rand stands for a random number between 0 and 1, generated by the method nextDouble() from the java.util.Random class. The method generates a stream of pseudorandom numbers via linear congruential generator (LCG) with modulus \(2^{48}\).
A disjoint-set data structure (Union-Find algorithm) is employed to find all components in graph. N random realizations of ER or PARG yield an ensemble of binary numbers
for \(x=1,2,\cdots , n\). \(|C_i^j|\) denotes the size of the ith largest component in the jth realization. Then one can evaluate the size distribution of the big components by
One can choose an a prior function to measure the similarity between the simulated components sizes and the observed sizes. For example, Poisson distribution can be used to define the similarity between \(|C_i^j|\) and the observation :
or one can use the normal distribution:
where s is a constant (typically \(s=0.1\), so that 10% deviation from the truth results in 1/2 similarity). Simulations in this work employed the later definition. The probability distribution of \(|C_2|\) under the condition \(|C_1|\approx |C_1^*|\) is given by
Likewise, \(P(|C_3|=x \; \big |\; |C_1| \approx |C_1^*| ) =\frac{1}{N} \sum _{j=1}^N \Psi _1^j \delta _{3x}^j\).
We define the objective function
for a random graph to measure whether its first two/three largest component sizes are close to that in observation. When \(|C_2^*|\) or \(|C_3^*|\) is much smaller than \(|C_1^*|\), the following metric would be more appropriate:
Genetic interaction network of yeast genes
The data of yeast genes is from https://thecellmap.org/ costanzo2016/. The networks in Fig. 3g–i are drawn from the data of the Essential \(\times\) Essential network, which involves 1,261 mutant strains. So, each network in Fig. 3g–i consists of 1261 nodes. The PPC values between the mutant strains are obtained from the genetic interaction profile similarity matrices. An edge is included in the graph if the corresponding PPC value is above the predefined threshold. The graphs are projected onto squares using our Java program, as shown in Fig. 3g–i. In a dynamical process, the agents (nodes) push away from each other, while each edge drags the two end nodes into a fixed range. The color (hue) of edges indicates the size of the relevant component.
Protein network of yeast genes
The data of S. cerevisiae protein complexes is obtained from the additional File 1 of45. We programmed a Java application to read the table, construct the 1,628 nodes (proteins) and 11,249 edges (a pair of nodes belong to a same complex), and find the components via Union-Find algorithm.
Random hyperbolic graph
Our implementation follows the formulation in50,51. \(R=2\ln n + C\) denotes the radius of the disc. The probability density for the radial coordinate r of a point \((r, \phi )\) is given by
We use inverse transform sampling to generate the radii of the points, i.e., \(r= \frac{1}{a} arcosh (1+ \cosh (\alpha R) x -x)\) where \(x\in (0,1)\) denotes a random number from the uniform distribution. Our experiments generate x by nextDouble() in Java. For each pair of nodes u, v, a link is added to the graph if \(d(u,v) < D\) where \(D \in (0,R]\) is a constant and d(u, v) denotes the distance between the two points in the hyperbolic space.
Achlioptas process
At each time step of the Achlioptas process18,35, two edges \(e_1\) and \(e_2\) compete for addition. Suppose \(e_1\) involves two components of size \(|C_a|,|C_b|\); \(e_2\) involves two components of size \(|C_c|,|C_d|\), we consider three types of competing rules:
-
add \(e_1\) if \(|C_a|+|C_b| < |C_c|+|C_d|\), add \(e_2\) otherwise.
-
product rule (PR): add \(e_1\) if \(|C_a||C_b| < |C_c||C_d|\), add \(e_2\) otherwise.
-
Bohman Frieze (BF) rule: add \(e_1\) if \(|C_a|=|C_b| =1\), add \(e_2\) otherwise.
Our experiment treats the number of edges (added to the graph) as the parameter.
Conclusions
A number theory approach to random graph is proposed. A set of n random numbers generates an n by n adjacency matrix whose binary elements follow the probability (1). Thus, a hierarchical structure is implemented through ultrametrics40,58. The simplicity of the digit form (2) for hierarchical structures of random graph facilitates the enumeration of different setting of clusters and hierarchies. In contrast to mapping complex structures or rules from real world to random graph, our PARG approach explores the complex structures in numbers2,59 which might be rich enough for modeling complicated observations. An alternative point of view suggests that a plain graph can unfold a hidden hierarchical structure, based on an indispensable definition of absolute value.
The proposed PARG model is more abstract than multilayer networks, multiplex networks, and social-geographical models, but more concrete than ER-like models without community structures. Therefore, a future framework of research would consist of two interconnected levels: (1) Searching for hyper parameters, such as hierarchical structures, in PARG, given empirical data and (2) Constructing the ad hoc realistic models, for example, the biomolecular environments in cell or the social-geographical structures in a city, which the PARG model is projected onto.
References
Shekatkar, S. M., Bhagwat, C. & Ambika, G. Divisibility patterns of natural numbers on a complex network. Sci. Rep. 5, 14280 (2015).
Yan, X.-Y., Wang, W.-X., Chen, G.-R. & Shi, D.-H. Multiplex congruence network of natural numbers. Sci. Rep. 6, 1–8 (2016).
Gubser, S. S., Knaute, J., Parikh, S., Samberg, A. & Witaszczyk, P. p-adic AdS/CFT. Commun. Math. Phys. 352, 1019–1059 (2017).
Khrennikov, A., Oleschko, K. & López, M. d. J. C. Application of p-adic wavelets to model reaction–diffusion dynamics in random porous media. J. Fourier Anal. Appl. 22, 809–822 (2016).
Avetisov, V. & Bikulov, A. Protein ultrametricity and spectral diffusion in deeply frozen proteins. Biophys. Rev. Lett. 3, 387–396 (2008).
Dragovich, B. & Mišić, N. Ž. p-adic hierarchical properties of the genetic code. Biosystems (2019).
Newman, M. E. Communities, modules and large-scale structure in networks. Nat. Phys. 8, 25–31 (2012).
Vespignani, A. Modelling dynamical processes in complex socio-technical systems. Nat. Phys. 8, 32–39 (2012).
Cimini, G. et al. The statistical physics of real-world networks. Nat. Rev. Phys. 1, 58–71 (2019).
Newman, M. E., Strogatz, S. H. & Watts, D. J. Random graphs with arbitrary degree distributions and their applications. Phys. Rev. E 64, 026118 (2001).
Bollobás, B. & Riordan, O. An old approach to the giant component problem. J. Comb. Theory B 113, 236–260 (2015).
Tishby, I., Biham, O., Katzav, E. & Kühn, R. Revealing the microstructure of the giant component in random graph ensembles. Phys. Rev. E 97, 042318 (2018).
Gao, J., Zhang, Y.-C. & Zhou, T. Computational socioeconomics. Phys. Rep. (2019).
Lambiotte, R., Rosvall, M. & Scholtes, I. From networks to optimal higher-order models of complex systems. Nat. Phys. 15, 313–320 (2019).
Robins, G., Pattison, P., Kalish, Y. & Lusher, D. An introduction to exponential random graph (p*) models for social networks. Soc. Netw. 29, 173–191 (2007).
Karrer, B. & Newman, M. E. Stochastic blockmodels and community structure in networks. Phys. Rev. E 83, 016107 (2011).
Abbe, E. Community detection and stochastic block models: recent developments. J. Mach. Learn. Res. 18, 6446–6531 (2017).
Achlioptas, D., D’Souza, R. M. & Spencer, J. Explosive percolation in random networks. Science 323, 1453–1455 (2009).
Donner, R. V. et al. Recurrence-based time series analysis by means of complex network methods. Int. J. Bifurc. Chaos 21, 1019–1046 (2011).
Pastor-Satorras, R., Castellano, C., Van Mieghem, P. & Vespignani, A. Epidemic processes in complex networks. Rev. Mod. Phys. 87, 925 (2015).
Stegehuis, C., Van Der Hofstad, R. & Van Leeuwaarden, J. S. Epidemic spreading on complex networks with community structures. Sci. Rep. 6, 1–7 (2016).
Luke, D. A. & Harris, J. K. Network analysis in public health: history, methods, and applications. Annu. Rev. Public Health 28, 69–93 (2007).
Bearman, P. S., Moody, J. & Stovel, K. Chains of affection: The structure of adolescent romantic and sexual networks. Am. J. Sociol. 110, 44–91 (2004).
Goh, K.-I. et al. The human disease network. Proc. Natl. Acad. Sci. USA 104, 8685–8690 (2007).
Huttlin, E. L. et al. Architecture of the human interactome defines protein communities and disease networks. Nature 545, 505–509 (2017).
Costanzo, M. et al. A global genetic interaction network maps a wiring diagram of cellular function. Science 353, aaf1420 (2016).
Barabasi, A.-L. & Oltvai, Z. N. Network biology: Understanding the cell’s functional organization. Nat. Revi. Genet. 5, 101–113 (2004).
Yıldırım, M. A., Goh, K.-I., Cusick, M. E., Barabási, A.-L. & Vidal, M. Drug-target network. Nat. Biotechnol. 25, 1119–1126 (2007).
Csermely, P., Korcsmáros, T., Kiss, H. J., London, G. & Nussinov, R. Structure and dynamics of molecular networks: A novel paradigm of drug discovery: A comprehensive review. Pharmacol. Ther. 138, 333–408 (2013).
Barthélemy, M. Spatial networks. Phys. Rep. 499, 1–101 (2011).
Salehi, M. et al. Spreading processes in multilayer networks. IEEE Trans. Netw. Sci. Eng. 2, 65–83 (2015).
De Domenico, M., Granell, C., Porter, M. A. & Arenas, A. The physics of spreading processes in multilayer networks. Nat. Phys. 12, 901–906 (2016).
Lee, K.-M., Min, B. & Goh, K.-I. Towards real-world complexity: an introduction to multiplex networks. Eur. Phys. J. B 88, 48 (2015).
Kleineberg, K.-K., Boguná, M., Serrano, M. Á. & Papadopoulos, F. Hidden geometric correlations in real multiplex networks. Nat. Phys. 12, 1076–1081 (2016).
D’Souza, R. M. & Nagler, J. Anomalous critical and supercritical phenomena in explosive percolation. Nat. Phys. 11, 531–538 (2015).
Krioukov, D., Papadopoulos, F., Kitsak, M., Vahdat, A. & Boguná, M. Hyperbolic geometry of complex networks. Phys. Rev. E 82, 036106 (2010).
Jacob, E. et al. Spatial preferential attachment networks: Power laws and clustering coefficients. Ann. Appl. Probab. 25, 632–662 (2015).
Söderberg, B. General formalism for inhomogeneous random graphs. Phys. Rev. E 66, 066121 (2002).
Bringmann, K., Keusch, R. & Lengler, J. Geometric inhomogeneous random graphs. Theor. Comput. Sci. 760, 35–54 (2019).
Dragovich, B., Khrennikov, A. Y., Kozyrev, S., Volovich, I. & Zelenov, E. p-adic mathematical physics: the first 30 years. p-Adic Number. Ultramet. Anal. Appl. 9, 87–121 (2017).
Berchenko, Y., Artzy-Randrup, Y., Teicher, M. & Stone, L. Emergence and size of the giant component in clustered random graphs with a given degree distribution. Phys. Rev. Lett. 102, 138701 (2009).
Molloy, M. & Reed, B. The size of the giant component of a random graph with a given degree sequence. Comb. Probabi. Comput. 7, 295–305 (1998).
Danon, L. et al. Networks and the epidemiology of infectious disease. Interdiscip. Perspecti. Infect. Dis. 2011, (2011).
Snider, J. et al. Fundamentals of protein interaction network mapping. Mol. Syst. Biol. 11, (2015).
Vlasblom, J. & Wodak, S. J. Markov clustering versus affinity propagation for the partitioning of protein interaction graphs. BMC Bioinf. 10, 99 (2009).
Smith, K. P. & Christakis, N. A. Social networks and health. Annu. Rev. Sociol. 34, 405–429 (2008).
Wenzel, S. et al. Social networks of homeless youth in emerging adulthood. J. Youth Adolesc. 41, 561–571 (2012).
De la Haye, K. et al. Who is supporting homeless youth? predictors of support in personal networks. J. Res. Adolesc. 22, 604–616 (2012).
Barman-Adhikari, A. & Rice, E. Social networks as the context for understanding employment services utilization among homeless youth. Eval. Prog. Plan. 45, 90–101 (2014).
Gugelmann, L., Panagiotou, K. & Peter, U. Random hyperbolic graphs: degree sequence and clustering. In International Colloquium on Automata, Languages, and Programming, 573–585 (Springer, 2012).
Friedrich, T. & Krohmer, A. Cliques in hyperbolic random graphs. In 2015 IEEE Conference on Computer Communications (INFOCOM), 1544–1552 (IEEE, 2015).
Guan, W.-J. et al. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 382, 1708–1720 (2020).
Lauer, S. A. et al. The incubation period of coronavirus disease 2019 (covid-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med. (2020).
Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet 395, 497–506 (2020).
Boccaletti, S. et al. Explosive transitions in complex networks’ structure and dynamics: Percolation and synchronization. Phys. Rep. 660, 1–94 (2016).
Liu, Y.-Y., Slotine, J.-J. & Barabási, A.-L. Controllability of complex networks. Nature 473, 167–173 (2011).
Pellis, L. et al. Eight challenges for network epidemic models. Epidemics 10, 58–62 (2015).
Hughes, B. Trees, ultrametrics, and noncommutative geometry. Pure Appl. Math. Q. 8, 221–312 (2012).
García-Pérez, G., Serrano, M. Á. & Boguná, M. Complex architecture of primes and natural numbers. Phys. Rev. E 90, 022806 (2014).
Acknowledgements
This research is supported by National Key Research and Development Program of China (2019YFD1100805).
Author information
Authors and Affiliations
Contributions
L.H. introduced the number theory. H.H. made the compute programs wrote the main manuscript text.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hua, H., Hovestadt, L. p-adic numbers encode complex networks. Sci Rep 11, 17 (2021). https://doi.org/10.1038/s41598-020-79507-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-79507-4
This article is cited by
-
p-adic Cellular Neural Networks
Journal of Nonlinear Mathematical Physics (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.