Abstract
Investigation of divisibility properties of natural numbers is one of the most important themes in the theory of numbers. Various tools have been developed over the centuries to discover and study the various patterns in the sequence of natural numbers in the context of divisibility. In the present paper, we study the divisibility of natural numbers using the framework of a growing complex network. In particular, using tools from the field of statistical inference, we show that the network is scalefree but has a nonstationary degree distribution. Along with this, we report a new kind of similarity pattern for the local clustering, which we call “stretching similarity”, in this network. We also show that the various characteristics like average degree, global clustering coefficient and assortativity coefficient of the network vary smoothly with the size of the network. Using analytical arguments we estimate the asymptotic behavior of global clustering and average degree which is validated using numerical analysis.
Introduction
The study of complex networks has become a very important part of many disciplines like information^{1}, technology^{2}, social sciences^{3}, ecology^{4} and biology^{5,6,7}. The characterization of structure of real networks is an indispensable part of this study. Despite being random, real networks show certain statistical properties which set them apart from their completely random mathematical counterparts. This hints towards underlying organizing principles which shape the structures of real networks^{8}. In particular, many real networks are scalefree which means that the distribution of degrees of their nodes follows a power law^{8,9}. The density of triangles in the network is another important characteristic of networks measured using a quantity called clustering coefficient. Empirical studies show that the real networks are highly clustered as compared to completely random mathematical models like ErdosRenyi graph^{8,9}.
In the present paper, we report an analysis for a particular deterministic network that resembles real networks in many aspects. This network consists of natural numbers 1, 2, 3, … as nodes and if a given number divides another, then their corresponding nodes are connected by an undirected link. The network thus constructed, though deterministic, can be studied on an equal footing with the other random networks because of the irregular distribution of primes which makes divisibility relations themselves irregular. It is helpful to view this network as a growing network where nodes are added one at a time. A similar network with nodes as composite numbers has already been studied^{10}. Also, a directed network of natural numbers based on the divisibility which includes only the multiples in the pattern has been reported by Dinghua et al.^{11}. A bipartite structure separating composite and prime numbers with weighted links between them based on divisibility has been analyzed by GarcíaPérez et al.^{12}.
In the present work we consider a more general set up where we put all the natural numbers on a complex network with their divisibility relations as the underlying deterministic rule of connections. Here the network is undirected with links to both divisors and multiples. Using tools from statistical inference, we confirm that this network is scalefree and show that average degree, global clustering coefficient and assortativity coefficient vary smoothly with the size of the network. This is surprising in view of the fact that distribution of primes is quite irregular in the sequence of natural numbers. We provide analytical results for the asymptotic behavior of average degree and global clustering coefficient for this network. In particular, we show that the global clustering coefficient of this network decays to zero whereas average degree increases logarithmically. We also report an interesting and novel similarity exhibited by local clustering coefficients of nodes in this network which we call “stretching similarity”.
The remaining paper is organized as follows: In the next section we describe the construction of the network and show that the network is scalefree. We then describe the existence of stretching similarity in this network. Finally we show the behavior of average degree, global clustering coefficient and assortativity coefficient as a function of size of the network and analytically obtain the asymptotic trends for average degree and clustering.
Results
Construction of the network and its scaling properties
The nodes of the present network are natural numbers 1, 2, 3, … and there is a link between two nodes if either divides the other. We avoid selflinks and all the links are undirected. Since the sequence of natural numbers has natural ordering, it is helpful to view this network as a growing network with the addition of a new node at each discrete time as follows:
At time t = 1 network starts with a single node n = 1 and at every time t, a node with the number n = t is added to the network.
This node connects to all the existing nodes whose numbers divide it.
The network thus constructed is shown in Fig. 1 at two different times t = 16 and t = 32 which would correspond to networks of size N = 16 and N = 32 respectively. To find the distribution of degrees of this network, we grow the network till the size reaches N = 2^{25} = 3, 35, 54, 432. The resulting distribution shown in Fig. 2 seems to follow a power law asymptotically. Using the method of maximum likelihood we find that the scalingindex α ~ 2. We establish the existence of powerlaw in the distribution (and hence the fact that this network is scalefree) using the approach described in Clauset et al.^{13} (see Methods). We also study the scaling behavior of the local clustering coefficient with degree. The local clustering of a node in the network is defined as the fraction of number of edges that are present among its neighbors. For node i with degree k_{i} this can be written as^{14}:
where E_{i} is the actual number of edges among the neighbors of node i.
In Fig. 3 we show the dependence of local clustering coefficient of nodes in the network on the degree. It can be seen that the asymptotic behavior is compatible with a power law with exponent 1. This behavior is similar to one that is usually observed in real networks^{14}.
Stretching similarity of local clustering
We now discuss an interesting behavior that sets network of natural numbers apart from other complex networks. In the network presented here, each node has an identity which is the number attached to it and this defines a natural order on the nodes. This means that we can study various properties of nodes as a function of their labels. This is not possible for other networks because no such unique labeling exists for the nodes. Here we specifically consider local clustering coefficient of nodes and study its behavior as a function of node index. We find that the clustering coefficient c_{i} of node i varies seemingly irregularly. However, when c_{i} is plotted against i, a global pattern is seen. In Fig. 4 we show this pattern for three different network sizes. For better visualization, the plots are shown only for relatively small network sizes. From the figure, it is clear that the global pattern of the local clustering coefficient gets stretched as the size of the network increases such that the nature of the pattern remains the same. We call this new kind of similarity as “stretching similarity” and this seems to be a unique feature of this network, not so far reported for any other complex network. We note from plots in Fig. 4 that for a network with size N some discontinuous vertical steps occur approximately at values . Also, we observe a band of numbers with clustering coefficient 1 between N/3 and N/2 and these numbers correspond to prime numbers and their powers in that range. This can be seen by the following argument. Consider any prime number p in the interval (N/3, N/2). On the lower side, it is connected only to 1 while on the upper side, it would be connected only to its multiples. However, all the numbers in this range would have only one multiple 2p up to N. Thus, three numbers 1, p, 2p form a triangle and hence clustering coefficient of number p must be 1. A similar argument for prime powers in this range tells that they also have clustering coefficient 1. There is another band of numbers with clustering coefficient exactly 0 between N/2 and N which are also prime numbers. This is because all the primes in this range are connected only to 1 making their clustering 0.
Now we discuss the local clustering coefficient for the composite numbers between N/2 and N. For a vertex n, the only neighbors are the proper divisors of n i.e. m such that 1 ≤ m < n and n is divisible by m.
Let be the factorization of n as the product of (distinct) prime powers. The fundamental theorem of arithmetic states that such a factorization is unique up to a reordering of the primes p_{i}’s. It can be observed that every divisor m of n is of the form where for every 1 ≤ i ≤ k.
Any two nodes and such that m < m′ are adjacent to each other if and only if for all i.
Thus the clustering coefficient of n in the network of size N is given by,
where .
From the above expression it follows that value of c_{n} depends only on the number of distinct prime factors of n and the powers j_{i}’s which appear in the prime factorization of n; but not on the actual primes which appear there. Thus for any given j_{1}, j_{2}, … j_{k}, the value c_{n} is constant for every n in the range N/2 < n ≤ N such that for some set of k distinct primes p_{1}, p_{2}, … p_{k}. This explains the occurrence of horizontal dotted lines in the plot for local clustering coefficients.
Similarly, the clustering coefficients for other n can be computed and it can be observed that they depend on the powers and the number of distinct prime factors of n as well as the range in which n belongs that is r such that . Here one has to also consider the number of multiples of n in the range 1, 2, …, N. This leads to possibly different values of clustering coefficients. This explains the occurrence of demarked regions like N/2 to N, N/3 to N/2, N/4 to N/3 etc in the plot for local clustering coefficients. For any N there will be sufficient number of primes in the range [1, N/2] and choices for j_{i} such that the pattern of horizontal lines between N/2 to N remains the same. Also, the demarked regions have similar structures. This provides a possible explanation for the observed stretching similarity in the clustering coefficients as N is changed (Fig. 4).
We also observe an interesting pattern when we plot the difference as a function of i in Fig. 5. We find that this pattern is symmetric about Δc = 0 which can be quantified by finding the local density of values in the plot (see Methods). With increasing size of the network, this pattern also shows stretching similarity.
Topological characteristics of the network
In the present section, we discuss how three of the most important quantities average degree, global clustering and assortativity coefficient vary with the size of the network.
Average degree
Here we derive an approximate expression for the average degree of the network as a function of its size. By definition, the average degree of the network is given by:
where m is the total number of edges in the network and n is the size of the network. The value of m is also equal to the sum of the elements in lower (or upper) triangular part of the adjacency matrix. To find this sum, we interpret the second index of element A_{ij} of adjacency matrix to be the divisor of first index if A_{ij} = 1. In other words, let A_{ij} = 1 if and only if i > j and ji. Then the sum of the elements in the lower triangular part of the matrix is equal to the number of integers of the form kj with k ≥ 2 and kj ≤ n. However, whenever all the entries in the j^{th} column of the lower triangular part of A are zero. Let denote the greatest integer ≤x. Then m is given by:
It is well known that the first term on the right satisfies an estimate as follows^{15}:
where γ is EulerMascheroni constant. Also we observe that:
From Eqs (4, 5, 6, 7), it follows that:
Since γ ≈ 0.5772, in the limit of large n, we get,
This means that the average degree of the network increases logarithmically with the size and this variation is plotted in Fig. 6 (solid line) using Eq. (9). We calculate this numerically by growing the network up to N = 2^{25} and the results obtained, shown by solid dots in Fig. 6, are found to agree exactly with analytic expression (9). Since the average degree of the network increases with size, the degree distribution of the network is not stationary though as shown in the previous section, the network is scalefree at each stage (see Methods).
Global clustering coefficient
The global clustering coefficient of the network quantifies the density of closed triplets in the network. A connected triplet in the network is the set of 3 nodes connected to each other with at least 2 links. A closed triplet is the set of 3 nodes connected to each other with exactly 3 links. A triangle in the network counts as three closed triplets (one centered at each node of the triangle). The global clustering coefficient of the network is then defined as:
We estimate the number of triangles T_{n} in the network using the following strategy. Let us fix a vertex i and calculate the number of triangles in which i is the smallest vertex. The number i has proper multiples in the range [1, n]. Each of them is of the form ki where . Thus, T_{n} is given by:
Using the integral approximation for the above:
The above is bounded by,
Here A and B are constants. Hence we see that:
In particular,
Let U(n) be the the number of connected triplets in the network after n^{th} stage. Then U(n) is given by:
Since we have (Fig. 2) observed that the degree distribution of the network follows a power law k^{−α} with α ~ 2, we see that the proportion p(k) of nodes with degree k is ~k^{−2}.
Thus, the expectation of the variable k^{2} satisfies:
Hence we see that
From Eqs (10), (14) and (17), the global clustering coefficient decays to zero as the network size goes to infinity. We verify this by numerically computing the global clustering coefficient and this is shown in Fig. 7a. However, we note that the WattsStrogatz clustering coefficient C_{WS} of the network (which is defined as the average of all local clustering coefficients over all the nodes of the network^{16}) does not decay to zero and instead reaches to a constant value ~0.6. This is clear from Fig. (4) since the pattern repeats with stretching similarity as the network size increases. To the best of our knowledge, there is no other network in which C_{WS} saturates to a high nonzero value but the global clustering coefficient decays to 0.
Assortativity coefficient
The correlation of degrees in the network is an important quantifier of the network structure^{17}. If in a network the high degree nodes tend to connect to low degree nodes (i.e. if the network has negative degree correlations), then the network is said to be dissortative in structure whereas if similar degree nodes tend to connect to each other, network is said to be assortative. All the real networks except social networks are dissortative^{17} and this has been explained using the fact that the dissortative state is the most likely state of scalefree networks^{18}. The assortative/dissortative nature of networks can be quantified using the assortativity coefficient^{9}:
where k_{i} is the degree of the i^{th} node, A_{ij} is the (i, j)^{th} element of the adjacency matrix, m is the total number of edges in the network and δ_{ij} is the Kronecker delta.
In Fig. 7b we show the dependence of r on the size of the network and in spite of irregularity in the divisibility pattern, r has a smooth behavior with n. It can be seen that r always remains negative though asymptotically it seems to reach the value 0 implying that the network is dissortative. The dissortative nature of the network of natural numbers is understandable from the following argument. For any link in this network, the one end of the link is divisor (node A) and other is multiple (node B). Hence node A is also connected to all the nodes which are multiples of B but the reverse is not true. This means that the degree of node A always tends to be very high as compared to degree of node B for a given size of the network giving the negative value for the overall correlation coefficient.
We also find that all the important statistical properties of the network like stretching similarity, degree distribution, clusteringdegree correlation etc. are very robust to the removal of even the biggest hubs like numbers 1, 2, 3, …. This shows that the global divisibility pattern of natural numbers does not depend only on the few nodes but instead is built by contributions from all the nodes. (See Methods)
Discussion
The network of natural numbers constructed using divisibility relations looks like real networks in many characteristics like degree distribution, clustering and degree correlations. We show how insights into the divisibility patterns of natural numbers can be obtained using the framework of complex networks, where we consider both composite and prime numbers in a single undirected network with links generated using both multiples and divisors. Some of the interesting results that we get are the scalefree nature of the network with a nonstationary distribution and the existence of stretching similarity. We validate the existence of powerlaw in the distribution and estimate the corresponding powerlaw index using rigorous techniques from statistical inference advocated by Clauset et al.^{13}. We find that the average degree of the network grows logarithmically with the size of the network and we find the exact formula for its behaviour analytically. We also find that the global clustering coefficient of the network reaches to the value 0 while the average clustering coefficient C_{WS} saturates to a high value. All these results are validated by extensive numerical calculations for network up to size 2^{25}.
We also find that there exists a pattern in the local clustering coefficients that reflects universality in the organization of natural numbers in terms of their prime constituents. We observe that this pattern has a stretching similarity which is a reflection of the nature of prime factorization of natural numbers. Also, the behavior of characteristics like average degree, global clustering and assortativity coefficients for this network vary quite smoothly and hence may help us to understand better the divisibility relations between natural numbers. In conclusion, the work presented here describes an interesting perspective on the divisibility relations of natural numbers and has potential to become an important tool in the investigation of the properties of natural numbers.
Methods
Establishing the scalefree nature of the network
The shape of the degree distribution of the network in Fig. 2 hints at the existence of asymptotic power law in the distribution ( for k ≥ k_{min}). However a visual inspection to find k_{min} and least square fit and related methods to find the exponent α of the power law are known to produce very bad estimates^{19}. Hence we use the method of maximum likelihood for the degree sequence of the network to find scaling index α of the powerlaw distribution^{13}. For this, we initially assume that the sequence is drawn from a distribution that follows a power law k^{−α} for all k after k ≥ k_{min}. To find this k_{min}, we use the approach proposed by Clauset et al.^{20}. The idea behind this method is to choose that value of k as k_{min} which makes the probability distribution of the data and bestfit powerlaw model as similar as possible above k_{min} where we use KolmogorovSmirnov statistic as the distance between two distributions^{13}. After finding k_{min} using this method, the best estimation for scaling exponent α is given by:
where k_{i}, i = 1, …, N are values of k such that k_{i} ≥ k_{min}. For the network of size 2^{15}, the value α is obtained here as ~2.
To validate the existence of power law, we use the approach described in Clauset et al.^{13}. In this approach we generate many synthetic data sets from a true powerlaw distribution and measure how far they fluctuate from the powerlaw type of behavior. We then compare the results of similar measurements on the observed data. If the observed data set is much further from the powerlaw form than the synthetic one, the powerlaw is rejected. The p–value is defined as the fraction of the syntheticempirical distances that are larger than the empirical distance. A large p–value is indicative of existence of power law in the data. In the present work we calculate the p–values for three different sizes of the network: N = 256, 512, 1024. For this, we generate 2500 synthetic data sets which gives p–values accurate up to two decimal places as 0.62, 0.95 and 0.98 respectively. The existence of powerlaw degree distribution for this network is thus confirmed by the fact that p–values rapidly converge to 1 as the network size increases.
The distribution in Fig. 2 is plotted with logarithmic binning with the successive bin sizes equal to successive powers of 2 and the count in each bin is normalized by dividing the count by the binwidth. The same strategy is used to show the dependence of local clustering coefficient c(k) on degree k in Fig. 3.
Symmetry in difference of successive local clustering coefficients
To establish the global symmetry of difference in local clustering values Δc around the horizontal axis Δc = 0 (Fig. 5) for any value of N, we calculate the local density of points in the plot. For this, we divide the horizontal axis into 2^{7} = 128 cells and vertical axis into 200 cells. The whole plot then gets divided into pixels of dimension 0.01 × 2^{N−7}. We define density ρ(x, y) of a particular pixel (x, y) as the ratio of the number of points present in the pixel to the maximum number that can be there which is equal to 2^{N−7} (all the points on yaxis with difference less than 0.01 are to be considered same so the vertical dimension of each pixel is just 1). For each x we calculate the absolute difference between the corresponding pixels on each side of the line Δc = 0. If the pattern is symmetric then these absolute differences are expected to be small. We calculate the average of such differences as:
In Fig. 8 we show ϕ(x) as a function of x and as is clear from the figure, all ϕ values are very close to 0 confirming that the pattern is indeed symmetric.
Removal of hubs from the network
To test the robustness of the various statistical properties of the network against the removal of hubs from the network, we simulated the network of natural numbers removing numbers 1 to 4 step by step. When number 1 is removed from the network, all the prime numbers between N/2 and N become isolated and these remain as the only isolated nodes. This means that in this case the network consists of a giant component along with many isolated nodes. We find that such a removal does not affect the degree distribution and clusteringdegree correlation too much and qualitatively the network remains scalefree with the same powerlaw index as for the original network. The other properties like average degree, clustering coefficients and assortativity do change to some extent by this removal but qualitatively remain the same. The plot of degree distributions after removing hubs is shown in Fig. 9.
Additional Information
How to cite this article: Shekatkar, S. M., Bhagwat, C. and Ambika, G. Divisibility patterns of natural numbers on a complex network. Sci. Rep .5, 14280; doi: 10.1038/srep14280 (2015).
References
 1.
Broder, A. E. A. Graph structure in the web. Comput. Netw. 33, 309–320 (2000).
 2.
Faloutsos, M., Faloutsos, P. & Faloutsos, C. On powerlaw relationships of the internet topology. Comput. Commun. Rev. 29, 251–262 (2004).
 3.
Amaral, L. A. N., Scala, A., Barthélémy, M. & Stanley, H. E. Classes of smallworld networks. Proc. Natl. Acad. Sci. USA 97, 11149–11152 (2000).
 4.
Huxham, M., Beaney, S. & Raffaelli, D. Do parasites reduce the chances of triangulation in a real food web? Oikos 76, 284–300 (1996).
 5.
Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N. & Barabási, A.L. The largescale organization of metabolic networks. Nature 407, 651–654 (2000).
 6.
Jeong, H., Mason, S., Barabási, A.L. & Oltvai, Z. N. Lethality and centrality in protein networks. Nature 411, 41–42 (2001).
 7.
White, J. G., Southgate, E., Thompson, J. & Brenner, S. The structure of the nervous system of the nematode Caenorhabditis Elegans. Phil. Trans. R. Soc. London 314, 1–340 (1986).
 8.
Albert, R. & Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47–97 (2002).
 9.
Newman, M. E. J. Networks: An Introduction (Oxford University Press, New York, 2010).
 10.
Zhou, T., Wang, B. H., Hui, P. M. & Chan, K. P. Topological properties of integer networks. Physica A 367, 613–618 (2006).
 11.
Dinghua, S. & Huijie, Z. Natural number network and the prime number theorem. Complex Systems and Complexity Science 7, 3813 (2010).
 12.
GarcíaPérez, G., Serrano, M. & Boguñá, M. Complex architecture of primes and natural numbers. Phys. Rev. E 90, 022806 (2014).
 13.
Clauset, A., Shalizi, C. R. & Newman, M. E. J. Powerlaw distributions in empirical data. SIAM Review 51, 661–703 (2009).
 14.
Ravasz, E. & Barabási, A.L. Hierarchical organization in complex networks. Phys. Rev. E 67, 026112 (2003).
 15.
Apostol, T. M. Introduction to analytic number theory (SpringerVerlag, New YorkHeidelberg, 1976).
 16.
Watts, D. J. & Strogatz, S. H. Collective dynamics of ‘smallworld’ networks. Nature 393, 440–442 (1998).
 17.
Newman, M. E. J. Assortative mixing in networks. Phys. Rev. Lett. 89, 208701(1–4) (2002).
 18.
Johnson, S., Torres, J. J., Marro, J. & A., M. M. Entropic origin of dissortativity in complex networks. Phys. Rev. Lett. 104, 108702(1–4) (2010).
 19.
Goldstein, M. L., Morris, S. A. & Yen, G. G. Probelms with fitting to the powerlaw distribution. Eur. Phys. J. B. 41, 255–258 (2004).
 20.
Clauset, A., Young, M. & Gleditsch, K. S. On the frequency of severe terrorist events. J. Conflict Resolution 51, 58–87 (2007).
Acknowledgements
S.M.S. is supported by Senior Research Fellowship from University Grants Commission, New Delhi, India. C.B. is supported by DSTINSPIRE faculty scheme, award number [IFA11MA05]. Authors acknowledge Joel Ornstein for making the python implementations of some of the methods used in this paper available to us.
Author information
Affiliations
Indian Institute of Science Education and Research, Pune, 411008, India
 Snehal M. Shekatkar
 , Chandrasheel Bhagwat
 & G. Ambika
Authors
Search for Snehal M. Shekatkar in:
Search for Chandrasheel Bhagwat in:
Search for G. Ambika in:
Contributions
S.M.S. proposed the idea and performed the numerical simulations. C.B. derived the results analytically. G.A. supervised the study. All authors discussed the results and prepared the manuscript.
Competing interests
The authors declare no competing financial interests.
Corresponding author
Correspondence to G. Ambika.
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Further reading

Multiplex congruence network of natural numbers
Scientific Reports (2016)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.