Introduction

Complex networks arise from natural and social phenomena such as the Internet, the protein interactions, the collaborations in research, and the social relationships. Readers are referred to Watts-Strogatz’s1 small-world network model and Barabási-Albert’s2 scale-free network model, and Newman’s review3 and book4, etc.

In this paper, we revisit the fractality of complex network by investigating three dimensions dB5, dball6 and df7 with respect to minimum box-covering, minimum ball-covering and average volume of balls. The compact box burning algorithm (CBB)8,9 and random ball-covering algorithm6 are proposed to calculate dB and dball respectively. However the minimum box-covering problem and minimum ball-covering problem are NP-complete, which are proved rigorously in Theorem 1 and Proposition 2 respectively. The NP-completeness implies that the CBB algorithm and the random ball-covering algorithm do not have high performance, then we suggest some algorithms to improve the random ball-covering algorithm. For the third dimension df, we obtain an efficient algorithm: the random ball-volume algorithm. When do the three dimensions coincide? To answer this question, we introduce the notion of Ahlfors regularity of networks and prove that dB = dball = df (Theorem 2) if networks are Ahlfors regular. Then for Ahlfors regular networks, the random ball-volume algorithm is efficient to obtain the above three fractal dimensions.

Fractal dimensions and covering problems

Song, Havlin and Makse5 reveal that many real networks have self-similarity and fractality, and Gallos, Song, Havlin and Makse give a review of fractality of complex networks10. The algorithms to numerically calculate the fractal dimension of complex networks have been proposed: For example, the CBB algorithm8,9 is applied to calculate the fractal dimension of complex networks through the minimum box-covering; Kim, Goh, Kahng and Kim11 improve the CBB algorithm to investigate the fractal scaling property in scale-free networks; Zhou, Jing and Sornette12 propose the edge-covering box algorithm; Gao, Hu and Di6 give the minimum ball-covering approach to calculate the fractal dimension of complex networks.

Recall some notation. Considering a network as a graph G = (V, E) equipped with geodesic distance d, we let an l-box A denote a subset of V such that the geodesic distance of any two points in the subset is less than l, an l-ball centered at x0 the subset . Let Nl be the smallest number of l-boxes needed to cover V, and Bl the smallest number of l-balls needed to cover V. Suppose that

where dB is the fractal dimension defined by Song, Havlin and Makse5, and dball is defined by Gao, Hu and Di6.

For box-covering, Song, Gallos, Havlin and Makse9 point out that the minimum l-box-covering problem is NP-complete for any l ≥ 2. On the other hand, for ball-covering, which is far from box-covering in graph theory, we have

Theorem 1. The minimum l-ball covering problem is NP-complete for any l ≥ 2.

Ball-covering algorithms

Due to the NP-completeness, for finding the feasible solution of minimum ball-covering problem, we can apply the usual random ball-covering algorithm (RBC)6: when l is fixed, in each time t, we randomly choose one node xt in the vertex set Vt−1 remained in time (t − 1), and obtain Vt by cutting all nodes in .

In the RBC algorithm we give a random sorting for nodes in Vt−1 and take the first node. Moreover, given some function , we can sort these nodes according to the values of function f.

Given a function , suppose we sort nodes according to values of f in nondecreasing order: If f is the degree function, we can obtain degree-order ball-covering algorithm (DOBC); If and, we obtain volume-order ball-covering algorithm (VOBC).

For a function , assume we sort nodes according to values of g in nonincreasing order, we propose the following greedy algorithm:

  1. 1

    Assume that such that .

  2. 2

    Set and the sorting of nodes in Vt inherits from V0 = V.

When , we obtain the volume-greedy ball-covering algorithm (VGBC). Let g(x) = deg(x), we have the degree-greedy ball-covering algorithm (DGBC).

In the point of view on fractal geometry, the box dimension is independent of the geometric shapes of covering, such as ball or box. It is easy to check that Bl ≤ Nl ≤ Bl/2, hence  ≤  ≤  ≈  ≈ . By the above estimate, when the diameter of network is large enough to insure that l can be taken large enough, we have

Proposition 1. The fractal dimensions dB and dball w.r.t. the box covering and ball covering respectively are the same.

However, for real networks with small-world effect, we can not take l large enough, and the upper bound of error is not small enough. On the other hand, we only find the feasible solutions of minimum covering problems due to their NP-completeness. See the following example.

Example 1. Through above 5 algorithms (Fig. 1), we calculate dball for the WWW network (Table 1).

Figure 1
figure 1

Slopes exist w.r.t. 5 algorithms for the WWW network: (a) RBC, (b) DGBC, (c) DOBC, (d) VGBC, (e) VOBC.

Table 1 dball w.r.t. 5 algorithms for the WWW network.

In Table 1, the value of the RBC algorithm is exactly the value dball = 4.2 by Gao, Hu and Di6. Note that Song, Havlin, and Makse5 obtain that dB = 4.1.

For the WWW network, we also compare the above 5 algorithms (Fig. 2). It seems that the VGBC algorithm is the best and the performance of the RBC is the worst and close to the VOBC.

Figure 2
figure 2

Comparison of 5 algorithms for the WWW network.

Random ball-volume algorithm

Based on Shanker’s work13, Guo and Cai7 investigate the power law between the average volume of balls and the their radii. Given a network, let p(l) be the average cardinality of nodes in a ball with radius l, suppose that

We call df the volume dimension. Please also see generalized volume dimension14 by Wei et al.

We will discuss the volume dimension df related to average ball-volume and propose the random ball-volume algorithm for networks. Compared with the minimum box-covering algorithm and the minimum ball-covering algorithm, we have the following algorithm to calculate the average volume of ball with size l approximately.

Random ball-volume algorithm (RBV) (for fixed size l):

  1. 1

    Randomly take a node x in the network.

  2. 2

    Calculate the volume ν(B(x, l)).

  3. 3

    Repeat the steps 1–2 and obtain average volume of random l-balls.

For the WWW network, using the RBV algorithm we obtain df = 5.833 (Fig. 3).

Figure 3
figure 3

RBV for the WWW network.

Ahlfors regularity of networks

Fractal geometry and fractal network have deep connection. We can generate complex network models from self-similar fractals. For example Andrade et al.15 and Zhou et al.16 discuss Apollonian networks generated from Apollonian fractal, Zhang et al.17,18,19 construct evolving networks modeled from Sierpinski gasket by taking the line segments as nodes. Besides Zhang et al.20 construct the networks produced from Vicsek fractals, Liu et al.21 and Chen et al.22 explore some Koch networks related to Koch curves, Song et al.23 study complex networks modeled on Platonic solids, Chen et al.24 investigate networks generated by Sierpinski tetrahedron.

In this paper, we try to find out the farther connection between the fractal networks and fractal geometry. Recall some classical result on fractal dimension. We find out that many dimension results have measure versions. Suppose μ is a Borel (finite) measure supported on a compact subset E, denoted by spt . For any , let the lower local dimension of μ at point x be defined by . A classical result25,26 on Hausdorff dimension dimH (·) is

That means for Hausdorff dimension, we have the corresponding measure version. When replacing by , we obtain packing dimension dimP (·)25,26. We always have , where is upper box dimension. A reasonable case is and there is a suitable measure μ such that , or we can pose the Ahlfors regularity assumption on the measure

where c is an independent constant.

We give a natural measure on a graph G = (V, E). For , we let ν(Ω) be the cardinality of Ω, which is called the volume of Ω. We say that {Gt}t is a family of growing networks, i.e., , which means the node set of Gt+1 contains node set of Gt, and neighbors of Gt are still neighbors of Gt+1. When {Gt}t is growing, we let νt(Ω) denote cardinality of , where Vt is the node set of Gt.

Remark 1. When taking as the sum of degrees of nodes in Ω, Wei et al.14 obtain the generalized volume dimension.

Definition 1. Given s > 0, if , we call the network an Ahlfors s-regular network. When {Gt}t is growing, we call {Gt}t Ahlfors s-regular networks, if there is an independent constant c such that for all , r < diam(Vt) and t,

When the diameter of network is large enough, we have

Theorem 2. df = dball = dB = s if the network or growing networks are Ahlfors s-regular.

When the networks are regular, we can use RBV algorithm to obtain their fractal dimensions efficiently.

Ahlfors regular trees

Now, we obtain a rule (rule 1) of generating Ahlfors s-regular networks and growing trees in Figs 4 and 5. We have in Fig. 6. By embedding the self-similar tree into the self-similar fractal in , we find that the volume of ball in the tree is comparable with the (self-similar) measure of ball in plane, then we can obtain

Figure 4
figure 4

Rule 1.

Figure 5
figure 5

G1, G2, G3 of growing trees w.r.t. rule 1.

Figure 6
figure 6

Fractal dimensions of G5: (a) CBB, (b) RBC, (c) DGBC, (d) DOBC, (e) RBV.

Theorem 3. The growing self-similar trees defined above are Ahlfors s-regular with s = log 5/log 3. Therefore, we have

We also have rule 2 and growing trees in Figs 7 and 8. For this self-similar tree with respect to rule 2, we have .

Figure 7
figure 7

Rule 2.

Figure 8
figure 8

G1, G2 of growing trees w.r.t. rule 2.

Fix an infinite sequence of 1 and 2 such that exists. We can construct a family of growing networks as follows by induction: for time t, we take rule 1 if xt = 1, else take rule 2. For example, if the sequence is , we obtain our growing networks G1, G2, G3 as in Fig. 9. This is a family of deterministic growing networks.

Figure 9
figure 9

The first three steps according to an infinite sequence .

Then we can generate a Moran tree with mixed rules. For this Moran tree without self-similarity, we have . We also obtain random growing networks, for each time t, we can choose rule 1 in probability p and rule 2 in probability 1 − p.

The rest of paper is organized as follows. Section 2 is devoted to the rigourous proofs on the NP-completeness of minimum ball-covering problem (Theorem 2) and minimum box-covering problem (Proposition 2). Section 3 is the preliminary on the Ahlfors regularity of fractal geometry, including covering inequality and self-similar fractal. In this section, we also recall the fact that the open set condition of self-similar fractal implies the Ahlfors regularity of fractal measure. Replacing the fractal measures by the cardinalities of subsets of networks, we obtain the Ahlfors regularity of networks. In Section 4, we prove Theorem 2 by using covering inequality shown in Section 2, and obtain Ahlfors regularity of a class of self-similar network (Theorem 3) by constructing bilipschitz mappings from a self-similar fractal, satisfying the open set condition, to self-similar networks, and estimating the cardinalities of balls of graph from the Ahlfors regularity of the fractal measure.

NP-completeness of minimum covering problems

Recall some notation of computer science. For an alphabet , let be the set of finite strings of elements of , and the class of functions from into defined by one-tape Turing machine which operate in polynomial time.

Definition 2. Let L and M be languages. Then (L is reduced to M) if there is a function such that . We say that some language is NP-complete , if for all .

The concept of NP-completeness was introduced in 1971 by Cook27. In Cook’s theorem, he proved that the Boolean satisfiability problem is NP-complete.

In 1972, Karp28 proved that several other problems were also NP-complete. For example, we give the following two in Karp’s 21 NP-complete problems.

  1. 1

    Clique covering problem

  2. 2

    Input: graph G = (V, E), positive integer k

  3. 3

    Property: V is the union of k or fewer cliques, where a clique is a subset of vertices of G such that its induced subgraph is complete.

  4. 4

    Set covering problem

  5. 5

    Input: universe U and a family S of subsets of U, positive integer k

  6. 6

    Property: there is a set covering of size k or less, where a set covering is a subfamily of sets whose union is U.

  7. 7

    In 1992, Kann29 proved that the set covering problem, which is NP-complete, can be reduced to the following dominating set problem (hence it’s also NP-complete).

  8. 8

    Dominating set problem

  9. 9

    Input: graph G = (V, E), positive integer k

  10. 10

    Property: there is a dominating set of k or fewer nodes, where a dominating set is a subset D of V such that every vertex not in D is adjacent to at least one member of D.

  11. 11

    In this section, we will show the following two problems are NP-completes.

  12. 12

    l-ball-covering problem

  13. 13

    Input: graph G = (V, E), positive integer k

  14. 14

    Property: V is the union of k or fewer l-balls.

  15. 15

    l-box-covering problem

Input: graph G = (V, E), positive integer k

Property: V is the union of k or fewer l-boxes.

Proof of Theorem 1

If l = 2, then l-ball-covering problem is exactly the dominating set problem, which is NP-complete.

If l = 3, given a undirected graph G = (V, E) as in Fig. 10, we construct a new graph in polynomial time w.r.t. the size of G.

  1. 1

    For any , we insert a median point z (in red) in the edge with degree 2 in , i.e., in we have x ~ z, z ~ y and x, y are not neighbors in .

  2. 2

    We add a Hub (in blue) to connect all median points.

  3. 3

    Insert sub-median-point (in yellow) for every edge between one median point (in Step I) and Hub.

  4. 4

    We construct a leaf node (in pink) and the median point (in green) between leaf node and the Hub.

Figure 10
figure 10

The reduction process for l = 3.

We have the following

Claim 1. There is a dominating set of k or fewer nodes in G if and only if is the union of (k + 1) or fewer 3-balls.

To verify this claim, we notice the following facts.

  1. a

    For any nodes , in their geodesic distance .

  2. b

    The subset of all nodes not in V is a 3-ball centered at the Hub.

  3. c

    The geodesic distance between the pink node and any node in V is 5, that means any 3-ball can not contain the pink node and any node of V simultaneously.

  4. d

    For any 3-ball D with , we can find a node u in such that

Suppose is the minimum dominating set of G and there is a minimum 3-ball covering of . We only need to show that

In fact, we have . It follows from the fact (a) that

for any i ≤ s. Applying the fact (b), we see that there exists a 3-ball covering with (s + 1) balls. Hence

On the other hand, considering the minimum 3-ball covering , by fact (d), we obtain a dominating set of G, where . Therefore, . Since the pink point must belong to some ball , by fact (c), we have . Therefore we have

Then (1) follows from (2) and (3). Then Theorem 1 is proved for l = 3.

For l ≥ 4, we have the similar construction during reduction. In fact, we insert (l − 2) median points into each edge of G, add a Hub to connect all median points, insert (l − 2) sub-median-point for every edge between one median point and Hub. Finally, we construct the leaf node and connect it to the Hub, insert (l − 2) the median point between leaf node and the Hub. See Fig. 11 for l = 4.

Figure 11
figure 11

The reduction process for l = 4.

Remark 2. To prove one problem is NP-complete, we always find a reduction from a known NP-complete problem to our problem. On the other hand, we can always construct a reduction from our (NP) problem to a known NP-complete problem due to the definition of NP-completeness.

We give a proof of the following fact which is pointed out by Song, Gallos, Havlin and Makse9.

Proposition 2. For any fixed size l, the l-box-covering problem is NP-complete.

Proof. If l = 2, l-box-covering problem is exactly the clique covering problem, which is NP-complete.

If l = 3, given a undirected graph G = (V, E), as in Fig. 12, we construct a new graph G′ = (V′, E′) in polynomial time with respect to the size of G.

  1. 1

    For any , we insert a median point z (in red) in the edge (x, y) with degree 2, i.e., x ~ z, z ~ y and x, y are not neighbors in G′.

  2. 2

    We add a Hub (in blue) to connect all median points.

  3. 3

    We construct a leaf node (in pink) adjacent to the Hub.

Figure 12
figure 12

The reduction process for l = 3.

We have the following

Claim 2. V is the union of k or fewer cliques if and only if Vis the union of (k + 1) or fewer 3-boxes.

To verify this claim, we notice the following facts.

  1. i

    For any nodes , in G′ their geodesic distance .

  2. ii

    The subset of nodes not in V is a 3-box.

  3. iii

    The geodesic distance between leaf node (in pink) and any node in V is 3.

Suppose is a family of cliques of G such that s is minimal one. Suppose there is a minimum 3-box covering of G′. We only need to show that

In fact, we have . It follows from the fact (i) that Ai is a 3-box in G′ for any i ≤ s. Applying the fact (ii), we see that there exists a 3-box covering with (s + 1) boxes. Hence

On the other hand, it follows from fact (i) that is a family of cliques in G where . Therefore, . We also notice that if the pink point belongs to some , by fact (iii), we have . Therefore we have

Then (4) follows from (5) and (6).

For l ≥ 4, we have the similar construction during reduction. See Fig. 13 for l = 4.□

Figure 13
figure 13

The reduction process for l = 4.

Covering inequality, self-similar fractal and Moran fractal

Covering and packing on metric space

Given a compact metric space (X, d), let a δ-ball centered at x0 be an open ball , and a δ-cube a cube of Euclidean space with side length δ, a δ-box B is a subspace of X such that its diameter less than δ, i.e., d(x, y) < δ for all x, . Denote

We recall an elementary inequality26 which is important in this paper. We give the proof for the self-containedness of this paper.

Lemma 1. .

Proof. Suppose is a packing family of δ-balls, we conclude that . Otherwise, suppose , for any , we have d(y, yi) ≥ d(y, xi) − d(yi, xi) ≥ 2δ − δ = δ. That means is empty for any i. Now, we obtain a new packing family of , which is a contradiction. Therefore, we have , and thus we have Pδ ≥ B2δ.

Assume is a packing family of δ-balls, then d(xi, xj) ≥ δ for all . Notice that on Euclidean space, we have d(xi, xj) ≥ 2δ for all . Suppose there is a minimum covering of δ/2-balls . Now, every δ/2-ball contains at most one points in since the diameter of a δ/2-ball is less than δand d(xi, xj) ≥ δ for all . On the other hand, every xi must be contained in some δ/2-ball. Therefore, we obtain Pδ ≤ Bδ/2.□

We also have

By the above inequalities, the classical result25,26 on box dimension is that

In fact, in the above formula, we take upper box dimension or lower box dimension when the limit does not exist.

Self-similar set on Euclidean space

Let be a self-similar set30 on a Euclidean space , where Si is a similarity with ratio ri, i.e., for all x, . In fact, where , and Ri is orthogonal. That means any similarity is the compositions of homothety, translation and orthogonal transformation.

We say that the open set condition (OSC) holds if there exists a non-empty open set V such that

Let and the probability vector . According to ref. 30, there is a unique Borel measure μ (self-similar measure) satisfying . When the OSC holds, Hutchinson30 obtained that dimHK = dimBK = s, and there is a constant C ≥ 1 such that for all and r ≤ |K| (the diameter of K),

A compact set E is said to be Ahlfors s-regular26, if there is a Borel measure μ supported on E satisfying (8). That means the self-similar set satisfying the OSC is Ahlfors regular.

Self-similar fractals

We introduce a special self-similar fractal on (Figs 14 and 15). Let

Figure 14
figure 14

The first two steps of self-similar fractal (model 1).

Figure 15
figure 15

Step 4 of self-similar fractal (model 1).

where orthogonal. matrixes , , .

Let V be the interior of polygon with vertexes (0, 0), (1/3, 1/3), (2/3, 1/3), (1, 0), (4/9, −1/9) and (5/9, −1/9). Then (7) holds for m = 5 (Fig. 16).

Figure 16
figure 16

OSC holds.

Taking , we give a self-similar fractal of model 2 (Figs 17 and 18).

Figure 17
figure 17

The first two steps of self-similar fractal of model 2.

Figure 18
figure 18

Step 4 of self-similar fractal of model 2.

Then the OSC also holds. Let E1, E2 be the self-tree of models 1 and 2 respectively. Then

Moran fractal and random fractal

Fix an infinite sequence of 1 and 2, we can generate a Moran fractal with mixed model. If xt = 1 then we take model 1, else we take model 2. Let

If exists, then the corresponding fractal has fractal dimension

An interesting fact is that this is a deterministic fractal without self-similarity. This is a Moran fractals31.

An alternative is a random fractal such that for each time t, we can choose model 1 in probability p and model 2 in probability 1 − p. Then we obtain the above dimension almost surely.

Ahlfors regularity of networks

Proof of Theorem 2

By the definition of Ahlfors regularity, we have df = s.

Suppose . Since the network is covered by Bl balls of radius l, that means

On the other hand, we have Pl/2 packing balls of radius l/2, which implies

That means

here we use the inequality B2δ ≤ Pδ ≤ Bδ/2 in Lemma 1. Therefore,

which implies , i.e., dball = dB = s.

Proof of Theorem 3

Let A = (0, 0) and B = (1, 0). Let and .

Remark 3. One node may have distinct codings and if . We also notice that each node has three codings at most.

Two different nodes x, are neighbors if and only if there exists a word such that

Let dt be the geodesic distance on Gt.

We denote if there is a constant d > 0 independent of the index i such that d−1bi ≤ ai ≤ dbi.

Now, we will prove the following important

Lemma 2. There is a constant c > 0 independent of t such that

Proof. Suppose

where . Notice that

where . Without loss of generality, we assume that .

Case 1. If is empty, then and , and

Then (9) follows in this case.□

Case 2. If is non-empty, we may assume that without loss of generality.

For D = (1/3, 0), let . Then there exists (Fig. 16) such that θ ≥ θ0 (>0). Now,

which implies

We also have . Therefore, we have

On the other hand,

by the tree structure. It follows from (10) and (11) that we only need to verify (9) for the pairs (y1, D) and (D, y2). By the self-similarity, now we only need to prove the case when .

Without loss of generality, let y1 = A and where . Then

and

then (9) follows.

Since the OSC holds, then the self-similar measure μ with respect to the vector (1/5, 1/5, 1/5, 1/5, 1/5) is Ahlfors s-regular for s = log 5/log 3.

It follows from the above lemma and Remark 3 that

where #Vt = 5t + 1. Therefore, we have

Notice that the constant in (12) is independent of t. Now, the growing networks {Gt}t are Ahlfors s-regular.

Conclusion

We focus on the NP-completeness of minimum ball-covering problem, propose some heuristic ball-covering algorithms such as GOBC, GDBC, VOBC and VGBC, and compare these algorithms with usual RBC algorithm. Inspired by the notion of measure on fractal, a natural measure on the finite graph is obtained such that the measure of every subset is the cardinality of subset. Based on this measure, we revisit the volume dimension df and propose the random ball-volume algorithm, which has performance better than the above five minimum covering algorithms due to the NP-completeness. Applying the notion of Ahlfors regularity from fractal geometry, we prove that dB = dball = df = s if the network is Ahlfors s-regular. Finally, we investigate the Ahlfors regularity of a class of self-similar trees and random trees which come from the self-similar fractals and Moran fractals respectively. Although we only prove Theorem 3 for self-similar tree of model 1, but our approach can be applied to many self-similar trees, Moran tree and random trees. Essentially, our approach is to embed our networks into a self-similar (or Moran) fractal (on Euclidean space) satisfying the open set condition, using the Ahlfors regularity of corresponding self-similar (or Moran) measure, we can estimate the volume of balls in networks.

Additional Information

How to cite this article: Wang, L. et al. On the Fractality of Complex Networks: Covering Problem, Algorithms and Ahlfors Regularity. Sci. Rep. 7, 41385; doi: 10.1038/srep41385 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.