Deployment of check-in nodes in complex networks

Jiang, Zhong-Yuan; Ma, Jian-Feng

doi:10.1038/srep40428

Download PDF

Article
Open access
Published: 11 January 2017

Deployment of check-in nodes in complex networks

Zhong-Yuan Jiang¹ &
Jian-Feng Ma^1,2

Scientific Reports volume 7, Article number: 40428 (2017) Cite this article

1175 Accesses
7 Citations
1 Altmetric
Metrics details

Subjects

Abstract

In many real complex networks such as the city road networks and highway networks, vehicles often have to pass through some specially functioned nodes to receive check-in like services such as gas supplement at gas stations. Based on existing network structures, to guarantee every shortest path including at least a check-in node, the location selection of all check-in nodes is very essential and important to make vehicles to easily visit these check-in nodes, and it is still remains an open problem in complex network studies. In this work, we aim to find possible solutions for this problem. We first convert it into a set cover problem which is NP-complete and propose to employ the greedy algorithm to achieve an approximate result. Inspired by heuristic information of network structure, we discuss other four check-in node location deployment methods including high betweenness first (HBF), high degree first (HDF), random and low degree first (LDF). Finally, we compose extensive simulations in classical scale-free networks, random networks and real network models, and the results can well confirm the effectiveness of the greedy algorithm. This work has potential applications into many real networks.

A route pruning algorithm for an automated geographic location graph construction

Article Open access 02 June 2021

Christoph Schweimer, Bernhard C. Geiger, … Derek Groen

Optimizing target nodes selection for the control energy of directed complex networks

Article Open access 22 October 2020

Hong Chen & Ee Hou Yong

Search graph structure and its implications for multi-graph constrained routing and scheduling problems

Article Open access 01 September 2022

Michal Weiszer, Edmund K. Burke & Jun Chen

Introduction

The advent of complex network^1,2,3 theory has had a significant impact on the network and data science⁴ over the course of past 20 years. People’s daily life deeply relies on kinds of artificial networks such as city road networks, highway networks, power grids, communication networks, and virtual networks such as WWW (World Wide Web), social networks, and so on. A wide range of research topics aim to solve the challenges that many real networks face. As discussed in our previous work⁵, a portion of nodes in many complex networks have special functions such gas stations in road networks and highway networks supplying for check-in like services. In air transportation, for the convenience of passengers and resource locations, e.g. maintenance crews, it is very important to locate the hub nodes of an airline⁶. In IT infrastructure, we may want to allocate specific functions to critical nodes or driver nodes⁷, for instance, the nodes that control the Internet traffic in the search for viruses. In interdependent networks (e.g. power grids and communication networks), a fraction of critical nodes may result in the collapse of whole interdependent network⁸, such as the largest blackout of the power gird and the outages of the Internet. In social science, for security purpose, many “inside” agents are need to intercept all communications⁹ in a network of terrorists. In food web¹⁰, the predation relation can be also considered as check-in like service, and mining the key species whose disappearance may lead to large scale species extinction is a very critical problem. These nodes with special functions can be called check-in nodes, and objects that flow in networks need to finish check-in like services at the check-in nodes. For instance, vehicles often have to pass through gas stations to get gas supplement. Then two aspects of this problem should be considered:

1
Efficient routing strategies. With a portion of predesigned locations (perhaps randomized ones) of gas stations, designing efficient routes for all vehicles is very essential and important to alleviate traffic congestion, save gas fuel and time consumption of drivers. Our work⁵ tried to explore a possible check-in based routing framework for this problem. Definitely, many previous routing optimization methods including the efficient routing¹¹, optimal routing¹², global dynamic routing¹³, incremental routing¹⁴ and hybrid routing¹⁵ can be referenced. For simplicity and without loss of generality, here we employ the classical shortest path routing method for path discovery.
2
Optimal deployment of check-in node locations. With a given number of check-in nodes, which positions are the optimal ones that can achieve the highest profits to citizens and governors? To our best knowledge, it is still an open problem in complex network research.

In other words, with minimum number of check-in nodes, we aim to maximize the profits of the whole network in this work. There are several aspects which need to be clarified clearly for this problem:

1
Clear problem definition and evaluation metric. The problem of check-in node deployment should be clear and a metric should be defined to accurately evaluate performance for check-in node deployment methods.
2
Efficient check-in node deployment method. Currently, to our best knowledge, the check-in node deployment method research is open, and there is a lack of deep study.
3
Evaluations. To verify the effectiveness of proposed methods, extensive simulations must be composed in both classical complex network models (e.g. scale-free network model and random network model) and real network models.

In the following section, we will first show the results of this work. Then we introduce the proposed algorithms and the employed network models. Finally, we close this work with a conclusion.

Results

Here we first define the check-in node deployment problem. Given a network which might be directed or undirected, assuming the shortest path routing protocol is employed, every shortest path between any pair of source and destination must include at least a check-in node to receive the check-in like services. Then the minimum number of check-in nodes (MNCN) which can guarantee every shortest path including at least a check-in node can be employed to evaluate the performance of a check-in node deployment method.

This problem can be converted into the set cover problem¹⁶ (see details in Methods section) and solved by employing greedy algorithm¹⁶ (GA). To compare with GA, other 4 check-in node position selection methods (see details in Methods section) including high betweenness first (HBF), high degree first (HDF), random and low degree first (LDF) are discussed.

Given a set of locations for check-in nodes , the cover rate of all shortest paths f (see details in Methods section) can be employed to evaluate the effectiveness of check-in node location selection methods.

We first investigate the evolution of cover rate f as a function of the number of check-in nodes in BA¹⁷ scale-free networks and ER¹⁸ random networks in Fig. 1(a,b) respectively. One can see that the GA achieves the highest f. With the same number of check-in nodes, for instance, 150 check-in nodes in Fig. 1(a,b), f under the five methods appears to be GA > HFB > HDF > Random > LDF in both two types of networks. The HBF and HDF appear to be a bit lower than the GA, but very near. The LDF is the worst, because under the shortest path routing, paths trend to pass through the nodes with high degrees. Therefore, with the same number of check-in nodes, the number of the shortest paths that passing through check-in nodes of low degrees is very small, resulting in low f. With increasing number of check-in nodes, the f increases under the 5 methods. When the number of check-in nodes goes beyond a critical value, the f gets its maximum value of 1.0. Then the MNCN can be efficiently achieved and represented by the critical value.

**Figure 1: Evolution of cover rate f as a function of the number of check-in nodes under the five different check-in nodes selection methods.**

In Fig. 2, we investigate the comparisons of different location selection methods in the two types of classical network models. In Fig. 2(a), based on the GA method, with the same network size and average degree, the robustness of BA¹⁷ networks appears to be better. It is related to the network structure, and in BA¹⁷ networks, most of the shortest path pass through a fraction of hub nodes. Meanwhile, the betweenness distribution of all nodes in ER¹⁸ network is much even, and more check-in nodes are needed. Similarly, under the HDF and HBF methods, the results are very similar to GA. Under the random selection, the effects are almost the same in two network models. In Fig. 2(e), under the LDF method, the ER¹⁸ network appears to achieve better performance. It is also related to the network structure. The degree distribution of ER¹⁸ networks is more even than BA¹⁷ networks.

**Figure 2: Comparisons of different methods in the two classical networks.**

In Fig. 3, we investigate the evolution of minimum number of check-in nodes (MNCN) under the 5 methods in BA¹⁷ scale-free networks and ER¹⁸ random networks of different network sizes. With increasing network size, the MNCN increases. Because the larger the network size, the higher the number of the shortest paths appears, and more check-in nodes are needed. Under all network sizes, the GA method can achieve the lowest MNCN, and it can confirm the effectiveness of GA method.

**Figure 3: Evolution of MNCN as a function of the network size under the five different check-in nodes selection methods.**

In Fig. 4, we investigate the comparisons of all methods in the two network models. We can see that it is very obvious that under the GA, HBF, and HDF, the BA¹⁷ network models appear to have smaller MNCN, namely higher efficiency than the ER¹⁸ networks. The effects are almost the same under the Random and LDF methods.

So far, we can say the GA can achieve very good results when compared with all other methods. However, we may want to compare the results with the optimal solution which has been proved to be NP-hard¹⁶. Here we set network size N = 20, average degree 〈k〉 = 4. We run the simulations on many BA¹⁷ and ER¹⁸ networks on a PC of Intel(R) Core(TM) i5-3470 CPU @3.2 GHz 3.2 GHz, RAM 4.0 GB. In Table 1, the results show that the average MNCN = 9 for both GA and optimal solution in BA¹⁷ networks, and average MNCN = 11 for both GA the optimal in ER¹⁸ networks. But the computational cost of the optimal solution is about 4800 and 12000 times more than GA in BA¹⁷ networks and ER¹⁸ networks respectively. The results are very amazing, especially for MNCN for both GA and optimal solution. For this special problem, the GA can achieve very good results.

Table 1 The comparisons MNCN and Computational cost under GA and the Optimal methods in several small network models.

Full size table

In Table 2, we evaluate MNCN for many real networks which are widely used in previous research papers. One can see that the GA method can efficiently locate the check-in nodes than other 4 methods.

Table 2 The comparisons of minimum number of check-in nodes (MNCN) under different check-in node selection methods in the many real and classical networks.

Full size table

Discussion

In this work, assuming a portion of nodes were designated as check-in ones to supply check-in services for vehicles or network objects, we aimed to find efficient locations for these check-in nodes to achieve every shortest path including at least a check-in node. By carefully analyzing this problem, we transformed it into a set cover problem which has proved to be NP-complete, and proposed to use the greedy algorithm¹⁶ to find a cover. To verify the effectiveness of greedy algorithm¹⁶, we discussed other four heuristic location selection methods including high betweenness first, high degree first, random, and low degree first. To compare these methods, extensive simulations were done in BA¹⁷ scale-free networks and ER¹⁸ random networks. We investigated evolution of cover rate as functions of network sizes and average degrees, and found that with increasing network size and average degree the minimum number of check-in nodes which can guarantee every shortest path including at least a check-in node increases. Moreover, we employed these methods into many real network models. All the results can well confirm the effectiveness of the greedy algorithm for set cover problem. We compare the results of the greedy algorithm¹⁶ with the optimal results, and found that the GA method can achieve better network robustness with low computational cost. The results of this work can be employed for check-in node location selections in many potential real networks. In reality, other factors such as traffic density, source and destination distributions, and routing methods should also be comprehensively considered to efficiently solve the real challenges in complex networks. Moreover, network resilience is a very important topic in network science. In epidemic processes^19,20, it has been found that the epidemic processes are drastically affected by the first two moments of the degree distribution²¹. Can these methods be employed into these network processes and enhance the other network resilience measures? In our future work, we will continue the research topic and share the results soon.

Methods

Algorithms

As shown in Fig. 5(a), a simple directed network with 5 nodes. The shortest path routing is employed. If many shortest paths exist between a source and destination pair, one of them is used randomly. For instance in Fig. 5(a), the shortest path from node 1 to 5 might be P_1,5 = {1, 2, 5} or P_1,5 = {1, 3, 5}, and we randomly select P_1,5 = {1, 3, 5}. Figure 5(b) shows all the shortest paths in the network.

**Figure 5: An example for the problem.**

In order to find the minimum number of check-in nodes, we first collect the shortest paths which pass through a given node. As shown in Fig. 5(c), node 1 has 6 shortest paths including this node, denoted by a set S₁.

In fact, the MNCN problem can be described from another perspective. Given every S_i of node i in the network, find the minimum number of sets that can cover all the shortest paths in the network, namely finding a cover J (J ⊂ V) with minimum |J| that can achieve , where and V is the set of all nodes in the network. Then it is converted into the classical Set Cover problem²² which has been proved to be NP-complete and can be approximately solved by greedy algorithm¹⁶ described as follows.

Algorithm 1: Greedy algorithm (GA):

Step 0. Set J = .

Step 1. If S_i = for all i then stop: J is a cover. Otherwise, find a subscript j maximizing |S_j| and proceed to Step 2.

Step 2. Add j to J, replace each S_i by S_i − S_j and return to Step 1.

As shown in Fig. 5, by employing the greedy algorithm, the cover J = {3, 1, 2}, namely the minimum number of check-in nodes MNCN equal to |J| = 3.

The above greedy algorithm¹⁶ can find an approximate cover J. In the greedy algorithm process, at each step, we find the set which can maximize the number of included shortest paths. Finally, there is a set sequence J. However, in real check-in demands, it is not necessary to cover all paths, and we may want to only cover a large portion of them with minimum check-in nodes. Given a subset of V, denoted as , here we employ a cover rate metric to evaluate the covered portion of all the shortest paths, described as

Under the greedy algorithm¹⁶, when , . However, the scales of real networks are very large, and it is very difficult to emulate all shortest paths in the network and calculate the set cover. Is there any simple and heuristic algorithm to achieve an approximate cover rate f with small number of check-in nodes? Most of real networks can be modeled by the scale-free network model¹⁷, in which many nodes with the highest degrees are considered as central nodes. Moreover, the betweenness centrality²³ of a node v is defined as the number of shortest paths passing through the node and be used to evaluate the importance of node in the network. Inspired by these heuristic information, in the following parts, we will employ several check-in node selection methods as baselines to compare with the greedy algorithm.

In general, the betweenness of a node directly represents the number of shortest path passing through the node, so the betweenness information based method can be described as follows.

Algorithm 2: High betweenness first (HBF).

Step 0. Sort the betweenness of all nodes in descend order.

Step 1. Given the number of check-in nodes, select the top nodes in the descend order.

Step 2. Calculate the .

In HBF, the betweenness of every node must be calculated first. Though the fast algorithm²⁴ can be used, it is still consuming huge computation resource especially for large scale networks. Meanwhile, getting the node degree information is relatively simple, and the degree information is also very efficient in evaluating node importance. Moreover, in complex networks, betweenness of a node is strong correlated to its degree. A node of high degree often has large betweenness²³. Therefore, here we propose a degree based check-in node deployment method.

Algorithm 3: High degree first (HDF)

Step 0. Sort the degrees of all nodes in descend order.

Step 1. Given the number of check-in nodes, select the top nodes in the descend order.

Step 2. Calculate the .

In HDF, the node degree in employed. Sometimes, the network structure might not be known to us, and no heuristic information can be used. Then the random location deployment mechanism can be simply used.

Algorithm 4: Random

Step 0. Given the number of check-in nodes, randomly select the nodes in the network.

Step 1. Calculate the .

Opposite to HDF, as discussed in our previous work⁵, if the check-in nodes are selected as the nodes of the lowest degrees, the network traffic capacity²⁵ will be remarkably reduced. Here, we assume the nodes of the lowest degrees are set as the check-in nodes, and compare the results with other methods.

Algorithm 5: Low degree first (LDF)

Step 0. Sort the degrees of all nodes in ascend order.

Step 1. Given the number of check-in nodes, select the top nodes in the ascend order.

Step 2. Calculate the .

Moreover, in order to compare the results with optimal solution, here we try to obtain optimal by emulating all possible sets of check-in nodes. Described as follows:

Step 0. Assuming .

Step 1. Find all combinations .

Step 2. For each combination, if then is the result, else , go to Step 1.

Network models

To verify the effectiveness of above check-in node selection methods, the network structure is the basic. In this work, the used network models include two categories: BA¹⁷ scale free networks, ER¹⁸ random networks and real network models.

The BA¹⁷ scale-free network model which is constructed by two general rules: (1) Growth; (2) Preferential attachment. Starting from m₀ fully connected nodes, a new node with m (m ≤ m₀) edges are added to the existing network, and the other end of every new edge is connected to an old node preferentially proportional to the degree of the old node.

Another classical network model is the ER¹⁸ random graph. The network generation is simple. Initially, beginning with N isolated nodes, a pair of nodes is connected by a probability p.

Additional Information

How to cite this article: Jiang, Z.-Y. and Ma, J.-F. Deployment of check-in nodes in complex networks. Sci. Rep. 7, 40428; doi: 10.1038/srep40428 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Boccaletti, S., Latora, V., Moreno, Y., Chavez, M. & Hwang, D.-U. Complex networks: Structure and dynamics. Physics reports 424, 175–308 (2006).
Article ADS MathSciNet Google Scholar
Albert, R. & Barabási, A.-L. Statistical mechanics of complex networks. Reviews of modern physics 74, 47 (2002).
Article ADS MathSciNet Google Scholar
Newman, M. E. The structure and function of complex networks. SIAM review 45, 167–256 (2003).
Article ADS MathSciNet Google Scholar
Zanin, M. et al. Combining complex networks and data mining: why and how. Physics Reports 635, 1–44 (2016).
Article ADS MathSciNet Google Scholar
Jiang, Z. Y., Ma, J. F. & Shen, Y. L. Check-in based routing strategy in scale-free networks. Physica A: Statistical Mechanics and its Applications 468, 205–211 (2017).
Article ADS Google Scholar
Jaillet, P., Song, G. & Yu, G. Airline network design and hub location problems. Location science 4, 195–212 (1996).
Article Google Scholar
Liu, Y.-Y., Slotine, J.-J. & Barabási, A.-L. Controllability of complex networks. Nature 473, 167–173 (2011).
Article CAS ADS Google Scholar
Buldyrev, S. V., Parshani, R., Paul, G., Stanley, H. E. & Havlin, S. Catastrophic cascade of failures in interdependent networks. Nature 464, 1025–1028 (2010).
Article CAS ADS Google Scholar
Kamijo, S., Matsushita, Y., Ikeuchi, K. & Sakauchi, M. Traffic monitoring and accident detection at intersections. IEEE transactions on Intelligent transportation systems 1, 108–118 (2000).
Article Google Scholar
Dunne, J. A., Williams, R. J. & Martinez, N. D. Food-web structure and network theory: the role of connectance and size. Proceedings of the National Academy of Sciences 99, 12917–12922 (2002).
Article CAS ADS Google Scholar
Yan, G., Zhou, T., Hu, B., Fu, Z.-Q. & Wang, B.-H. Efficient routing on complex networks. Physical Review E 73, 046108 (2006).
Article ADS Google Scholar
Danila, B., Yu, Y., Marsh, J. A. & Bassler, K. E. Optimal transport on complex networks. Physical Review E 74, 046106 (2006).
Article ADS Google Scholar
Ling, X., Hu, M.-B., Jiang, R. & Wu, Q.-S. Global dynamic routing for scale-free networks. Physical Review E 81, 016113 (2010).
Article ADS Google Scholar
Jiang, Z.-Y. & Liang, M.-G. Incremental routing strategy on scale-free networks. Physica A: Statistical Mechanics and its Applications 392, 1894–1901 (2013).
Article ADS MathSciNet Google Scholar
Jiang, Z.-Y., Ma, J.-F. & Jing, X. Enhancing traffic capacity of scale-free networks by employing hybrid routing strategy. Physica A: Statistical Mechanics and its Applications 422, 181–186 (2015).
Article ADS Google Scholar
Chvatal, V. A greedy heuristic for the set-covering problem. Mathematics of operations research 4, 233–235 (1979).
Article MathSciNet Google Scholar
Barabási, A.-L. & Albert, R. Emergence of scaling in random networks. science 286, 509–512 (1999).
Article ADS MathSciNet Google Scholar
Erdös, P. & Rényi, A. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci 5, 43 (1960).
MathSciNet MATH Google Scholar
Graham, M. & House, T. Dynamics of stochastic epidemics on heterogeneous networks. Journal of Mathematical Biology 68, 1583–1605 (2014).
Article MathSciNet Google Scholar
Pastorsatorras, R., Castellano, C., Mieghem, P. V. & Vespignani, A. Epidemic processes in complex networks. Review of Modern Physics 87, 120–131 (2015).
MathSciNet Google Scholar
Li, K., Zhang, H., Fu, X., Ding, Y. & Small, M. Epidemic threshold determined by the first moments of network with alternating degree distributions. Physica A: Statistical Mechanics and Its Applications 419, 585–593 (2015).
Article ADS MathSciNet Google Scholar
Karp, R. M. Reducibility among combinatorial problems. In Complexity of computer computations, 85–103 (Springer, 1972).
Newman, M. E. Scientific collaboration networks. ii. shortest paths, weighted networks, and centrality. Physical review E 64, 016132 (2001).
Article CAS ADS Google Scholar
Brandes, U. A faster algorithm for betweenness centrality*. Journal of mathematical sociology 25, 163–177 (2001).
Article Google Scholar
Guimerà, R., Díaz-Guilera, A., Vega-Redondo, F., Cabrales, A. & Arenas, A. Optimal network topologies for local search with congestion. Physical Review Letters 89, 248701 (2002).
Article ADS Google Scholar

Download references

Acknowledgements

The authors are grateful to the anonymous reviewers for their valuable comments and suggestions. This work is partly supported by the National Natural Science Foundation of China (No. 61502375), the Natural Science Basis Research Plan in Shaanxi Province of China (No. 2016JQ6046), the Fundamental Research Funds for the Central Universities (No. 20101156150), the National High Technology Research and Development Program (863 Program) (No. 2015AA016007), and the China 111 Project (No. B16037).

Author information

Authors and Affiliations

School of Cyber Engineering, Xidian University, Xi’an, 710071, Shaanxi, China
Zhong-Yuan Jiang & Jian-Feng Ma
School of Computer Science and Technology, Xidian University, Xi’an, 710071, Shaanxi, China
Jian-Feng Ma

Authors

Zhong-Yuan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Feng Ma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.Y.J. and J.F.M. designed the research. Z.Y.J. performed numerical simulation and analyzed data. Z.Y.J. and J.F.M. wrote the paper. All authors reviewed the paper.

Corresponding author

Correspondence to Zhong-Yuan Jiang.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Jiang, ZY., Ma, JF. Deployment of check-in nodes in complex networks. Sci Rep 7, 40428 (2017). https://doi.org/10.1038/srep40428

Download citation

Received: 06 July 2016
Accepted: 02 December 2016
Published: 11 January 2017
DOI: https://doi.org/10.1038/srep40428

This article is cited by

Finding Key Node Sets in Complex Networks Based on Improved Discrete Fireworks Algorithm
- Fengzeng Liu
- Bing Xiao
- Hao Li
Journal of Systems Science and Complexity (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.