Abstract
Controlling large natural and technological networks is an outstanding challenge. It is typically neither feasible nor necessary to control the entire network, prompting us to explore target control: the efficient control of a preselected subset of nodes. We show that the structural controllability approach used for full control overestimates the minimum number of driver nodes needed for target control. Here we develop an alternate ‘kwalk’ theory for directed tree networks, and we rigorously prove that one node can control a set of target nodes if the path length to each target node is unique. For more general cases, we develop a greedy algorithm to approximate the minimum set of driver nodes sufficient for target control. We find that degree heterogeneous networks are target controllable with higher efficiency than homogeneous networks and that the structure of many realworld networks are suitable for efficient target control.
Introduction
In the past decade network science offered deep insights into the structure and dynamics of complex networked systems^{1,2,3,4,5,6,7}. Yet, we continue to lack tools to efficiently control the dynamics of complex networks. According to control theory^{8,9,10}, a dynamic system is controllable if suitable external inputs can move the internal state of the system from any initial state to an arbitrary accessible final state in a finite time interval^{11}. Recently, we introduced an analytical framework to study the controllability of complex networks via a combination of tools from network science, control theory and statistical physics^{12}. By mapping the structural controllability problem^{13} to a maximum matching problem^{14}, we were able to identify a minimum set of driver nodes of size N_{D}, sufficient to control the whole network^{12,15}. For engineered systems, such as the autopilot system of an airplane, full control is essential. However, many biological, technological and social systems are massive in size and complexity, hence it is neither feasible nor necessary to control the full network. Instead, it is more realistic and sufficient to achieve target control, that is, to control a subset of target nodes (or a subsystem) that are essential for the system’s mission pertaining to a selected task. While potential applications of target control have been developed in domainspecific areas in biology^{16}, chemical engineering^{17}, epidemics^{18} and economic networks^{19}, a principled approach to identify a minimum set of driver nodes sufficient for the targeted control of an arbitrary complex network remains an open problem.
We study two distinct schemes for choosing the subset of nodes we wish to control. In the random scheme, a fraction f of nodes are chosen uniformly at random. In the local scheme, the chosen nodes form a connected component, capturing a welldefined local network neighbourhood. To develop an efficiency measure of target control, we compare P_{D}, which is the minimum number of driver nodes needed to control the fraction f of target nodes, with fN_{D}, which is the corresponding relative fraction of the driver nodes needed for full control. If P_{D} is less than fN_{D} then target control is more efficient than full control for a given f. We also establish the overall target control efficiency by considering the integrated efficiency across the entire range of 0<f≤1. For both scalefree (SF)^{20} and Erdös–Rényi (ER) networks^{21,22}, we find that in general, local target control is more efficient than random target control. More surprisingly, we find that degree heterogeneous networks, such as SF networks, have higher specific and overall target control efficiency than degree homogeneous networks, for both random and local schemes, when the average degree of the network is large. In contrast, for full control, degree homogeneous networks, such as ER networks, are more efficient than the corresponding degree heterogeneous networks^{12}. Finally, we apply our methods to real data, confirming that many realworld networks display high efficiency of target control.
Results
Model
The dynamics of most real systems is driven by nonlinear processes. However, the dynamical rules of realworld networks are so diverse that writing a general dynamical equation that captures them all is plainly impossible. Moreover, for many networks, especially biological networks, we do not even know the dynamical rules. Thus, before we explore the fully nonlinear dynamical problem, we have to understand the impact of the topological characteristics on linear control, which naturally serves as a prerequisite of the nonlinear controllability problem. Furthermore, the controllability of nonlinear systems is often structurally similar to and determined by the system’s linearized dynamics^{23}. Indeed, a basic starting point for exploring the controllability of any nonlinear system is the study of the linearized version of the nonlinear dynamical system. Therefore, we start with the canonical linear timeinvariant dynamics.
where , and represent the system’s state, input and output vector, respectively. , and denote the state, input and output matrices, respectively. A captures the wiring diagram of the system; B identifies the nodes that are controlled by an external controller and u is the timedependent input applied to the nodes in B; C is the output matrix identifying the target nodes we want to control. For a network with node set , we are interested in controlling a target node set of size . We set the output matrix C=[I(c_{1}),I(c_{2}),... I(c_{S})], where I(i) denotes the ith row of an N × N identity matrix I. The system (A, B, C) is said to be target controllable with respect to a given target node set C if there exists a timedependent input vector u(t)=(u_{1}(t),... u_{M}(t))^{T} that can drive the state of the target nodes to any desired final state in finite time. Target controllability can be viewed as a special type of output controllability and the system (A, B, C) is target controllable if and only if the dimension of the output controllable subspace d(A, B, C) satisfies
representing the mathematical condition for target controllability^{24}. Note that when S=N, C is the identity matrix and (2) reduces to the Kalman condition for full controllability^{11}. If (2) is satisfied, for initial condition x(0)=0, we can compute the optimal input vector u(t) such that we can reach the desired final state with minimum energy cost in time (that is, ) (ref. 24).
where (Supplementary Note 1).
In some ways the target control problem is more difficult than the full control problem. Full control has a graphical condition, which can be easily checked by exactly mapping the controllability problem to the maximum matching problem. Target control lacks such an exact mapping. Therefore, to solve the target control problem in the singleinput case, we develop a new approach that we call the ‘kwalk theory’ (Fig. 1; Supplementary Fig. 1; Supplementary Note 2). This theory is based on the principle that a node can control a target set of nodes provided that the length of the path from the control node to each target node in the set is unique. Using the kwalk theory, we can identify all sets of nodes that can be controlled by the given node. We show that the kwalk theory is more powerful than the standard approach as it can find controllable subsystems that the structural control theory misses (Fig. 1; Supplementary Note 2). We rigorously prove that the kwalk theory is correct for directed treelike networks, that is, networks with no loops (Supplementary Note 2).
Although powerful, the kwalk theory is only applicable for the singleinput case. For networks that require >1 control input, we formulate the target control problem in graph theoretic terms, allowing us to develop a greedy algorithm (GA) that offers a good approximation to the minimum set of inputs sufficient for target control (Fig. 2). We rigorously prove that the input set selected by our algorithm can indeed control all target nodes (Fig. 3; Supplementary Note 3). Both the kwalk theory and the GA are based on the structural control theory, that is, the system parameters are either fixed at zero or are independent free parameters. This approach has lead to several recent advances on network control^{19,25,26,27,28,29,30,31,32,33,34}. Structural control theory, in general, relies on the canonical linear timeinvariant dynamics of the system as discussed in the Methods.
Previous studies based on the structural control theory shed some light on how to determine the upper bound of the minimum number of driver nodes for target controllability^{35,36}. We can use the maximum matching method to find the minimum set of driver nodes (N_{D} nodes) for full control of the network (Fig. 2a–c). This results in N_{D} disjoint ‘cacti’ (Supplementary Note 3), each denoting the control region of a particular driver node under one specific maximum matching. We can then count the minimal number of disjoint cacti that cover all target nodes, which yields an upper bound of P_{D} for target control (denoted as ) (see Fig. 2; Supplementary Note 3 for details). The lower bound of P_{D}, denoted with , can also be obtained using maximum matching. We consider the maximum matching in a bipartite graph connecting on one side (1) the target nodes ( nodes), and on the other (2) the nodes that can reach the target nodes via (3) the edges among them. If a target node is unmatched, we must drive it directly, and the number of unmatched target nodes is the lower bound of P_{D}. Taking Fig. 2 for example, the lower bound of P_{D} is given by the first iteration of the GA, that is, for the target nodes {1, 2, 4, 6, 7}, because in the first iteration only node 1 is unmatched.
Target controllability
To quantify the efficiency of target control for a specific fraction f, we define the target controllability parameter α_{D}≡P_{D}/N_{D} (Methods section) and investigate it for both random and local control schemes, as shown in Fig. 4a,d, respectively. Note that for f=1, we have α_{D}=1 as target control reduces to full control. In Fig. 4 the black lines denote the neutral condition (α_{D}=f) and serve as a benchmark because they mean that controlling an f fraction of target nodes requires an f fraction of the driver nodes needed for full control, that is, P_{D}=fN_{D}. In each figure, UB denotes the upper bound of the minimum number of driver nodes needed to control an f fraction of target nodes, predicted by the structural control theory (Supplementary Note 3). Likewise, GA denotes the results of the greedy algorithm and LB (Supplementary Note 3) denotes the lower bound, which is obtained by the first step of the GA (Methods section). Figure 4b shows the results of random target control of ER networks. Here the GA curve is above the neutral line (α_{D}>f), which indicates that the target control is less efficient than the neutral expectation. Figure 4c shows the results of random target control on SF networks and the GA curve is almost at the neutral line (α_{D}≈f), which indicates that target control is as efficient as the neutral expectation. If we apply the local target control scheme to an ER network, we observe reduced efficiency for f<0.5 (minus symbol in Fig. 4e) and enhanced efficiency for f>0.5 (plus symbol in Fig. 4e). Figure 4f shows the case of local target control on SF networks, charactering that the target control is more efficient (that is, α_{D}<f).
In general, we find that the local target control scheme requires fewer driver nodes when compared with the random target control scheme. This can be explained as follows. The GA helps us obtain an approximately minimal set of driver nodes sufficient to control the target nodes. Yet, the size of the controllable subsystem (or equivalently, the dimension of the controllable subspace) can be larger than the size of the target node set. In other words, we may actually be able to control a larger subsystem than necessary.
In Supplementary Fig. 3, we show that the random target control scheme has larger controllable subsystems than the local target control scheme. This is true for any fraction f of target nodes. Hence the local control scheme is more efficient than the random control scheme. Furthermore, we find that α_{D} is robust to changes in network size as shown in Supplementary Fig. 6. Finally, since the hubs play an important role in complex networks, we study the case of controlling an f fraction of the highest in and outdegree nodes as target nodes (Supplementary Fig. 5). In general, we find that controlling the highdegree nodes reduces the minimum number of driver nodes.
Target control efficiency
The above observations raise a fundamental question: what kind of network topology is more efficient for target control and which control scheme (random or local) facilitates target control for a general f fraction of target nodes? To address this question, we define the overall target control efficiency in the Methods. The overall target control efficiency of two model networks (ER and SF) are shown in Fig. 5 as provided by simulations. When ‹k›=0 (a network with N isolated nodes), we have . That is, if we wish to control an f fraction of nodes we need to drive the same fraction of driver nodes as in the case of full control. We observe a peak of target control efficiency for SF networks with degree exponent γε[2, 3], which is the range of γ relevant for many realworld networks^{37,38} (Fig. 5a,c).
This optimal target control efficiency is likely due to the presence of the hubs. Indeed, starting from an ER network, we can fix the number of links and rewire the network to increase the degree of a preselected node, turning it into a hub. We find that the target control efficiency increases as we develop a hub (Supplementary Fig. 4; Supplementary Note 4), explaining the increase of efficiency as we lower γ from γ>3 to γ≈2.5.
Overall, Fig. 5a suggests that many realworld networks may have been optimized for efficient target controllability. Interestingly, we find that networks, which are easier to control if we wish to achieve full control, that is, those with large average degree ‹k› or large degree exponent γ^{12}, do not show high efficiency for target control. Moreover, SF networks with 2≤γ≤3, which are harder to fully control than ER networks, have high random target control efficiency (Fig. 5c). Compared with ER networks, a SF network has lower local target control efficiency when the average degree ‹k› is small but higher local target control efficiency when the average degree ‹k› is large (Fig. 5d). Figure 5d also indicates the existence of a critical ‹k›_{c} so that when ‹k›>‹k›_{c} SF networks are more efficient than ER networks for target control. In general, sparse and homogeneous networks display high efficiency for target control, especially in the local scheme.
Target control of real networks
Next we apply the tools developed above to several real networks divided into seven categories, which are chosen for their diversity in applications and scope (Fig. 6; Table 1). We find that networks from the same category display similar target control efficiency (Fig. 6a,d). Interestingly, only networks with large average degree (‹k›>‹k›_{c}>2.4) are more efficient than their randomized counterparts with respect to the local target control scheme, that is, (with values indicated in italics in Table 1, column ‹k› and in Fig. 6f). This is in agreement with simulations (Fig. 5d). We observe that local target control markedly increases the control efficiency of neuronal networks (Fig. 6e). The opposite result is obtained for the power grid, finding that random target control is more efficient than the local control (Fig. 6b,e). This implies that the structure of the power grid is not optimized to facilitate local target control. For all gene networks target control efficiency is close to 0, indicating that they have evolved towards a topology that has comparable control efficiency within any subsegment. Also, in the random target control scheme, the comparison between real networks and randomized networks support our earlier conclusion: real networks are optimized to have high random target control efficiency, because for almost all of them. The two counterexamples are the trust networks, which are extremely small, however, with only 32 and 67 nodes. The target control efficiency of real networks is summarized in Table 1.
Discussion
In this work, we studied the target controllability of complex networks. We developed a new theoretical approach, the kwalk theory, to identify the controllable subgraph that one node can control, and a GA to identify an approximately minimum set of driver nodes to control a specified target set of nodes. We studied both random and local target control schemes and analysed how the network topology impacts the target control efficiency. The GA proposed in this work is efficient in finding the driver nodes for target control when the network structure is completely known. In reality, we have very incomplete maps of many realworld networks and the full controllability of networks with missing links has been recently addressed in ref. 39. Note that if we add the missing links back, they will only enhance the target controllability of the system, unless the link weights satisfy some particular algebraic constraints, which is a typical zeromeasure event.
Our results raise several open questions: what higherorder network characteristics (such as communities, degree correlations) determine target controllability and target control efficiency? How to design the control inputs to steer the target nodes towards a desired final state? How to apply the GA to the study the target control of link dynamics?
Finally, as discussed in the Methods section, our formulation considers the canonical linear timeinvariant dynamics of a system. Extending these findings to networks with fully nonlinear dynamics may require connecting our current work with complimentary approaches that consider the basins of attraction that describe the steadystate properties of nonlinear dynamics^{10,40}. Understanding these questions can significantly improve our understanding of the control principles of complex systems.
Methods
Greedy algorithm
The GA we introduce here is based, in part, on iterating the procedure for the lowerbound calculation to ultimately approximate the minimum set of driver nodes () for target control (Fig. 2d,e). The lowerbound of P_{D} can be calculated as follows: (1) Construct a bipartite graph , where the right side consists of all the target nodes, and the left side consists of all the nodes that can reach the target nodes. There is a link between node and if there is a link in the original directed network . (2) Find a maximum matching in . The number of unmatched nodes in yields the lowerbound of P_{D}. The greedy algorithm works as follows. (1) Initialize the set to be the set of the unmatched nodes found from the lowerbound calculation. (2) Identify the set of nodes in that match the nodes in , and let this node set to be the new set and get a new bipartite graph. (3) Calculate a maximum matching in the updated bipartite graph and add unmatched nodes in to the set . (4) Repeat (2) and (3) until all nodes have been matched or are in the set . In Fig. 2d,e, we offer a specific example and the details of the GA. The proof of its sufficiency for target controllability is provided in Supplementary Fig. 2 and Supplementary Note 2. Figure 2e can also be obtained by the kwalk theory, because the length of the path from the node 1 to each target node {1, 2, 4, 6, 7} is unique. Note that if the GA converges after only one iteration, we obtain the exact number for the minimum number of driver nodes for target control.
A direct illustration of our algorithm’s utility is illustrated in Fig. 3, where we consider control of the network shown in Fig. 2. To control the entire network, we need at least three driver nodes. Indeed, as we show in Fig. 3a,b using these three nodes we can move the state of all nodes to the desired final state x_{i}=10. But if we just want to control a subsystem {1, 2, 4, 6, 7} (highlighted in red in Fig. 2e), the GA predicts that we need a single driver node, node 1. Indeed, as we show in Fig. 3c,d we can now move the state of the target nodes to the desired final state through an input to node 1 only. The nodes outside of the target list take arbitrary values as we do not control them.
Target control efficiency
We define the overall target control efficiency of an arbitrary network as
denoting the efficiency of random and local target control scheme by and , respectively. For example, for the overall network efficiency is neutral, that is, to control an f fraction of target nodes we need fN_{D} driver nodes. When , the network is less (or more) efficient than neutral expectation. Furthermore, , so corresponds to the most efficient case and shows the least efficient case. One example of the least efficient case is when only one driver node is needed to control the whole network. If we control any fraction of nodes, we still need one driver node, thus (observed for Food Web and Neuronal networks in Table 1). Note that target controllability for a specific fraction, α_{D}(f), depends on the fraction f of driver nodes, but overall target control efficiency, , is a property of the whole network, independent of the fraction of target nodes.
Additional information
How to cite this article: Gao, J. et al. Target control of complex networks. Nat. Commun. 5:5415 doi: 10.1038/ncomms6415 (2014).
References
 1
Dorogovtsev, S. N. & Mendes, J. F. Evolution of Networks: From Biological Nets to the Internet and WWW (Physics) Oxford University Press (2003).
 2
Albert, R. & Barabási, A.L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47–97 (2002).
 3
Cohen, R. & Havlin, S. Complex Networks: Structure, Robustness and Function Cambridge University Press: Cambridge, (2010).
 4
Newman, M. E. J. Networks: An Introduction Oxford University Press (2010).
 5
Song, C., Havlin, S. & Makse, H. A. Selfsimilarity of complex networks. Nature 433, 392–395 (2005).
 6
Barrat, A., Barthelemy, M. & Vespignani, A. Dynamical Processes on Complex Networks Cambridge University Press (2008).
 7
Gallos, L. K., Song, C., Havlin, S. & Makse, H. A. Scaling theory of transport in complex biological networks. Proc. Natl Acad. Sci. USA 104, 7746–7751 (2007).
 8
Dorf, R. C. Modern Control Systems AddisonWesley Longman Publishing Co. Inc. (1991).
 9
Boyd, S. P., El Ghaoui, L., Feron, E. & Balakrishnan, V. Linear Matrix Inequalities in System and Control Theory 15, Siam (1994).
 10
Cornelius, S. P., Kath, W. L. & Motter, A. E. Realistic control of network dynamics. Nat. Commun. 4, 1942 (2013).
 11
Kalman, R. E. Mathematical description of linear dynamical systems. J. Soc. Indus. Appl. Math. Ser. A 1, 152 (1963).
 12
Liu, Y.Y., Slotine, J.J. & Barabási, A.L. Controllability of complex networks. Nature 473, 167–173 (2011).
 13
Lin, C.T. Structural controllability. IEEE T. Automat. Contr. 19, 201–208 (1974).
 14
Hopcroft, J. E. & Karp, R. M. An n^{5/2} algorithm for maximum matchings in bipartite graphs. SIAM J. Comput. 2, 225–231 (1973).
 15
Nepusz, T. & Vicsek, T. Controlling edge dynamics in complex networks. Nat. Phys. 8, 568–573 (2012).
 16
Kuchtey, J., Fulton, S. A., Reba, S. M., Harding, C. V. & Boom, W. H. Interferonαβ mediates partial control of early pulmonary mycobacterium bovis bacillus calmetteguérin infection. Immunology 118, 39–49 (2006).
 17
Baldea, M. & Daoutidis, P. Model reduction and control of reactorheat exchanger networks. J Process Contr. 16, 265–274 (2006).
 18
Cohen, R., Havlin, S. & ben Avraham, D. Efficient immunization strategies for computer networks and populations. Phys. Rev. Lett. 91, 247901 (2003).
 19
Galbiati, M., Delpini, D. & Battiston, S. The power to control. Nat. Phys. 9, 126–128 (2013).
 20
Barabási, A.L. & Albert, R. Emergence of scaling in random networks. Science 286, 509–512 (1999).
 21
Erdős, P. R. A. On random graphs. I. Publ. Math. 6, 290–297 (1959).
 22
Erdős, P. & Rényi, A. On the evolution of random graphs. Inst. Hung. Acad. Sci. 5, 17–61 (1960).
 23
Slotine, J.J. & Li, W. Applied Nonlinear Control PrenticeHall (1991).
 24
Murota, K. & Poljak, S. Note on a graphtheoretic criterion for structural output controllability. IEEE T. Automat. Contr. 35, 939–942 (1990).
 25
Ruths, J. & Ruths, D. Control profiles of complex networks. Science 343, 1373–1376 (2014).
 26
Wu, F.X., Wu, L., Wang, J., Liu, J. & Chen, L. Transittability of complex networks and its applications to regulatory biomolecular networks. Sci. Rep. 4, 4819 (2014).
 27
Liu, Y.Y., Slotine, J.J. & Barabási, A.L. Observability of complex systems. Proc. Nat.l Acad. Sci. 110, 2460–2465 (2013).
 28
Wang, W.X., Ni, X., Lai, Y.C. & Grebogi, C. Optimizing controllability of complex networks by minimum structural perturbations. Phys. Rev. E 85, 026115 (2012).
 29
Yan, G., Ren, J., Lai, Y.C., Lai, C.H. & Li, B. Controlling complex networks: how much energy is needed? Phys. Rev. Lett. 108, 218703 (2012).
 30
Liu, Y.Y., Slotine, J.J. & Barabási, A.L. Control centrality and hierarchical structure in complex networks. Plos ONE 7, e44459 (2012).
 31
Pósfai, M., Liu, Y.Y., Slotine, J.J. & Barabási, A.L. Effect of correlations on network controllability. Sci. Rep. 3, 1067 (2013).
 32
Sun, J. & Motter, A. E. Controllability transition and nonlocality in network control. Phys. Rev. Lett. 110, 208701 (2013).
 33
Yuan, Z., Zhao, C., Di, Z., Wang, W.X. & Lai, Y.C. Exact controllability of complex networks. Nat. Commun. 4, 2447 (2013).
 34
Jia, T. et al. Emergence of bimodality in controlling complex networks. Nat. Commun. 4, 2002 (2013).
 35
Chan, B. Y. & Shachter, R. D. InProceedings of the Eighth International Conference on Uncertainty in Artificial Intelligence 25–32Morgan Kaufmann Publishers Inc. (1992).
 36
Blackhall, L. & Hill, D. J. On the structural controllability of networks of linear systems (2nd IFAC Workshop on Distributed Estimation and Control in Networked Systems) 245–250 (2010).
 37
Albert, R., Jeong, H. & Barabási, A.L. Error and attack tolerance of complex networks. Nature 406, 378–382 (2000).
 38
Caldarelli, G. ScaleFree Networks: Complex Webs in Nature and Technology Oxford University Press (2007).
 39
Slotine, J.J. & Liu, Y.Y. Complex networks: the missing link. Nat. Phys. 8, 512–513 (2012).
 40
Lai, Y.C. National Science Review Oxford University Press (2014).
Acknowledgements
We gratefully acknowledge support from the US Army Research Laboratory and the US Army Research Office under Cooperative Agreement W911NF0920053 and MURI award W911NF1310340, The John Templeton Foundation ID #51977, as well as the Defense Threat Reduction Agency Basic Research Grant No. HDTRA11010100. We thank Tao Jia, JeanJacques Slotine and Gang Yan for discussions.
Author information
Affiliations
Contributions
All authors designed and did the research. J.G. analysed the empirical data and did the analytical and numerical calculations. A.L.B., R.M.D’S. and Y.Y.L. were the lead writers of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Information
Supplementary Figures 16, Supplementary Notes 14 and Supplementary References. (PDF 771 kb)
Rights and permissions
This work is licensed under a Creative Commons AttributionNonCommercialShareAlike 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/byncsa/4.0/
About this article
Cite this article
Gao, J., Liu, Y., D'Souza, R. et al. Target control of complex networks. Nat Commun 5, 5415 (2014). https://doi.org/10.1038/ncomms6415
Received:
Accepted:
Published:
Further reading

Synthetic ablations in the C. elegans nervous system
Network Neuroscience (2020)

Target Control of Directed Networks based on Network Flow Problems
IEEE Transactions on Control of Network Systems (2020)

ControllabilityGramian submatrices for a network consensus model
Systems & Control Letters (2020)

Computational network biology: Data, models, and applications
Physics Reports (2020)

Control Distance and Energy Scaling of Complex Networks
IEEE Transactions on Network Science and Engineering (2020)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.