Ranking in interconnected multilayer networks reveals versatile nodes

De Domenico, Manlio; Solé-Ribalta, Albert; Omodei, Elisa; Gómez, Sergio; Arenas, Alex

doi:10.1038/ncomms7868

Article
Published: 23 April 2015

Ranking in interconnected multilayer networks reveals versatile nodes

Nature Communications volume 6, Article number: 6868 (2015) Cite this article

16k Accesses
322 Citations
39 Altmetric
Metrics details

Subjects

Abstract

The determination of the most central agents in complex networks is important because they are responsible for a faster propagation of information, epidemics, failures and congestion, among others. A challenging problem is to identify them in networked systems characterized by different types of interactions, forming interconnected multilayer networks. Here we describe a mathematical framework that allows us to calculate centrality in such networks and rank nodes accordingly, finding the ones that play the most central roles in the cohesion of the whole structure, bridging together different types of relations. These nodes are the most versatile in the multilayer network. We investigate empirical interconnected multilayer networks and show that the approaches based on aggregating—or neglecting—the multilayer structure lead to a wrong identification of the most versatile nodes, overestimating the importance of more marginal agents and demonstrating the power of versatility in predicting their role in diffusive and congestion processes.

You have full access to this article via your institution.

Download PDF

Improving microbial phylogeny with citizen science within a mass-market video game

Article Open access 15 April 2024

Persistent interaction patterns across social media platforms and over time

Article Open access 20 March 2024

Principal component analysis

Article 22 December 2022

Introduction

A comprehensive definition of centrality has driven the interest of sociologists for several decades^1,2,3, although, intriguingly, the result is not unique. Many different measures, depending on the application of interest, have defined centrality in terms of activity, control, communicability or independence⁴. Centrality measures are useful to identify proteins crucial for the survival of the cell⁵, design optimal network topologies for local search with congestion⁶, design efficient ways in which to engineer the structure of the network⁷, identify influential spreaders⁸, drive the network towards a desired state⁹, or mitigate the cascading failures of technological networks¹⁰ and identify potential drug targets in a signalling network of human cancer¹¹. Although probably, the most famous, by use, measure of centrality is PageRank¹², the ranking measure operating behind the universal search engine of Google.

It has been a common practice, in network theory, to assume that nodes are linked by a single type of static edge that encapsulates their interactions, although in a myriad of scenarios this assumption oversimplifies the complexity of the network. Accounting for different types of interactions between nodes can be nowadays correctly analysed in the framework of multilayer networks^{13,14,15,16,17}. A schematic of an interconnected multilayer network is shown in Fig. 1 (see Supplementary Note 1 for additional real-world examples and Supplementary Fig. 1 for additional synthetic examples). Neglecting the existence of multiple relationships between nodes, or aggregating such relationships to a single weighted network, alters the topological and dynamical properties of the full system^15,18,19,20 and the importance of the nodes with respect to the whole structure^{21,22,23,24,25}.

Despite of their ubiquity, multilayer networks are still poorly understood. Here we exploited a recent mathematical grounded formalism that uses a tensorial representation of multilayer networks to determine their most central nodes with respect to specific established definitions. In the case of multilayer networks, the corresponding representation is a rank-4 tensor encoding a directed, weighted, connection between node i from layer α to any other node j in any other layer β (see Supplementary Note 2 for details about notation and on the tensorial nature of adjacency tensors). Although it is difficult to represent such a four-dimensional object, it can be thought as composed by two-dimensional slices along the third dimension, representing intralayer connections between nodes within the same layer α, together with two-dimensional slices along the fourth dimension, representing interlayer connections between nodes laying on different layers. This topology and the dynamics of processes on top of it make multilayer networks unique entities with new structural and dynamical properties to be unveiled. It is worth remarking that the monoplex adjacency tensor can be interpreted as a linear transformation which, given a vector (or 1—form) representing a node, returns another vector (or 1—form) with the set of their adjacent nodes. Thus, the only acceptable representation for the monoplex adjacency object is a 1—covariant and 1—contravariant tensor. Likewise, the multilayer adjacency tensor transforms a node in one layer into the set of adjacent nodes, keeping also the information of which layer they belong to, thus a 2—covariant and 2—contravariant tensor is needed. Our results show that calculating the centrality of nodes in each network of the multilayer structure separately or aggregating the information to a single network inevitably leads to misleading results. The tensorial formulation of multilayer networks allows to overcome such limitations and to generalize widely adopted centrality measures (see Supplementary Note 3) capturing the importance of nodes in real interconnected topologies, such as social, transportation and biological networks. We demonstrated that our framework provides new insights in empirical networks, which are inherently multilayer, and that accounting for the interconnected structure is an essential requirement to identify key actors (versatile nodes) in systems exhibiting complex relationships. Moreover, versatility is a good predictor for diffusive and congestion processes in multilayer networks.

Results

Eigenvector versatility

Focusing on ranking the nodes of the multilayer network according to their central role, we have revisited several definitions and adapted them to the new framework. One of the widest adopted measures of centrality in networks is based on an iterative procedure assigning to each node a score that is the sum of the scores of its neighbours. Mathematically, this is equivalent to calculate the largest eigenvalue and the corresponding eigenvector of the adjacency matrix. In the case of interconnected networks, a formally similar procedure is introduced (see Supplementary Note 4) to calculate the leading eigentensor Θ_iα of as the solution of the tensorial equation¹⁴

where λ₁ is the largest eigenvalue. Here, Θ_iα encodes the eigenvector versatility of each node (i) in each layer (α) when accounting for the whole interconnected structure. The versatility of each node is obtained by aggregating over layers the centrality of each node in each layer computed using the full multilayer structure, by θ_i=Θ_iαu^α, where u^α is the rank-1 tensor with all components equal to 1. The choice of this aggregation corresponds to a maximum entropy principle, a reasonable choice when no specific criteria about the importance of layers is considered. When other information is considered, and used to weight layers, it is possible to obtain specific weighted aggregations as discussed in Solá et al.²².

PageRank versatility

Google’s PageRank centrality¹² is a variant of the previous definition, and corresponds to the steady-state solution of the master equation of a random walk where the walker jumps to a neighbour with rate r and teleport to any other node in the network with another rate r′. We also extended this concept to PageRank versatility of interconnected multilayer networks, where the teleportation might occur to any other node in any layer, and we directly validated our theoretical predictions against simulations (see Supplementary Note 3 for details). Similarly, other measures such as hub/authority, Katz and shortest-path-based (betweenness) versatilities have been described using the tensorial formalism presented above (see full description in Supplementary Note 3). It is worth noting that, in general, the calculation of centrality in each layer separately and its subsequent aggregation may lead to misleading results, because the nonlinear competition between layers is difficult to be accounted for a posteriori. A representative example is shown in Fig. 2 (see Supplementary Fig. 1 and Supplementary Note 5 for additional examples).

**Figure 2: Versatility compared with centrality.**

Versatility of nodes in empirical multilayer networks

To show the results of our study, first we considered biologists, chemists, computer scientists, economists, inventors, mathematicians, philosophers and physicists in Wikipedia, and we built an interconnected multilayer network with 5,513 nodes where each layer represents a discipline and two people are connected if a hyperlink exists between their pages. The disciplines for each individual have been determined from the listed pages curated by the community on Wikipedia, whereas intra- and interlayer links are created as follows. First, we build the aggregated hyperlink network, regardless of the layer(s) each node belongs to: this network is directed and weighted by the number of hyperlinks between two web pages. We discard all nodes having total degree smaller than 4, regardless of the strength of the corresponding links, and we focus our attention only on nodes belonging to the giant connected component of the resulting network to build the multilayer. For example, if there exists a hyperlink between two scientists (a and b) and these happen to satisfy three disciplines simultaneously (physicists, philosophers and chemists), then we make three intralayer directed edges between a and b, one for each layer, uniformly distributing the weight of the hyperlink on each edge. If a and b do not share at least one layer, then directed interlayer edges are made between all pair of layers where a and b exist, with the weight of the original hyperlink uniformly distributed among such edges. Finally, interlayer edges between all nodes’ replicas are created and assigned weight 1.

The ranking obtained from PageRank versatility is shown for some top nodes in Table 1 and is compared with the result of using the equivalent centrality measure in the aggregated version of the network. Note that Edward Osborne Wilson, the father of sociobiology, and Harold Clayton Urey, Nobel Prize in Chemistry, well known for theories on the development of organic life from non-living matter, who played a significant role in the development of the atom bomb,gained many positions with respect to the aggregated as well as, for instance, Kurt Gödel, one of the greatest logicians of all time, with impact on several different disciplines, from pure mathematics to physics and philosophy. Interestingly, our procedure captured the versatility of people generally recognized as trans-disciplinary, with outstanding contributions to different areas. This is the case of visionary people like Leonardo da Vinci (ranked 118 by PageRank), Italian genius who lived in the fifteenth century, and Leó Szilárd (480), who patented the idea of a nuclear reactor with Enrico Fermi and conceived fundamental tools in experimental research as the electron microscope, the linear accelerator and the cyclotron, who gained thousands of positions in the multilayer because of their relevance and their links to relevant people in multiple subjects. Versatility is successfully captured in Milton Friedman as well, contributing to economics, statistics, international finance, risk and insurance, and microeconomic theory, in Hilary Putnam, a computer scientist and mathematician with outstanding contributions in philosophy of mind, of mathematics and of science, and in Charles Stark Draper, an engineer and scientist who invented inertial navigation and founded the Massachusetts Institute of Technology’s Instrumentation Laboratory responsible for designing the guidance computer of NASA’s Apollo missions. In the aggregated network, the importance of bridging different subjects can not be accounted for and, as expected, very important, but not necessarily versatile, names are top ranked, Immanuel Kant (1), Aristotle (2), Plato (3), Thomas Aquinas (6), Isaac Newton (7) and Albert Einstein (9). We have performed similar analysis on the co-authorship network obtained from the papers published in journals of the American Physical Society between 2005 and 2009, from scientists working in European institutions (data provided by APS on request, https://publish.aps.org/datasets). We considered also a sub-sample of two online social networks, Twitter and Instagram, and we built the multilayer network of 13,297 nodes, where two users are linked by directed and unweighted edges if they follow each other (see Supplementary Note 6).

Table 1 PageRank Ranking.

Full size table

We also considered a transportation network (see ref. 26 for details about this data set and how it has been obtained) shown in Fig. 1b, 450 nodes and 37 layers, and found relevant differences between betweenness versatility and centrality in multilayer and aggregated networks, respectively. London airports are rather central in the aggregated network, although they become less important in the multilayer network, because they have many connections distributed on a few airlines. For the opposite reason, airports like Brussels and Paris Charles de Gaulle, less central in the aggregated, become versatile, because their flights are operated by almost all airlines (see Supplementary Tables 6–8). It is worth noting that, in general, there is not a linear relation between versatility and the number of layers where a node exists, because versatility also depends on the contribution of each node to its centrality per layer. For completeness, the rankings distributions are compiled in Supplementary Fig. 2 for all data sets. Centrality measures in this context play a crucial role in spreading processes, from epidemic transmission to delays’ propagation through airports²⁷. We have explored the use of versatility to understand the role of nodes in substantial dynamical scenarios. Using the multilayer airport network above, we simulate ensembles of random walkers departing from each airport separately and calculate their coverage¹⁵ at time τ, defined as the fraction of nodes that have been visited up to time τ. We use the coverage at time τ=1,000 as a proxy for the size of an hypothetical epidemic spreading²⁸ starting in an airport. In the absence of empirical data about the flow between different airports, it is difficult to assess the physical time scale of our simulations. We choose τ=1,000 as a good trade-off between the initial stage of the diffusion (τ≤100), where the dynamics is still very local and there is no difference between considering the multilayer network or its aggregation, and the final stage of the diffusion (τ≥10,000), where, conversely, the coverage is almost 100%, because diffusive agents had enough time to hit almost all airports in the network, with no difference between multilayer and aggregate networks. See Supplementary Note 7 for further details. We rank airports by their coverage and use PageRank versatility and PageRank centrality in the multilayer and aggregated networks, respectively, to predict it. The results are shown in Fig. 3a,b and put in evidence that PageRank versatility outperforms the predicting capabilities of the standard PageRank centrality obtained from the aggregated network. We have also considered another dynamical process that models airplane traffic on the airport network. The model is an extension of ref. 6 to multilayer networks. The traffic is simulated by injecting, at each time step, ρ airplanes at each airport with random destination. During the following time steps, airplanes travel to its destination using shortest routes over the multiplex structure. Each airport in each layer is attached with a queue where the airplanes wait to be routed. Airports will route airplanes considering its arrival time (first-in-first-out strategy). To simulate the physical constraints of the airports, each airport is assumed to have a limited routing capacity η (for the sake of simplicity we have considered the same value for all airports). Given a sufficiently large ρ, one or more airports will achieve a congested state. In that situation, the congested airports will not be able to handle the incoming traffic and the amount of airplanes waiting to be routed will increase proportional to time. Here, we analyse the ordering at which the airports arrive to congestion and we show how the betweenness versatility is a better predictor to this ordering than the betweenness centrality. See Supplementary Note 7 for further details.

**Figure 3: Versatility as a predictor of nodes’ role in multilayer network dynamics.**

We used betweenness versatility and betweenness centrality to predict the congestion ranking. We show the results in Fig. 3c,d, putting in evidence again that the versatility predictor outperforms the predictor obtained from the aggregated network.

Discussion

In summary, we have developed a framework to compute any centrality measure in the context of multilayer interconnected complex networks. These measures reveal the versatility of nodes according to a given definition. The versatility is proved to be a good descriptor of dynamical aspects on multilayer structures than can not be achieved considering the aggregation of layers into a single network. Versatility is a promising descriptor in the exploratory analysis of any categorized data set.

Additional information

How to cite this article: De Domenico, M. et al. Ranking in interconnected multilayer networks reveals versatile nodes. Nat. Commun. 6:6868 doi: 10.1038/ncomms7868 (2015).

References

Katz, L. A new status index derived from sociometric analysis. Psychometrika 18, 39–43 (1953).
Article Google Scholar
Bonacich, P. Factoring and weighting approaches to status scores and clique identification. J. Math. Sociol. 2, 113–120 (1972).
Article Google Scholar
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Freeman, L. C. Centrality in social networks conceptual clarification. Soc. Networks 1, 215–239 (1979).
Article Google Scholar
Jeong, H., Mason, S. P., Barabási, A. -L. & Oltvai, Z. N. Lethality and centrality in protein networks. Nature 411, 41–42 (2001).
Article ADS CAS Google Scholar
Guimerà, R., Díaz-Guilera, A., Vega-Redondo, F., Cabrales, A. & Arenas, A. Optimal network topologies for local search with congestion. Phys. Rev. Lett. 89, 248701 (2002).
Article ADS Google Scholar
Guimerà, R., Mossa, S., Turtschi, A. & Amaral, L. N. The worldwide air transportation network: anomalous centrality, community structure, and cities' global roles. Proc. Natl. Acad. Sci. 102, 7794–7799 (2005).
Article ADS MathSciNet Google Scholar
Kitsak, M. et al. Identification of influential spreaders in complex networks. Nat. Phys. 6, 888–893 (2010).
Article CAS Google Scholar
Liu, Y. -Y., Slotine, J. -J. & Barabási, A. -L. Controllability of complex networks. Nature 473, 167–173 (2011).
Article ADS CAS Google Scholar
Simonsen, I., Buzna, L., Peters, K., Bornholdt, S. & Helbing, D. Transient dynamics increasing network vulnerability to cascading failures. Phys. Rev. Lett. 100, 218701 (2008).
Article ADS Google Scholar
Cornelius, S. P., Kath, W. L. & Motter, A. E. Realistic control of network dynamics. Nat. Commun. 4, 1942 (2013).
Article ADS Google Scholar
Brin, S. & Page, L. In Proceedings of the Seventh International Conference on World Wide Web 7, 107–117Elsevier Science Publishers B. V. (1998).
Buldyrev, S. V., Parshani, R., Paul, G., Stanley, H. E. & Havlin, S. Catastrophic cascade of failures in interdependent networks. Nature 464, 1025–1028 (2010).
Article ADS CAS Google Scholar
De Domenico, M. et al. Mathematical formulation of multi-layer networks. Phys. Rev. X 3, 041022 (2013).
Google Scholar
De Domenico, M., Solé-Ribalta, A., Gómez, S. & Arenas, A. Navigability of interconnected networks under random failures. Proc. Natl. Acad. Sci. 111, 8351–8356 (2014).
Article ADS MathSciNet CAS Google Scholar
Kivelä, M. et al. Multilayer networks. J. Complex Networks 2, 203–271 (2014).
Article Google Scholar
Boccaletti, S. et al. The structure and dynamics of multilayer networks. Phys. Rep. 544, 1–122 (2014).
Article ADS MathSciNet Google Scholar
Mucha, P. J., Richardson, T., Macon, K., Porter, M. A. & Onnela, J. -P. Community structure in time-dependent, multiscale, and multiplex networks. Science 328, 876–878 (2010).
Article ADS MathSciNet CAS Google Scholar
Gómez, S. et al. Diffusion dynamics on multiplex networks. Phys. Rev. Lett. 110, 028701 (2013).
Article ADS Google Scholar
Radicchi, F. & Arenas, A. Abrupt transition in the structural formation of interconnected networks. Nat. Phys. 9, 717–720 (2013).
Article CAS Google Scholar
Kolda, T. G., Bader, B. W. & Kenny, J. P. In 2013 IEEE 13th International Conference on Data Mining 242–249IEEE Computer Society (2005).
Solá, L. et al. Eigenvector centrality of nodes in multiplex networks. Chaos 3, 033131 (2013).
Article ADS Google Scholar
Halu, A., Mondragon, R. J., Panzarasa, P. & Bianconi, G. Multiplex pagerank. PLoS ONE 8, e78293 (2013).
Article ADS CAS Google Scholar
Solé-Ribalta, A., De Domenico, M., Gómez, S. & Arenas, A. In Proceedings of the 2014 ACM Conference on Web Science 149–155ACM (2014).
Battiston, F., Nicosia, V. & Latora, V. Structural measures for multiplex networks. Phy. Rev. E 89, 032804 (2014).
Article ADS Google Scholar
Cardillo, A. et al. Emergence of network features from multiplexity. Sci. Rep. 3, 1344 (2013).
Article CAS Google Scholar
Fleurquin, P., Ramasco, J. J. & Eguiluz, V. M. Systemic delay propagation in the US airport network. Sci. Rep. 3, 1159 (2013).
Article ADS CAS Google Scholar
Maki, D. P. & Thompson, M. Mathematical models and applications Prentice-Hall (1973).

Download references

Acknowledgements

A.A., M.D.D., S.G. and A.S. were supported by the European Commission FET-Proactive project PLEXMATH (grant number 317614) and the Generalitat de Catalunya 2009-SGR-838. A.A. also acknowledges financial support from the ICREA Academia and the James S. McDonnell Foundation. S.G. and A.A. were supported by FIS2012-38266. E.O. is supported by DIM 2011–Région Île-de-France.

Author information

Authors and Affiliations

Departament d’Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili, Tarragona, 43007, Spain
Manlio De Domenico, Albert Solé-Ribalta, Elisa Omodei, Sergio Gómez & Alex Arenas
LaTTiCe-CNRS, École Normale Supérieure, Paris, 75005, France
Elisa Omodei
Institut des Systèmes Complexes - Paris Île-de-France (ISC-PIF), Paris, 75013, France
Elisa Omodei
IPHES, Institut Català de Paleoecologia Humana i Evolució Social, Tarragona, 43007, Spain
Alex Arenas

Authors

Manlio De Domenico
View author publications
You can also search for this author in PubMed Google Scholar
Albert Solé-Ribalta
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Omodei
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Alex Arenas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.A., M.D.D., A.S. and S.G. designed the study. M.D.D., E.O. and A.S. performed experiments. A.A. and M.D.D. wrote the paper. All the authors contributed equally to the revision of the manuscript.

Corresponding authors

Correspondence to Manlio De Domenico or Alex Arenas.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-6, Supplementary Tables 1-17, Supplementary Notes 1-7 and Supplementary References (PDF 1968 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

De Domenico, M., Solé-Ribalta, A., Omodei, E. et al. Ranking in interconnected multilayer networks reveals versatile nodes. Nat Commun 6, 6868 (2015). https://doi.org/10.1038/ncomms7868

Download citation

Received: 17 October 2014
Accepted: 07 March 2015
Published: 23 April 2015
DOI: https://doi.org/10.1038/ncomms7868

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.