Introduction

Social preferences contribute to the formation of social networks1,2. In particular, preferences proportional to the degree of a node (i.e., the number of links connected to a node), referred to as preferential attachment, simulate the scale-free property of the degree distribution in large-scale evolving networks and serves as a key concept for understanding their formation and the phenomena that occur on them3,4,5,6,7,8. While local mechanisms (e.g., link selection9, copying10,11, or optimization models12,13,14) sometimes explain the generation of a preference proportional to the degree of a node, a naive human perception of this preference is that of “a person with many friends”.

However, since resources on a network propagate and have an influence beyond direct human relationships, preferences for others are not limited to direct neighbours. In fact, it has been reported that obesity, smoking, and happiness are influenced not only by direct friends but also by friends of friends and even friends of friends of friends15,16,17. According to well-known social theories, weak tie theory and structural hole, which represent the importance of infrequent and non-redundant relationships (i.e. connections to different clusters), suggest that human relationships beyond direct neighbours are important channels for acquiring novel information and that individuals who occupy such structural positions within a network have an advantage in such achievements as employment or promotion18,19,20. In addition, the success of the PageRank algorithm shows that people prefer information on web pages linked by web pages with high importance, rather than those simply linked by a large number of links, which suggests the importance of indirect sources in preferences21,22.

Several studies have shown that preferential attachment associated with indirect links beyond direct neighbours links (the sum of the number of the links that neighbours have23, the number of links within a second degree of separation24, and the betweenness centrality25) in an evolving network forms the heavy-tailed degree distribution, the mechanisms of which can complement conventional degree-based preferential attachment through comparison with the structural and temporal characteristics of real networks. Other kinds of models, which are indirectly associated to the preference based on indirect links as links with new nodes are formed through neighbouring nodes, have also been explored26,27,28,29. However few models are available that investigate theoretically the structural characteristics of networks that social preferences based on indirect links form, and little is known about the social effects of such network structures.

In this study, we investigate the mechanism of structural formation of an evolving network that includes indirect resources as social preferences and its expected social effects. For this purpose, we propose a new mathematical model of evolving networks with a preferential attachment based on utility. Utility is the physical, economic, and psychological effect obtained from various resources, and is usually described as a function consisting of terms of costs and benefits30,31. And, from the perspective of network, the utility can be obtained from nodes reachable on the network. We introduce a utility function that considers utility not only from directly connected nodes but also from indirectly connected nodes beyond the directly connected nodes32,33,34,35. In this utility function, the benefit term of a node that can be obtained from other nodes is designed according to the degree of separation from that node, and, in the case of direct connections, the cost term is designed additionally for the establishment and maintenance of the relationship (see Method for details).

In what follows, we first present our analysis of the temporal networks that a utility-based preferential attachment model evolves under various conditions by adjusting the cost and benefit terms of a utility function. From a macroscopic perspective, our analysis takes into account the structural characteristics of a network and the growth of utility in the entire network; microscopically, we take into account the growth patterns of the direct and indirect links around an individual node through mathematical analysis and numerical simulations. Furthermore, our investigation of growth patterns of direct and indirect links around an individual node in real networks, especially social networks, confirms that the growth approximates the estimation of the preferential attachment model based on indirect utility.

Our results suggest that the utility-based preferential attachment model can provide a foundation for understanding the structural and functional evolution of large-scale social networks. In particular, we show that preferential attachment based on indirect utility stably forms a converged and hierarchical structure. This stability provides robust evidence of emerging mutual growth among adjacent nodes in social networks.

Results

Illustrative explanation of the utility-based preferential attachment model

Our analysis of the utility-based preferential attachment model takes into consideration the utility within the 2nd degree of separation. The illustration in Fig. 1 serves to explain the preferential attachment of this model. A new node \(j\) connects to an existing node \(i\) with the preferential attachment probability \({\Pi }_{i}^{[2]}\left({t}_{j}\right)\) at time \({t}_{j}(>{t}_{i})\). \({\Pi }_{i}^{[2]}\left(t\right)\) is proportional to the utility \({u}_{i}^{[2]}\left(t\right)={k}_{i}^{\left[1\right]}\left(t\right)({b}_{1}-c)+{k}_{i}^{\left[2\right]}\left(t\right){b}_{2}\), where \({k}_{i}^{\left[l\right]}\left(t\right) (l=\mathrm{1,2})\) is the number of links at the \(l\)th degree of separation from node \(i\), \({b}_{l}(l=\mathrm{1,2})\) is a nonnegative-valued function representing the benefit of node \(i\) obtained from a node within the \(l\)th degree of separation, \(c\) is a positive-valued parameter representing the cost of node \(i\) to connect with a node within the \(1\)st degree of separation. Notably, node \(i\) incurs no cost for establishing and maintaining indirect links.

Figure 1
figure 1

Preferential attachment of the utility-based preferential attachment model. A new node \(j\) connects to an existing node \(i\) with the preferential attachment probability \({\Pi }_{i}^{[2]}\left({t}_{j}\right)\) at time \({t}_{j}(>{t}_{i})\). \({\Pi }_{i}^{[2]}\left(t\right)\) is proportional to the utility \({u}_{i}^{[2]}\left(t\right)={k}_{i}^{\left[1\right]}\left(t\right)({b}_{1}-c)+{k}_{i}^{\left[2\right]}\left(t\right){b}_{2}\), where \({k}_{i}^{\left[l\right]}\left(t\right) (l=\mathrm{1,2})\) is the number of direct links with the \(l\)th degree of separation from a node \(i\), \({b}_{l}(l=\mathrm{1,2})\) and \(c\) are parameters which associate the weight of benefits and costs, respectively (see the “Methods” section). In this example, node \(i\) is directly connected to nodes \({i}_{1}\) and \({i}_{2}\), i.e., \({k}_{i}^{\left[1\right]}\left({t}_{j}\right)=2\). Therefore, node \(i\) must pay \(2c\) as the cost for the direct links to obtain \(2{b}_{1}\) from nodes \({i}_{1}\) and \({i}_{2}\) as the direct benefit. Thus, the direct utility is described as \(2({b}_{1}-c)\). Also, node \(i\) obtains \(4{b}_{2}\) as the indirect benefit because it connects indirectly to four nodes, \({i}_{{1}_{1}}\), \({i}_{{1}_{2}}\), \({i}_{{2}_{1}}\), and \({i}_{{2}_{2}}\), i.e., \({k}_{i}^{\left[2\right]}\left({t}_{j}\right)=4\). Thus, the indirect utility becomes \(4{b}_{2}\).

To investigate the effect of indirect utility on the structure and growth of evolving networks, we compared four special cases of the utility-based preferential attachment model, models UD, UI, UM+, UM- with a random-attachment model, UR. In every model, a conventional growth rule is used such that a new node joins at each step and makes connections with an existing node with a specific preferential attachment probability. The UD model takes into consideration only the direct utility, i.e., \({\Pi }_{i}^{[2]}\left(t\right)\propto {k}_{i}^{[1]}(t)\). The UI model takes into consideration only the indirect utility, i.e., \({\Pi }_{i}^{[2]}\left(t\right)\propto {k}_{i}^{[2]}(t)\). The UM+ and UM- models take into consideration both the direct and indirect utilities. The UM+ model represents a case in which the direct benefit exceeds the cost (\({b}_{1}=c+1, {b}_{2}=1\)), i.e., \({\Pi }_{i}^{[2]}\left(t\right)\propto {k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)\), whereas the UM- model represents a case in which the cost exceeds the direct benefit (\({b}_{1}=c-1, {b}_{2}=1\)), i.e., \({\Pi }_{i}^{[2]}\left(t\right)\propto {-k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)\). The attachment probability of the UR model is the same for all of the existing nodes in the network.

The effect of indirect utility on the overall structure and growth of evolving networks

Figure 2 shows the effect of indirect utility on the structure of networks formed by the UR, UD, UI, UM+, and UM- models. Figure 2a illustrates the networks formed by the UR, UD, and UI models at around \(t=5000\). A node with more direct links than the surrounding nodes, hereafter referred to as a “local hub”, is marked in red, and other nodes are marked in blue as the distance from a local hub increases. The distance is defined as the number of steps taken to find a local hub by recursively searching the neighbour with the most direct links. In the network formed by the UR model, all of the nodes grow evenly, while, in the networks formed by each of the UD and UI models, the growth of a hub with a large number of direct links is conspicuous, resulting in a heavy-tailed degree distribution. In particular, the network formed by the UI model has a converged and hierarchical structure in which many nodes grow sequentially around a very large local hub.

Figure 2
figure 2

Effect of indirect utility on the structure of the networks formed by the utility-based preferential attachment model. (a) An illustration of a network formed by the respective UR, UD, and UI models at \(t=5000\). The red colour represents the local hub, which is a node with \({k}_{i}^{\left[1\right]}\) greater than the surrounding nodes, and it is displayed in blue gradually according to the distance from the nearest local hub. In UI model, a converged and hierarchical structure forms in which many nodes grow sequentially around a very large local hub. (b,c) The distribution \(P(k^{[1]})\) and the growth of the ratio \(\overline{{k}^{\left[2\right]}}(t)(\equiv {\sum }_{{{j}}=1}^{{{t}}}{{{k}}}_{{{j}}}^{\left[2\right]}({{t}})/{\sum }_{{{j}}=1}^{{{t}}}{{{k}}}_{{{j}}}^{\left[1\right]}({{t}}))\) in the utility-based preferential attachment models. The solid line represents the average of 20 numerical simulations (\(t=100,000\)), and the error bar denotes the standard deviation.

The degree distribution, \(P({k}^{[1]})\) of UR (blue line in Fig. 2b) and UD (orange line in Fig. 2b) shows the exponential and the scale-free property, respectively, which is well known as characteristics of the random attachment model and the degree-based preferential attachment model in evolving networks3. Meanwhile, the UI, UM+, and UM- models have a bump property in the degree distribution (green, red, and purple plots). These models commonly have an indirect utility in preference, and it is known that including 2nd degree of separation in preference result in time dependent degree distributions, and gradual collapse of its scaling24. As shown in the UI model of Fig. 2a, the bump property of the degree distribution is associated with the converged and hierarchical structure, and it can be seen that it is a common structure formed by indirect utility.

The growth of the indirect links as shown in Fig. 2c also confirms the difference between the models. Here, \(\overline{{k }^{\left[2\right]}}\left(t\right)\) is defined as \(\overline{{k }^{\left[2\right]}}\left(t\right)\equiv {\sum }_{j=1}^{t}{k}_{j}^{\left[2\right]}\left(t\right)/{\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)\). Because \({\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)\) can be calculated as \({\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)\approx 2t\) in each model, the difference among these models results from \({\sum }_{j=1}^{t}{k}_{j}^{\left[2\right]}\left(t\right)\), which represents indirect benefits among overall network. As Fig. 2c shows, the growth pattern of \(\overline{{k }^{\left[2\right]}}\left(t\right)\) for each model is distinct. The UR model shows that the growth pattern of \(\overline{{k }^{\left[2\right]}}\left(t\right)\) asymptotically converges to a constant, and the UD model shows that the growth pattern of \(\overline{{k }^{\left[2\right]}}\left(t\right)\) is asymptotically proportional to \(\mathrm{ln}\sqrt{{\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)}\)(\(\approx \mathrm{ln}\sqrt{2t}\)) (see Supplementary Information S1 for mathematical analysis). \(\mathrm{The growth of }\overline{{k }^{\left[2\right]}}\left(t\right)\) in the UI, UM+, and UM- models is lower than that in the UD model from the early stage until around \(t=\mathrm{5,000}\). However, since \(\overline{{k }^{\left[2\right]}}\left(t\right)\) in these models afterwards grows faster, \(\overline{{k }^{\left[2\right]}}\left(t\right)\) becomes larger than in the UD model.

The UM-, UI, and UM+ models have the same indirect utility coefficient as 1, but the direct utility coefficients differ as − 1, 0, and 1 (see Eqs. (S29), (S33), and (S37)). Interestingly, the value of \(\overline{{k }^{\left[2\right]}}\left(t\right)\) increases in the order of the UM-, UI, and UM+ models as time passes. These differences results from the balance between direct utility and indirect utility in preference. And, it is counterintuitive that higher \(\overline{{k }^{\left[2\right]}}\) is observed in the UM- model which penalizes the number of direct links \({k}_{i}^{\left[1\right]}\), since attaching to a node with a high \({k}_{i}^{\left[1\right]}\) is advantageous for the immediate growth of \(\overline{{k }^{\left[2\right]}}\), which means \(\Delta \overline{{k }^{\left[2\right]}}=2{k}_{i}^{\left[1\right]}\) where \(i\) denotes the node attached by a new node. Thus, the structure formed by the preference for \({k}_{i}^{\left[2\right]}\) contributes more to the growth of \(\overline{{k }^{\left[2\right]}}\) than the preference for \({k}_{i}^{\left[1\right]}\) as the network grows up.

The effect of the indirect utility on the growth of the respective nodes in evolving networks

In any case, to understand the rapid growth of \(\overline{{k }^{\left[2\right]}}\left(t\right)\) in the UI, UM+, and UM- models at a late stage, it is necessary to consider the growth mechanism of the network formed by the indirect utility that the three models have in common. Figure 3a shows a possible growth mechanism of the network formed by the UI model, in which the preferential attachment probability is proportional to the indirect utility. As can be seen on the left, when new node \(j\) attaches to node \(i\) in the existing network, \({k}_{i}^{\left[1\right]}\) increases by one in what is hereafter referred to as “direct growth” from a perspective of node \(i\). At the same time, each \({k}_{{i}_{n}}^{\left[2\right]}\) of existing nodes \({i}_{n}\) adjacent to node \(i\) increases by one. This increases the preference of nodes \({i}_{n}\), since \({\Pi }_{{i}_{n}}^{[2]}\) is proportional to \({k}_{{i}_{n}}^{\left[2\right]}\) in the UI model. The figure on the right explains the case where a new node \(j\) attaches to \({i}_{3}\) which is one of the nodes \({i}_{n}\). This attachment corresponds to “indirect growth” from the point of view of node \(i\) and increases \({k}_{{i}_{3}}^{\left[1\right]}\) and \({k}_{i}^{\left[2\right]}\) by one, and increases the preference of node \(i\). In this way, a specific node \(i\) and adjacent nodes \({i}_{n}\) grow mutually by alternating direct growth and indirect growth. Such growth is hereafter referred to as a “mutual growth”.

Figure 3
figure 3

A growth mechanism of the UI model and the relationship between direct and indirect growth in a network formed by the UR, UD, and UI models. (a) A growth mechanism for the networks formed by the UI model. If new node \(j\) attached to \(i\). When a new node \(j\) is attached to node \(i\), \({k}_{i}^{\left[1\right]}\) and \({k}_{{i}_{n}}^{\left[2\right]}\) increase, which is reflected in the indirect utility of node \({i}_{n}\), increasing the preference for node \({i}_{n}\). Similarly, When a new node \(j\) is attached to node \({i}_{n}\), \({k}_{{i}_{n}}^{\left[1\right]}\) and \({k}_{i}^{\left[2\right]}\) increase, which is reflected in the indirect utility of node \(i\), increasing the preference for node \(i\). Therefore, a node \(i\) and neighbouring nodes \({i}_{n}\) mutually grow through increase each other's preference, and the relationship is established as \({k}_{i}^{\left[2\right]}(t)\propto {({k}_{i}^{\left[1\right]}(t))}^{1.5}\) (see Supplementary Information S2). (b) A set of heat maps between \(\Delta {k}_{i}^{\left[1\right]}(t)\) and \(\Delta {k}_{i}^{\left[2\right]}(t)\), each of which shows the distribution of the direct and indirect growth of nodes in an evolving network simulated numerically by the UR, UD, and UI models. The horizontal and vertical arrows denote the orientation of the direct and indirect growth. The number of nodes in each area in the heat map is the average value of 20 numerical simulations (\(t=\mathrm{100,000}\)). The dashed lines are estimates based on mathematical analysis (see Supplementary Information S2). Areas with no growth in direct and indirect growth are excluded for convenience on log scale.

Figure 3b shows the patterns of direct growth and indirect growth of each node in an evolving network numerically simulated by the UR, UD, and UI models, respectively. Direct growth and indirect growth are set to \(\Delta {k}_{i}^{\left[1\right]}(t)\equiv {k}_{i}^{\left[1\right]}(t)-{{k}_{i}^{\left[1\right]}}_{int}\) and \(\Delta {k}_{i}^{\left[2\right]}(t)\equiv {k}_{i}^{\left[2\right]}(t)-{{k}_{i}^{\left[2\right]}}_{int}\) to eliminate the initial value dependency that occurs when node \(i\) first enters the network, and the colour of the heat map indicates the number of nodes in that region. As Fig. 3b indicates, the UI model shows the most positive correlation between \({k}_{i}^{\left[1\right]}(t)\) and \({k}_{i}^{\left[2\right]}(t)\) in these models. In the UR model direct and indirect growth of nodes are relatively insignificant, and In the UD model, high indirect growth is observed in nodes with high direct growth, but no clear correlation across nodes as a whole can be seen. Our mathematical analysis in Supplementary Information S2 demonstrates that the scale of \({k}_{i}^{\left[2\right]}(t)\) of nodes grown considerably in the UR, UD, and UI models is approximately \({({k}_{i}^{\left[1\right]}(t))}^{2}\), \({k}_{i}^{\left[1\right]}(t)\mathrm{log}{k}_{i}^{\left[1\right]}(t)\), and \({({k}_{i}^{\left[1\right]}\left(t\right))}^{1.5}\), respectively.

In particular, in the UI model, two patterns of growth mechanism appear according to the direction of mutual growth (see Supplementary Information S2 for details). A node in which mutual growth of \({k}_{i}^{\left[1\right]}\left(t\right)\) and \({k}_{i}^{\left[2\right]}\left(t\right)\) dominates the growth mechanism leads the growth of outer areas (\({k}_{i}^{\left[l\right]}\left(t\right)\) (\(l>2\))) and becomes the centre of growth. This growth is hereafter referred to as “active growth”. However, if the influence by the outer area \({k}_{i}^{\left[3\right]}\left(t\right)\) is dominant in the growth mechanism, mutual growth occurs in the opposite direction to the above, where \({k}_{i}^{\left[3\right]}\left(t\right)\) leads to the growth of \({k}_{i}^{\left[2\right]}\left(t\right)\) and then \({k}_{i}^{\left[2\right]}\left(t\right)\) leads to the growth of \({k}_{i}^{\left[1\right]}\left(t\right)\). This growth is hereafter referred to as “passive growth”. Therefore, excluding external influences \({k}_{i}^{\left[l\right]}\left(t\right)\) (\(l>2\)), that is, considering only the growth of \({k}_{i}^{\left[1\right]}\left(t\right)\) and \({k}_{i}^{\left[2\right]}\left(t\right)\) of node \(i\), the indirect preferential attachment leads a scaling exponent of 1.5 (i.e. \({k}_{i}^{\left[2\right]}\left(t\right)\sim {({k}_{i}^{\left[1\right]}\left(t\right))}^{1.5}\)). Thus, mutual growth by the indirect preferential attachment is ideally expected to have a scaling exponent of 1.5, but a scaling exponent greater than 1.5 appear in the case of passive growth, which is attracted to external growth (the UI model in Fig. 3b). These distinct growth patterns, active growth and passive growth explain the nodes that are the centre of growth and the nodes that propagate the growth in the network structure, respectively, and explain how the local mechanism of mutual growth between direct growth and indirect growth (Fig. 3a) leads to the emergence of a converged and hierarchical structure that the growth of nodes propagate outward from the centre (UI model in Fig. 2a).

The effect of indirect utility on the structure and growth of nodes in real social networks

The mathematical analysis and numerical simulations just described indicate that the mutual growth between direct growth and indirect growth with a scaling exponent of 1.5 in the networks formed by the preferential attachment based on indirect utility. However, observing the mutual growth in real networks and interpreting it as growth by indirect utilities can be challenging, since even from a perspective of the utility-based preferential attachment model, the benefit and cost coefficients can vary depending on the environment in which the network is formed, and also there can be nodes with various preferences in evolving networks. In this regard, our mathematical analysis indicates that in the utility-based preferential attachment model with a preference for indirect utility (e.g., UM+ and UM- models), a node with active growth forms the scaling exponent of 1.5, regardless of the benefit and cost coefficients (see Supplementary Information S3). Further, we confirm the dominant effect of indirect utility in a network grown by sequentially attaching nodes with preferential attachments of UR, UD, and UI models, and also confirm the scaling exponent of 1.5 in sufficiently grown nodes (see Supplementary Information S4). These robust results support the assumption that there is a preference based on indirect utility in the evolving mechanism of the network when the scale of mutual growth in sufficiently grown nodes is close to 1.5, from the point of view of the utility-based preferential attachment model.

To confirm the existence of a preference based on indirect utility in social networks, we investigated the scaling exponent of the indirect growth \({\Delta k}_{i}^{\left[2\right]}\left(t\right)\) of each node with respect to the direct growth \({\Delta k}_{i}^{\left[1\right]}\left(t\right)\) in two real temporal social networks, YouTube and Facebook, which was well observed over time36,37. The data on the YouTube network records the friendships that grew for 165 days, and the data on the Facebook network records the comment activities on wall pages of New Orleans users for 850 days.

Figure 4a,b show the results of tracking \(\Delta {k}_{i}^{\left[1\right]}\left(t\right)\) and \({\Delta k}_{i}^{\left[2\right]}\left(t\right)\) of each node in the YouTube and Facebook networks, respectively. As confirmed in the numerical simulation and mathematical analysis, the 1.5 scale due to mutual growth is evident in the sufficiently grown nodes of active growth. Therefore, we coloured the top 50 nodes (blue) in order of high \(\Delta {k}_{i}^{\left[1\right]}\), and local hubs (red) with high \(\Delta {k}_{i}^{\left[1\right]}\) compared to neighbouring nodes that are likely to be in an active growth state. The dashed lines represent the growth of the nodes with scaling exponents of 1.5 in the relationship between \(\Delta {k}_{i}^{\left[1\right]}\left(t\right)\) and \({\Delta k}_{i}^{\left[2\right]}\left(t\right)\). In the YouTube network, the scaling exponent of the local hubs has a value lower than 1.5 while, in the Facebook network, the value is close to 1.5. The top 50 nodes in each social network show a trend similar to that of the scaling exponent for the local hubs. This result indicates the possibility that some indirect utility relates deeply to the major mechanism of the growth of the nodes in the Facebook network whereas other mechanisms may explain the growth of nodes in the YouTube network.

Figure 4
figure 4

Relationship between the direct and indirect growth of nodes in real temporal social networks. (a,b) The results of tracking the direct growth, \(\Delta {k}_{i}^{\left[1\right]}\left(t\right)\), and indirect growth, \({\Delta k}_{i}^{\left[2\right]}\left(t\right)\), of each node in the YouTube and Facebook networks. Local hubs (red) are nodes with \({k}_{i}^{\left[1\right]}\) larger than the neighbouring nodes in the final state, and Top50 (blue) corresponds to 50 nodes in the order of higher \({k}_{i}^{\left[1\right]}\) in the final state. The dashed line represents the relationship \({k}_{i}^{\left[2\right]}(t)\propto {({k}_{i}^{\left[1\right]}(t))}^{1.5}\) which is the theoretical estimate of the mutual growth mechanism (see Supplementary Information S2). (c,d) Heat maps in the relationship between \(\Delta {k}_{i}^{\left[1\right]}(t)\) and \(\Delta {k}_{i}^{\left[2\right]}(t)\) which shows the distribution of the direct and indirect growth of nodes in the YouTube and Facebook networks in the final state (same with Fig. 3b).

Figure 4c,d present heat maps in the relationship between \(\Delta {k}_{i}^{\left[1\right]}\) and \(\Delta {k}_{i}^{\left[2\right]}\) of each node entering the YouTube and Facebook networks, respectively. Figure 4d clearly shows that the indirect links in the Facebook network grow with a scaling exponent of 1.5, a result similar to that of the UI model (Fig. 3b). However, no sign of such growth is observable in the YouTube network shown in Fig. 4c. This result suggests that there are mutual growths between the direct and indirect growths in the Facebook network.

Consideration of the meaning of the links in each network is necessary to interpret these results. The friendship links observed on the YouTube network require the mutual consent of each user. However, since this relationship is not publicly revealed to others, its indirect utility may be difficult to observe. In the Facebook network, by contrast, the comments on a user’s wall are publicly exposed to others. Therefore, posting a comment on the wall naturally increases a user’s exposure to others and the probability that they will visit and comment on the wall. The formation of links triggered by this indirect exposure can be seen as preferential attachment based on indirect utility and suggests the possibility that the direct and indirect growth of nodes affect each other mutually.

Next, we consider the scaling exponent of \(\Delta {k}_{i}^{\left[2\right]}\) of each node with respect to \(\Delta {k}_{i}^{\left[1\right]}\) in real static social networks using various categories of datasets. Unlike temporal networks, static networks cannot directly observe the scaling exponent of \(\Delta {k}_{i}^{\left[2\right]}\) of each node with respect to \(\Delta {k}_{i}^{\left[1\right]}\) over time, but we roughly approximate the growth scale using the fact that, as Figs. 3b and 4 show, the influence of the scale exponent becomes dominant in the local hubs. Specifically, \(\Delta {k}_{i}^{\left[2\right]}\propto {(\Delta {k}_{i}^{\left[1\right]})}^{\alpha }\) is set estimated roughly as \({k}_{i}^{\left[2\right]}={({k}_{i}^{\left[1\right]})}^{\alpha }\) for the local hubs satisfying \({k}_{i}^{\left[1\right]}>50\). From the Stanford large network dataset collection, we select 34 undirected and unweighted networks, the categories of which are classified as autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks)38. The selection criteria targeted categories grouped into the same type, and only cases where the category contained three or more undirected and unweighted network samples were selected (see “Methods” for details).

Figure 5a shows the averaged value of \(\alpha\) calculated from the local hubs of each of the various networks. The social networks in social media (deezer, facebook, physical location-based online social media, Last.fm), and collaboration networks between researchers show a large value of \(\alpha\) while the networks of Wikipedia, autonomous systems, and some social networks (github, twitch) show a small value of \(\alpha\). We are unable to completely explain the complex growth mechanism of these real networks, but the preferential attachment based on indirect utility provides a partial explanation for values of \(\alpha\) ranging from 1 to 1.5. In the networks in which social interactions take place, as described above, local hubs have a value of \(\alpha\) close to 1.5, indicating that the local hubs in the social networks grow not alone but together with the surrounding nodes and suggesting that preferential attachment based on indirect utility may have contributed to this growth.

Figure 5
figure 5

Averaged scaling exponent of the indirect growth of local hubs with respect to the direct growth and the relationship among the scaling exponent, cluster coefficient, and degree assortativity coefficient in real static networks. (a) The average scaling exponent \(\alpha\) calculated from the local hubs in each of 34 undirected and unweighted networks for which the categories are classified as autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks). (b–d) The correlation between scaling exponent \(\alpha\) and the degree assortativity coefficient \(r\), between the cluster coefficient \(C\) and \(r\), and between \(\alpha\) and \(C\). The grey line represents the correlation of the entire network, and the blue line represents the correlation except for networks with exceptionally large values of \(r\) and \(C\). The value in the legend is a Pearson’s correlation coefficient. The colour of the dot is the same as the categories identified in (a).

Further, we investigated the relationship between the scaling exponent \(\alpha\) and two representative indicators of the network structure, the “assortativity coefficient” and “clustering coefficient”. The assortativity coefficient \(r\) indicates a correlation between the degrees of a node and the neighbouring nodes across the network39. The clustering coefficient \(C\) indicates the average of the ratio of closed triplets among the possible triplets for each node in the whole network. Previous studies indicate that there is a correlation between \(r\) and \(C\)40,41. As Fig. 5c shows, there is a strong correlation between \(r\) and \(C\), with a Pearson’s correlation coefficient of 0.77. However, after exclusion of the three networks with exceptionally large values of \(r\), \(C\) decreases to 0.36. By contrast, as Fig. 5b clearly shows, with the exclusion of the three networks, the Pearson’s correlation coefficient between \(r\) and \(\alpha\) increases from 0.44 to 0.82. The three networks taking very large values of \(r\) and \(C\) suggest that the clustering of the nodes around the hubs is reflected in the large values of \(r\). Meanwhile, as Fig. 5d shows, there is no significant relationship between \(\alpha\) and \(C\).

It is well known that the relationship between \(r\) and \(C\) appears as a positive correlation in social networks39. And, it has been suggested that community structure and clustering can have a positive effect on degree assortativity40,41,42. However, the results reported above indicate that the effect of \(\alpha\) can explain the positivity of \(r\) in a different, even better direction from the effect of \(C\). Accordingly, the mutual growth pattern by the preferential attachment based on indirect utility may be an important growth mechanism of the positive degree assortativity in social networks.

Discussion

Preferential attachment has become an important mechanism for understanding the evolution of social networks3,4,5,6,7,8. However, compared to the interest in the mechanism by which such a preference can occur in a physical sense9,10,11,12,13,14, there has been a lack of interest in a framework that deals with preferences in a cognitive sense, especially regarding indirect utility15,16,17,18,19,20,21,22. In this study, we have introduced the utility-based preferential attachment model by focusing on social preference in human cognition, investigating in particular the effect of the preference attachment based on indirect utility on the structure and growth of evolving networks. Our numerical simulations and mathematical analysis demonstrated that, in evolving networks, converged and hierarchical structures where the growths of nodes propagate outward from the centre are stably emerged through the preferential attachment based on indirect utility. In addition, we show that this growth mechanism promotes the mutual growth of direct growth and indirect growth with a scaling exponent of 1.5 from a perspective of each node, and observe the mutual growth patterns in real networks, especially social networks.

The main finding of our numerical simulations and mathematical analyses is the converged and hierarchical structures (see Fig. 2a) formed by the influx of nodes that prefer nodes with high indirect utility. This structure is a robust outcome confirmed not only in a model taking into consideration only indirect utility (the UI model), but also in models taking into consideration both the direct and indirect utilities (e.g., the UM+ and UM- models\(,\) see Supplementary Information S3), and a model in which nodes with different preferences are mixed (see Supplementary Information S4). Our work reveals how preferential attachment based on indirect utility forms a microscopic mutual growth mechanism (see Fig. 3a), and how this mechanism differentiates into active growth and passive growth, and leads to the emergence of the converged and hierarchical structure. Therefore, in a society where there is a continuous influx of people who prefer indirect utility, this structure can emerge deterministically.

In the utility-based preferential attachment model, utility serves not only as to preferences for the attachment to nodes in the network, but also as the utility that each node can obtain from the network. In an evolving network, the total indirect utility increases in proportion to the number of direct links of the node attached by a new node. Thus, the preference proportional to the number of direct links explicitly means the preference for increasing indirect utility. However, despite the absence of an explicit relationship to increasing indirect utility, the preferential attachment based on indirect utility rapidly increases the indirect utility of the whole network, exceeding the total indirect utility of the network formed by the preferential attachment based on direct utility over time (see Fig. 2c). The rapid growth is an outcome not due to a preference of nodes for indirect utility per se, but due to the mechanism of mutual growth between adjacent nodes in the network based on the preference and the converged and hierarchical structure that emerged from it. This increasing pattern of utility is likely the underlying mechanism by which people increase utility in large-scale evolving networks such as cities43,44, especially considering that the number of direct connections of people can be finite under cognitive constraints45,46.

In the conventional preferential attachment models, the growth of a node depended on the elapsed time after entering the network3 or on a fitness of individual nodes7. Therefore, it is common not to associate preference with the growth of a node, and preference has been understood as an intrinsic property of each node or as physical mechanisms9,10,11,12,13,14. However, in the preferential attachment based on indirect utility, attachment by preference and outcome by attachment are dependent mutually. For example, a node attached to a location near centre area of a converged and hierarchical structure grows faster than nodes that do not. And, if the outcome of attachment by a preference leads to greater growth, a feedback loop that reinforces the preference can form. The analysis in a network model grown by sequentially attaching nodes with preferential attachments of the UR, UD, and UI models also supports that nodes with a preference for indirect utility are more advantageous for the growth of direct link and indirect link than nodes with no preference or nodes with preference for direct utility (see Supplementary Information S4). Therefore, preferences for indirect utility are not just an intrinsic property, but a property that can be stably formed by preferential attachment and feedback by its outcomes. It is meaningful to consider this dependent growth and the possibility of forming indirect preferences as its background in social networks. There is dependent growth in the formation of networks between people. For example, popularity among peers in adolescence has a contagious effect47,48, and research on weak tie theory and structural holes shows that links that expand indirect connections could be a source of growth18,19,20. Under these environmental conditions, people do not simply choose nodes that are random or have a high probability of being connected. People adaptively and strategically evaluate the utility of the nodes with which to form connections, and also pursue their own growth through the choice. Our model shows that this dependent growth mechanism can emerge simply from a preferential attachment based on indirect utility.

The strong correlation between the scaling exponent of indirect growth and degree assortativity (Fig. 5b) opens up new possibilities for positive degree assortativity appeared in evolving social networks40. The conventional interpretations of the origin of degree assortativity have been focused on the clustering effect40,41,42,49, since the more closed the open-triad is, the more clusters with similar degree are formed. However, the degree of a node adjacent to node \(i\) includes not only the links of clusters formed with other nodes adjacent to node \(i\), but also the links formed on outside of the nodes adjacent to node \(i\). The scaling exponent of indirect growth of a node can be described as an indicator independent from the clustering coefficient in that it excludes the link between the adjacent nodes of node \(i\). The fact that this indicator correlates strongly with the degree assortativity indicates that the mutual growth mechanism arising from the preferential attachment based on indirect utility may contributes to the positivity of the degree assortativity, which is an important characteristic of social networks.

One of the major limitations of the utility-based preferential attachment model is that preferential attachment requires computing the utility information of all nodes in the network. The assumption that a new node can explore preferences for all nodes is unrealistic, and conventional preferential attachment models have supported feasibility through models based on local mechanisms9,10,11,12,13,14. For the preferential attachment based on indirect utility, a local mechanism can be devised in which a link is formed after moving randomly to a node with a shortest distance of 2 from a point selected randomly so that effects of indirect utility can be obtained. In this case, the mutual growth mechanism around a node, which is a local growth mechanism, would work identically with the utility-based preferential attachment model, but the size or formation dynamics of the macroscopic structure may be different. For example, our numerical simulations form converged and hierarchical structures around very few dominant active growth nodes, but if a local growth mechanism is applied, the structures centred on more nodes with active growth can be formed throughout the network. Also, preferences for indirect utility can be considered even in connections between existing nodes for realistic extensions. A recent study shows that selection to increase betweenness centrality forms an ultra-small world network through a game theoretical framework50. Preferences for indirect utility are likely to make a similar contribution, as it could be a practical way to increase betweenness centrality. With these extensions, future studies can construct more realistic models. In addition, the definition of the local hub (i.e., a node with higher degree than neighbouring nodes) introduced to estimate the active growth state needs to be refined. Future studies can attempt to increase the observation accuracy of mutual growth through some strategies such as removing nodes with a larger degree among neighbouring nodes, or nodes showing unusual growth patterns. Moreover, the growth mechanism may not always be stable throughout the entire process of network formation. For example, though it is rough, in the case of the YouTube friendship network (see Fig. 4c), the growth pattern of the nodes with prominent growth (Top50) appears to change into a growth pattern with different scaling exponent around \({k}^{\left[1\right]}\approx 100\). In this way, the growth mechanism of real networks may vary depending on the growth scale of nodes, and future studies can consider these heterogeneous growth patterns. Lastly, in this study, we defined the most basic form of utility function to be linearly proportional to the number of directly connected nodes and indirectly connected nodes. However, the model may need to be adjusted depending on the context of the network under investigation. For example, if it is a type of network where the number of human relationships is limited by the cognitive limits of the nodes, such as Dunbar's number45,46, a non-linear function in which utility is saturated as the number of nodes increases can be considered. Depending on the nature of the utility or the perception of the utility, there may be cases where the benefit or cost is fixed and not proportional to the number of links51. Future research could attempt to investigate the variation and robustness of network structural evolution and mutual growth scaling by adjusting the model in different contexts.

In summary, our study showed growth mechanisms and structural features that emerge from the influx of nodes that prefer indirect utility. These results contribute to the understanding of evolving mechanisms of social networks, and the methodologies can be widely applied to the investigation of microscopic and macroscopic growth patterns of evolving networks.

Methods

Utility-based preferential attachment model

We propose as a mathematical model of evolving networks the utility-based preferential attachment model

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=m{\Pi }_{i}^{[n]}\left(t\right),$$
(1)

where

$${\Pi }_{i}^{[n]}\left(t\right)=\frac{{u}_{i}^{\left[n\right]}\left(t\right)}{{\sum }_{j=1}^{t}{u}_{j}^{\left[n\right]}\left(t\right)},$$
(2)
$${u}_{i}^{[n]}\left(t\right)={\sum }_{l=1}^{n}{k}_{i}^{\left[l\right]}\left(t\right){b}_{i(l)}-{k}_{i}^{\left[1\right]}\left(t\right){c}_{i}.$$
(3)

Here, \({\Pi }_{i}^{[n]}\left(t\right)\) is the preferential attachment probability normalized as

$${0\le\Pi }_{i}^{[n]}\left(t\right)\le 1 \quad \mathrm{for}\,\, \forall t\left(\ge {t}_{i}\right).$$
(4)

\({u}_{i}^{[n]}\left(t\right)\) is the utility function of node \(i\), which was attached to a node on the existing network at time \({t}_{i}\), at time \(t (\ge {t}_{i})\). \({u}_{i}^{[n]}\left(t\right)\) represents the utility that node \(i\) obtains from all of the nodes within the nth degree of separation on the network. The first term on the right side of this function,

$${b}_{i}^{[n]}\left(t\right)\equiv {\sum }_{l=1}^{n}{k}_{i}^{\left[l\right]}\left(t\right){b}_{i(l)},$$
(5)

denotes the benefit of node \(i\) obtained from all of the nodes within the nth degree of separation from, and the second term,

$${c}_{i}^{\left[1\right]}(t)\equiv {k}_{i}^{\left[1\right]}\left(t\right){c}_{i},$$
(6)

denotes the cost that node \(i\) needs to pay to directly connect to the nodes with the 1st degree of separation. Here, \({c}_{i}\) is a parameter with a positive value representing the cost that node \(i\) needs to pay to connect to a node with the 1st degree of separation from node \(i\). \({k}_{i}^{\left[l\right]}\left(t\right)\) is the number of links within the lth degree of separation from node \(i\), which was attached to a node on the existing network at time \({t}_{i}\), at time \(t (\ge {t}_{i})\). \({b}_{i(l)}\) denotes a function with a positive value representing the benefit of node \(i\) obtained from a node with the lth degree of separation. Also, \(m\) is the number of links attaching a new node to existing nodes.

\({u}_{i}^{[n]}\left(t\right)\) is also rewritten as

$${u}_{i}^{[n]}\left(t\right)={u}_{i}^{\left[n\right]\left[D\right]}\left(t\right)+{u}_{i}^{\left[n\right]\left[I\right]}\left(t\right),$$
(7)

where

$${u}_{i}^{\left[n\right]\left[D\right]}\left(t\right)\equiv {k}_{i}^{\left[1\right]}\left(t\right)\left({b}_{i(1)}-{c}_{i}\right),$$
(8)
$${u}_{i}^{\left[n\right]\left[I\right]}\left(t\right)\equiv {\sum }_{l=2}^{n}{k}_{i}^{\left[l\right]}\left(t\right){b}_{i(l)}.$$
(9)

Here, \({u}_{i}^{\left[n\right]\left[D\right]}\left(t\right)\) is the direct utility term, that is, the utility of node \(i\) obtained from all of the nodes with the 1st degree of separation, and \({u}_{i}^{\left[n\right]\left[I\right]}\left(t\right)\) is the indirect utility term, that is, the utility of node \(i\) obtained from all of the nodes within the range between the 2nd and nth degrees of separation.

The utility-based preferential attachment models with \({\varvec{n}}=2,\) including the UD, UI, UM+, and UM- models

In what follows, we set \({c}_{i}\) for arbitrary node \(i\) to the same value \(c\) and \({b}_{i(l)}\) for arbitrary node \(i\) to the same function \({b}_{(l)}\) for simplicity. Notably, \({b}_{(l)}\) is written as simply \({b}_{l}\). We restrict the distance between nodes here to \(n=2\). Thus, the utility-based preferential attachment model is reduced to

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=m{\Pi }_{i}^{[2]}\left(t\right),$$
(10)

where

$${\Pi }_{i}^{[2]}\left(t\right)=\frac{{u}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}{u}_{j}^{\left[2\right]}\left(t\right)},$$
(11)
$$\begin{aligned} u_{i}^{\left[ 2 \right]} \left( t \right) &= u_{i}^{\left[ 2 \right]\left[ D \right]} \left( t \right) + u_{i}^{\left[ 2 \right]\left[ I \right]} \left( t \right) \nonumber \\ &\quad = g\left( {c,b_{1} } \right)k_{i}^{\left[ 1 \right]} \left( t \right) + b_{2} k_{i}^{\left[ 2 \right]} \left( t \right), \end{aligned}$$
(12)
$$g\left(c,{b}_{1}\right)\equiv {b}_{1}-c.$$
(13)

Then, \({\Pi }_{i}^{[2]}\left(t\right)\) is described as

$${\Pi }_{i}^{[2]}\left(t\right)=\frac{g(c,{b}_{1}){k}_{i}^{\left[1\right]}\left(t\right)+{b}_{2}{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left\{g(c,{b}_{1}){k}_{j}^{\left[1\right]}\left(t\right)+{b}_{2}{k}_{j}^{\left[2\right]}\left(t\right)\right\}},$$
(14)

and Eqs. (11) and (12) show that the utility-based preferential attachment model with \(n=2\) is described as

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=m\frac{g\left(c,{b}_{1}\right){k}_{i}^{\left[1\right]}\left(t\right)+{b}_{2}{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left\{g\left(c,{b}_{1}\right){k}_{j}^{\left[1\right]}\left(t\right)+{b}_{2}{k}_{j}^{\left[2\right]}\left(t\right)\right\}},$$
(15)
$$\frac{d}{dt}{k}_{i}^{\left[2\right]}\left(t\right)=m\frac{{b}_{2}{\left({k}_{i}^{\left[1\right]}\left(t\right)\right)}^{2}+\left\{g\left(c,{b}_{1}\right)-{b}_{2}\right\}{k}_{i}^{\left[1\right]}(t)+g\left(c,{b}_{1}\right){k}_{i}^{\left[2\right]}\left(t\right)+{b}_{2}{k}_{i}^{[3]}(t)}{{\sum }_{j=1}^{t}\left\{g\left(c,{b}_{1}\right){k}_{j}^{\left[1\right]}\left(t\right)+{b}_{2}{k}_{j}^{\left[2\right]}\left(t\right)\right\}}$$
(16)

(see Supplementary Information S3). This model is hereafter referred to as the “utility-based preferential attachment model” for simplicity. In this study, we analyse the case of \(m=1\).

The UD model represents the situation satisfying \({u}_{i}^{\left[2\right]}\left(t\right)={u}_{i}^{\left[2\right]\left[D\right]}\left(t\right)\), which is realized in the condition \({b}_{2}=0\), and the substitution of this condition into Eqs. (13) and (14) shows that \({\Pi }_{i}^{\left[2\right]}\left(t\right)\) becomes

$${\Pi }_{i}^{\left[2\right]}\left(t\right)=\frac{{k}_{i}^{\left[1\right]}\left(t\right)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)}.$$
(17)

This is the same preferential attachment probability as the BA model, and this means that considering only direct utility in the preference, the utility-based preferential attachment model is reduced to the BA model. Then, the substitution of \({b}_{2}=0\) into Eqs. (15) and (16) shows that the UD model is mathematically described as

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=\frac{{k}_{i}^{\left[1\right]}\left(t\right)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)},$$
(18)
$$\frac{d}{dt}{k}_{i}^{\left[2\right]}\left(t\right)=\frac{{{k}_{i}^{\left[1\right]}\left(t\right)+k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[1\right]}\left(t\right)}.$$
(19)

The UI model represents the situation satisfying \({u}_{i}^{\left[2\right]}\left(t\right)={u}_{i}^{\left[2\right]\left[I\right]}\left(t\right)\), which is realized in the condition \({b}_{1}=c\), and substituting this condition into Eqs. (13) and (14) indicates that \({\Pi }_{i}^{\left[2\right]}\left(t\right)\) becomes

$${\Pi }_{i}^{\left[2\right]}\left(t\right)=\frac{{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[2\right]}\left(t\right)}.$$
(20)

Then, the substitution of \({b}_{1}=c\) into Eqs. (15) and (16) shows that the UI model is mathematically described as

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=\frac{{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[2\right]}\left(t\right)},$$
(21)
$$\frac{d}{dt}{k}_{i}^{\left[2\right]}\left(t\right)=\frac{{\left({k}_{i}^{\left[1\right]}\left(t\right)\right)}^{2}-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[3\right]}(t)}{{\sum }_{j=1}^{t}{k}_{j}^{\left[2\right]}(t)}.$$
(22)

The UM+ model represents the situation satisfying \({u}_{i}^{\left[2\right]}\left(t\right)={u}_{i}^{\left[2\right]\left[D\right]}\left(t\right)+{u}_{i}^{\left[2\right]\left[I\right]}\left(t\right)={k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)\), which is realized in the condition \({b}_{1}=c+1, {b}_{2}=1\), and the substitution of this condition into Eqs. (13) and (14) shows that \({\Pi }_{i}^{\left[2\right]}\left(t\right)\) becomes

$${\Pi }_{i}^{\left[2\right]}\left(t\right)=\frac{{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left({k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)}.$$
(23)

This is the same preferential attachment probability as the K2 model. Then, the substitution of \({b}_{1}=c+1, {b}_{2}=1\) into Eqs. (15) and (16) shows that the UM+ model is mathematically described as

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=\frac{{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left({k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)},$$
(24)
$$\frac{d}{dt}{k}_{i}^{\left[2\right]}\left(t\right)=\frac{{\left({k}_{i}^{\left[1\right]}\left(t\right)\right)}^{2}+{k}_{i}^{\left[2\right]}(t)+{k}_{i}^{\left[3\right]}(t)}{{\sum }_{j=1}^{t}\left({k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)}.$$
(25)

The UM- model represents the situation satisfying \({u}_{i}^{\left[2\right]}\left(t\right)={u}_{i}^{\left[2\right]\left[D\right]}\left(t\right)+{u}_{i}^{\left[2\right]\left[I\right]}\left(t\right)=-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)\), which is realized in the condition \({b}_{1}=c-1, {b}_{2}=1\), and the substitution of this condition into Eqs. (13) and (14) shows that \({\Pi }_{i}^{\left[2\right]}\left(t\right)\) becomes

$${\Pi }_{i}^{\left[2\right]}\left(t\right)=\frac{-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left(-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)}.$$
(26)

Then, the substitution of \({b}_{1}=c-1, {b}_{2}=1\) into Eqs. (15) and (16) shows that the UM- model is mathematically described as

$$\frac{d}{dt}{k}_{i}^{\left[1\right]}\left(t\right)=\frac{-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{i}^{\left[2\right]}\left(t\right)}{{\sum }_{j=1}^{t}\left(-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)},$$
(27)
$$\frac{d}{dt}{k}_{i}^{\left[2\right]}\left(t\right)=\frac{{\left({k}_{i}^{\left[1\right]}\left(t\right)\right)}^{2}-2{k}_{i}^{\left[1\right]}\left(t\right)-{k}_{i}^{\left[2\right]}(t)+{k}_{i}^{\left[3\right]}(t)}{{\sum }_{j=1}^{t}\left(-{k}_{i}^{\left[1\right]}\left(t\right)+{k}_{j}^{\left[2\right]}\left(t\right)\right)}.$$
(28)

The criteria for selecting a static social network in Stanford large network dataset collection

The Stanford large network dataset collection38 divides network datasets into 23 categories. Our selection criteria are as follows: (1) a category in which the properties of nodes can be grouped into the same type, (2) containing at least three network samples to discuss general properties of the category, (3) an unweighted and undirected network like our mathematical model, and (4) a network in which the local hub's degree has grown sufficiently as in the analysis criteria (\({k}_{i}^{\left[1\right]}>50\)). The categories that met our criteria are “autonomous systems”, “social networks”, “location-based online social networks”, “collaboration networks”, and “Wikipedia”, of which “social networks” and “location-based online social networks” are classified into “social networks” types together. Two networks are excluded as exceptions. “ego-Facebook” network in the “social networks” category is not of interest because it is a dataset about the ego network, and “as-Skitter” network in “autonomous systems” is excluded because it is too large to analyse. As a result, 34 networks met our criteria: autonomous systems (3 networks), collaboration networks (5 networks), social networks (23 networks), and Wikipedia (3 networks).