The Collective Direction of Attention Diffusion

Wang, Cheng-Jun; Wu, Lingfei; Zhang, Jiang; Janssen, Marco A.

doi:10.1038/srep34059

Download PDF

Article
Open access
Published: 28 September 2016

The Collective Direction of Attention Diffusion

Cheng-Jun Wang¹,
Lingfei Wu²,
Jiang Zhang³ &
…
Marco A. Janssen^2,4

Scientific Reports volume 6, Article number: 34059 (2016) Cite this article

2233 Accesses
8 Citations
2 Altmetric
Metrics details

Subjects

Abstract

We find that the flow of attention on the Web forms a directed, tree-like structure implying the time-sensitive browsing behavior of users. Using the data of a news sharing website, we construct clickstream networks in which nodes are news stories and edges represent the consecutive clicks between two stories. To identify the flow direction of clickstreams, we define the “flow distance” of nodes (L_i), which measures the average number of steps a random walker takes to reach the ith node. It is observed that L_i is related with the clicks (C_i) to news stories and the age (T_i) of stories. Putting these three variables together help us understand the rise and decay of news stories from a network perspective. We also find that the studied clickstream networks preserve a stable structure over time, leading to the scaling between users and clicks. The universal scaling behavior is confirmed by the 1,000 Web forums. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior, and compare the simulation results with empirical data.

Anomalous structure and dynamics in news diffusion among heterogeneous individuals

Article 20 May 2019

Xiaochen Wang, Yueheng Lan & Jinghua Xiao

Unraveling the Origin of Social Bursts in Collective Attention

Article Open access 13 March 2020

Manlio De Domenico & Eduardo G. Altmann

Realistic modelling of information spread using peer-to-peer diffusion patterns

Article 28 August 2020

Bin Zhou, Sen Pei, … H. Eugene Stanley

Introduction

In theory, information can have an infinite number of copies. This presents challenges for predicting the duration of information diffusion and the number of copies to be generated at each step. To address this problem, scholars investigate the competition for limited attention between information pieces^1,2,3,4,5. A widely used model is called “clickstream networks”, in which nodes are information pieces and edges represent the successive clicks between nodes created by users. This model has been applied to uncover the hidden structure of semantic spaces², create high-resolution human knowledge maps¹, and analyze the diffusion of Internet memes³.

In previous studies on clickstream networks, a rarely mentioned topic is the direction of clickstreams. We assume that in an online system with many users, the randomness in the browsing behavior of individual users may cancel out with each other, giving rise to a system-level order that reveals the collective preference of users. Therefore, we should be able to identify the direction of clickstream diffusion at the collective level, which is an innovation compared to earlier literature. To verify our assumption, we analyze two datasets, the news story voting records from a news sharing website and the thread browsing records from 1,000 Chinese Web forums. We define “flow distance” L_i, a network-based metric, to detect the direction of clickstreams. L_i measures the average number of steps a random walker takes to reach the ith node in clickstream networks. It is similar to “effective distance” proposed in ref. 6, but overcomes the latter’s limitation⁷ by considering all possible paths between two nodes and not just the shortest path. By sorting nodes in order of L_i we retrieve the global direction of clickstreams in the network from the complex local interactions between nodes; clickstreams are generally transported from nodes associated with low values of L_i to nodes who have high values of L_i. Meanwhile, we find two variables characterizing the properties of nodes, the clicks to nodes C_i and the age of nodes T_i, are related with L_i. In the news sharing website, putting together these three variables gives a comprehensive understanding of the life cycle of news: as time goes by, the flow distance of news stories increases due to a lack of novelty and visibility, leading to the decline of clicks to these stories. The constant replacement of old news by the most recent news gives rise to a stable, tree-like structure of clickstream networks, in which the latest news stories are always located near the “root” and the earlier stories occupy the “leaf” positions. A systematic investigation of 1,000 Web forums not only confirms the tree structure of clickstream networks, but also predicts that, this structure is related with the scaling between users and clicks, which has been widely observed in online systems^5,8. We suggest that the tree-like, stable structure of clickstream networks reveals the time-sensitive preference of users in online browsing. To test our assumption, we discuss three models on individual browsing behavior. In these models users generally visit webpages in reverse chronological order, but different models allow users to repeatedly visit pages in different ways. By comparing the mechanisms that allow users revisit the newest, the oldest, and the middle-age stories, respectively, we find that the model in which users revisit middle-age webpages presents the properties most similar to the real systems.

Materials and Methods

Data Sources

We analyze two datasets of web browsing activities, including DIGG and TIEBA. Digg (http://digg.com/) is a news sharing website. On this website, users submit new stories and vote for them. In particular, headlines of news are displayed on the homepage and sub-category pages (e.g., technology, entertainment, sports, etc.). To view a news story, a user clicks the headline and opens a new webpage that displays the full story. If the user likes the story, he gives a thumbs-up (also called “digg”) for that story. The stories will appear on the homepage if they obtain a critical mass of votes (diggs) quickly enough. Due to the limited space of the homepage, old stories are constantly replaced by new stories. The DIGG dataset under study includes 3 × 10⁶ votes to 3553 news stories created by 1.4 × 10⁴ users in a month⁴. Tieba (http://tieba.baidu.com/) is the largest Chinese web forum system managed by the Chinese search engine company, Baidu. On this platform, users can create new forums (“bar”) with custom, unique names. After a forum is created, users may start a new thread, post a reply in a particular thread, or click the headline of a thread to open a new page and read the content. Different from Digg, Tieba does not calculate the popularity of threads (news stories). Threads are displayed in reverse chronological order by the time of the latest reply. As a result, while most of threads are removed away from the homepage due to a lack of novelty, a few threads may sit on the homage for a long time. Out data set contains the thread browsing records on the top 1,000 forums created by more than 1 × 10⁷ users in 24 hours. Both DIGG and TIEBA are anonymous datasets and we do not have access to the personally identifiable information of users.

Constructing Clickstream Networks

To obtain clickstream networks we split dataset into chunks of equal time span (a hour in TIEBA and a day in DIGG) and count the number of successive visits w_ij to two pages i and j within the given time period. For each w_ij we add a weighted, directed edge pointing from i to j. After all edges are included in the network, we apply a technique called “network balancing”. We add two artificial nodes “source” and “sink” and connect them to the existing nodes such that weighted in-degree (the sum of weights over inbound edges) equals weighted out-degree (the sum of weights over outbound edges) on each of the existing nodes⁹. These two artificially added nodes represent the “environment” of an online community, namely, other online communities and/or the offline world. Including the environment in the analysis allows us to investigate the complete clicking paths of users. All users come from the environment (“source”), enter into the system to click a sequence of webpages, and then leave from the system and back to the environment (“sink”). This information is particularly important if we want to analyze the transition probability of users between webpages. Note that besides the aforementioned method, there are also other methods for constructing the temporal paths¹⁰.

Figure 1 presents two example clickstream networks, in which nodes are webpages and edges represent the successive clicks between webpages. Figure 1A gives a very simple network to illustrate that we can calculate finite flow distance for nodes on loops. In Fig. 1B we present a more complex network to illustrate the principle of clickstream conservation after networks are balanced. Clickstreams conservation means that (1) the total number of users entering into the system equals to the total number of users leaving the system; and (2) The total number of clicks created in the system equals the sum of clicks over all nodes. The clickstreams conservation can be expressed as

**Figure 1: Two example clickstream networks in which nodes are webpages and edges represent the successive visits between webpages.**

and

in which N is the number of nodes in the network. F and V represents the total number of users and clicks, respectively. D_i is the number of clicks transported to sink from the ith node. It also indicates the number of users leaving the system from the ith node. C_i is the total number of clicks generated on the ith node. It equals the weighted in-degree or the weighted out-degree of node i.

Calculating Flow Distance L_i

We define flow distance L_i as the average number of steps (edges) a user takes from source to reach the ith node. In Fig. 1 we show the flow distance of nodes using text in red. L_i can be calculated using either an iterative method or a matrix-based method. For the iterative calculation, we firstly set the initial values of L_i as one unit for all nodes. Starting for any arbitrarily selected node i, we calculate the average path length of all its upstream nodes j, weighted by the normalized form of w_ji (which is divided by the weighted in-degree of i). We then obtain and continue to calculate the flow distance of the downstream nodes of i. The process goes on until the values of flow distance converge on all nodes. The iterative calculation is simple and fast in practice and is particularly powerful in handling large datasets. A similar method is used to calculate the PageRank values of webpages in large hyperlink networks¹¹.

The iterative method is very useful in empirical data processing, but it does not give a formal definition of L_i. Therefore, we derive the matrix-based definition of L_i based on a Markov model of clickstreams as follows. Firstly we obtain a weight matrix F from a clickstream network and normalize F by column to derive a new matrix M, whose element M_ji represents the probability that a user visiting node i comes from its upstream neighbor j. For the convenience of calculation, we transpose M to M^T such that , and let the sum of each row in M^T equals 1. We can write L_i as:

Eq. 3 holds because in order to reach node i, a user has to visit one of its upstream neighbors j, creating a path of length 1 + L_j with probability . The only exception occurs when the random walker jumps to i directly from source, generating a path of length 1 with probability . To solve Eq. 3 we use the condition that the sum of each row in M^T equals 1 and rewrite it as

which tells us that, the flow distance from source to node i equals 1 plus the expected value of the flow distances from source to its upstream nodes j. Therefore, the previously discussed iterative calculation of L_i is proved to be correct. The matrix form of Eq. 4 reads

in which I is an all-ones vector. Eq. 5 can be solved as

in which A is an identity matrix.

Results

The Life Cycle of News

In the age of information overload, news has to compete for the limited attention of users, i.e., attract clicks, in order to stay visible in the virtual world. This competition exists throughout the whole life cycle of news, leading to the constantly replacement of old news by the latest news⁴. In the analyzed Digg system, a news story has to obtain a critical mass of votes (diggs) in a short period (which is defined by the news ranking algorithm of Digg) in order to appear on the homepage. And sitting on the homepage will help it attract more votes quickly. But the increase of votes will gradually slow down and saturate due to the lack of novelty and visibility (when its position on the homepage is taken by the latest news). Besides the popularity ranking algorithm, other applications such as the interface displaying friends’ preferences may also influence users’ decision on popular stories to promote¹². Therefore, the rise and decay of news is caused by the content switching mechanism of users, which reflects the preference of users moderated by the website algorithms designed to facilitate news voting.

We investigate the life cycle of news in DIGG and find that the daily clicks to news stories generally increase and reach a maximum value on the second day of news submission, and then decay over a relatively long time period. To quantify the decay period we write clicks C as a function of time T and find that ln(−ln(C)) scales to ln(T) linearly, or

in which k and ω are constant parameters. Eq. 7 is called stretched exponential function or Kohlrausch-Williams-Watts function⁴. The parameter ω determines the decay rate of clicks. When ω > 0, clicks decrease slower than exponential function and faster than power law over time⁴. In Fig. 2A we estimate ω = 0.4 using the data of the ten most popular categories of news. Interestingly, while the decay of clicks generally follows Eq. 7, different categories of news have different decay rates. In particular, among the top ten categories, the attention to politics news has the fastest decay, and the interest to health news lasts the longest, as shown in Fig. 2B.

**Figure 2: The temporal evolution of clicks.**

To better understand the temporal evolution of clicks to news, we construct clickstream networks to observe how users switch between news stories systematically and investigate whether the observed decay of attention is related with the change of the position of news in these networks. We trace the individual news voting streams of users on a daily basis and aggregate them to obtain daily networks in which nodes represent news stories and edges show the clickstreams between stories. As introduced in Materials and Methods, we add two artificial nodes, “source” and “sink” to represent the “environment” of clickstream networks, which could be other online communities and/or the offline world. Including the environment in the analysis allows us to investigate the complete clicking paths of users.

We find that the diffusion of clickstreams has a direction at the network level, revealing the collective preference of users in favor of new stories against old ones. In particular, we define a network-based metric, “flow distance” L_i, to measure the expected number of steps (edges) a random walker takes from source to reach the ith story (see Materials and Methods for details). By sorting nodes in order of L_i we retrieve the global direction of clickstreams; they are generally transported from nodes associated with low values of L_i to nodes who have high values of L_i. As shown by Fig. 3, the flow of clickstreams forms a directed, tree-like structure, and the distribution of clicks on L_i seems to be time-invariant, even though the positions of old nodes are constantly taken by new nodes.

**Figure 3: Four daily DIGG clickstream trees.**

It is observed that the flow distance of news scales to time sub-linearly, satisfying the function

in which m and ω′ are constant parameters. In Fig. 4A we plot ln(L) against ln(T) in the log-log axes and estimate that ω′ = 0.35 using the OLS regression. The 95% Confidence Interval of ω′ is [0.25, 0.42]. According to the property of power functions, ω < 1 means that the flow distance of news goes up rapidly at first and then the increasing speed becomes slower as time goes by.

**Figure 4: The relations between news age, flow distance, and the number of received clicks.**

We find that the cumulative number of clicks W increases with flow distance monotonically, forming a S-shaped growth curve. While other functions are also available, we use the Gompertz function to model the growth curve. This is because the Gompertz function (and its derivative) is both simple in math and also widely used to characterize the growth dynamics of complex systems¹³.

in which α₁ and β₁ are constant parameters. α₁ determines the horizontal position of the midpoint of the function (where W(L) = 0.5) and β₁ affects how steeply the function rises as it passes through its midpoint. The shape of W(L) characterizes the clickstream production behavior of the system. Increasing the value of α₁ will move the curve to the right, increasing the fraction of users of long surfing paths. These two parameters are estimated to be α₁ = 159.46 and β₁ = 0.34 in Fig. 4B.

From Eq. 9 we know that the function of clicks on flow distance should be the differential equation of the Gompertz function, that is,

As shown by Fig. 4B, the number of clicks increases with flow distance at first and then decreases with it. The turning point appears at the location L ≈ 4, which corresponds to T ≈ 1 according to Fig. 4A. This observation is consistent with Fig. 2, which shows that the daily clicks to news increase and reach a maximum value on the second day of news submission (T = 0 for the first day and T = 1 for the second day in Fig. 2), and then decay over a relatively long time period. As we focus on the decay trend of clicks, we only need to analyze the behavior of Eq. 9 and Eq. 10 when T ≥ 1 and L ≥ 4. Considering the empirical values of α₁ and β₁, we know that when L ≥ 4, the first term in the exponent of Eq. 10 is approximately 0, giving

Putting together Eq. 8 and Eq. 11 leads to

Comparing Eq. 12 with Eq. 7, we predict that ω′ ≈ ω. In empirical data analysis, we find that the value ω = 0.4 lies in the 95% Confidence Interval of ω′, that is, [0.25, 0.42]. This means that the studied clickstream networks preserve a stable daily structure that allows the prediction of clicks to news stories by their positions in networks.

The Scaling of Clickstream Networks

In the last section we constructed clickstream network and analyzed the relationship between three variables of news stories, including age (T), the distance from source (L), and clicks (C). We find that due to the existence of a stable clickstream structure, we can predict the clicks to news stories from their position in clickstream networks. To further investigate the observed stable structure of clickstream networks, we will analyze the TIEBA dataset, which includes the clicking records generated by more than 1 × 10⁷ users in 24 hours on 1,000 web forums. If the clickstream networks always preserve the similar structure over time, we should be able to obtain a robust relationship between users and clicks, as discussed in^5,8,14. Our analysis is presented as follows.

We have shown that the cumulative number of clicks W(L) increases with flow distance L, forming a S-shaped curve that can be fitted by the Gompertz function, as described by Eq. 2 and Fig. 4B. We define another quantity U(L), that is, the fraction of users leaving the system from nodes of a flow distance smaller than L. We find that U(L) also has an S-shape, i.e., we have:

Putting Eq. 9 and Eq. 13 together, we achieve a more comprehensive understanding on the metabolism of clickstreams in online systems. We fit these two equations using the clickstream networks collected from the EXO (which is the name of a Korean band) forum and find that the values of β₁/β₂ and α₁/α₂ do not fluctuate a lot over time, supporting our assumption on the time-invariant structure of clickstream networks. In particular, the mean of β₁/β₂ is 0.96 and the mean of α₁/α₂ is 1.17. The standard deviations (SD) of both variables are 0.08, which are very small compared to the means.

In Fig. 5A we show the relationship between U(L) (the upper bound of bands) and W(L) (the lower bound of bands) in each hour. Generally, these two curves have the similar S-shape, but are separated by a gap between them. As α in the Gompertz function determines the location of curve on the horizontal axis, the value of α₁/α₂ controls the width of the gap. In particular, α₁ determines the mode value of surfing lengths at the individual level (see the dashed blue curve in Fig. 5A for example) and α₂ determines the mode value of the surfing lengths aggregated at the webpage level (see the dashed green curve in Fig. 5A for example). If we fix the value of α₁ and compare two online systems A and B, in which α₂ is greater in B than in A, we can infer that while a majority of users have a similar surfing length in two systems, there are more users who have long surfing paths in system B, increasing the mode value of surfing length at the webpage level. In other words, while most users leave system B within a few steps, a few users visit a lot of threads, generating long surfing paths. Therefore, the gap between U(L) and W(L) reflects the inequality of click contribution among users. And the larger the gap is, the more unequal the click contributions are^15,16,17.

**Figure 5: The scaling of clickstream networks.**

Using the condition that β₁/β₂ ≈ 1 and α₁/α₂ > 1, we combine Eq. 9 and Eq. 13 and derive

Note that when L reaches the maximum value, W(L) and U(L) equals the total number of clicks and users in the network. Considering Eq. 1 and Eq. 2, we have

and

Therefore, Eq. 14 reads

which predicts that within each network, W(L) always scales to U(L) super-linearly. This prediction is supported by Fig. 5B. Meanwhile, we find that equation

also holds (see Fig. 5C). Putting Eq. 17 and Eq. 18 together we have

or,

If we treat ψ as random noise, Eq. 20 allows us to predict γ from α₁/α₂. This is non-trivial because the value of α₁/α₂ can be obtained by analyzing a single, randomly selected hourly network, whereas γ can only be obtain through a collection of networks over many hours. To verify our assumption, we systematically investigate the 1,000 forums in the TIEBA dataset and find that Eq. 20 is supported by the empirical data (Fig. 5D).

To summarize, our analysis shows that directed, tree-like structure of clickstream networks, which remains stable over time, leads to the scaling between user and clicks. In particular, the gap between the increase of clicks and users with flow distance, which reflects the inequality of click contribution among users, determines the super-linear scaling γ of the total number of clicks against user population.

Conclusions and Discussions

The previous empirical analysis in this paper reveals the time-sensitive nature of news and thread browsing activities at the collective level. However, the content switching mechanism at the individual level still remains unclear. Proposing a universal mechanism for individual browsing goes beyond the scope of the current paper, but we would like to present some preliminary results towards this direction to inspire further studies. We assume that users visit webpages in reverse chronological order, but in different models users return to previously visited nodes following different rules. In particular, users revisit the newest, the oldest, and the middle-age webpages in three models, respectively. We compare three models on individual browsing behavior.

Panel A in Fig. 6 gives the schematic representations of the three models. All the three models have two parameters, the way-back searching probability p and the number of clicks N. The indexed nodes show the webpages sorted in reverse chronological order. In particular, node t₁ represents the newest webpage and node t_i represent the webpage that was added into the system i time steps ago. We assume that a typical user starts browsing by visiting t₁ and then continues by clicking earlier nodes. For each step of browsing, we assume the probability of visiting the next page is p, and the probability of returning back to previously visited pages is 1 − p. Meanwhile, starting from t₁, a user either continue to visit t₂ with probability p or return to t₁ again with probability 1 − p. As for the returning rules, Model 1 only allows one-step return from t_i to t_{i −1}. Model 2 assumes that users always return from t_i to t₁. Model 3 assumes that users return from t_i to the webpage in the mid-range between t₁ and t_i, i.e., the i/2th page. In Fig. 6B we compared the simulation results for p = 0.5 and N = 1000 between three models. We find that Model 1, also called “bounded random walk model”¹⁸, allows users to explore very old webpages, whereas Model 2 and 3 only allow users to explore the newest pages. Therefore, Model 2 and 3 give time-sensitive dynamics that is more similar to that in real systems. In Fig. 7 we systematically compare the relationships between clicks C, age T, and flow distance L across three models. It is observed that Model 3 presents more similar dynamics to empirical findings than the other two models.

**Figure 7: The dynamics of three Web browsing models.**

In summary, we analyze the directed, tree-like structure of attention diffusion. We develop a network-based metric flow distance L to detect the direction of attention flow and suggest that, by putting together L, clicks C, and age T we are able to quantify the competition for attention throughout the entire life cycles of news, which are relevant to their positions in the clickstream networks. The structure of clickstream networks is found to be stable over time, leading to the scaling between users and clicks. Finally, we compare three time-sensitive individual browsing models and find that the model in which users revisit the middle-age webpages gives rise to the dynamics that is most similar to real systems.

Additional Information

How to cite this article: Wang, C.-J. et al. The Collective Direction of Attention Diffusion. Sci. Rep. 6, 34059; doi: 10.1038/srep34059 (2016).

References

J. Bollen et al. Clickstream data yields high-resolution maps of science. Plos One 4(3), e4803 (2009).
Article ADS Google Scholar
C. Cattuto, V. Loreto & L. Pietronero . Semiotic dynamics and collaborative tagging. Proceedings of the National Academy of Sciences 104(5), 1461–1464 (2007).
Article CAS ADS Google Scholar
L. Weng, A. Flammini, A. Vespignani & F. Menczer . Competition among memes in a world with limited attention. Scientific Reports 2 (2012).
F. Wu & B. A. Huberman . Novelty and collective attention. Proceedings of the National Academy of Sciences 104(45), 17599–17601 (2007).
Article CAS ADS Google Scholar
L. Wu, J. Zhang & M. Zhao . The metabolism and growth of web forums. Plos one, 9(8), e102646 (2014).
Article ADS Google Scholar
D. Brockmann & D. Helbing . The hidden geometry of complex, network-driven contagion phenomena. Science 342(6164), 1337–1342 (2013).
Article CAS ADS Google Scholar
L. Wang & X. Li . Spatial epidemiology of networked metapopulation: An overview. Chinese Science Bulletin 59(28), 3511–3522 (2014).
Article ADS Google Scholar
L. Wu & J. Zhang . Accelerating growth and size-dependent distribution of human online activities. Physical Review E 84(2), 026113 (2011).
Article ADS Google Scholar
M. Higashi . Extended input-output flow analysis of ecosystems. Ecological Modelling 32(1), 137–147 (1986).
Article Google Scholar
Y. Zhang, L. Wang, Y.-Q. Zhang & X. Li. Towards a temporal network analysis of interactive wifi users. Europhysics Letters 98(6), 68002 (2012).
Article ADS Google Scholar
S. Brin & L. Page . The anatomy of a large-scale hypertextual web search engine. Computer networks and ISDN systems 30(1), 107–117 (1998).
Article Google Scholar
K. Lerman & R. Ghosh . Information contagion: An empirical study of the spread of news on digg and twitter social networks. ICWSM 10, 90–97 (2010).
Google Scholar
P. Waliszewski & J. Konarski . A mystery of the gompertz function. In Fractals in biology and medicine pages 277–286. Springer (2005).
J. Zhang & L. Wu . Allometry and dissipation of ecological flow networks. Plos one 8(9), e72525 (2013).
Article CAS ADS Google Scholar
B. A. Huberman, P. L. Pirolli, J. E. Pitkow & R. M. Lukose . Strong regularities in world wide web surfing. Science 280(5360), 95–97 (1998).
Article CAS ADS Google Scholar
B. A. Huberman, D. M. Romero & F. Wu . Crowdsourcing, attention and productivity. Journal of Information Science 35(6), 758–765 (2009).
Article Google Scholar
L. Wu . The accelerating growth of online tagging systems. The European Physical Journal B-Condensed Matter and Complex Systems 83(2), 283–287 (2011).
Article CAS ADS Google Scholar
J. Nicolau . Stationary processes that look like random walks ? the bounded random walk process in discrete and continuous time. Econometric Theory 18(01), 99–118 (2002).
Article MathSciNet Google Scholar
E. M. Reingold & J. S. Tilford . Tidier drawings of trees. Software Engineering, IEEE Transactions on (2), 223–228 (1981).

Download references

Acknowledgements

L.W. acknowledges the financial support for this work from the National Science Foundation, grant number 1210856. CJ.W. acknowledges the financial support for this work from the National Social Science Foundation of China, grant number 15CXW017, the China Postdoctoral Science Foundation, grant number 2015M571722, and the Fundamental Research Funds for the Central Universities, grant number 2062015008.

Author information

Authors and Affiliations

Computational Communication Collaboratory, School of Journalism and Communication, Nanjing University, Nanjing, 210093, P.R. China
Cheng-Jun Wang
Center for Behavior, Institutions and the Environment, Arizona State University, Tempe, 85281, AZ, USA
Lingfei Wu & Marco A. Janssen
School of Systems Science, Beijing Normal University, Beijing, 100875, P.R. China
Jiang Zhang
School of Sustainability, Arizona State University, Tempe, 85281, AZ, USA
Marco A. Janssen

Authors

Cheng-Jun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lingfei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Marco A. Janssen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.W. proposed the idea and led the study, C.-J.W., L.W. and J.Z. performed the data analysis and did the analytical work, L.W. and M.A.J. prepared the manuscript.

Corresponding author

Correspondence to Lingfei Wu.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Wang, CJ., Wu, L., Zhang, J. et al. The Collective Direction of Attention Diffusion. Sci Rep 6, 34059 (2016). https://doi.org/10.1038/srep34059

Download citation

Received: 29 September 2015
Accepted: 05 September 2016
Published: 28 September 2016
DOI: https://doi.org/10.1038/srep34059

This article is cited by

The Hidden Flow Structure and Metric Space of Network Embedding Algorithms Based on Random Walks
- Weiwei Gu
- Li Gong
- Jiang Zhang
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.