Exploiting Information Diffusion Feature for Link Prediction in Sina Weibo

Li, Dong; Zhang, Yongchao; Xu, Zhiming; Chu, Dianhui; Li, Sheng

doi:10.1038/srep20058

Download PDF

Article
Open access
Published: 28 January 2016

Exploiting Information Diffusion Feature for Link Prediction in Sina Weibo

Dong Li^1,2,
Yongchao Zhang¹,
Zhiming Xu¹,
Dianhui Chu² &
…
Sheng Li¹

Scientific Reports volume 6, Article number: 20058 (2016) Cite this article

2951 Accesses
24 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The rapid development of online social networks (e.g., Twitter and Facebook) has promoted research related to social networks in which link prediction is a key problem. Although numerous attempts have been made for link prediction based on network structure, node attribute and so on, few of the current studies have considered the impact of information diffusion on link creation and prediction. This paper mainly addresses Sina Weibo, which is the largest microblog platform with Chinese characteristics and proposes the hypothesis that information diffusion influences link creation and verifies the hypothesis based on real data analysis. We also detect an important feature from the information diffusion process, which is used to promote link prediction performance. Finally, the experimental results on Sina Weibo dataset have demonstrated the effectiveness of our methods.

Similarity-based link prediction in social networks using latent relationships between the users

Article Open access 18 November 2020

An information theoretic approach to link prediction in multiplex networks

Article Open access 24 June 2021

Evaluating link prediction by diffusion processes in dynamic networks

Article Open access 25 July 2019

Introduction

With the rapid development of networking sites (e.g., Twitter Facebook and Sina Weibo), online social networks have drawn substantial attention. In online social networks, users can not only make friends, but also be able to seek and share information. So, online social networks support social interaction and information diffusion among users.

In these online social networks, link prediction is a critical task that not only offers insights into the factors behind creation of individual social relationship but also plays an essential role in the whole network growth. Link prediction can be applied in many fields including user recommendation (e.g., Sina Weibo introduced friend suggestions function “people you might be interested in”), community detection, network growth modeling and so on.

Link prediction has attracted extensive research attention. Nowell and Kleinberg¹ proposed an array of methods for link prediction using network topology. They modeled a social network as a homogeneous graph, in which, each node represents a user and each link denotes a social relationship. Hasan et al.² and Brzozowski et al.³ considered link prediction as a classification problem, they separately compared the effective of different types of features. Random walk methods^4,5 and probability graphical methods^6,7 also have been studied respectively in link prediction task. Some other researchers attempted to explore time feature⁸, offline events⁹, place features¹⁰ and spatial proximity¹¹ to predict links.

Although there is so much interesting research related with link prediction, all above work ignores the impact of information diffusion process on the link creation and prediction. In online social networks, when a user observes that his neighbors share or repost a piece of information, the user will be influenced to consider whether to share or repost the information, which leads to information diffusion. Information diffusion allows users to receive or observe information that is beyond the scope of their social cycles. Furthermore, this phenomenon will influence the creation of new links. For example, there are there users u, v and w in Sina Weibo (a biggest microblog media in China, currently has 500 million users). At the beginning, user u follows user v, user v follows user w and user u does not follow user w. After user v reposts a piece of information released by user w, user u will observe the information. If user u finds that the observed information is valuable or if user u is interested in user w himself in real life, then user u may decide to follow user w.

In current studies, some work has noted the above problem. Zhou et al.¹², Myers et al.¹³, Weng et al.¹⁴ and Antoniades et al.¹⁵ analyzed the relation between information diffusion and link creation and Zhou et al. also proposed a visibility-based model for link prediction. Farajtabar et al.¹⁶ did not analyze the influence of information diffusion on link creation, but directly explored information diffusion process to predict links. However, all above studies are based on the USA’s Twitter or Meme. China has the largest number of internet users in this world and social media in China usually has strong Chinese characteristics. Especially, Sina Weibo is the biggest microblog media in China, current studies^17,18 have presented differences between Sina Weibo and Twitter in many dimensions (e.g., access behavior, syntactic content analysis, temporal behavior). Our work is the first to explore the influence of information diffusion on links formation from a quantitative perspective and to explore information diffusion for link prediction in Sina Weibo. Specifically, the contributions of this paper are as following:

Considering Sina Weibo, we introduce the hypothesis that information diffusion impacts link creation and use examples to explain rationality of our hypothesis.
We detect an important feature (observation number) from information diffusion process. Statistical analysis results in Sina Weibo dataset show that the detected feature is related to following relation creation, which verifies and supports our proposed hypothesis.
We combine diffusion feature with network topology features for link prediction and conduct various experiments on Sina Weibo dataset. Experiments results validate the diffusion feature is indeed helpful for promoting performance of link prediction.

The rest of this paper is organized as follows: Section 2 discusses related works; Section 3 introduces Sina Weibo dataset we collected; Section 4 introduces and verifies our hypothesis. Section 5 presents experimental results that validate the effectiveness of our methodology; Finally, Section 6 concludes.

Related Work

Link prediction is one of the core tasks in social networks research. The basic link prediction method is based on the local neighborhood structures, as surveyed by Liben-Nowell and Kleinberg^1,19. Clauset et al.²⁰ presented a general technique for inferring hierarchical structure from network data. Yin et al.²¹ analyzed link structures in Twitter and proposed a novel structure-based personalized link prediction model. Random walk method is a variation of PageRank. Backstrom et al.⁴ proposed a supervised random walk algorithm which combines information from the network structure with node and edge level attributes to estimate the strength of social links. Yin et al.⁵ modeled social networks as heterogeneous graphs and applied a random walk algorithm on them to calculate link proximity.

Hasan et al.² considered link prediction as a classification problem. They compared different classes of supervised learning algorithms based on proximity features, topological features and aggregated features. Brzozowskiet al.³ also compared the effective of different types of features including user preference, user behavior and network topology. Lichtenwalter et al.²² presented a classification framework which employed their PropFlow as a feature. Wang et al.⁶ proposed a local probabilistic model for link prediction that used Markov Random Field (MRF), an undirected graphical model. Kashima et al.⁷ proposed a probabilistic model of network evolution which can be used for link prediction. Erims et al.²³ coupled tensor factorization framework to predict links. Dong et al.²⁴ tried to find missing links by convex nonnegative matrix factorization with block detection.

Zhou et al.¹² found that exposing the same user multiple times does not necessarily increase the probability a new link will form and they also proposed a visibility-based model for link prediction. Myers et al.¹³, Weng et al.¹⁴ and Antoniades et al.¹⁵ also focused on the same problem, but got a different conclusion that that repeated exposure to contents posted by a user increases the probability of following that user. Farajtabar et al.¹⁶ proposed a model for simulating diffusion and network events form the co-evolutionary dynamics which can be used to predict links. However, all these studies are based on the USA’s Twitter or Meme, our work attempts to explore the influence of information diffusion on link creation in Sina Weibo with strong Chinese characteristics.

In recent years, the studies on link prediction also have evolved over various aspects. One of the main aspects among these is to consider the time factor, which can be named as time-aware link prediction^8,25,26. Besides, Leskovec et al.²⁷ employed a logistic regression model to predict positive and negative links in online social networks. Song et al.²⁸ explored the scalability of the proposed solutions for link prediction.

Dataset

Sina Weibo is the most popular microblog media in China, which contains more than 500 million users as of 2015. In Sina Weibo, users can follow other users. For example, user u follows user v, we say that user u is a follower of user v and user v is a followee of user u. If user u and user v both follow each other, we consider them mutual friends. We define followers(u) as the set of user u’s followers, define followee(u) as the set of user u’s followees and define friends(u) as the set of user u’s mutual friends. Sina Weibo enables users to post messages of up to 140 characters. Posting messages can contain text, pictures, videos, links and hashtag. Similar to Twitter, Sina Weibo allows users to repost or forward someone else’s messages to their followers.

The dataset we need should contain social network structure data and messages data released by users in the social network. Firstly, we select 114 seed users that are related to the internet field. Then we collect users followed by seed users and the follow relations among all these users. Secondly, we collect messages published or forwarded by all these users from 11/07/2011 to 11/28/2011. About each message, we collect id of the message, id of user posting the message, flag marking whether this message is reposted, id of reposted message and id of user posting the reposted message. Finally, we collected 30270 users, 7694408 relations, 6054000 messages with related information. The social network we collected can be modeled as a directed graph, nodes and edges in the graph correspond to users and follow relations in the social network, respectively. The statistics on the graph are summarized in Table 1.

Table 1 The statistics on the network graph in Sina Weibo dataset.

Full size table

Figure 1 shows the in-degree distribution and out-degree distribution of users in the social network. In Fig. 1, x-axis represents in-degree or out-degree of users, y-axis represent the count of users. Each node in Fig. 1 indicates the count of users with a specific in-degree or out-degree. The in-degree distribution and out-degree distribution both exhibit heavy-tailed distributions. According to our calculations, we find that 6.0% of users receive more in-links than the other 94.0% of users and 5.4% of users receive more out-links than the other 94.6% of users. We also find that 10% of users receive about 65.7% of in-links and 10% of users receive about 69.2% of out-links.

Information Diffusion and Link Prediction

In this section, we first introduce the hypothesis that information diffusion process impacts links creation, then use examples to explain rationality of our hypothesis. Next, we do statistics and analysis on collected Sina Weibo dataset to verify our hypothesis.

Users’ sharing or reposting behaviors create an information diffusion process that allow users to receive information from outside of their own social cycles. Our hypothesis is as follows: when one user receives or observes a piece of information released by an unrelated user, he may be interested in the information content or the user releasing the information, in that case, he may try to create a new social relation with the unrelated user. For two unrelated users, u and v, as user u observes more information that is released by user v, the probability that a new social relation from user u to user v will be created increases. We believe that the features detected from the information diffusion process will be helpful in predicting links.

Here, we take two examples to explain our hypothesis. Because of the copyright issues, we can not use real examples from Sina Weibo directly in this paper. Alternatively, we design and draw two examples presented in Fig. 2 following the format of Sina Weibo. The user names in examples are distributed by ourselves and do not correspond to any real user in Sina Weibo. Readers can find some similar real examples based on our examples in Sina Weibo.

We assume that there is a user u in Sina Weibo. Figure 2 presents two messages that user u observed on his homepage, which represent two situations in which user u may follow another unrelated user. From Fig. 2(a), we can notice that the message was released by “LoveNBA” and Tom, Jerry and Blake reposted the message. Thus, the message was spread to user u according to the line of “LoveNBA → Jerry → Tom → Blake → u”. This message is related with basketball and user u is also very interested in basketball. He liked this message, then he viewed the homepage of “LoveNBA” and found some other interesting messages. At last, he may decide to follow user “LoveNBA”. In Fig. 2(b), the message was released by “Larry” who is an expert on machine learning and was spread to user u according to the line of “Larry → Jack → Rain → Lucy → Tim → u”. In daily life of user u, he is always concerned about machine learning and Mr. Larry, but he do not know Larry has opened an account in Sina Weibo. Information diffusion makes him observe the account of “Larry”, then he thinks this account is important to him and may decide to follow user “Larry”.

Information diffusion process makes one user can observe other unrelated users. For two unrelated users, u and v, we assume D(u, v) as the number of user u observing user v. When one of user u’s followees reposts one message released by user v, the value of observation number D(u, v) will increase by 1. The value of observation number D(u, v) is equal to the total number of behaviors of reposting user v’s messages performed by all followees of user u. Observation number is an important feature detected from information diffusion process, so here it is called as a diffusion feature. We think observation number is related to links creation and prediction. For verifying our hypothesis, we analyze the relationship between observation number and follow probability based on Sina Weibo dataset.

Firstly, we make statistics of observation number of all user pairs in Sina Weibo dataset. We get statistic results that the observation number of 87.1% of user pairs is no more than 20 times and the observation number of 97.5% of user pairs is no more than 100 times.

Secondly, we count the number of user pairs where a follow relation is created and the number user pairs where no follow relation is created for each observation number. We thereby calculate the follow probability for each observation number. Based on statistics about observation number in first step, we present Figs 3 and 4 whose maximum observation numbers are set as 20 and 100 respectively. In Fig. 3, the X axis represents observation number and the Y axis represents follow probability. For representing global trend changing, we set granularity of X axis as 10 in Fig. 4. The Y axis represents the average follow probability of each 10 observation numbers. For example, the first column means that the average following probability between user pairs whose observation number is in the range of 1 to 10 is 0.014. From Figs 3 and 4, we can easily find that the probability of follow relation created is increasing with the increase of observation number. Comparing with similar studies in Twitter, our conclusion is contrary to the conclusion in¹², however, is same with conclusions in^13,14,15. We consider that Zhu et al.¹² only examined the relationship between how often a user v received a specific user @CARightToKnow and the average probability that v followed @CARightToKnow in return. However, we test all user pairs who might form a new relationship. This may lead that Zhu et al. got a biased result.

Above analysis shows that information diffusion process does promote positively relation creation. Based on above explanation and analysis, we can conclude that our hypothesis is reasonable. Because information diffusion is an important driver of social relations creation, we believe that the diffusion feature (observation number) is definitely helpful in link prediction task. The diffusion feature can be combined with some other features for promoting performance of link prediction.

Experiment

In this section, we combine the diffusion feature and topological features together for link prediction and conduct various experiments to compare combination methods with the methods using topological features. Experiment results on Sina Weibo dataset can verify whether the diffusion feature is helpful to improve link prediction results.

Experiment Setup

Prediction Setting

In our experiments, we use the Sina Weibo dataset described in section 3. We split the collected Sina Weibo dataset into training dataset and testing dataset. For each user in testing dataset, we remove half follow links and the prediction task is then to use the pruned networks and training data to find the missing links.

Comparison Methods

Topological features are the most commonly used features which are detected from network topology structure^1,19,20. In our experiment, we adopt three topological features from Sina Weibo network: mutual followers similarity, mutual followees similarity and mutual friends similarity (The definitions of followers, followees and friends can be found in section “Dataset”). Here, we take user u and user v as examples to explain three topological features. Mutual followers similarity between user u and user v is equal to . The numerator is the size of the overlap of the follower sets between u and v and the denominator is the size of the union of the follower sets between two users which is used to normalized the numerator (mutual followers count). Mutual followees similarity and mutual friends similarity between two users are equal to , separately. The numerators and denominators of these two features are similar to these of mutual followers similarity, so we do not repeat them any more in here. Because these three topological features are decimal, we also normalize the diffusion feature for combing different features together easily.

We compare the method combing diffusion feature and topological features with other methods using single feature or combine three topological features in link prediction task. If the combination method performs better, we can conclude that diffusion feature is a helpful feature to predict links. Specifically, Following lists the methods we evaluate and compare in our experiments.

Mutual followers similarity (method 1)
Mutual followees similarity (method 2)
Mutual friends similarity (method 3)
Diffusion feature (method 4)
Mutual followers similarity + Mutual followees similarity + Mutual friends similarity (method 5)
Mutual followers similarity + Mutual followees similarity + Mutual friends similarity + Diffusion feature (method 6)

In our experiment, we consider link prediction task as a ranking problem. In methods 1–4, we directly assume feature values as ranking scores. Because methods 5–6 make use of multiple features, so we need combine these features together to calculate ranking scores. Here, we adopt a linear combination method. We take method 6 as an example to explain, the calculation method of the ranking score is as following:

where sim1, sim2, sim3 and sim4 separately correspond to four features and w1, w2, w3 and w4 separately correspond to the weights of four features. In this paper, we adopt the analytic hierarchy process to calculate the weight of each feature. After getting ranking scores, we select top K links based on ranking scores as prediction results. We evaluate the performance of different methods in terms of Precision, Recall, F1-Measure at top K predicted links. The values of K are set to be 5, 10, 15, 20, 25 and 30 separately.

where TP stands for true positives, FN stands for false negatives, FP stands for false positive and TN stands for true negative. Under this presupposition, we denote TP as the number of links which were truly created by users and our prediction method gives the same predicting results, FP as the number of links which were not truly created by users but our prediction method predicts links will be created, TN as the number of links which were not truly created by users and our prediction method gives the same predicting results, FN as the number of links which were truly created by users but our prediction method predicts links will not be created.

Experiment Results

Figures 5, 6, 7 present the performances of these different methods by three different metrics on Sina Weibo dataset. We first analyze results of methods which make use of topological features. In the prediction task, the method 2 using mutual followees similarity performs better than the method 1 using mutual followers similarity, the method 3 using mutual friends similarity performs better than the method 2 using mutual followees similarity. Method 5, which combines three topological features, achieves better performance than do methods 1–3, each of which use a single topological feature.

The method 4 using diffusion feature can get similar performance with methods 1–3 using single topological feature. While the method 6 integrating diffusion feature and topological features together achieves best performance in our experiment. This result shows that diffusion feature is indeed a machine helpful feature in link perdition task.

To quantify the extent of fluctuations around the average, we also compute standard deviations and draw error bar for each plot in Figs 5, 6, 7. We can find that, error bars become shorter with the increase of K value in Fig. 5, however, become longer with the increase of K value in Figs 6 and 7. We also notice that, error bars of combination methods (method 5, 6) are longer than these of methods (methods 1–4) using a single feature at same K value. Because precision values become larger, Recall and F1-measure values become smaller, with the increase of K values. The evaluation metrics of combination methods are also bigger than these of methods using a single feature. So, we analyze that the changes of evaluation metric values may impact the sizes of error bars. Next, we compare two combination methods in terms of error bar and find that error bars of method 6 is shorter than these of method 5 at same K value. This means that the diffusion feature is helpful for getting a more smooth prediction results.

Here we should highlight that, the aid of combing diffusion feature with topological features in our experiment is to prove that diffusion feature can be used with other features together to promote prediction performance. This does not mean that diffusion feature only can be combined with topological features. In contrast, diffusion feature can be combined with any other features (e.g., user preference features, user behavior features) under any machine learning model.

Conclusions

In recent years, online social networks have undergone a significant growth and attracted much attention. In these online social networks, link prediction is a critical task which has widespread application scenarios. In this paper, we mainly focus on Sina Weibo and point that information diffusion process affects new links creation. We detect a new feature from diffusion process and combine it with topological features for link prediction. Experimental results on Sina Weibo dataset show that new method performs better than the methods which only use topological features.

In the future, we will try to adopt classifier models to combine the diffusion feature with other features and also will explore more helpful features for link prediction. Because the limitation of Sina Weibo APIs, our current dataset does not contain time information of link creation, so our current work is to predict whether links will be created but not to predict when link will be create. We will attempt to collect time-related datasets and then explore the problem that how to make use of information diffusion to predict time-related links.

Additional Information

How to cite this article: Li, D. et al. Exploiting Information Diffusion Feature for Link Prediction in Sina Weibo. Sci. Rep. 6, 20058; doi: 10.1038/srep20058 (2016).

References

Liben-Nowell, D. & Kleinberg, J. The link prediction problem for social networks. Proc. 12ed ACM international Conference on Information and Knowledge Management, New Orleans, Louisiana, USA. New York: ACM Press, 556–559 (2003, November 2–8).
AlHasan, M. et al. Link prediction using supervised learning. SDM'06: Workshop on Link Analysis, Counter-terrorism and Security, Bethesda, Maryland, USA. Philadelphia, USA: Society for Industrial and Applied Mathematics, 19104–2688 (2006, April 22).
Brzozowski, M. & Romero, D. Who Should I Follow? Recommending People in Directed Social Networks. Proc. 5th International AAAI Conference on Weblogs and Social Media, Barcelona, Spain. Menlo Park: AAAI Press 57–66 (2011, July 17–21).
Backstrom, L. & Leskovec, J. Supervised random walks: predicting and recommending links in social networks. Proc. 4th ACM international conference on Web search and data mining, Hong Kong, China. New York: ACM Press, 635–644 (2011, February 9–12).
Yin, Z., Gupta, M. & Weninger, T. LINKREC: a unified framework for link recommendation with user attributes and graph structure. Proc. 19th international conference on World wide web, Raleigh, North Carolina, USA. New York: ACM Press, 1211–1212 (2010, April 26–30).
Wang. C., Satuluri, V. & Parthasarathy, S. Local Probabilistic Models for Link Prediction. Proc. 7th IEEE International Conference on Data Mining, Omaha NE, USA. Boston: IEEE Computer Society, 322–331 (2007, October 28–31).
Wang, C., Satuluri, V. & Parthasarathy, S. A Parameterized Probabilistic Model of Network Evolution for Supervised Link Prediction. Proc. 6th IEEE International Conference on Data Mining, Hong Kong, China. Boston: IEEE Computer Society, 1163–1168 (2006, December 18–22).
Tylenda, T., Angelova, R. & Bedathur, S. Towards time-aware link prediction in evolving social networks. Proc. 3rd Workshop on Social Network Mining and Analysis, Paris, France. New York: ACM Press (2009, June 28-July 1).
Gomez Rodriguez, M. & Rogati, M. Bridging offline and online social graph dynamics. Proc. 21st ACM international conference on Information and knowledge management, Maui, Hawaii. New York: ACM Press, 2447–2450 (2012, October 29-November 2).
Scellato, S., Noulas, A., Mascolo CBrzozowski, M . & Romero, D. Exploiting place features in link prediction on location-based social networks. Proc. 17th ACM SIGKDD international conference on Knowledge discovery and data mining, San Diego, CA, USA. New York: ACM Press, 1046–1054 (2011, August 21–24).
Wang, D. et al. Human mobility, social ties and link prediction. Proc. 17th ACM SIGKDD international conference on Knowledge discovery and data mining, San Diego, CA, USA. New York: ACM Press, 1100–1108 (2011, August 21–24).
Zhu, L. & Lerman, K. A visibility-based model for link prediction in social media. Proc. 6th Sixth ASE International Conference on Social Computing, Stanford, CA, USA. NC, USA: ASE (2013, May 27–31).
Myers, S. A. & Leskovec, J. The bursty dynamics of the twitter information network. Proc. 23rd international conference on World Wide Web, Seoul, Korea. New York: ACM Press, 913–924 (2014, April 7–11).
Weng, L. et al. The role of information diffusion in the evolution of social networks. Proc. 19th ACM SIGKDD international conference on Knowledge discovery and data mining, Chicago, IL, USA. New York: ACM Press, 356–364 (2013, August 11–14).
Antoniades, D. & Dovrolis, C. Co-evolutionary dynamics in social networks: A case study of Twitter. Proc. 10th International Conference on Signal-Image Technology and Internet-Based Systems. Marrakech, Morroco. Boston: IEEE Computer Society, 361–368 (2014, November 23–27).
Farajtabar, M. et al. Co-evolutionary Dynamics of Information Diffusion and Network Structure. Proc. 23rd international conference on World Wide Web, Firenze, Italy. New York: ACM Press, 619–620 (2015, May 18–22).
Chen, S., Zhang, H., Lin, M. & Lv, S. Comparison of microblogging service between Sina Weibo and Twitter. Proc. 3rd International Conference on Computer Science and Network Technology, Dalian, China. Boston: IEEE Computer Society, 2259–2263 (2013, October 12–13).
Gao, Q., Abel, F., Houben, G. J. & Yu, Y. A comparative study of users’ microblogging behavior on Sina Weibo and Twitter. Proc. 20th International Conference on User Modeling, Adaptation and Personalization, Montreal, Canada. Berlin: Springer, 88–101 (2012, July 16–20).
Liben-Nowell, D. & Kleinberg, J. The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol. 58(7), 1019–1031 (2007).
Article Google Scholar
Clauset, A., Moore, C. & Newman, M. Hierarchical structure and the prediction of missing links in networks. Nature. 453(7191), 98–101 (2008).
Article ADS CAS PubMed Google Scholar
Yin, D., Hong, L. & Davison, B. Structural link analysis and prediction in microblogs. Proc. 20th ACM international conference on Information and knowledge management, Glasgow, Scotland, UK. New York: ACM Press, 1163–1168 (2011, October 24–28).
Lichtenwalter, R., Lussier, J . & Chawla, N. New perspectives and methods in link prediction. Proc. 16th ACM SIGKDD international conference on Knowledge discovery and data mining, Washington, DC, USA. New York: ACM Press, 243–252 (2010, July 24–28).
Ermis, B., Acar, E. & Cemgil, A. T. Link prediction in heterogeneous data via generalized coupled tensor factorization. Data Mining and Knowledge Discovery. 29(1), 203–236 (2015).
Article MathSciNet Google Scholar
Dong, E., Li, J. & Xie, Z. Link Prediction via Convex Nonnegative Matrix Factorization on Multiscale Blocks. Journal of Applied Mathematics (2014).
Ahmed, A. & Xing, E. P. Recovering time-varying network of dependencies in Social and biological studies. PNAS. 106(29), 11878–11883 (2006).
Article ADS Google Scholar
Bliss, C. A., Frank, M. R., Danforth, C. M. & Dodds, P. S. An evolutionary algorithm approach to link prediction in dynamic social networks. Journal of Computational Science. 5(5), 750–764 (2014).
Article MathSciNet Google Scholar
Leskovec, J., Huttenlocher, D. & Kleinberg J. Predicting positive and negative links in online social networks. Proc. 19th international conference on World wide web, Raleigh, North Carolina USA. New York: ACM Press, 641–650 (2010, April 26–30).
Song, H. H., Cho, T. W., Dave, V., Zhang, Y. & Qiu, L. Scalable proximity Estimation and Link Prediction in Online Social Networks. Proc. 9th ACM SIGCOMM conference on Internet measurement conference, Chicago, IL, USA. New York: ACM Press, 322–335 (2009, November 4–6).

Download references

Acknowledgements

This work is supported by the Natural Science Foundation of China (No. 61173074) and the major science and technology project of Shandong Province (No. 2015ZDXX0201B02).

Author information

Authors and Affiliations

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
Dong Li, Yongchao Zhang, Zhiming Xu & Sheng Li
School of Computer Science and Technology, Harbin Institute of Technology (Weihai), Weihai, Shandong, China
Dong Li & Dianhui Chu

Authors

Dong Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongchao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiming Xu
View author publications
You can also search for this author in PubMed Google Scholar
Dianhui Chu
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L., Y.Z., Z.X. and S.L. designed the research; D.L. and Y.Z. performed the experiments; D.L. analyzed the data, prepared the figures and wrote the paper; D.C. and D.L. performed new experiments and modified the paper in the revision. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Li, D., Zhang, Y., Xu, Z. et al. Exploiting Information Diffusion Feature for Link Prediction in Sina Weibo. Sci Rep 6, 20058 (2016). https://doi.org/10.1038/srep20058

Download citation

Received: 17 August 2015
Accepted: 22 October 2015
Published: 28 January 2016
DOI: https://doi.org/10.1038/srep20058

This article is cited by

TRTCD: trust route prediction based on trusted community detection
- Elaheh Golzardi
- Amir Sheikhahmadi
- Alireza Abdollahpouri
Multimedia Tools and Applications (2023)
An automatic crisis information recognition model based on BP neural networks
- Li Yang
- Huihui Guo
- Jiaxue Wang
Journal of Ambient Intelligence and Humanized Computing (2023)
Evaluating link prediction by diffusion processes in dynamic networks
- Didier A. Vega-Oliveros
- Liang Zhao
- Lilian Berton
Scientific Reports (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.