Inferring microRNA-disease association by hybrid recommendation algorithm and unbalanced bi-random walk on heterogeneous network

Yu, Dong-Ling; Ma, Yuan-Lin; Yu, Zu-Guo

doi:10.1038/s41598-019-39226-x

Download PDF

Article
Open access
Published: 21 February 2019

Inferring microRNA-disease association by hybrid recommendation algorithm and unbalanced bi-random walk on heterogeneous network

Scientific Reports volume 9, Article number: 2474 (2019) Cite this article

1729 Accesses
11 Citations
Metrics details

Subjects

Abstract

More and more research works have indicated that microRNAs (miRNAs) play indispensable roles in exploring the pathogenesis of diseases. Detecting miRNA-disease associations by experimental techniques in biology is expensive and time-consuming. Hence, it is important to propose reliable and accurate computational methods to exploring potential miRNAs related diseases. In our work, we develop a novel method (BRWHNHA) to uncover potential miRNAs associated with diseases based on hybrid recommendation algorithm and unbalanced bi-random walk. We first integrate the Gaussian interaction profile kernel similarity into the miRNA functional similarity network and the disease semantic similarity network. Then we calculate the transition probability matrix of bipartite network by using hybrid recommendation algorithm. Finally, we adopt unbalanced bi-random walk on the heterogeneous network to infer undiscovered miRNA-disease relationships. We tested BRWHNHA on 22 diseases based on five-fold cross-validation and achieves reliable performance with average AUC of 0.857, which an area under the ROC curve ranging from 0.807 to 0.924. As a result, BRWHNHA significantly improves the performance of inferring potential miRNA-disease association compared with previous methods. Moreover, the case studies on lung neoplasms and prostate neoplasms also illustrate that BRWHNHA is superior to previous prediction methods and is more advantageous in exploring potential miRNAs related diseases. All source codes can be downloaded from https://github.com/myl446/BRWHNHA.

Predicting miRNA–disease associations using improved random walk with restart and integrating multiple similarities

Article Open access 26 October 2021

Predicting miRNA-based disease-disease relationships through network diffusion on multi-omics biological data

Article Open access 26 May 2020

Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model

Article Open access 20 April 2020

Introduction

MicroRNAs (miRNAs) are a class of short non–coding RNAs (21–25 nt)^1,2,3. As an important transcriptional regulatory factor, miRNAs are widely involved in the biological procedures of disease-related gene regulation, which is closely related to human multi-gene diseases^4,5,6,7. Increasing evidences have demonstrated that miRNAs play a critical role in the emergence and development of diseases^8,9. Hence, revealing miRNAs associated diseases is an efficient way to accelerate the acquaintance about disease pathology at the molecular level^10,11,12,13.

As detecting miRNA-disease associations by experimental techniques is expensive and time-consuming, many effective computational methods about the prediction of the relationship between miRNAs and diseases have been proposed. For example, Jiang et al.¹⁴ proposed a computational approach to infer potential miRNA-disease associations by hypergeometric distribution. For a given disease, they priorited the entire human miRNAs. In addition, Jiang et al.¹⁵ further improved the calculation of concordance score between a miRNA and a given disease. Chen et al.¹⁶ firstly presented a prediction computational method named RWRMDA based on global network similarity, to predict novel human miRNA-disease associations by adopting the method of random walk on network of miRNA functional similarity. Then, Xuan et al.¹⁷ developed a reliable prediction method based on random walk, they assigned different weights to transition matrix of miRNAs depending on whether they are associated with given diseases to exploit the prior information of nodes and the various ranges of topologies. And they extended the walk on a miRNA-disease bipartite network to predict candidates miRNAs, specially for the diseases without any known related miRNAs. Furthermore, Chen et al.¹⁸ developed a novel prediction method named WBSMDA for inferring miRNA-disease based on integrating miRNA functional similarity, disease semantic similarity, the known miRNA-disease associations, and the Gaussian interaction profile kernel similarity into heterogeneous network. WBSMDA not only could deal with new diseases without any known associated miRNAs, but also could handle new miRNAs without any known associated disease. In 2016, Zeng et al.¹⁹ conducted a review on methods for predicting disease and miRNA associations based on biological interaction networks. After detailed comparing these methods, they pointed out the current challenges in predicting disease and miRNA correlations. Liu et al.²⁰ further proposed a method to explore potential miRNAs related to diseases by integrating multiple biology data in 2017. In recent years, recommendation system algorithms have been successfully applied in many fields. Chen et al.²¹ presented a hybrid approach for miRNA-disease association prediction (HAMDA) method based on hybrid recommendation methods, which combined available biology data and network-based inference methods. However, just like the above mentioned methods, they only prioritized miRNAs by utilizing the same layers neighbor nodes of miRNAs and diseases rather than making use of the different structural and topological characteristics among subgraphs of heterogeneous networks. Luo et al.²² proposed a novel effective prediction model that use unbalanced bi-random walk to improve performance of prediction. They fully exploited the different topological and structural of miRNA similarity networks and disease similarity network. This method improved prediction performance, but ignored the prior information and the respective topological structural of bipartite network. Zeng et al.²³ found that heterogeneous miRNA-disease networks perform better on prediction than single disease similarity networks, miRNA similarity networks, and the known disease-gene association networks in 2018. So they adopted a method of structural perturbation to improve the prediction accuracy of miRNA-disease association.

We believe that the topological and structural features of heterogeneous network contain important information which is useful for discovering more reliable miRNA-disease associations. In present work, we develop an efficient computational method based on hybrid recommendation approach and unbalanced bi-random walk, called BRWHNHA (Bi-random Walk on Heterogeneous Network based on Hybrid Approach), which exploits the characteristic of nodes and the topological structural of the known miRNA-disease association by using hybrid recommendation approach, and taken advantage of the different topological structural between similarity networks of miRNA and disease by adopting bi-random walk on heterogeneous network. The hybrid recommendation algorithm adds some virtual edges to heterogeneous networks by calculating the transition matrix of bipartite network, so that the unbalanced bi-random walk on the new heterogeneous network can find potential miRNAs related to diseases more efficiently. To validate the prediction ability of BRWHNHA, we adopted five-fold cross-validation and compared BRWHNHA with MIDPE¹⁷, HAMDA²¹, and BRWH²². The average AUC is 2.13%, 0.69%, and 2.20% higher than the three methods. The case studies on lung neoplasms and prostatic neoplasms, and in the top 50 predicted associations, there are 49 and 46 real associations, respectively. It further demonstrates the ability of BRWHNHA in discovering potential miRNAs associated with disease.

Results

To evaluate the prediction effectiveness of BRWHNHA in exploring undiscovered association between miRNAs and diseases, we compared BRWHNHA with MIDPE¹⁷, HAMDA²¹, and BRWH²² by five-fold cross-validation with repeating 100 times on the dataset obtained by Luo and Xiao²². For a given disease, we randomly divided the known-related miRNAs into five subsets with equal size. For each round, we used one subset as testing set and other four subsets as training set. After 5 rounds, we calculated the average AUC value. In order to reduce false positive, we recalculated miRNAs similarity and obtained a bran-new similarity matrix in each round of prediction. Then we calculated the probability of association between the given disease and miRNAs by BRWHNHA. Finally, all candidate miRNAs were ranked by association probability. The higher the miRNAs in testing set were ranked, the better the performance. As the most of diseases only have a few association with miRNAs that have been proved, the performance of the prediction methods can not be accurately evaluated. Hence we only tested the 22 diseases associated with at least 60 miRNAs as Luo and Xiao²². We only showed recall-precision curve of breast neoplasms and lung neoplasms. In addition, we analyzed effect of parameters on performance of BRWHNHA.

Performance evaluation

In this study, the novelty of BRWHNHA was to calculate the transition probability matrix of bipartite network by using hybrid recommendation algorithm, and then a bi-random walk on heterogeneous network based on hybrid approach was adopted. The average AUC is 83.55% without using hybrid recommendation algorithm, which was 2.14% less than using hybrid recommendation algorithm. Therefore, it is important to construct the transition probability matrix by hybrid recommendation algorithm. The prediction accuracy was actually improved by exploring the prior information and topological structure of bipartite networks. The same heterogeneous network was used on MIDPE¹⁷, HAMDA²¹ and BRWH²². The best parameters of α = 0.9 and γ = 0.8 for MIDPE, σ = 0.7 and ρ = 0.8 for HAMDA λ = 0.6, α = 0.4, r = 2, l = 1 for BRWH were adopted as reported in original papers.

As illustrated in Table 1, the average AUC values of MIDPE, HAMDA, BRWHA and BRWHNHA in 22 diseases are 83.55%, 85.00%, 83.49% and 85.69% respectively. BRWHNHA performed the best with AUC 2.13%, 0.69% and 2.20% higher than other three methods. Moreover, BRWHNHA is superior to MIDPE and BRWH in all measurements for 22 diseases. Although HAMDA achieves higher AUC than BRWHNHA in 7 out of 22 diseases, but BRWHNHA obtains better performance in most of diseases. Since HAMDA repeatedly uses the known miRNAs-disease association data in the measurement of miRNAs similarity, it maybe overestimate the results. ROC curves of BRWHNHA and other three methods corresponding to the maximum AUC value in five-fold cross-validation at 100 times have shown in Supplementary Fig. S1 in Additional file.

Table 1 Predicting outcomes for MIDPE, HAMDA, BRWH and BRWHNHA by the five-fold cross-validation.

Full size table

In Fig. 1, we compared BRWHNHA with other three methods in the recall-precision curves of breast neoplasms and lung neoplasms based on five-fold cross-validation. The precision-recall curve was obtained by measuring recall and precision at positions of top k (k = 10, 20, …, 100). The results show that our method achieves the highest precision and recall in the top 20. Moreover, with the increase of k value, the precision of BRWHNHA decreases, but the recall increases. It suggestes that the associations ranked in top position have higher probability of being potential miRNA-disease associations. We also compared the statistical significance of the difference in predictive ability between BRWHNHA and other three methods by paired t-tests. The P-values are listed in Table 2. Obviously, BRWHNHA achieves better performance than MIDPE, HAMDA, BRWHA at the significance level of 0.05.

Table 2 Pairwise comparison between BRWHNHA and another method by paired t-test on the AUC of prediction.

Full size table

We also compared our method with SPM on the dataset used by Zeng et al.²³. We found that our method performs slightly better than SPM on five subsets with equal size in five-fold cross-validation in most cases (comparison results are not shown here).

Effect of parameters in BRWHNHA

There are four parameters λ, α, r and l explored in our method. The parameter λ is the hybridization parameter to mediate between HeatS algorithm and ProbS algorithm different kinds of resource distribution processes, and parameter α plays the role to control the consistence between the predicted candidate miRNA-disease associations and the known associations. The parameters of r and l are the numbers of maximal random walk steps in miRNA similarity network and disease similarity network, respectively. We set various values of λ and α ranging from 0 to 1, the step length was 0.1. r and l were taken to be between 0 step to 5 steps, the step length was 1. Then, we calculated average AUC in the framework of five-fold cross-validation. Table 3 shows the effects of λ, α, r and l on the cross validation result in miRNA-disease association dataset. It can be observed that BRWHNHA achieves the best performance, when λ = 0.6, α = 0.4, r = 2, l = 1.

Table 3 Effects of parameters λ, α, r, l on prediction performance of BRWHNHA.

Full size table

Case study

To further validate efficiency of BRWHNHA for discovering the potential associations between miRNAs and diseases, we conducted two case studies of Lung neoplasms and Prostatic neoplasms here. All known miRNA-disease associations released in June 2014 were regarded as training sets, and the set of candidate associations formed by all other associations. The prediction results of Lung neoplasms and Prostatic neoplasms were confirmed based on relevant literatures and two important public database: dbDEMC²⁴ and MiR2Disease²⁵.

Lung cancer is one of the malignant tumors with the highest morbidity and mortality, and it is the greatest health and life threat to human. Over the past 50 years, many countries have reported significant increases in the incidence and mortality of lung cancer. The first 50 predicted miRNA associated with lung cancer were shown in Table 4. As a result, among the top 20 and 50 potential Lung neoplasms associated miRNAs, 20 and 49 were confirmed by dbDEMC database, MiR2Disease and literature. Though there is no database or literature that proved the miRNA (hsa-mir-200) relevance to lung neoplasms, the mir-200 family, which includes 5 members (miRNA-200a, miRNA-200b, miRNA-200c, miRNA-429, and miRNA-141), is associated with Lung neoplasms in dbDEMC, so we have reason to believe that it is related to the disease. In addition, we also listed the potential miRNAs from top 51 to top100 (Supplementary Table S2 in Additional file 1).

Table 4 The first 50 potential miRNAs associated with lung neoplasms predicted by BRWHNHA.

Full size table

Prostate neoplasms is an important malignant tumor in male patients. There are usually no clinical symptoms in the early stage. Currently, most of the patients admitted by prostate neoplasms are in the late stage²⁶. Therefore, early diagnosis is an urgent problem. There are many evidences that have confirmed a link between miRNA and prostate neoplasms, and it could be therapeutically useful for the treatment of prostate neoplasms by regulating the expression of related miRNAs^27,28. As a result of the case study for prostate neoplasms, 18 out of the top-20 and 46 out of the top-50 predicted miRNAs of prostate neoplasms were verified by dbDEMC, MiR2Disease and literature (shown in Table 5). However, hsa-mir-302f, hsa-mir-1915, hsa-mir-4257 and hsa-mir-1286 are not included in dbDEMC, MiR2Disease and literature. We also listed the potential miRNAs from top 51 to top100 (Supplementary Table S3 in Additional file 1).

Table 5 The first 50 potential miRNAs associated with prostate neoplasms predicted by BRWHNHA.

Full size table

Conclusion

Taking full account of the different topological and structural characteristics of heterogeneous network is a very challenging and meaningful task in prioritizing potential disease-related miRNAs. In this paper, we first adopted an effective measurement, which is suitable for miRNAs and diseases without known miRNA-disease associations, to estimate the similarity of miRNAs and diseases. Then, we presented a BRWHNHA method based on hybrid recommendation algorithm and unbalanced bi-random walk to predict potential diseases associated miRNAs. We made full use of the prior information and topological structural by calculating the transition probability matrix of bipartite network in using hybrid recommendation algorithm, in addition, we fully exploited the topologies and structures of miRNA similarity network(MMS) and disease similarity network(DDS) in the different lever by adopting unbalanced bi-random walk on heterogeneous network. To assess the performance of BRWHNHA, we compared BRWHNHA with MIDP, HAMDA and BRWH on the dataset obtained by Luo and Xiao²². The results indicate that BRWHNHA has the best prediction ability among these methods, the average AUC was 2.13%, 0.69% and 2.20% higher than MIDP, HAMDA and BRWH, respectively. Furthermore, case studies on lung neoplasms and prostatic were employed to further identify the performance evaluation of BRWHNHA, which the top 49 out of 50 and 46 out of 50 predicted miRNA-disease associations respectively were confirmed by recently published literature and databases of dbDEMC and MiR2Disease. The results show that BRWHNHA can be used as an effective and important method to explore the potential association between miRNAs and diseases.

Nevertheless, there is a limitation on our BRWHNHA that should be improved in future study. That is there are many parameters need to be set in this method. So a more effective method need be adopted to find the optimal parameters.

Methods

The measurement of disease semantic similarity and miRNA functional similarity

As described in the category C of MeSH descriptor, the disease relationships can be regarded as a directed acyclic graph structure. The disease K can be represented as DAG(K) = (K, T(K), E(K))²², where T(K) represents the set of all the ancestor nodes of disease K and disease K, E(K) represents the set of all direct edges from parent nodes to child nodes in the subgraph, as shown in Fig. 2. For two diseases d_i and d_j, the disease semantic similarity measurement DSS(d_i, d_j) is defined by Luo and Xiao²².

Based on the assumption that miRNAs with similar functions are more likely to be associated with similar diseases and vice versa^29,30, the miRNA function similarity measurement MFS(m_i, m_j) for two miRNAs m_i and m_j is adopted, which proposed by Wang et al.³¹.

Gaussian interaction profile kernel similarity

The Gaussian interaction profile kernel similarity also is based on the assumption that miRNAs with similar functions are more likely to be associated with similar diseases and vice versa. Let $A={({a}_{ij})}_{{n}_{m}\times {n}_{d}}$ be the adjacency matrix of MD, and n_m denotes the number of miRNAs and n_d the number of diseases, respectively. The Gaussian interaction profile kernel similarity is calculated by the known miRNA-disease associations²¹, so let IP(d_i) binary vector indicate whether disease d_i is associated with each miRNA, in other words, IP(d_i) is the ith column of A. The Gaussian interaction profile kernel similarity between two diseases d_i and d_j is calculated as:

$$DGS({d}_{i},{d}_{j})=exp(\,-\,{r}_{d}\parallel IP({d}_{i})-IP({d}_{j}){\parallel }^{2})$$

(1)

where ${r}_{d}={r^{\prime} }_{d}/(\tfrac{1}{{n}_{d}}\,{\sum }_{i=1}^{{n}_{d}}\,\parallel IP({d}_{i}){\parallel }^{2})$ is the kernel bandwidth and ${r}_{d}^{^{\prime} }$ is a new bandwidth parameter (e.g. (${r}_{d}^{^{\prime} }=1$ as^32,33) to normalize r_d.

For two miRNAs m_i and m_j, the Gaussian interaction profile kernel similarity is calculated as:

$$MGS({m}_{i},{m}_{j})=exp(\,-\,{r}_{m}\parallel IP({m}_{i})-IP({m}_{j}){\parallel }^{2})$$

(2)

where ${r}_{m}={r^{\prime} }_{m}/(\tfrac{1}{{n}_{m}}\,{\sum }_{i=1}^{{n}_{m}}\,\parallel IP({m}_{i}){\parallel }^{2})$ is the kernel bandwidth and ${r}_{m}^{^{\prime} }$ is a new bandwidth parameter to normalize r_m.

Integrated similarity for miRNAs and diseases

A new disease similarity matrix can be obtained by integrated disease semantic similarity and the disease Gaussian interaction profile kernel similarity²¹. For two diseases d_i and d_j, the new diseases similarity can be defined as follows:

$${D}_{S}({d}_{i},{d}_{j})=\{\begin{array}{ll}DSS({d}_{i},{d}_{j}), & DSS({d}_{i},{d}_{j})\ne 0\\ DGS({d}_{i},{d}_{j}), & {\rm{otherwise}}\end{array}$$

(3)

The integrated similarity between miRNAs m_i and m_j can be defined as follows:

$${M}_{S}({m}_{i},{m}_{j})=\{\begin{array}{ll}MFS({m}_{i},{m}_{j}), & MFS({m}_{i},{m}_{j})\ne 0\\ MGS({m}_{i},{m}_{j}), & {\rm{otherwise}}\end{array}$$

(4)

Hybrid recommendation algorithm

A binary network MD(M, D, E) is constructed by experimentally confirmed miRNA-disease association, where D represents all diseases nodes, M represents all miRNAs nodes, and E represents all edges in MD. The adjacency matrix A is defined as follows:

$${a}_{ij}=\{\begin{array}{ll}1, & {m}_{i}\,{\rm{is}}\,{\rm{associated}}\,{\rm{with}}\,{d}_{j}\\ 0, & {\rm{otherwise}}\end{array}$$

(5)

Zhou et al.³⁴ proposed the hybrid recommendation algorithm, which combined the heat spreading (HeatS) algorithm and probabilistic spreading (ProbS) algorithm by incorporating the hybridization parameter λ to balance the accuracy of HeatS and the diversity of ProbS. For a given disease, HeatS and ProbS both work by assigning miRNA an initial resource represented by the vector f (where f_i is the resource possessed by miRNA m_i), which was redistributed though the transformation $f^{\prime} =W\,\ast \,f$. The miRNA that possess more resource is more likely associated the given disease. In Fig. 3, the visualization process of HeatS algorithm and ProbS algorithm is presented.

HeatS is defined as follows:

$${W}_{ij}^{H}=\frac{1}{k({m}_{i})}\,{\sum }_{l=1}^{{n}_{d}}\,\frac{{a}_{il}{a}_{jl}}{k({d}_{l})}$$

(6)

$$f^{\prime} ={W}^{H}\ast f$$

(7)

ProbS is defined as follows:

$${W}_{ij}^{P}=\frac{1}{k({m}_{j})}\,{\sum }_{l=1}^{{n}_{d}}\,\frac{{a}_{il}{a}_{jl}}{k({d}_{l})}$$

(8)

$$f^{\prime} ={W}^{P}\ast f$$

(9)

Hybrid recommendation algorithm is defined as follows:

$${W}_{ij}^{H+P}=\frac{1}{k{({m}_{i})}^{1-\lambda }k{({m}_{j})}^{\lambda }}\,\sum _{l=1}^{nd}\,\frac{{a}_{il}{a}_{jl}}{k({d}_{l})}$$

(10)

$$f^{\prime} ={W}^{H+P}\ast f$$

(11)

k(x) denotes the degree of nodes x in bipartite graph MD(M, D, E).

Our method BRWHNHA

In this paper, we present a BRWHNHA method based on hybrid recommendation algorithm and unbalance bi-random walk to predict potential diseases associated miRNAs. Luo et al.²² found that most of the nodes in DDS and MMS are isolated, and the sparsity of disease semantic similarity and miRNA functional similarity effect the prediction performance. To overcome this disadvantages in data, the similarity is estimated for each disease pair via integrating disease semantic similarity and disease Gaussian interaction profile kernel similarity, as well as miRNA pair is estimated via integrating miRNA function similarity and miRNA Gaussian interaction profile kernel similarity. Then, the bipartite miRNA-disease network (MD) is constructed, where edges in the miRNA-disease network are the known associations between miRNAs and diseases that were released by HMDD in June 2014. The transition probability matrix of MD is obtained by using hybrid recommendation algorithm in bipartite networks. Then, unbalance bi-random walk is carried out in heterogeneous network that includes DDS, MMS and MD. Finally, for a given disease, all candidate miRNAs will be ranked according to transition probability matrix, and the higher the rank, the more likely it is to be associated with the given disease. Flowchart of potential miRNA-disease association prediction based on the computational model of BRWHNHA is shown in Fig. 4. The most important 2 steps is:

Step 1 (Calculate the transition probability matrix of DDS, MMS and MD): The transition probability matrix $M={(M(i,j))}_{{n}_{m}\times {n}_{m}}$ of MMS is constructed as:

$$M(i,j)=\{\begin{array}{ll}\tfrac{{M}_{S}(i,j)}{{\sum }_{k=1}^{{n}_{m}}\,{M}_{S}(k,j)}, & \sum _{k=1}^{{n}_{m}}\,{M}_{S}(k,j)\ne 0\\ 0, & {\rm{otherwise}}\end{array}$$

(12)

Similarly, $D{(D(i,j))}_{{n}_{d}\times {n}_{d}}$ is the transition probability matrix of the DDS:

$$D(i,j)=\{\begin{array}{ll}\tfrac{{D}_{S}(i,j)}{{\sum }_{k=1}^{{n}_{d}}\,{D}_{S}(k,j)}, & \sum _{k=1}^{{n}_{d}}\,{D}_{S}(k,j)\ne 0\\ 0, & {\rm{otherwise}}\end{array}$$

(13)

Based on hybrid recommendation algorithm, the miRNA node m_i is assigned an initial lever of resource f(m_i) = 1, or 0 depending on whether the miRNA is associated with given disease. All the resource of miRNA nodes redistributed via the transition matrix of hybrid recommendation algorithm, and transition probability matrix of MD is calculated as:

$${P}_{A}={W}^{H+P}\ast A.$$

(14)

Step 2 (Implement unbalance bi-random walk in heterogeneous network): Because of the different topological characteristics between MMS and DDS, in these two networks, we introduce two parameters of l and r as the biggest step random walk on MMS and DDS.

$$MMS:{P}_{{t}_{ \mbox{-} M}}=(1-\alpha )\ast M\ast {P}_{t-1}+\alpha \ast {P}_{A}$$

(15)

$$DDS:{P}_{{t}_{ \mbox{-} D}}=(1-\alpha )\ast {P}_{t-1}\ast D+\alpha \ast {P}_{A}$$

(16)

$${P}_{t}=\{\begin{array}{ll}\tfrac{{P}_{{t}_{ \mbox{-} M}}+{P}_{{t}_{ \mbox{-} D}}}{2} & t\le r,t\le l\\ {P}_{{t}_{ \mbox{-} M}}, & t\le r,t > l\\ {P}_{{t}_{ \mbox{-} D}}, & t > r,t\le l\end{array}$$

(17)

α denotes a decay factor ranging from 0 and 1. The matrix P_A is used to control the prior probability of the iterative process and is the transition probability matrix of the bipartite network G obtained by the recommendation algorithm. P_A is a transition probability matrix, and P₀ = P_A/sum(P_A). After several iterations, P_t is the steady-state probability matrix between miRNAs and diseases. For a given disease, we ranked all the candidate miRNAs based on the probability. In BRWHNHA algorithm, we effectively utilize the topological information of heterogeneous networks, including: MMS, DDS and MD.

Data Availability

All data generated or analyzed during this study are included in this article [Additional file 2, Additional file 3, Additional file 4]. (The data we used was downloaded from the paper of Luo and Xiao²²).

References

Lee, R. C., Feinbaum, R. L. & Ambros, V. The c. elegans heterochronic gene lin-4 encodes small rnas with antisense complementarity to lin-14. Cell 75, 843–854 (1993).
Article CAS Google Scholar
Wang, C., Wei, L., Guo, M. & Quan, Z. Computational approaches in detecting non-coding rna. Current Genomics 14, 371 (2013).
Article CAS Google Scholar
Wei, L. et al. Improved and promising identification of human micrornas by incorporating a high-quality negative set. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11, 192–201 (2014).
Article Google Scholar
Mitra et al. Identifying transcription factor and microrna mediated synergetic regulatory networks in lung cancer. BMC Bioinformatics 14, A14 (2013).
Article Google Scholar
Cheng, A. M., Byrom, M. W., Shelton, J. & Ford, L. P. Antisense inhibition of human mirnas and indications for an involvement of mirna in cell growth and apoptosis. Nucleic Acids Research 33, 1290–1297 (2005).
Article CAS Google Scholar
Miska, E. How micrornas control cell division, differentiation and death. Current Opinion Genetics Development 15, 563–568 (2005).
Article CAS Google Scholar
Xu, P., Guo, M. & Hay, B. A. Micrornas and the regulation of cell death. TRENDS Genetics 20, 617–624 (2004).
Article CAS Google Scholar
Wu, D. et al. ncrdeathdb: A comprehensive bioinformatics resource for deciphering network organization of the ncrnamediated cell death system. Autophagy 11, 1917–1926 (2015).
Article CAS Google Scholar
Li, Y., W., Y. & Zhuang, L. Connect the dots: a systems level approach for analyzing the mirnar -mediated cell death network. Autophagy 9, 436–439 (2013).
Article CAS Google Scholar
Kahraman, M. et al. Microrna in diagnosis and therapy monitoring of early-stage triple-negative breast cancer. Scientific Reports 8, 11584 (2018).
Article ADS Google Scholar
Markou, A. et al. Prognostic value of mature microrna-21 and microrna-205 overexpression in non-small cell lung cancer by quantitative real-time rt-pcr. Clinical Chemistry 54, 1696–1704 (2008).
Article CAS Google Scholar
Miller, TylerE. et al. Microrna-221/222 confers tamoxifen resistance in breast cancer by targeting p27kip1. Journal of Biological Chemistry 283, 29897–29903 (2008).
Article CAS Google Scholar
Weinberg, M. S. & Wood, M. J. A. Short non-coding rna biology and neurodegenerative disorders. novel disease targets and therapeutics. Human Molecular Genetics 18, R27–R39 (2009).
Article CAS Google Scholar
Jiang, Q. et al. Prioritization of disease micrornas through a human phenome-micrornaome network. BMC Systems Biology 4, 1–9 (2010).
Article Google Scholar
Jiang, Q., Hao, Y., Wang, G., Zhang, T. & Wang, Y. Weighted networkbased inference of human microrna-disease associations. Fifth International Conference on Frontier of Computer Science and Technology August 18–22, 431–435 (2010).
Chen, X., Liu, M. X. & Yan, G. Y. Rwrmda: Predicting novel human microrna-disease associations. Molecular Biosystems 8, 2792–2798 (2012).
Article CAS Google Scholar
Xuan, P. et al. Prediction of potential disease-associated micrornas based on random walk. Bioinformatics 31, 1805–1815 (2015).
Article CAS Google Scholar
Chen, X. et al. Wbsmda: Within and between score for mirna-disease association prediction. Scientific Reports 6, 21106 (2016).
Article ADS CAS Google Scholar
Zeng, X., Zhang, X. & Zou, Q. Integrative approaches for predicting microrna function and prioritizing disease-related microrna using biological interaction networks. Briefings in Bioinformatics 17, 193 (2016).
Article CAS Google Scholar
Liu, Y., Zeng, X., He, Z. & Zou, Q. Inferring microrna-disease associations by random walk on a heterogeneous network with multiple data sources. IEEE/ACM Transactions on Computational Biology and Bioinformatics (2017).
Chen, X., Niu, Y. W., Wang, G. H. & Yan, G. Y. Hamda: hybrid approach for mirna-disease association prediction. Journal of Biomedical Informatics 76, 50–58 (2017).
Article CAS Google Scholar
Luo, J. & Qiu, X. A novel approach for predicting microrna-disease associations by unbalanced bi-random walk on heterogeneous network. Journal of Biomedical Informatics 66, 194–203 (2017).
Article Google Scholar
Zeng, X., Liu, L., Lv, L. & Zou, Q. Prediction of potential disease-associated micrornas using structural perturbation method. Bioinformatics PP, 1–1 (2018).
Google Scholar
Yang, Z. et al. Dbdemc: a database of (2010) differentially expressed mirnas in human cancers. BMC Genomics 11, 1–8 (2010).
Article Google Scholar
Jiang et al. mir2disease: a manually curated database for microrna deregulation in human disease. Nucleic Acids Research 37(Database issue), D98–104 (2008).
PubMed PubMed Central Google Scholar
McGuire, S. World cancer report 2014. geneva, switzerland: World health organization, international agency for research on cancer, who press, 2015. Advances in Nutrition 7, 418–419 (2016).
Article Google Scholar
Hart, M. et al. The protooncogene erg is a target of microrna mir-145 in prostate cancer. Febs Journal 280, 2105–2116 (2013).
Article CAS Google Scholar
Ueno, K. et al. Microrna-183 is an oncogene targeting dkk-3 and smad4 in prostate cancer. British Journal of Cancer 108, 1659–1667 (2013).
Article CAS Google Scholar
Lu, M. et al. An analysis of human microrna and disease associations. PLoS One 3, e3420 (2008).
Article ADS Google Scholar
Bandyopadhyay, S., Mitra, R., Maulik, U. & Zhang, M. Q. Development of the human cancer microrna network. Silence 1, 6 (2010).
Article Google Scholar
Wang, J. Z. et al. A new method to measure the semantic similarity of go terms. Bioinformatics 23, 1274–1281 (2007).
Article CAS Google Scholar
Chen, X. & Yan, G.-Y. Novel human lncrna-disease association inference based on lncrna expression profiles. Bioinformatics 29, 2617–2624 (2013).
Article CAS Google Scholar
Chen, X. et al. Nllss: Predicting synergistic drug combinations based on semi-supervised learning. PLoS Computational Biology 12, e1004975 (2016).
Article Google Scholar
Zhou, L., Liu, K., Liu, J. & Zhang, R. Solving the apparent diversity-accuracy dilemma of recommender systems. Proceedings of the National Academy of Sciences of the United States of America 107, 4511–4515 (2010).
Article ADS CAS Google Scholar

Download references

Acknowledgements

This project was supported by National Natural Science Foundation of China (Grant No. 11871061); Collaborative Research project for Overseas Scholars (including Hong Kong and Macau) of National Natural Science Foundation of China (Grant No. 61828203); Chinese Program for Changjiang Scholars and Innovative Research Team in University (PCSIRT)(Grant No. IRT_15R58); Research Foundation of Education Commission of Hunan Province of China (Grant No. 17K090); Innovation project of Hunan Province of China (Grant No. Cx2016B252).

Author information

Yuan-Lin Ma and Dong-Ling Yu contributed equally.

Authors and Affiliations

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, Hunan 411105, P.R. China
Dong-Ling Yu, Yuan-Lin Ma & Zu-Guo Yu
School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane, Q4001, Australia
Zu-Guo Yu

Authors

Dong-Ling Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yuan-Lin Ma
View author publications
You can also search for this author in PubMed Google Scholar
Zu-Guo Yu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L.Y. and Y.L.M. contributed to the conception and design of the study and developed the method. D.L.Y. implemented the algorithms and analyzed the data and results. Z.G.Y. gave the ideas and supervised the project. D.L.Y. wrote the manuscript. All authors discussed the results and reviewed the manuscript, and approved the final manuscript.

Corresponding author

Correspondence to Zu-Guo Yu.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1

Additional file 2

Additional file 3

Additional file 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, DL., Ma, YL. & Yu, ZG. Inferring microRNA-disease association by hybrid recommendation algorithm and unbalanced bi-random walk on heterogeneous network. Sci Rep 9, 2474 (2019). https://doi.org/10.1038/s41598-019-39226-x

Download citation

Received: 18 September 2018
Accepted: 18 January 2019
Published: 21 February 2019
DOI: https://doi.org/10.1038/s41598-019-39226-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.