Literature DB >> 30792474

Inferring microRNA-disease association by hybrid recommendation algorithm and unbalanced bi-random walk on heterogeneous network.

Dong-Ling Yu1, Yuan-Lin Ma1, Zu-Guo Yu2,3.   

Abstract

More and more research works have indicated that microRNAs (miRNAs) play indispensable roles in exploring the pathogenesis of diseases. Detecting miRNA-disease associations by experimental techniques in biology is expensive and time-consuming. Hence, it is important to propose reliable and accurate computational methods to exploring potential miRNAs related diseases. In our work, we develop a novel method (BRWHNHA) to uncover potential miRNAs associated with diseases based on hybrid recommendation algorithm and unbalanced bi-random walk. We first integrate the Gaussian interaction profile kernel similarity into the miRNA functional similarity network and the disease semantic similarity network. Then we calculate the transition probability matrix of bipartite network by using hybrid recommendation algorithm. Finally, we adopt unbalanced bi-random walk on the heterogeneous network to infer undiscovered miRNA-disease relationships. We tested BRWHNHA on 22 diseases based on five-fold cross-validation and achieves reliable performance with average AUC of 0.857, which an area under the ROC curve ranging from 0.807 to 0.924. As a result, BRWHNHA significantly improves the performance of inferring potential miRNA-disease association compared with previous methods. Moreover, the case studies on lung neoplasms and prostate neoplasms also illustrate that BRWHNHA is superior to previous prediction methods and is more advantageous in exploring potential miRNAs related diseases. All source codes can be downloaded from https://github.com/myl446/BRWHNHA .

Entities:  

Year:  2019        PMID: 30792474      PMCID: PMC6385311          DOI: 10.1038/s41598-019-39226-x

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

MicroRNAs (miRNAs) are a class of short non–coding RNAs (21–25 nt)[1-3]. As an important transcriptional regulatory factor, miRNAs are widely involved in the biological procedures of disease-related gene regulation, which is closely related to human multi-gene diseases[4-7]. Increasing evidences have demonstrated that miRNAs play a critical role in the emergence and development of diseases[8,9]. Hence, revealing miRNAs associated diseases is an efficient way to accelerate the acquaintance about disease pathology at the molecular level[10-13]. As detecting miRNA-disease associations by experimental techniques is expensive and time-consuming, many effective computational methods about the prediction of the relationship between miRNAs and diseases have been proposed. For example, Jiang et al.[14] proposed a computational approach to infer potential miRNA-disease associations by hypergeometric distribution. For a given disease, they priorited the entire human miRNAs. In addition, Jiang et al.[15] further improved the calculation of concordance score between a miRNA and a given disease. Chen et al.[16] firstly presented a prediction computational method named RWRMDA based on global network similarity, to predict novel human miRNA-disease associations by adopting the method of random walk on network of miRNA functional similarity. Then, Xuan et al.[17] developed a reliable prediction method based on random walk, they assigned different weights to transition matrix of miRNAs depending on whether they are associated with given diseases to exploit the prior information of nodes and the various ranges of topologies. And they extended the walk on a miRNA-disease bipartite network to predict candidates miRNAs, specially for the diseases without any known related miRNAs. Furthermore, Chen et al.[18] developed a novel prediction method named WBSMDA for inferring miRNA-disease based on integrating miRNA functional similarity, disease semantic similarity, the known miRNA-disease associations, and the Gaussian interaction profile kernel similarity into heterogeneous network. WBSMDA not only could deal with new diseases without any known associated miRNAs, but also could handle new miRNAs without any known associated disease. In 2016, Zeng et al.[19] conducted a review on methods for predicting disease and miRNA associations based on biological interaction networks. After detailed comparing these methods, they pointed out the current challenges in predicting disease and miRNA correlations. Liu et al.[20] further proposed a method to explore potential miRNAs related to diseases by integrating multiple biology data in 2017. In recent years, recommendation system algorithms have been successfully applied in many fields. Chen et al.[21] presented a hybrid approach for miRNA-disease association prediction (HAMDA) method based on hybrid recommendation methods, which combined available biology data and network-based inference methods. However, just like the above mentioned methods, they only prioritized miRNAs by utilizing the same layers neighbor nodes of miRNAs and diseases rather than making use of the different structural and topological characteristics among subgraphs of heterogeneous networks. Luo et al.[22] proposed a novel effective prediction model that use unbalanced bi-random walk to improve performance of prediction. They fully exploited the different topological and structural of miRNA similarity networks and disease similarity network. This method improved prediction performance, but ignored the prior information and the respective topological structural of bipartite network. Zeng et al.[23] found that heterogeneous miRNA-disease networks perform better on prediction than single disease similarity networks, miRNA similarity networks, and the known disease-gene association networks in 2018. So they adopted a method of structural perturbation to improve the prediction accuracy of miRNA-disease association. We believe that the topological and structural features of heterogeneous network contain important information which is useful for discovering more reliable miRNA-disease associations. In present work, we develop an efficient computational method based on hybrid recommendation approach and unbalanced bi-random walk, called BRWHNHA (Bi-random Walk on Heterogeneous Network based on Hybrid Approach), which exploits the characteristic of nodes and the topological structural of the known miRNA-disease association by using hybrid recommendation approach, and taken advantage of the different topological structural between similarity networks of miRNA and disease by adopting bi-random walk on heterogeneous network. The hybrid recommendation algorithm adds some virtual edges to heterogeneous networks by calculating the transition matrix of bipartite network, so that the unbalanced bi-random walk on the new heterogeneous network can find potential miRNAs related to diseases more efficiently. To validate the prediction ability of BRWHNHA, we adopted five-fold cross-validation and compared BRWHNHA with MIDPE[17], HAMDA[21], and BRWH[22]. The average AUC is 2.13%, 0.69%, and 2.20% higher than the three methods. The case studies on lung neoplasms and prostatic neoplasms, and in the top 50 predicted associations, there are 49 and 46 real associations, respectively. It further demonstrates the ability of BRWHNHA in discovering potential miRNAs associated with disease.

Results

To evaluate the prediction effectiveness of BRWHNHA in exploring undiscovered association between miRNAs and diseases, we compared BRWHNHA with MIDPE[17], HAMDA[21], and BRWH[22] by five-fold cross-validation with repeating 100 times on the dataset obtained by Luo and Xiao[22]. For a given disease, we randomly divided the known-related miRNAs into five subsets with equal size. For each round, we used one subset as testing set and other four subsets as training set. After 5 rounds, we calculated the average AUC value. In order to reduce false positive, we recalculated miRNAs similarity and obtained a bran-new similarity matrix in each round of prediction. Then we calculated the probability of association between the given disease and miRNAs by BRWHNHA. Finally, all candidate miRNAs were ranked by association probability. The higher the miRNAs in testing set were ranked, the better the performance. As the most of diseases only have a few association with miRNAs that have been proved, the performance of the prediction methods can not be accurately evaluated. Hence we only tested the 22 diseases associated with at least 60 miRNAs as Luo and Xiao[22]. We only showed recall-precision curve of breast neoplasms and lung neoplasms. In addition, we analyzed effect of parameters on performance of BRWHNHA.

Performance evaluation

In this study, the novelty of BRWHNHA was to calculate the transition probability matrix of bipartite network by using hybrid recommendation algorithm, and then a bi-random walk on heterogeneous network based on hybrid approach was adopted. The average AUC is 83.55% without using hybrid recommendation algorithm, which was 2.14% less than using hybrid recommendation algorithm. Therefore, it is important to construct the transition probability matrix by hybrid recommendation algorithm. The prediction accuracy was actually improved by exploring the prior information and topological structure of bipartite networks. The same heterogeneous network was used on MIDPE[17], HAMDA[21] and BRWH[22]. The best parameters of α = 0.9 and γ = 0.8 for MIDPE, σ = 0.7 and ρ = 0.8 for HAMDA λ = 0.6, α = 0.4, r = 2, l = 1 for BRWH were adopted as reported in original papers. As illustrated in Table 1, the average AUC values of MIDPE, HAMDA, BRWHA and BRWHNHA in 22 diseases are 83.55%, 85.00%, 83.49% and 85.69% respectively. BRWHNHA performed the best with AUC 2.13%, 0.69% and 2.20% higher than other three methods. Moreover, BRWHNHA is superior to MIDPE and BRWH in all measurements for 22 diseases. Although HAMDA achieves higher AUC than BRWHNHA in 7 out of 22 diseases, but BRWHNHA obtains better performance in most of diseases. Since HAMDA repeatedly uses the known miRNAs-disease association data in the measurement of miRNAs similarity, it maybe overestimate the results. ROC curves of BRWHNHA and other three methods corresponding to the maximum AUC value in five-fold cross-validation at 100 times have shown in Supplementary Fig. S1 in Additional file.
Table 1

Predicting outcomes for MIDPE, HAMDA, BRWH and BRWHNHA by the five-fold cross-validation.

Diseases nameNumber of related miRNAsThe average AUC
MIDPE[17]HAMDA[21]BRWH[22]BRWHNHA
Breast Neoplasms2020.8130.8210.8120.855
Carcinoma, Hepatocellular2140.7770.7910.7760.805
Carcinoma, Non-Small-Cell Lung950.8590.8670.8570.874
Carcinoma, Renal Cell1070.8140.8220.8120.828
Carcinoma, Squamous Cell800.8740.8860.8750.882
Colonic Neoplasms780.8550.8660.8530.872
Colorectal Neoplasms1470.8190.8380.8240.848
Endometriosis620.8150.8500.8140.841
Esophageal Neoplasms740.7940.7930.7930.825
Glioblastoma960.8050.8250.8010.839
Glioma710.8680.8780.8630.874
Head and Neck Neoplasms640.8720.8860.8700.881
Heart Failure1200.8020.8100.8000.807
Leukemia, Myeloid, Acute640.8570.8680.8520.858
Lung Neoplasms1320.9060.9190.9060.924
Medulloblastoma620.8010.8070.7980.811
Melanoma1410.8230.8380.8250.849
Ovarian Neoplasms1140.8660.9020.8870.903
Pancreatic Neoplasms990.9050.9070.9020.911
Prostatic Neoplasms1180.8330.8570.8310.864
Stomach Neoplasms1730.7820.8020.7830.832
Urinary Bladder Neoplasms920.8420.8670.8330.867
Predicting outcomes for MIDPE, HAMDA, BRWH and BRWHNHA by the five-fold cross-validation. In Fig. 1, we compared BRWHNHA with other three methods in the recall-precision curves of breast neoplasms and lung neoplasms based on five-fold cross-validation. The precision-recall curve was obtained by measuring recall and precision at positions of top k (k = 10, 20, …, 100). The results show that our method achieves the highest precision and recall in the top 20. Moreover, with the increase of k value, the precision of BRWHNHA decreases, but the recall increases. It suggestes that the associations ranked in top position have higher probability of being potential miRNA-disease associations. We also compared the statistical significance of the difference in predictive ability between BRWHNHA and other three methods by paired t-tests. The P-values are listed in Table 2. Obviously, BRWHNHA achieves better performance than MIDPE, HAMDA, BRWHA at the significance level of 0.05.
Figure 1

Recall-precision curves of breast neoplasm and lung neoplasm by five-fold cross-validation.

Table 2

Pairwise comparison between BRWHNHA and another method by paired t-test on the AUC of prediction.

MethodMIDPE[17]HAMDA[21]BRWH[22]
P-value2.41E-071.71E-024.00E-08
Recall-precision curves of breast neoplasm and lung neoplasm by five-fold cross-validation. Pairwise comparison between BRWHNHA and another method by paired t-test on the AUC of prediction. We also compared our method with SPM on the dataset used by Zeng et al.[23]. We found that our method performs slightly better than SPM on five subsets with equal size in five-fold cross-validation in most cases (comparison results are not shown here).

Effect of parameters in BRWHNHA

There are four parameters λ, α, r and l explored in our method. The parameter λ is the hybridization parameter to mediate between HeatS algorithm and ProbS algorithm different kinds of resource distribution processes, and parameter α plays the role to control the consistence between the predicted candidate miRNA-disease associations and the known associations. The parameters of r and l are the numbers of maximal random walk steps in miRNA similarity network and disease similarity network, respectively. We set various values of λ and α ranging from 0 to 1, the step length was 0.1. r and l were taken to be between 0 step to 5 steps, the step length was 1. Then, we calculated average AUC in the framework of five-fold cross-validation. Table 3 shows the effects of λ, α, r and l on the cross validation result in miRNA-disease association dataset. It can be observed that BRWHNHA achieves the best performance, when λ = 0.6, α = 0.4, r = 2, l = 1.
Table 3

Effects of parameters λ, α, r, l on prediction performance of BRWHNHA.

α = 0.4, r = 2, l = 1λ = 0.6, r = 2, l = 1
λ Average AUC α Average AUC
00.74865400.804929
0.10.8142600.10.844848
0.20.8445240.20.853413
0.30.8526760.30.856242
0.40.8552200.4 0.856818
0.50.8560810.50.855095
0.6 0.856818 0.60.853919
0.70.8564180.70.850477
0.80.8561460.80.848039
0.90.8558240.90.847697
10.85541510.847478
λ = 0.6, α = 0.4, l = 1λ = 0.6, α = 0.4, r = 2
r Average AUC l Average AUC
00.84626700.856710
10.8527311 0.856818
2 0.856818 20.853561
30.85681430.846865
40.85655540.846626
50.85637150.846505
Effects of parameters λ, α, r, l on prediction performance of BRWHNHA.

Case study

To further validate efficiency of BRWHNHA for discovering the potential associations between miRNAs and diseases, we conducted two case studies of Lung neoplasms and Prostatic neoplasms here. All known miRNA-disease associations released in June 2014 were regarded as training sets, and the set of candidate associations formed by all other associations. The prediction results of Lung neoplasms and Prostatic neoplasms were confirmed based on relevant literatures and two important public database: dbDEMC[24] and MiR2Disease[25]. Lung cancer is one of the malignant tumors with the highest morbidity and mortality, and it is the greatest health and life threat to human. Over the past 50 years, many countries have reported significant increases in the incidence and mortality of lung cancer. The first 50 predicted miRNA associated with lung cancer were shown in Table 4. As a result, among the top 20 and 50 potential Lung neoplasms associated miRNAs, 20 and 49 were confirmed by dbDEMC database, MiR2Disease and literature. Though there is no database or literature that proved the miRNA (hsa-mir-200) relevance to lung neoplasms, the mir-200 family, which includes 5 members (miRNA-200a, miRNA-200b, miRNA-200c, miRNA-429, and miRNA-141), is associated with Lung neoplasms in dbDEMC, so we have reason to believe that it is related to the disease. In addition, we also listed the potential miRNAs from top 51 to top100 (Supplementary Table S2 in Additional file 1).
Table 4

The first 50 potential miRNAs associated with lung neoplasms predicted by BRWHNHA.

RankmiRNAsEvidenceRankmiRNAsEvidence
1hsa-mir-20bDB26hsa-mir-302fDB
2hsa-mir-15bDB27hsa-mir-1258DB
3hsa-mir-34bDB28hsa-mir-1305DB
4hsa-mir-21DB, MD29hsa-mir-140DB, MD
5hsa-mir-200bDB, MD30hsa-mir-106aDB, MD
6hsa-mir-29bDB, MD31hsa-mir-219DB, MD
7hsa-mir-146aDB, MD32hsa-mir-1827PMID:21676885
8hsa-mir-7iDB33hsa-mir-675DB
9hsa-mir-1236DB34hsa-mir-485DB
10hsa-mir-30eDB, MD35hsa-mir-105DB
11hsa-let-200aDB, MD36hsa-mir-92bDB
12hsa-mir-885DB37hsa-mir-1323DB
13hsa-let-147bDB38hsa-mir-135aDB
14hsa-let-10aDB39hsa-mir-98DB, MD
15hsa-mir-198DB, MD40hsa-mir-137DB
16hsa-let-100DB41hsa-mir-27aDB
17hsa-mir-212DB, MD42hsa-mir-235DB
18hsa-mir-181dDB43hsa-mir-450bDB
19hsa-mir-217DB44hsa-mir-495DB
20hsa-mir-204DB, MD45hsa-mir-18aDB, MD
21hsa-mir-374bDB46hsa-mir-1915DB
22hsa-mir-133aDB47hsa-mir-101DB, MD
23hsa-mir-200Unconfirmed48hsa-mir-23aDB
24hsa-mir-106bDB49hsa-mir-1207DB
25hsa-mir-194DB50hsa-mir-337DB

*The databases dbDEMC and mir2disease are respectively represented as DB and MD.

The first 50 potential miRNAs associated with lung neoplasms predicted by BRWHNHA. *The databases dbDEMC and mir2disease are respectively represented as DB and MD. Prostate neoplasms is an important malignant tumor in male patients. There are usually no clinical symptoms in the early stage. Currently, most of the patients admitted by prostate neoplasms are in the late stage[26]. Therefore, early diagnosis is an urgent problem. There are many evidences that have confirmed a link between miRNA and prostate neoplasms, and it could be therapeutically useful for the treatment of prostate neoplasms by regulating the expression of related miRNAs[27,28]. As a result of the case study for prostate neoplasms, 18 out of the top-20 and 46 out of the top-50 predicted miRNAs of prostate neoplasms were verified by dbDEMC, MiR2Disease and literature (shown in Table 5). However, hsa-mir-302f, hsa-mir-1915, hsa-mir-4257 and hsa-mir-1286 are not included in dbDEMC, MiR2Disease and literature. We also listed the potential miRNAs from top 51 to top100 (Supplementary Table S3 in Additional file 1).
Table 5

The first 50 potential miRNAs associated with prostate neoplasms predicted by BRWHNHA.

RankmiRNAsEvidenceRankmiRNAsEvidence
1hsa-mir-154DB26hsa-mir-137PMID:26461474
2hsa-mir-189DB27hsa-mir-134DB
3hsa-mir-19aDB28hsa-mir-30aDB, MD
4hsa-mir-29bDB, MD29hsa-mir-138PMID:28741117
5hsa-mir-21DB, MD30hsa-mir-302aDB
6hsa-mir-199bDB, MD31hsa-mir-450bDB
7hsa-mir-141DB, MD32hsa-mir-495DB
8hsa-mir-10aDB, MD33hsa-mir-26aDB, MD
9hsa-mir-181DB34hsa-mir-20aDB, MD
10hsa-mir-885DB35hsa-mir-139DB
11hsa-let-7dDB, MD36hsa-mir-107DB
12hsa-mir-149DB, MD37hsa-mir-148bDB
13hsa-let-7eDB38hsa-mir-2355DB
14hsa-let-7fDB, MD39hsa-mir-302bDB
15hsa-mir-195DB, MD40hsa-mir-337DB
16hsa-let-7gDB, MD41hsa-mir-624DB
17hsa-mir-302fUnconfirmed42hsa-mir-214DB, MD
18hsa-mir-23bDB, MD43hsa-mir-205DB, MD
19hsa-mir-199aDB, MD44hsa-mir-1286Unconfirmed
20hsa-mir-1915Unconfirmed45hsa-mir-340DB
21hsa-mir-1258DB46hsa-mir-1275DB
22hsa-mir-675DB47hsa-mir-136DB
23hsa-mir-4257Unconfirmed48hsa-mir-218DB, MD
24hsa-mir-18aDB49hsa-mir-498DB, MD
25hsa-mir-101DB, MD50hsa-mir-301bPMID:29744254

*The databases dbDEMC and mir2disease are respectively represented as DB and MD.

The first 50 potential miRNAs associated with prostate neoplasms predicted by BRWHNHA. *The databases dbDEMC and mir2disease are respectively represented as DB and MD.

Conclusion

Taking full account of the different topological and structural characteristics of heterogeneous network is a very challenging and meaningful task in prioritizing potential disease-related miRNAs. In this paper, we first adopted an effective measurement, which is suitable for miRNAs and diseases without known miRNA-disease associations, to estimate the similarity of miRNAs and diseases. Then, we presented a BRWHNHA method based on hybrid recommendation algorithm and unbalanced bi-random walk to predict potential diseases associated miRNAs. We made full use of the prior information and topological structural by calculating the transition probability matrix of bipartite network in using hybrid recommendation algorithm, in addition, we fully exploited the topologies and structures of miRNA similarity network(MMS) and disease similarity network(DDS) in the different lever by adopting unbalanced bi-random walk on heterogeneous network. To assess the performance of BRWHNHA, we compared BRWHNHA with MIDP, HAMDA and BRWH on the dataset obtained by Luo and Xiao[22]. The results indicate that BRWHNHA has the best prediction ability among these methods, the average AUC was 2.13%, 0.69% and 2.20% higher than MIDP, HAMDA and BRWH, respectively. Furthermore, case studies on lung neoplasms and prostatic were employed to further identify the performance evaluation of BRWHNHA, which the top 49 out of 50 and 46 out of 50 predicted miRNA-disease associations respectively were confirmed by recently published literature and databases of dbDEMC and MiR2Disease. The results show that BRWHNHA can be used as an effective and important method to explore the potential association between miRNAs and diseases. Nevertheless, there is a limitation on our BRWHNHA that should be improved in future study. That is there are many parameters need to be set in this method. So a more effective method need be adopted to find the optimal parameters.

Methods

The measurement of disease semantic similarity and miRNA functional similarity

As described in the category C of MeSH descriptor, the disease relationships can be regarded as a directed acyclic graph structure. The disease K can be represented as DAG(K) = (K, T(K), E(K))[22], where T(K) represents the set of all the ancestor nodes of disease K and disease K, E(K) represents the set of all direct edges from parent nodes to child nodes in the subgraph, as shown in Fig. 2. For two diseases d and d, the disease semantic similarity measurement DSS(d, d) is defined by Luo and Xiao[22].
Figure 2

Hierarchical DAG graph of Lung neoplasms.

Hierarchical DAG graph of Lung neoplasms. Based on the assumption that miRNAs with similar functions are more likely to be associated with similar diseases and vice versa[29,30], the miRNA function similarity measurement MFS(m, m) for two miRNAs m and m is adopted, which proposed by Wang et al.[31].

Gaussian interaction profile kernel similarity

The Gaussian interaction profile kernel similarity also is based on the assumption that miRNAs with similar functions are more likely to be associated with similar diseases and vice versa. Let be the adjacency matrix of MD, and n denotes the number of miRNAs and n the number of diseases, respectively. The Gaussian interaction profile kernel similarity is calculated by the known miRNA-disease associations[21], so let IP(d) binary vector indicate whether disease d is associated with each miRNA, in other words, IP(d) is the ith column of A. The Gaussian interaction profile kernel similarity between two diseases d and d is calculated as:where is the kernel bandwidth and is a new bandwidth parameter (e.g. ( as[32,33]) to normalize r. For two miRNAs m and m, the Gaussian interaction profile kernel similarity is calculated as:where is the kernel bandwidth and is a new bandwidth parameter to normalize r.

Integrated similarity for miRNAs and diseases

A new disease similarity matrix can be obtained by integrated disease semantic similarity and the disease Gaussian interaction profile kernel similarity[21]. For two diseases d and d, the new diseases similarity can be defined as follows: The integrated similarity between miRNAs m and m can be defined as follows:

Hybrid recommendation algorithm

A binary network MD(M, D, E) is constructed by experimentally confirmed miRNA-disease association, where D represents all diseases nodes, M represents all miRNAs nodes, and E represents all edges in MD. The adjacency matrix A is defined as follows: Zhou et al.[34] proposed the hybrid recommendation algorithm, which combined the heat spreading (HeatS) algorithm and probabilistic spreading (ProbS) algorithm by incorporating the hybridization parameter λ to balance the accuracy of HeatS and the diversity of ProbS. For a given disease, HeatS and ProbS both work by assigning miRNA an initial resource represented by the vector f (where f is the resource possessed by miRNA m), which was redistributed though the transformation . The miRNA that possess more resource is more likely associated the given disease. In Fig. 3, the visualization process of HeatS algorithm and ProbS algorithm is presented.
Figure 3

The HeatS (a–c) and ProbS (d–f) algorithms at work on the bipartite miRNA-disease network. Disease are shown as green squares and dark green squares is a given disease, miRNAs are shown as red circles.

The HeatS (a–c) and ProbS (d–f) algorithms at work on the bipartite miRNA-disease network. Disease are shown as green squares and dark green squares is a given disease, miRNAs are shown as red circles. HeatS is defined as follows: ProbS is defined as follows: Hybrid recommendation algorithm is defined as follows: k(x) denotes the degree of nodes x in bipartite graph MD(M, D, E).

Our method BRWHNHA

In this paper, we present a BRWHNHA method based on hybrid recommendation algorithm and unbalance bi-random walk to predict potential diseases associated miRNAs. Luo et al.[22] found that most of the nodes in DDS and MMS are isolated, and the sparsity of disease semantic similarity and miRNA functional similarity effect the prediction performance. To overcome this disadvantages in data, the similarity is estimated for each disease pair via integrating disease semantic similarity and disease Gaussian interaction profile kernel similarity, as well as miRNA pair is estimated via integrating miRNA function similarity and miRNA Gaussian interaction profile kernel similarity. Then, the bipartite miRNA-disease network (MD) is constructed, where edges in the miRNA-disease network are the known associations between miRNAs and diseases that were released by HMDD in June 2014. The transition probability matrix of MD is obtained by using hybrid recommendation algorithm in bipartite networks. Then, unbalance bi-random walk is carried out in heterogeneous network that includes DDS, MMS and MD. Finally, for a given disease, all candidate miRNAs will be ranked according to transition probability matrix, and the higher the rank, the more likely it is to be associated with the given disease. Flowchart of potential miRNA-disease association prediction based on the computational model of BRWHNHA is shown in Fig. 4. The most important 2 steps is:
Figure 4

Flowchart of potential miRNA-disease association prediction based on the computational model of BRWHNHA.

Flowchart of potential miRNA-disease association prediction based on the computational model of BRWHNHA. Step 1 (Calculate the transition probability matrix of DDS, MMS and MD): The transition probability matrix of MMS is constructed as: Similarly, is the transition probability matrix of the DDS: Based on hybrid recommendation algorithm, the miRNA node m is assigned an initial lever of resource f(m) = 1, or 0 depending on whether the miRNA is associated with given disease. All the resource of miRNA nodes redistributed via the transition matrix of hybrid recommendation algorithm, and transition probability matrix of MD is calculated as: Step 2 (Implement unbalance bi-random walk in heterogeneous network): Because of the different topological characteristics between MMS and DDS, in these two networks, we introduce two parameters of l and r as the biggest step random walk on MMS and DDS.α denotes a decay factor ranging from 0 and 1. The matrix P is used to control the prior probability of the iterative process and is the transition probability matrix of the bipartite network G obtained by the recommendation algorithm. P is a transition probability matrix, and P0 = P/sum(P). After several iterations, P is the steady-state probability matrix between miRNAs and diseases. For a given disease, we ranked all the candidate miRNAs based on the probability. In BRWHNHA algorithm, we effectively utilize the topological information of heterogeneous networks, including: MMS, DDS and MD. Additional file 1 Additional file 2 Additional file 3 Additional file 4
  32 in total

Review 1.  MicroRNAs and the regulation of cell death.

Authors:  Peizhang Xu; Ming Guo; Bruce A Hay
Journal:  Trends Genet       Date:  2004-12       Impact factor: 11.639

Review 2.  How microRNAs control cell division, differentiation and death.

Authors:  Eric A Miska
Journal:  Curr Opin Genet Dev       Date:  2005-10       Impact factor: 5.578

3.  A new method to measure the semantic similarity of GO terms.

Authors:  James Z Wang; Zhidian Du; Rapeeporn Payattakool; Philip S Yu; Chin-Fu Chen
Journal:  Bioinformatics       Date:  2007-03-07       Impact factor: 6.937

4.  Solving the apparent diversity-accuracy dilemma of recommender systems.

Authors:  Tao Zhou; Zoltán Kuscsik; Jian-Guo Liu; Matús Medo; Joseph Rushton Wakeling; Yi-Cheng Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-22       Impact factor: 11.205

Review 5.  Short non-coding RNA biology and neurodegenerative disorders: novel disease targets and therapeutics.

Authors:  Marc S Weinberg; Matthew J A Wood
Journal:  Hum Mol Genet       Date:  2009-04-15       Impact factor: 6.150

6.  MicroRNA-221/222 confers tamoxifen resistance in breast cancer by targeting p27Kip1.

Authors:  Tyler E Miller; Kalpana Ghoshal; Bhuvaneswari Ramaswamy; Satavisha Roy; Jharna Datta; Charles L Shapiro; Samson Jacob; Sarmila Majumder
Journal:  J Biol Chem       Date:  2008-08-15       Impact factor: 5.157

7.  Prognostic value of mature microRNA-21 and microRNA-205 overexpression in non-small cell lung cancer by quantitative real-time RT-PCR.

Authors:  Athina Markou; Emily G Tsaroucha; Loukas Kaklamanis; Marianthi Fotinou; Vassilis Georgoulias; Evi S Lianidou
Journal:  Clin Chem       Date:  2008-08-21       Impact factor: 8.327

8.  miR2Disease: a manually curated database for microRNA deregulation in human disease.

Authors:  Qinghua Jiang; Yadong Wang; Yangyang Hao; Liran Juan; Mingxiang Teng; Xinjun Zhang; Meimei Li; Guohua Wang; Yunlong Liu
Journal:  Nucleic Acids Res       Date:  2008-10-15       Impact factor: 16.971

9.  Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis.

Authors:  Angie M Cheng; Mike W Byrom; Jeffrey Shelton; Lance P Ford
Journal:  Nucleic Acids Res       Date:  2005-03-01       Impact factor: 16.971

10.  An analysis of human microRNA and disease associations.

Authors:  Ming Lu; Qipeng Zhang; Min Deng; Jing Miao; Yanhong Guo; Wei Gao; Qinghua Cui
Journal:  PLoS One       Date:  2008-10-15       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.