Literature DB >> 34178973

Predicting miRNA-Disease Association Based on Modularity Preserving Heterogeneous Network Embedding.

Wei Peng^1,2, Jielin Du¹, Wei Dai^1,2, Wei Lan³.

Abstract

MicroRNAs (miRNAs) are a category of small non-coding RNAs that profoundly impact various biological processes related to human disease. Inferring the potential miRNA-disease associations benefits the study of human diseases, such as disease prevention, disease diagnosis, and drug development. In this work, we propose a novel heterogeneous network embedding-based method called MDN-NMTF (Module-based Dynamic Neighborhood Non-negative Matrix Tri-Factorization) for predicting miRNA-disease associations. MDN-NMTF constructs a heterogeneous network of disease similarity network, miRNA similarity network and a known miRNA-disease association network. After that, it learns the latent vector representation for miRNAs and diseases in the heterogeneous network. Finally, the association probability is computed by the product of the latent miRNA and disease vectors. MDN-NMTF not only successfully integrates diverse biological information of miRNAs and diseases to predict miRNA-disease associations, but also considers the module properties of miRNAs and diseases in the course of learning vector representation, which can maximally preserve the heterogeneous network structural information and the network properties. At the same time, we also extend MDN-NMTF to a new version (called MDN-NMTF2) by using modular information to improve the miRNA-disease association prediction ability. Our methods and the other four existing methods are applied to predict miRNA-disease associations in four databases. The prediction results show that our methods can improve the miRNA-disease association prediction to a high level compared with the four existing methods.

Entities: CellLine Chemical Disease Species

Keywords: disease; heterogeneous network embedding; matrix factorization; miRNA; miRNA-disease association prediction

Year: 2021 PMID： 34178973 PMCID： PMC8223753 DOI： 10.3389/fcell.2021.603758

Source DB: PubMed Journal: Front Cell Dev Biol ISSN： 2296-634X

Introduction

MicroRNA (miRNA) is a category of small endogenous single-stranded non-coding RNA molecules with about 22 nucleotides in length. They play an essential role in regulating gene expression and complex gene regulatory networks by repressing target mRNAs expression at the post-transcriptional level (Bartel, 2004; Meister and Tuschl, 2004). Studies show that about 60% of human protein-coding genes are targeted by miRNAs, where the 5′ region of miRNA binds to 3′ UTR of the target mRNAs (Friedman et al., 2009). With the rapid development of biotechnology, increasing research has demonstrated that miRNAs play crucial roles at multiple stages of many critical biological processes such as early cell growth, development, proliferation, differentiation, tumor invasion, and apoptosis (Ambros, 2003). Furthermore, studies have shown that abnormality and dysregulations of disease-related miRNAs may cause human diseases (Garzon et al., 2010). Therefore, inferring the potential miRNA-disease association is of great benefit to studying human diseases, such as disease prevention, disease diagnosis, and drug development. As we all know, discovering the miRNA-disease associations through traditional biological experiments is a time-consuming and labor-intensive process. Instead, computational models would serve as a low-cost, and high-efficiency way of predicting miRNA-disease associations. Previous researches observe that similar miRNAs tend to associate with the same diseases and similar diseases are highly likely related to the same miRNAs. Hence, many computational methods construct disease similarity network and miRNA similarity network and infer miRNA-disease associations based on the associations between or within the disease or miRNAs (Peng et al., 2016, 2017; Zou et al., 2016; Huang et al., 2019). Xuan et al. (2013) construct a miRNA similarity network according to the degree of two miRNAs sharing similar disease and consider the k most similar neighbors of each miRNA to infer miRNA-disease associations. Chen X. et al. (2012) implement a random walk on the miRNA functional similarity network and explore the potential miRNA-disease associations from the global network information. Xuan et al. (2015) divide the miRNA nodes in the miRNA similarity network into two categories: the given disease-related and the given disease-unrelated nodes. They assign different transition weights to different types of nodes and implement random walk on the miRNA similarity network to predict miRNA-disease associations. Besides the single network, some researchers build a heterogeneous network that consists of miRNAs, diseases, and their inter and intro associations. Liu et al. (2017) construct the miRNA similarity network, disease similarity network and known miRNA-disease association network. After that, they run a random walk on the heterogeneous network to propagate information and exploit potential miRNA-disease associations. Considering the difference in the network structure of the miRNA similarity network and disease similarity network, Luo and Xiao (2017) use an unbalanced Bi-Random walk (called UBiRW) on the heterogeneous network of disease similarity network, miRNA functional similarity network and a known miRNA-disease association network to infer potential miRNA-disease associations. Zeng et al. (2016) enumerate all of the paths from miRNA/disease to disease/miRNA in the heterogeneous network, and the final score between a miRNA and a disease is a linear combination of their path scores. You et al. (2017) construct the heterogeneous network by integrating known human miRNA-disease associations, miRNA functional similarity, disease semantic similarity, and the Gaussian Interaction Profile (GIP) kernel similarity. After that, they do a depth-first search to find the paths between the miRNAs and diseases on the heterogeneous network. Then they filter the long paths and calculate the association of miRNA and disease by combining all their paths. Recently, a group of researchers proposes the network embedding-based method to predict miRNA-disease associations. The network embedding method designs an objective function and converts the network nodes into a low dimensional vector while maximally preserves the network structural information. Chen and Yan (2014) develop a regularized least square method to learn the latent vectors for miRNAs and diseases on the miRNA similarity network and disease similarity network. They combine the two vectors to give the final solution of predicting new miRNA-disease associations. Lan et al. (2016) construct multi-kernels to store the miRNA functional similarity network, miRNA sequence similarity network and disease semantic similarity network. Then they employ a Bayesian matrix factorization method to infer potential miRNA-disease associations by integrating these data sources. Yan et al. (2019) develop a dynamic neighborhood regularized logistic matrix factorization method called DNRLMF-MDA to learn representation vectors for miRNAs and diseases and predict potential miRNA-disease associations. Li et al. (2017) design an objective function to ensure the scores of known miRNA-disease association matrix are close to those in the predicted miRNA-disease association matrix. They utilize the matrix completion algorithm to update the matrix of known miRNA-disease associations and to predict the potential associations. Xiao et al. (2018) use a graph regularized non-negative matrix factorization framework (named GRNMF) to identify possible associations for all diseases simultaneously. Similarly, Chen’s group proposes two matrix completion-based methods, namely IMCMDA (Chen et al., 2018), and NCMCMDA (Chen et al., 2020) for miRNA-disease association prediction. The differences are IMCMDA uses inductive matrix completion for miRNA-disease association prediction, while NCMCMDA integrates neighborhood constraint in the course of matrix completion. The methods mentioned above have achieved great success in predicting miRNA-disease associations. However, there are still some shortcomings in these existing methods. Firstly, the single network-based methods only use the miRNA similarity or disease similarity network. They may ignore the relationship between diseases or miRNAs. Secondly, seldom heterogeneous network-based methods consider the miRNA/disease similarity network’s modular structure. Although some network embedding-based methods, i.e., DNRLMF-MDA, NCMCMDA, learn node representation only considering the constraint from part of neighbors, most of them ignore the modular information of miRNAs and diseases. Lu et al. (2008) constructed disease network by giving two diseases an edge if they share at least one common associated miRNA. Diseases cluster together, which suggests that some diseases form modules sharing similar associations at the miRNA level. Moreover, the disease-associated miRNAs show various dysfunctions, such as mutation, upregulation, deleted, and downregulation. On the other hand, groups of homologous miRNA belong to the same miRNA families. They might have similar functions, and therefore, their dysfunction would lead to a similar phenotype. By analyzing members in disease modules or miRNA modules, researchers found that most of the members in the miRNA module are related to the same disease, and the members in the disease modules are mostly related to the same miRNA too. Therefore, this finding can guide us to predict novel disease-related miRNAs. In this work, we propose a novel heterogeneous network embedding-based method for predicting miRNA-disease associations. We calculate the disease semantic similarity, diseases functional similarity, miRNA functional similarity, and compute the GIP kernel similarity of miRNAs and diseases. Then, we integrate these similarities and construct a heterogeneous network of the disease similarity network, miRNA functional similarity network and a known miRNA-disease association network. After that, we propose a Module-based Dynamic Neighborhood Non-negative Matrix Tri-Factorization (MDN-NMTF) to learn the latent vector representation for miRNAs and diseases in the heterogeneous network. Finally, the association probability is computed by the product of the latent miRNA and disease vectors. MDN-NMTF not only successfully integrates diverse biological information of miRNAs and diseases to predict miRNA-disease associations, but also considers the module properties of miRNAs and diseases in the course of learning vector representation, which can maximally preserve the heterogeneous network structural information and the network properties. Meanwhile, we also extend MDN-NMTF to a new version (called MDN-NMTF2) by using the modular information to improve the prediction ability of MDN-NMTF. Our methods, as well as the other four existing methods [DNRLMF-MDA (Yan et al., 2019), IMCMDA (Chen et al., 2018), UBiRW (Luo and Xiao, 2017), and GRNMF (Xiao et al., 2018)], are applied to predict miRNA-disease associations on four data sets. The prediction results show that compared with the four existing methods, our methods can improve the performance of miRNA-disease association prediction to a high level.

Materials

Four datasets (see Table 1), namely HMDD2.0-You (Li et al., 2014), HMDD2.0-Lan (Lan et al., 2016), HMDD2.0-Yan, and HMDD3.0[1], were used to evaluate our methods and the other existing methods. HMDD2.0-You, HMDD2.0-Lan, and HMDD2.0-Yan were from HMDD database version 2.0. The HMDD2.0-You dataset includes 495 miRNAs, 380 diseases and 5,424 miRNA-disease associations. The HMDD2.0-Lan dataset consists of 550 miRNAs, 329 diseases and 6,084 miRNA-disease associations. The HMDD2.0-Yan dataset includes 576 miRNAs, 356 diseases, and 6,391 miRNA-disease associations. The HMDD3.0 dataset came from HMDD database version 3.0, which involves 1,207 miRNAs, 894 diseases, and 18,732 miRNA-disease associations. To calculate the functional similarity of diseases, we extracted the functional similarity scores of gene-gene pairs from the HumanNet database that contains 16,243 genes and 476,399 associations (Lee et al., 2011). The disease-gene associations of HMDD2.0-You, HMDD2.0-Yan, and HMDD3.0 were obtained from the DisGeNET database, where includes 13,000 diseases, over 16,000 genes, and 380,000 disease-gene associations (Piñero et al., 2015). The disease-gene associations of HMDD2.0-Lan were downloaded from the SIDD database, containing 2,603 genes, 2,817 diseases, and 117,190 disease-gene associations (Cheng et al., 2013).

TABLE 1

The number of MiRNAs, diseases and miRNA-disease associations in four datasets.

Dataset	n_m	n_d	n_md
HMDD2.0-You	495	380	5424
HMDD2.0-Lan	550	329	6084
HMDD2.0-Yan	576	356	6391
HMDD3.0	1207	894	18732

The number of MiRNAs, diseases and miRNA-disease associations in four datasets.

Methods

The MDN-NMTF model aims to learn the representation vectors for miRNAs and diseases and to achieve better prediction for disease-related miRNAs. It can maximally maintain their features in original spaces, i.e., known miRNA-disease associations, miRNA similarity network structure, and disease similarity network structure. The preparing process of MDN-NMTF is broadly divided into four steps: building the networks, learning feature representation; reconstructing the miRNA-disease association network, predicting miRNA-disease associations (see Figure 1).

FIGURE 1

The flowchart of MDN-NMTF and MDN-NMTF2 to predict miRNA-disease association. The MDN-NMTF model takes four steps to predict miRNA-disease associations: building the similarity networks, learning feature representation for miRNA and diseases; reconstructing the miRNA-disease association network, predicting miRNA-disease associations. MDN-NMTF2 is an extended version of MDN-NMTF. It divides the miRNAs and diseases into several modules on the basis of the representation vectors learned by MDN-NMTF. MDN-NMTF2 calculates the similarity of two miRNAs or two diseases based on the module they belong to and infers the novel miRNA-disease associations from similar miRNAs or diseases in the same modules.

Disease Semantic Similarity

The disease semantic similarity between diseases is calculated using Mesh descriptors of diseases (Nelson et al., 2002). The disease terms can be represented as a direct acyclic graph (DAG), where nodes represent disease terms and edges represent the associations between diseases. The similarity between diseases can be calculated according to their common ancestors in the DAGs. Let DAG represent disease d, DAG = (T). T is the set composed of all parent disease nodes of d and itself, and E is the set of all edges between disease nodes within T. The formula to calculate the semantic value DV of diseases t and d is as follows: Where t is the set of all common ancestors of diseases d. Δ is the semantic contribution factor, whose value is between 0 and 1. We set the value of Δ to 0.5 in this study, similar to the values in (Yan et al., 2019). DS(d) is the semantic values of a disease d in DAG. The semantic similarity between disease d and disease d is as follows:

Disease Functional Similarity

Calculating disease functional similarity is based on the assumption that similar diseases target similar disease genes (Cheng et al., 2014). Therefore, given a pair of diseases d and d, the functional similarity is defined as: Where G = {g,g,…,gam} and G = {g,g,…,gbn} are two gene sets which associate with diseases d and d, respectively, and m and n are the numbers of genes in G and G, respectively. GFS(gai) denotes the functional similarity between gene g and the genes in G. It can be defined as below: Where FS(g, g) denotes the functional similarity between gene g and gene g, which is obtained from HumanNet dataset (Lee et al., 2011) in this work. In the same way, the GFS(gbj) can be computed.

MiRNA Functional Similarity

The miRNA functional similarity between two miRNAs m1 and m2 is calculated based on the semantic similarity of diseases to which they are related. It can be defined as follows: Where n1 and n2 are the number of diseases that are associated with miRNAs m1 and m2, respectively. DT1 and DT2 are the sets of diseases that are associated with miRNAs m1 and m2, respectively. MFS (dt1i) is the semantic similarity of disease dt and the diseases in DT, which is defined as below: Where D (dt) is the semantic similarity between diseases dt1i and dt2j.

GIP Kernel Similarity

It is observed that miRNAs with similar functions are more likely to be associated with similar diseases and vice versa. According to this observation, GIP kernel similarity is constructed to describe the miRNA similarity and disease similarity (Laarhoven et al., 2011). First, we defined a binary vector IP (m) to represent the interaction profile of miRNA m by observing whether or not there is a known association between miRNA m and every disease. Then, the GIP similarity between miRNA m and m can be calculated as: Where, γ controls the kernel bandwidth, which normalizes another bandwidth parameter γ by the average number of related miRNAs per disease. γ is defined as follows: Here is set to be 1 based on the previous study (Yan et al., 2019). n is the number of miRNAs. Thus, the GIP kernel similarity between disease d and d is defined as follows: Where γd is also set to 1 and n is the number of diseases.

Integrating Similarity for miRNAs and Diseases

Because not all miRNA-miRNA pairs have functional similarity, the GIP kernel similarity for miRNA is interpolated to the miRNA functional similarity to obtain the integrated similarity for miRNA. The final miRNA similarity matrix between miRNA m and miRNA m is defined as follows: Similarly, the final disease similarity between disease d and disease d is defined as follows:

Regularized by Dynamic Neighborhood

Similar to previous DNRLMF-MDA method (Yan et al., 2019), we only preserve the relationships between a miRNA or a disease and their closest neighbors, when projecting the miRNA or disease to their latent spaces. Let N(m) and N(d) denote the set of nearest neighbors of miRNA m and disease d, respectively. The numbers of nearest neighbors of the miRNAs are not fixed but are dynamically determined according to Eq. (15). For miRNA m, h(m) denotes its number of nearest neighbors, which can be as follows: Where ε is the control parameter. It is set to 0.56 via cross-validation. The rs(m) is a ranked vector based on the similarity between miRNA m and other miRNAs from high to low, and rs(m) is the lth most similar value. H integer ranges from 1 to the total number of m’s neighbors and l (the exponent of ε) is a dynamic variable integer to satisfy the constraint. Similarly, for disease d, its number of nearest neighbors (h(d)) also can be formulated as follows: Where rs(d) is a ranked vector based on the similarities between disease d and other diseases from high to low, and rs(d) is the lth most similar value. Let matrix A be the dynamic nearest neighborhood matrix of miRNAs, its element a is calculated as below: Similarity, let matrix B be the dynamic nearest neighborhood matrix of diseases, its element b can be calculated as below: In this work, we assume that if the two miRNAs or two diseases are the nearest neighbors in their original similarity networks, they should show similar representations in the corresponding latent spaces. Hence, the following two regularization terms are designed for miRNA and disease, respectively, which will be incorporated into the MDN-NMTF objective function. The regularization term for miRNAs can be defined as the following equation (Liu et al., 2016): Where Tr() is the trace of a matrix, and , in which D and are the diagonal matrices, whose diagonal elements are and , respectively. G represents the latent matrices of all miRNAs. Similarity, the regularization term for diseases can be defined as the following equation: Where , in which D and are the diagonal matrices, whose diagonal elements are and , respectively. And G represents the latent matrices of all diseases.

The MDN-NMTF Model

Let R ∈ ℝ and R ∈ ℝ denote the adjacency matrix of the miRNA similarity network and disease similarity network, respectively. The latent matrices of all miRNAs and diseases are represented as G ∈ ℝ and G ∈ ℝ, respectively. K ∈ ℝ denotes the association matrix between miRNA modules and disease modules. Let D ∈ ℝ be the matrix storing the known miRNA-disease associations. The MDN-NMTF learns the representation vectors for miRNAs (G) and disease (G) by optimizing the following objective function. In Eq. (21), the term of captures the intrinsic module structure within the original miRNA similarity matrix. Because the values in G record the modules the miRNAs belong to and S records the relationship of these modules. ⊙ is the Hadamard product. The term of indicates the miRNAs and diseases share similar relationship both in their original space and the latent space at the module level. We only want to use the known miRNA-disease information to learn their representation matrixes. Hence, let Y ∈ ℝ be a label weighted matrix [see Eq. (22)], where the elements of Y are set to 1 if the miRNA is known to associate with the disease. The elements of Y are set to 0.2 if the miRNA is known to not associate with the disease. Otherwise, the elements of Y are set to 0. Here, we set different weight for knowing to have or have no miRNA-disease associations. Because it is hard to prove that the miRNAs do not associate to certain diseases and some associations are temporarily not annotated due to the limitation of techniques. The terms of Tr (G) [see Eq. (19)] and Tr (G) [see Eq. (20)] are used to preserve the network structure of the original miRNA similarity network and disease similarity network, respectively. We introduce L and L to represent the dynamic neighborhood of miRNAs and diseases, respectively, (see section “The MDN-NMTF model”). Two terms of and are adopted to penalize the magnitudes of the G and G for avoiding overfitting. is relaxed the constraint to K’K = I. λ, λ, and λ are balance parameters of matrix tri-factorization. α and α are regularization term parameters. β and β are the dynamic neighborhood regularization parameters. ω is the k-constraint parameter. In this work, the values of k, k, λ, λ, λ, α, α, β, β, and ω are set to 200, 200, 0.001, 5, 0.1, 0.2, 0.8, 90, 1.5, and 160, respectively (Supplementary Table 1).

Computation of S, S, G, G, and K

To obtain the optimal solution of S, S, G, G, and K in the objective function of MDN-NMTF model Eq. (21), we take the partial derivative of the objective function with respect to S, S, G, G, and K, respectively. Following the Karush–Kuhn–Tucker (KKT) condition for the non-negativity of S, S, G, G, and K and setting the partial derivative equal to zero, we can update S, S, G, G, and K as follows. In this algorithm, ⊙ denotes the Hadamard product, and ÷ is entry-wise division for matrices. As shown in section “The MDN-NMTF model,” A and B are the dynamic nearest neighborhood matrix of miRNAs and diseases, respectively.

Predicting miRNA-Disease Associations

After getting the low-rank matrixes G, K, and G, we rebuild matrix D1 by the produce of the matrixes G, K, and G (D1 = GKG) to predict miRNA-disease associations. The elements in D1 denote the probability between miRNAs and diseases. Following is the pseudocode of MDN-NMTF algorithm. In the third step of the while loop, each update iteration replaces the zero value in the matrices with 10–9 to guarantee the constraint condition in Eq. (21). The convergence condition is that the difference between two objective functions in the iteration is less than 10–6 or the number of iterations reaches the maximum number of iterations of 1,000.

Predicting miRNA-Disease Associations With Modular Information

At the same time, we also extend MDN-NMTF to a new version (called MDN-NMTF2, see Algorithm MDN-NMTF2) by using the modular information to improve the miRNA-disease association prediction ability of MDN-NMTF. Since the factorized matrices G and G obtained from MDN-NMTF record the modules the miRNAs or diseases belong to. MDN-NMTF2 utilizes the G and G values to partition the miRNAs and diseases into different models. Given G ∈ ℝ and G ∈ ℝ, there are k miRNA modules and k disease modules. The elements with relatively large values of each column of G (G) is assigned to the members of the corresponding module. We calculate the threshold for each miRNA (i.e., each row g(i,⋅) of G) with: where , , t is a given threshold. Based on this rule, we determined miRNA m as the kth module member if the entries of g(i,k) are larger than Th (i). In the same way, the threshold for each disease [each row g(i,⋅) of G] can be calculated. According to the settings of Ma et al. (2020), we also set t = 1.5 to identify miRNA and disease modules with proper resolution. Then we calculate the similarity of two miRNAs based on the module they belong to. If two miRNAs m and m belong to the same miRNA module, their similarity in the kth miRNA module (ms) can be constructed as: The Sim(u, v) can be calculate as Eq.(30): Here, k represents the dimension of the vectors u and v. u and v represent the kth element of the vectors u and v. Similarly, if two diseases d and d belong to the same disease module, we can construct their similarity in the kth disease module (ds) as Based on the assumption that the miRNAs in the same modules are highly likely related to the same diseases, vice versa, we use Scorem(m,d) to represent the correlation score between the disease d and the miRNA m in the kth miRNA module. It can be calculated according to Eq. (32): Where D ∈ ℝ is the matrix storing the known miRNA-disease associations. Thus, let D be the miRNA-disease associations that are predicted based on miRNA modules, which can be defined as: Similarly, we can get the correlation score (Scored) in the kth disease module and predict miRNA-disease associations (D) based on disease modules as follows: Here, k and k, as shown in the previous section, denote the number of miRNA modules and disease modules. The final predicted miRNA-disease associations of MDN-NMTF2 can be calculated by: Here, ∗ denotes the Min-Max Normalization of the matrix. Following is the pseudocode of MDN-NMTF2.

Results

Performance Evaluation

To evaluate the performances of MDN-NMTF and MDN-NMTF2, we compared them with four state-of-the-art methods (DNRLMF-MDA, IMCMDA, UBiRW, and GRNMF). The UBiRW uses an unbalanced Bi-Random walk on the heterogeneous network to propagate information and to infer potential miRNA-disease associations. DNRLMF-MDA, IMCMDA, and GRNMF are three latest network embedding-based methods. DNRLMF-MDA adopts a dynamic neighborhood regularized logistic matrix factorization method to predict potential miRNA-disease associations. IMCMDA uses Inductive matrix completion for miRNA-disease association prediction. GRNMF infer possible associations for all disease by a graph regularized non-negative matrix factorization framework. Considering there is no available interaction observed for new diseases or miRNAs, GRNMF develops a preprocessing step to construct the miRNA-disease associations according to the neighbors’ information. We implemented cross-validation under two different settings to evaluate the performance of the proposed methods. The two different settings are 5-fold randomly zeroing and single-column zeroing. For 5-fold randomly zeroing cross-validation, all the known miRNA-disease associations are randomly and equally divided into five non-overlapping parts. In each round, one of the five parts is for testing and the corresponding values in matrix D are cleared as 0, and the other four parts are as positive samples for training. Note that the miRNA and disease similarity network should be recalculated in each round. Single-column zeroing is to clear all miRNA-disease associations of a particular column of diseases and take them as testing data, others as training sets, and finally sum all AUCs to get the mean value. We repeat the cross-validation 20 times on four different datasets and show the average values in the following sections. For HMDD2.0-You, HMDD2.0-Lan, HMDD2.0-Yan and HMDD3.0 datasets, we select the way illustrated in section “Materials” to calculate the miRNA similarity network and disease similarity network. To make the comparison fair, we tuned the parameters for every method to perform them the best in all of our experiments through randomly zeroing 5-fold cross-validation on HMDD2.0-You, HMDD2.0-Lan, and HMDD2.0-Yan. The detailed information, please see the online Supplementary Files (Supplementary Table 2).

Randomly Zeroing Cross-Validation

As we can see from Table 2, MDN-NMTF and MDN-NMTF2 possess the highest two performance among the four methods on all the four datasets in terms of AUC values. On HMDD2.0-You dataset, compared with other methods (DNRLMF-MDA: 0.9301 ± 0.0036, IMCMDA: 0.8285 ± 0.0068, UBiRW: 0.9196 ± 0.0036, and GRNMF: 0.9031 ± 0.0049), the AUC values of MDN-NMTF and MDN-NMTF2 achieve 0.9335 ± 0.0037 and 0.9354 ± 0.0035, respectively. On HMDD2.0-Yan dataset, the prediction performance of MDN-NMTF and MDN-NMTF2 are the best two because their AUC values are 0.9409 ± 0.0030 and 0.9424 ± 0.0033, compared with other methods (DNRLMF-MDA: 0.9384 ± 0.0031, IMCMDA: 0.8045 ± 0.0062, UBiRW: 0.9191 ± 0.0030, GRNMF: 0.9153 ± 0.0045). On HMDD2.0-Lan dataset, the AUC values of MDN-NMTF and MDN-NMTF2 are 0.9391 ± 0.0033 and 0.9415 ± 0.0033, which are both superior to the other results of DNRLMF-MDA (0.9369 ± 0.0030), IMCMDA (0.7216 ± 0.0072), UBiRW (0.9198 ± 0.0032), and GRNMF (0.9157 ± 0.0044). On HMDD3.0 dataset, the AUC values of the MDN-NMTF and MDN-NMTF2 are 0.9435 ± 0.0021 and 0.9467 ± 0.0020, which is better than that of DNRLMF-MDA method (0.9390 ± 0.0015), that of IMCMDA method (0.6572 ± 0.0052), that of UBiRW method (0.9280 ± 0.0016) and that of GRNMF method (0.9247 ± 0.0023). We observe that DNRLMF-MDA leads to the highest performance among the four existing methods. It may be the DNRLMF-MDA method adopts a dynamic neighborhood regularized logistic matrix factorization method to predict potential miRNA-disease associations. Both our MDN-NMTF and MDN-NMTF2 methods and the DNRLMF-MDA method utilize the dynamic neighborhood regularized restriction to construct the miRNA and disease feature vectors. We observe that our MDN-NMTF and MDN-NMTF2 methods outperform DNRLMF-MDA. It can be partially attributed to the high quality of miRNA and disease features extracted by our MDN-NMTF and MDN-NMTF2 method from the heterogeneous network under the consideration of the networks’ module properties. MDN-NMTF2 employs the miRNA and disease features extracted by MDN-NMTF method to partition miRNA and disease modules. It infers potential miRNA-disease associations by considering the miRNAs’ neighbors and diseases’ neighbors in the same modules, which makes the MDN-NMTF2 method achieves a clear improvement than the MDN-NMTF method when predicting the missing miRNA-disease associations.

TABLE 2

The AUC values for the models on different databases by randomly zeroing cross validation.

Methods	HMDD2.0-You	HMDD2.0-Yan	HMDD2.0-Lan	HMDD3.0
MDN-NMTF	0.9335 ± 0.0037	0.9409 ± 0.0030	0.9391 ± 0.0033	0.9435 ± 0.0021
MDN-NMTF2	0.9354 ± 0.0035	0.9424 ± 0.0033	0.9415 ± 0.0033	0.9467 ± 0.0020
DNRLMF-MDA	0.9301 ± 0.0036	0.9384 ± 0.0031	0.9369 ± 0.0030	0.9390 ± 0.0015
IMCMDA	0.8285 ± 0.0068	0.8045 ± 0.0062	0.7216 ± 0.0072	0.6572 ± 0.0052
UBiRW	0.9196 ± 0.0036	0.9191 ± 0.0030	0.9198 ± 0.0032	0.9280 ± 0.0016
GRNMF	0.9031 ± 0.0049	0.9153 ± 0.0045	0.9157 ± 0.0044	0.9247 ± 0.0023

The AUC values for the models on different databases by randomly zeroing cross validation. Besides, we test the performance of each method on 14 common diseases related to at least 110 miRNAs. Figure 2 illustrates the Receiver operating characteristics curves of each method on the 14 disease. Table 3 GRNMF lists the corresponding area under the curves (AUC). Both results show that MDN-NMTF outperforms the other four methods for all the 14 diseases.

FIGURE 2

TABLE 3

AUC values of MDN-NMTF and other four compared methods for the 14 diseases on HMDD2.0-Yan Dataset.

Disease name	MDN-NMTF	DNRLMF-MDA	IMCMDA	UBiRW	GRNMF
Breast neoplasms	0.8732	0.8274	0.8340	0.8194	0.8183
Non-small-cell lung carcinoma	0.8989	0.8748	0.8658	0.8614	0.8582
Renal cell carcinoma	0.8699	0.8089	0.7936	0.7592	0.7846
Glioblastoma	0.8740	0.8280	0.8414	0.8248	0.8336
Heart failure	0.8528	0.7797	0.7864	0.7962	0.8120
Hepatocellular carcinoma	0.8408	0.7585	0.7588	0.7916	0.7846
Lung neoplasms	0.9207	0.9077	0.8963	0.8922	0.8885
Melanoma	0.8898	0.8375	0.8211	0.8216	0.8251
Neoplasms	0.9555	0.9253	0.9227	0.9223	0.9264
Ovarian neoplasms	0.9262	0.8941	0.8863	0.8835	0.8885
Pancreatic neoplasms	0.9274	0.9035	0.8943	0.8886	0.9057
Prostatic neoplasms	0.8919	0.8623	0.8395	0.8184	0.8261
Stomach neoplasms	0.8641	0.8054	0.8164	0.8055	0.8071
Colorectal neoplasms	0.8899	0.8292	0.8350	0.8463	0.8425

The ROC curves of MDN-NMTF and other four methods for 14 diseases on HMDD2.0-Yan Dataset. The figure shows that the ROC curves of MDN-NMTF in 14 diseases are all higher than that of the other four methods. AUC values of MDN-NMTF and other four compared methods for the 14 diseases on HMDD2.0-Yan Dataset.

Single-Column Zeroing Cross Validation

It still is a challenging task to infer miRNA associations for a new disease. To assess whether the MDN-NMTF and MDN-NMTF2 methods can successfully predict related miRNA for new diseases, we perform single-column zeroing cross-validation. Table 4 lists the AUC values of different methods on four datasets. The AUC values of MDN-NMTF and MDN-NMTF2 still control the highest two in the four datasets. Compared to DNRLMF-MDA that has relatively better performance among the four existing methods, the MDN-NMTF method achieves 1.04% improvement on HMDD2.0-You dataset, 0.37% improvement on HMDD2.0-Yan dataset, 0.72% improvement on HMDD2.0-Lan dataset, and 1.18% improvement on HMDD3.0 dataset. The results prove that our methods considering the intrinsic module structure of miRNA and disease networks can extract the high quality of miRNA and disease features to predict related miRNAs for new diseases successfully. We observe that MDN-NMTF2 has a little lower performance than MDN-NMTF across the four datasets. It may be MDN-NMTF2 fails to infer the associations for new disease from the miRNA in the same modules.

TABLE 4

The AUC values of each method on four different datasets by single-column zeroing cross validation.

Methods	HMDD2.0-You	HMDD2.0-Yan	HMDD2.0-Lan	HMDD3.0
MDN-NMTF	0.8570 ± 0.1223	0.8482 ± 0.1265	0.8445 ± 0.1339	0.8917 ± 0.1108
MDN-NMTF2	0.8561 ± 0.1240	0.8473 ± 0.1292	0.8447 ± 0.1342	0.8896 ± 0.1142
DNRLMF-MDA	0.8482 ± 0.1355	0.8451 ± 0.1431	0.8385 ± 0.1487	0.8813 ± 0.1181
IMCMDA	0.8329 ± 0.1297	0.8214 ± 0.1290	0.8158 ± 0.1357	0.8781 ± 0.1308
UBiRW	0.8512 ± 0.1343	0.8403 ± 0.1356	0.8326 ± 0.1499	0.8794 ± 0.1341
GRNMF	0.7833 ± 0.1505	0.7504 ± 0.1618	0.7895 ± 0.1465	0.8245 ± 0.1502

The AUC values of each method on four different datasets by single-column zeroing cross validation.

Case Study

To further illustrate the performance of MDN-NMTF, we evaluate its miRNA prediction ability for some cancer types, such as Stomach Neoplasms (gastric Neoplasms) and Lymphoma. The dbDEMC database and miRCancer database are used as the benchmark datasets. Among cancer-related deaths worldwide, Stomach Neoplasms ranks the third. Increasing evidence indicates that many miRNAs interact with Stomach Neoplasms by regulating the related genes of Stomach Neoplasms. Table 5 demonstrates the top 50 predicted novel Stomach Neoplasms-related miRNAs predicted by MDN-NMTF on HMDD2.0-Yan dataset and the corresponding evidence. Table 5 shows 35 of the 50 miRNAs are validated by dbDEMC database and miRCancer database. The remaining 15 miRNAs are all found to be related to human diseases in the literature. miR-181b modulates multidrug resistance by targeting BCL2 in human cancer cell lines (Zhu et al., 2010). MicroRNA-125b affects the proliferation of gastric cancer cells (Yang et al., 2013). miR-15b and miR-16 modulate multidrug resistance by targeting BCL2 in human gastric cancer cells (Xia et al., 2008). miR-101-2, miR-125b-2, and miR-451a act as potential tumor suppressors in primary GCs as well as in GC-derived AGS cells (Riquelme et al., 2016). MicroRNA-181b targets cAMP-responsive element-binding protein 1 in gastric adenocarcinomas (Chen L. et al., 2012). Plasma miRNA-199a-3p and miRNA-151-5p are significantly elevated (p < 0.05) and are significantly reduced after surgery (p < 0.05) in gastric cancer patients (Li et al., 2012). Genomic loss of miR-486 regulates tumor progression and the OLFM4 antiapoptotic factor in gastric cancer (Oh et al., 2011). Lack of microRNA-101 causes E-cadherin functional deregulation through EZH2 upregulation in intestinal gastric cancer (Carvalho et al., 2012). Significant associations are found between hypermethylation of the hsa-miR-124a and tumor size, differentiation, lymphatic metastasis, and invasion depth (Pei et al., 2011). miR-103, miR-21, miR-145, miR-106b, miR-146a, and miR-148a separate node-positive from node-negative gastric cancers (Tchernitsa et al., 2010). miR-7 is a novel mechanism by which the inflammatory response promotes gastric tumorigenesis (Kong et al., 2012).

TABLE 5

Top 50 Related miRNAs of Stomach Neoplasms predicted by MDN-NMTF on HMDD2.0-Yan Dataset.

Top1-25 miRNA	Evidence	Top26-50 miRNA	Evidence
hsa-mir-21	dbDEMC, miRCancer	hsa-mir-199a-1	PMID:22956063
hsa-mir-214	dbDEMC, miRCancer	hsa-mir-22	dbDEMC, miRCancer
hsa-mir-200b	dbDEMC, miRCancer	hsa-mir-375	dbDEMC, miRCancer
hsa-mir-200c	miRCancer	hsa-mir-486	PMID:21415212
hsa-mir-182	dbDEMC, miRCancer	hsa-mir-106a	dbDEMC, miRCancer
hsa-mir-221	dbDEMC, miRCancer	hsa-mir-16-1	miRCancer
hsa-mir-181b-1	PMID:20162574	hsa-mir-222	dbDEMC, miRCancer
hsa-mir-148a	dbDEMC, miRCancer	hsa-mir-101-1	PMID:22450781
hsa-mir-34c	miRCancer	hsa-mir-10b	dbDEMC, miRCancer
hsa-mir-146b	miRCancer	hsa-mir-195	dbDEMC, miRCancer
hsa-mir-34a	dbDEMC, miRCancer	hsa-mir-141	dbDEMC, miRCancer
hsa-mir-125b-1	PMID:23128435	hsa-mir-101-2	PMID:26458815
hsa-mir-200a	dbDEMC, miRCancer	hsa-mir-146a	miRCancer
hsa-mir-31	dbDEMC, miRCancer	hsa-mir-199a-2	PMID:22956063
hsa-mir-145	dbDEMC, miRCancer	hsa-mir-106b	dbDEMC, miRCancer
hsa-mir-126	dbDEMC, miRCancer	hsa-mir-143	dbDEMC, miRCancer
hsa-mir-34b	miRCancer	hsa-mir-124-1	PMID:21365509
hsa-mir-16-2	PMID:18449891	hsa-mir-124-2	PMID:21365509
hsa-mir-125b-2	PMID:26458815	hsa-mir-103a-2	PMID:20726036
hsa-mir-107	dbDEMC, miRCancer	hsa-mir-130a	dbDEMC, miRCancer
hsa-mir-223	dbDEMC, miRCancer	hsa-mir-27b	dbDEMC, miRCancer
hsa-mir-183	dbDEMC, miRCancer	hsa-mir-155	dbDEMC, miRCancer
hsa-mir-27a	dbDEMC, miRCancer	hsa-mir-335	miRCancer
hsa-mir-25	dbDEMC, miRCancer	hsa-mir-151a	PMID:22956063
hsa-mir-181b-2	PMID:22539488	hsa-mir-7-1	PMID:22139078

Top 50 Related miRNAs of Stomach Neoplasms predicted by MDN-NMTF on HMDD2.0-Yan Dataset. Lymphoma is a type of cancer that begins in immune system cells. It is one of the top 10 deadly diseases. Table 6 shows the result of top 50 Lymphoma-related miRNAs detected by MDN-NMTF on the HMDD2.0-Yan dataset. It shows that 39 of the 50 miRNAs are validated by dbDEMC database and miRCancer database. The remaining 11 miRNAs are all found to be disease-related in the literature. The plasma miR-92a value could be a novel biomarker not only for diagnosis but also for monitoring lymphoma patients after chemotherapy (Ohyashiki et al., 2011). Compared with healthy canine peripheral blood mononuclear cells and normal lymph nodes, mir-181a shows a decreased expression level (Uhl et al., 2011). miR-26a is repressed by MYC (Sander et al., 2009). The down-regulation of miR-16, miR-26a, miR-101, miR-29c, and miR138 in the t(14;18)-negative FL (follicular lymphoma) subset is associated with profound mRNA expression changes of potential target genes involving cell cycle control, apoptosis and B-cell differentiation. miR-16 targets CHEK1 showing increased expression on the protein level in t(14;18)-negative FL, while reducing TCL1A expression, in line with a partial loss of the germinal center B-cell phenotype in this FL subset (Leich et al., 2011). mir-499a is deregulated hypermutations (Navarro et al., 2009). miR-24 is overexpressed (Gibcus et al., 2009). A distinct set of five microRNAs (miR-150, miR-550, miR-124a, miR-518b, and miR-539) is shown to be differentially expressed in gastritis as opposed to MALT lymphoma (Thorns et al., 2012). miR-125b-5p not only regulates tumor growth in vivo but also increases cellular resistance to proteasome inhibitors via modulation of MAD4 (Manfè et al., 2013).

TABLE 6

Top 50 Related miRNAs of Lymphoma predicted by MDN-NMTF on HMDD2.0-Yan Dataset.

Top1-25 miRNA	Evidence	Top26-50 miRNA	Evidence
hsa-mir-17	dbDEMC, miRCancer	hsa-mir-363	dbDEMC
hsa-mir-20a	dbDEMC, miRCancer	hsa-mir-150	dbDEMC, miRCancer
hsa-mir-155	dbDEMC, miRCancer	hsa-mir-126	dbDEMC
hsa-mir-18a	dbDEMC, miRCancer	hsa-mir-200b	dbDEMC
hsa-mir-19a	dbDEMC, miRCancer	hsa-mir-184	dbDEMC
hsa-mir-19b-1	miRCancer	hsa-mir-200a	dbDEMC
hsa-mir-92a-1	PMID:21383985	hsa-mir-499a	PMID:19690137
hsa-mir-15a	dbDEMC, miRCancer	hsa-mir-34a	dbDEMC
hsa-mir-146a	dbDEMC	hsa-mir-210	dbDEMC
hsa-mir-19b-2	miRCancer	hsa-mir-200c	dbDEMC
hsa-mir-16-1	miRCancer	hsa-mir-205	dbDEMC
hsa-mir-16-2	miRCancer	hsa-mir-145	dbDEMC
hsa-mir-21	dbDEMC, miRCancer	hsa-mir-24-1	PMID:19177201
hsa-mir-92a-2	PMID:21383985	hsa-mir-125b-1	dbDEMC
hsa-mir-181a-1	dbDEMC	hsa-mir-20b	dbDEMC
hsa-mir-181a-2	PMID:21910161	hsa-mir-125a	dbDEMC
hsa-mir-26a-2	dbDEMC	hsa-mir-124-1	PMID:22395483
hsa-mir-26a-1	PMID:19197161	hsa-mir-141	dbDEMC
hsa-mir-122	dbDEMC	hsa-mir-125b-2	PMID:23527180
hsa-mir-101-1	PMID:21960592	hsa-mir-18b	dbDEMC
hsa-mir-101-2	PMID:21960592	hsa-mir-138-2	dbDEMC
hsa-mir-342	dbDEMC	hsa-mir-29c	dbDEMC
hsa-mir-486	dbDEMC	hsa-mir-138-1	PMID:21960592
hsa-mir-203	dbDEMC	hsa-mir-708	dbDEMC
hsa-mir-223	dbDEMC, miRCancer	hsa-mir-143	dbDEMC

Top 50 Related miRNAs of Lymphoma predicted by MDN-NMTF on HMDD2.0-Yan Dataset.

Module Analysis

To probe why the modules help the MDN-NMTF2 to obtain better result, we analyze the miRNA or disease modules detected by MDN-NMTF2 on the dataset HMDD2.0-Yan (Supplementary Texts 3, 4). Table 7 lists the details of these modules. There are 127 miRNA modules with more than one member after removing the modules. The average size of these modules is 40. There are 142 disease modules with more than one member and their average size is 22. The average function similarity of the members in the miRNA modules was 0.4409, which was 113.20% higher than the average value of 0.2068 of the whole miRNA function similarity network (Supplementary Text 5). Similarly, the average function similarity of the disease modules was 0.0939, which was 160.11% higher than the average value of 0.0361 of the whole disease function similarity network (Supplementary Text 6). It suggests that the miRNA modules and disease modules detected by MDN-NMTF2 consist of members with similar functions. We also find that average 82% of miRNAs in the same module are related to the same disease (Supplementary Text 7). Figure 3 shows an example of miRNA module that consists of 36 miRNAs. All of these miRNAs are associated with Leukemia Myeloid Acute. On the other hand, 61% of the disease in the same module relate to the same miRNA (Supplementary Text 8). Figure 4 illustrates an example of disease module with 13 members. 12 of 13 diseases in the module relate to a common miRNA has-mir-124-1 that expresses in human embryonic stem cells. Hence, the MDN-NMTF2 infers miRNA-disease associations from miRNAs or diseases in the same modules, which helps it achieve better prediction results.

TABLE 7

miRNA modules and disease modules detected by MDN-NMTF2 on HMDD2.0-Yan Dataset.

Modules	NM	AvgSize	AvgSim	AvgPc
miRNA	127	40	0.4409	82.20%
disease	142	22	0.0939	61.28%

FIGURE 3

An example of miRNA module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that all 36 miRNAs in the module are related to Leukemia Myeloid Acute.

FIGURE 4

An example of disease module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that 12 of 13 diseases in the module are related to a miRNA has-mir-124-1.

miRNA modules and disease modules detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. An example of miRNA module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that all 36 miRNAs in the module are related to Leukemia Myeloid Acute. An example of disease module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that 12 of 13 diseases in the module are related to a miRNA has-mir-124-1.

Conclusion

Inferring miRNA-disease associations is a crucial step to manifest principles of disease prevention, disease diagnosis and drug development. In this study, we have presented a novel method named MDN-NMTF to predict miRNA-disease associations. It constructs a heterogeneous network of disease similarity network, miRNA functional similarity network and a known miRNA-disease association network. After that, it learns the vector representation for miRNAs and diseases in the heterogeneous network by a matrix tri-factorization method under the constraint of the module structure and dynamic neighborhood. Finally, MDN-NMTF predicts novel miRNA-disease association probability by the product of the miRNA and disease latent vectors. At the same time, we also extend MDN-NMTF to a new version (called MDN-NMTF2) by using the modular information. Compared with the previous network propagation-based method, like UBiRW, MDN-NMTF, and MDN-NMTF2 project miRNAs and diseases to a latent space. It can successfully integrate diverse biological information of miRNAs and diseases to predict miRNA-disease associations. Compared with the network embedding-based methods, like DNRLMF-MDA, IMCMDA and GRNMF, and MDN-NMTF and MDN-NMTF2 consider the module properties of miRNAs and diseases in the course of learning vector representation, which can maximally preserves the heterogeneous network structural information and the network properties. In particular, MDN-NMTF2 not only considers the modularity in the feature learning process but also uses the miRNA module and disease module information when reconstructing the miRNA-disease association matrix. We test our methods and the other four existing methods on four different datasets by implementing randomly zero cross-validation and single-column zero cross-validation. The results show that our methods outperform the state-of-the-art methods not only on predicting the missing miRNA-disease associations but also on recommending related miRNA for new diseases.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://github.com/weiba/MDN-NMTF.

Author Contributions

WP and JD obtained and analyzed miRNA-related data, disease-related data, and miRNA-disease associations. WP, JD, WD, and WL designed the new method MDN-NMTF and analyzed the results. WP and JD drafted the manuscript together. WP, JD, WD, and WL participated in revising the draft. All authors have read and approved the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Algorithm MDN-NMTF
Input: miRNA similarity R_m; disease similarity R_d; miRNA-disease association D; the parameters λ₁, λ₂, λ₃, α₁, α₂, β₁, β₂, and ω.
Output: G_m, G_d, S_m, S_d, K, and D1
1: Initialize matrices G_m, G_d, S_m, S_d, and K with random non-negative matrices, while S_m, S_d are symmetric matrices.
2: Calculate the dynamic neighbor matrices A and B by Eqs (17, 18), and then calculate the diagonal matrices D_m and D_d, and the Laplacian matrix L_m and L_d.
3: While objective function value in Eq. (21) not converge do
(1) Fix G_m, G_d, S_d, and K and update S_m with Eq. (23).
(2) Fix G_m, G_d, S_m, and K and update S_d with Eq. (24)
(3) Fix G_d, S_m, S_d, and K and update G_m with Eq. (25).
(4) Fix G_m, S_m, S_d, and K and update G_d with Eq. (26).
(5) Fix G_m, G_d, S_m, and S_d and update K with Eq. (27).
end while
4: Rebuild miRNA-disease association matrix D1 = G_mKG_d′.

Algorithm MDN-NMTF2
Input: miRNA similarity matrix R_m; disease similarity matrix R_d and miRNA-disease association matrix D; the parameters λ₁, λ₂, λ₃, α₁, α₂, β₁, β₂, and ω.
Output: D2
1: Get G_m, G_d, S_m, S_d, K, and D1 by the Algorithm MDN-NMTF
2: Determine miRNA m_i as the kth module member if the entries of g_m(i,k) are larger than Th (i) [see Eq. (28)]
3. Determine disease d_i as the kth module member if the entries of g_d(i,k) are larger than Th (i) [see Eq. (28)]
4. Calculate similarity of two miRNAs If they belong to the same miRNA module by Eq. (29)
5. Calculate D_m as the miRNA-disease associations that are predicted based on miRNA modules [see Eq. (33)]
6. Calculate similarity of two diseases If they belong to the same disease module by Eq. (31)
7. Calculate D_d as the miRNA-disease associations that are predicted based on disease modules [see Eq. (35)]
8: Calculate the final predicted miRNA-disease associations D2 = D1^∗+D_m^∗/2+D_d^∗/2.

51 in total

1. Gaussian interaction profile kernels for predicting drug-target interaction.

Authors: Twan van Laarhoven; Sander B Nabuurs; Elena Marchiori
Journal: Bioinformatics Date: 2011-09-04 Impact factor: 6.937

2. Predicting miRNA-disease association based on inductive matrix completion.

Authors: Xing Chen; Lei Wang; Jia Qu; Na-Na Guan; Jian-Qiang Li
Journal: Bioinformatics Date: 2018-12-15 Impact factor: 6.937

3. Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources.

Authors: Yuansheng Liu; Xiangxiang Zeng; Zengyou He; Quan Zou
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2016-04-05 Impact factor: 3.710

4. Predicting MicroRNA-Disease Associations Based on Improved MicroRNA and Disease Similarities.

Authors: Wei Lan; Jianxin Wang; Min Li; Jin Liu; Fang-Xiang Wu; Yi Pan
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2016-07-07 Impact factor: 3.710

5. Neighborhood Regularized Logistic Matrix Factorization for Drug-Target Interaction Prediction.

Authors: Yong Liu; Min Wu; Chunyan Miao; Peilin Zhao; Xiao-Li Li
Journal: PLoS Comput Biol Date: 2016-02-12 Impact factor: 4.475

6. [Role of miR-124a methylation in patients with gastric cancer].

Authors: Lei Pei; Jia-zeng Xia; Hong-yu Huang; Rong-rong Zhang; Lu-bin Yao; Liang Zheng; Bo Hong
Journal: Zhonghua Wei Chang Wai Ke Za Zhi Date: 2011-02

7. Genomic loss of miR-486 regulates tumor progression and the OLFM4 antiapoptotic factor in gastric cancer.

Authors: Hue-Kian Oh; Angie Lay-Keng Tan; Kakoli Das; Chia-Huey Ooi; Nian-Tao Deng; Iain Beehuat Tan; Emmanuel Beillard; Julian Lee; Kalpana Ramnarayanan; Sun-Young Rha; Nallasivam Palanisamy; P Mathijs Voorhoeve; Patrick Tan
Journal: Clin Cancer Res Date: 2011-03-17 Impact factor: 12.531

8. Benchmark of computational methods for predicting microRNA-disease associations.

Authors: Zhou Huang; Leibo Liu; Yuanxu Gao; Jiangcheng Shi; Qinghua Cui; Jianwei Li; Yuan Zhou
Journal: Genome Biol Date: 2019-10-08 Impact factor: 13.583

9. An analysis of human microRNA and disease associations.

Authors: Ming Lu; Qipeng Zhang; Min Deng; Jing Miao; Yanhong Guo; Wei Gao; Qinghua Cui
Journal: PLoS One Date: 2008-10-15 Impact factor: 3.240

10. SIDD: a semantically integrated database towards a global view of human disease.

Authors: Liang Cheng; Guohua Wang; Jie Li; Tianjiao Zhang; Peigang Xu; Yadong Wang
Journal: PLoS One Date: 2013-10-11 Impact factor: 3.240

2 in total

1. DNRLCNN: A CNN Framework for Identifying MiRNA-Disease Associations Using Latent Feature Matrix Extraction with Positive Samples.

Authors: Jiancheng Zhong; Wubin Zhou; Jiedong Kang; Zhuo Fang; Minzhu Xie; Qiu Xiao; Wei Peng
Journal: Interdiscip Sci Date: 2022-04-15 Impact factor: 2.233

2. Identifying Cancer Subtypes Using a Residual Graph Convolution Model on a Sample Similarity Network.

Authors: Wei Dai; Wenhao Yue; Wei Peng; Xiaodong Fu; Li Liu; Lijun Liu
Journal: Genes (Basel) Date: 2021-12-27 Impact factor: 4.096

2 in total