| Literature DB >> 26078775 |
Wen Dai1, Xi Liu1, Yibo Gao1, Lin Chen1, Jianglong Song1, Di Chen1, Kuo Gao2, Yongshi Jiang1, Yiping Yang1, Jianxin Chen2, Peng Lu1.
Abstract
There has been rising interest in the discovery of novel drug indications because of high costs in introducing new drugs. Many computational techniques have been proposed to detect potential drug-disease associations based on the creation of explicit profiles of drugs and diseases, while seldom research takes advantage of the immense accumulation of interaction data. In this work, we propose a matrix factorization model based on known drug-disease associations to predict novel drug indications. In addition, genomic space is also integrated into our framework. The introduction of genomic space, which includes drug-gene interactions, disease-gene interactions, and gene-gene interactions, is aimed at providing molecular biological information for prediction of drug-disease associations. The rationality lies in our belief that association between drug and disease has its evidence in the interactome network of genes. Experiments show that the integration of genomic space is indeed effective. Drugs, diseases, and genes are described with feature vectors of the same dimension, which are retrieved from the interaction data. Then a matrix factorization model is set up to quantify the association between drugs and diseases. Finally, we use the matrix factorization model to predict novel indications for drugs.Entities:
Mesh:
Year: 2015 PMID: 26078775 PMCID: PMC4452507 DOI: 10.1155/2015/275045
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Strategy pipeline. Firstly, feature vectors of genes are extracted from gene interaction network. Then, feature vectors of the same rank are obtained for drugs and diseases from drug-gene interactions and disease-gene interactions, respectively. Next, a matrix factorization model is generated to reconstruct the known drug-disease associations. Finally, the estimated feature vectors of drugs and diseases are used to infer new drug-disease associations and predict novel drug indications.
Algorithm 1Algorithm for learning the matrix factorization model.
Figure 2The AUC results of models based on different penalization and learning rate. (a) Learning rate remains the same, while penalization coefficients increase twofold each time. (b) Penalization coefficient remains the same, while learning rate increases twofold each time.
Figure 3The AUC results of models based on different dimensions. Penalization coefficient and learning rate remain the same.
Figure 4The AUC results of models based on different folds of negative samples. Penalization coefficient and learning rate remain the same. The dimension of feature vectors is set as 32.
Performance of FV and MF.
| Our method | FV | MF | |
|
| |||
| AUC | 0.7508 | 0.5264 | 0.5778 |
AUC values obtained by 5 × 10-fold cross-validation.
| Pair prediction | Drug prediction | Disease prediction | |
|---|---|---|---|
| libFM | 0.7068 | 0.6637 | 0.349 |
| SVDFeature | 0.5699 | 0.6403 | 0.236 |
| libMF | 0.5868 | 0.6557 | 0.3173 |
| Our method | 0.7508 | 0.7216 | 0.532 |
Figure 5The AUC results of models based on different penalization coefficients. Learning rate remains the same. The dimension of feature vectors is set as 32.
AUC values obtained by 5 × 10-fold cross-validation.
| Pair prediction | Drug prediction | Disease prediction | |
|---|---|---|---|
| libFM | 0.8078 | 0.6226 | 0.3682 |
| SVDFeature | 0.7107 | 0.6069 | 0.3338 |
| libMF | 0.6388 | 0.6217 | 0.3878 |
| Our method | 0.8147 | 0.6573 | 0.5617 |