| Literature DB >> 31534955 |
Han-Jing Jiang1,2, Yu-An Huang3, Zhu-Hong You1,2.
Abstract
Computational drug repositioning, designed to identify new indications for existing drugs, significantly reduced the cost and time involved in drug development. Prediction of drug-disease associations is promising for drug repositioning. Recent years have witnessed an increasing number of machine learning-based methods for calculating drug repositioning. In this paper, a novel feature learning method based on Gaussian interaction profile kernel and autoencoder (GIPAE) is proposed for drug-disease association. In order to further reduce the computation cost, both batch normalization layer and the full-connected layer are introduced to reduce training complexity. The experimental results of 10-fold cross validation indicate that the proposed method achieves superior performance on Fdataset and Cdataset with the AUCs of 93.30% and 96.03%, respectively, which were higher than many previous computational models. To further assess the accuracy of GIPAE, we conducted case studies on two complex human diseases. The top 20 drugs predicted, 14 obesity-related drugs, and 11 drugs related to Alzheimer's disease were validated in the CTD database. The results of cross validation and case studies indicated that GIPAE is a reliable model for predicting drug-disease associations.Entities:
Mesh:
Year: 2019 PMID: 31534955 PMCID: PMC6732622 DOI: 10.1155/2019/2426958
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
General statistics on Fdataset and Cdataset.
| Datasets | Drugs | Diseases | Interactions |
|---|---|---|---|
| Cdataset | 663 | 409 | 2532 |
| Fdataset | 593 | 313 | 1933 |
Figure 1Structure of autoencoder.
Figure 2Structure of full-connected layer.
Figure 3Flowchart of GIPAE model to predict potential drug-disease associations.
Experimental results of 10-fold cross validation yielded by GIPAE on Fdataset.
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 87.11 | 86.00 | 88.66 | 87.31 |
| 2 | 87.89 | 86.57 | 89.69 | 88.10 |
| 3 | 85.57 | 83.82 | 88.14 | 85.93 |
| 4 | 91.19 | 91.19 | 91.19 | 91.19 |
| 5 | 87.05 | 85.22 | 89.64 | 87.37 |
| 6 | 88.60 | 86.34 | 91.71 | 88.94 |
| 7 | 86.01 | 84.92 | 87.56 | 86.22 |
| 8 | 84.97 | 82.61 | 88.60 | 85.50 |
| 9 | 85.49 | 84.77 | 86.53 | 85.64 |
| 10 | 89.12 | 89.12 | 89.12 | 89.12 |
| Average |
|
|
|
|
Experimental results of the 10-fold cross validation yielded by GIPAE on Cdataset.
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 88.19 | 89.11 | 87.01 | 88.05 |
| 2 | 92.91 | 92.25 | 93.70 | 92.97 |
| 3 | 88.93 | 86.89 | 91.70 | 89.23 |
| 4 | 90.91 | 89.35 | 92.89 | 91.09 |
| 5 | 90.12 | 90.44 | 89.72 | 90.08 |
| 6 | 90.12 | 88.89 | 91.70 | 90.27 |
| 7 | 92.49 | 90.87 | 94.47 | 92.64 |
| 8 | 90.91 | 89.66 | 92.49 | 91.05 |
| 9 | 88.54 | 88.84 | 88.14 | 88.49 |
| 10 | 92.09 | 91.44 | 92.89 | 92.16 |
| Average |
|
|
|
|
Figure 4Positive and negative sample probability distribution.
Figure 5(a) and (b) Shown are the ROC curves yielded by GIPAE using 10-fold cross validation on Fdataset and Cdataset, respectively.
AUC results yielded by different methods using 10-fold cross validation.
| Fdataset | Cdataset | |
|---|---|---|
| DrugNet [ | 0.778 | 0.804 |
| HGBI [ | 0.829 | 0.858 |
| KBMF [ | 0.915 | 0.928 |
| MBiRW [ | 0.917 | 0.933 |
| DRRS [ | 0.930 | 0.947 |
| GIPAE (the proposed method) | 0.933 | 0.960 |
Results yielded by SVM on Fdataset using 10-fold cross validation.
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 78.87 | 79.79 | 77.32 | 78.53 |
| 2 | 77.83 | 78.12 | 77.32 | 77.72 |
| 3 | 80.93 | 81.91 | 79.38 | 80.63 |
| 4 | 78.50 | 78.06 | 79.27 | 78.66 |
| 5 | 81.87 | 81.22 | 82.90 | 82.05 |
| 6 | 80.31 | 80.00 | 80.83 | 80.41 |
| 7 | 79.02 | 79.17 | 78.76 | 78.96 |
| 8 | 80.57 | 78.37 | 84.46 | 81.30 |
| 9 | 79.02 | 79.17 | 78.76 | 78.96 |
| 10 | 76.17 | 77.01 | 74.61 | 75.79 |
| Average |
|
|
|
|
Results yielded by SVM on Cdataset using 10-fold cross validation.
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 82.68 | 82.42 | 83.07 | 82.75 |
| 2 | 86.61 | 86.05 | 87.40 | 86.72 |
| 3 | 83.99 | 82.82 | 85.77 | 84.27 |
| 4 | 83.79 | 84.34 | 83.00 | 83.67 |
| 5 | 81.62 | 81.75 | 81.42 | 81.58 |
| 6 | 82.41 | 83.06 | 81.42 | 82.24 |
| 7 | 85.77 | 86.06 | 85.38 | 85.71 |
| 8 | 84.58 | 84.58 | 84.58 | 84.58 |
| 9 | 84.98 | 86.42 | 83.00 | 84.68 |
| 10 | 81.82 | 81.82 | 81.82 | 81.82 |
| Average |
|
|
|
|
Figure 6(a) and (b) Shown are the ROC curves yielded by SVM using 10-fold cross validation on Fdataset and Cdataset, respectively.
Top-20 drugs predicted by GIPAE to be associated with Obesity based on Fdatabase.
| Index | Drug name | Evidence | Index | Drug name | Evidence |
|---|---|---|---|---|---|
| 1 | Phendimetrazine | Confirmed | 11 | Almotriptan | N.A. |
| 2 | Ticlopidine | Confirmed | 12 | Thioguanine | N.A. |
| 3 | Sibutramine | Confirmed | 13 | Terfenadine | Confirmed |
| 4 | Nitroglycerin | Confirmed | 14 | Sodium tetradecyl sulfate | N.A. |
| 5 | Mesalazine | Confirmed | 15 | Salsalate | Confirmed |
| 6 | Flutamide | Confirmed | 16 | Ramipril | N.A. |
| 7 | Felodipine | Confirmed | 17 | Procainamide | N.A. |
| 8 | Diethylpropion | Confirmed | 18 | Pravastatin | Confirmed |
| 9 | Desipramine | Confirmed | 19 | Phentermine | Confirmed |
| 10 | Captopril | N.A. | 20 | Orlistat | Confirmed |
Top-20 drugs predicted by GIPAE to be associated with Alzheimer disease based on Fdatabase.
| Index | Drug name | Evidence | Index | Drug name | Evidence |
|---|---|---|---|---|---|
| 1 | Dopamine | Confirmed | 11 | Ranitidine | Confirmed |
| 2 | Methylergonovine | N.A. | 12 | Nizatidine | N.A. |
| 3 | Meperidine | N.A. | 13 | Nifedipine | Confirmed |
| 4 | Gemcitabine | Confirmed | 14 | Lithium | Confirmed |
| 5 | Betamethasone | Confirmed | 15 | Esomeprazole | N.A. |
| 6 | Pioglitazone | Confirmed | 16 | Ergocalciferol | Confirmed |
| 7 | Guanethidine | Confirmed | 17 | Dihydrocodeine | N.A. |
| 8 | Valrubicin | N.A. | 18 | Amitriptyline | Confirmed |
| 9 | Teriparatide | N.A. | 19 | Almotriptan | Confirmed |
| 10 | Rizatriptan | N.A. | 20 | Adenosine | N.A. |