| Literature DB >> 31747915 |
Han-Jing Jiang1,2,3, Zhu-Hong You4,5,6, Yu-An Huang7.
Abstract
BACKGROUND: In the process of drug development, computational drug repositioning is effective and resource-saving with regards to its important functions on identifying new drug-disease associations. Recent years have witnessed a great progression in the field of data mining with the advent of deep learning. An increasing number of deep learning-based techniques have been proposed to develop computational tools in bioinformatics.Entities:
Keywords: Convolutional Neural Networks; Random forest; Sigmoid kernel
Year: 2019 PMID: 31747915 PMCID: PMC6868698 DOI: 10.1186/s12967-019-2127-5
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1Flowchart of SKCNN model to predict potential drug–disease associations
General statistics on Fdataset and Cdataset
| Datasets | Drugs | Diseases | Interactions |
|---|---|---|---|
| Cdataset | 663 | 409 | 2532 |
| Fdataset | 593 | 313 | 1933 |
Fig. 2Convolution on features
Fig. 3Maximum pooling of features
Experimental results of tenfold cross-validation yielded by SKCNN on Fdataset
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 89.69 | 92.31 | 86.60 | 89.36 |
| 2 | 87.37 | 90.06 | 84.02 | 86.93 |
| 3 | 88.66 | 90.32 | 86.60 | 88.42 |
| 4 | 88.86 | 90.76 | 86.53 | 88.59 |
| 5 | 88.86 | 89.06 | 88.60 | 88.83 |
| 6 | 89.64 | 91.80 | 87.05 | 89.36 |
| 7 | 90.93 | 95.40 | 86.01 | 90.46 |
| 8 | 89.38 | 91.30 | 87.05 | 89.12 |
| 9 | 91.45 | 91.67 | 91.19 | 91.43 |
| 10 | 90.67 | 93.85 | 87.05 | 90.32 |
| Average | 89.55 ± 1.15 | 91.65 ± 1.77 | 87.07 ± 1.75 | 89.28 ± 1.19 |
Experimental results of the tenfold cross-validation yielded by SKCNN on Cdataset
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 90.35 | 92.18 | 88.19 | 90.14 |
| 2 | 93.11 | 95.82 | 90.16 | 92.90 |
| 3 | 89.13 | 90.91 | 86.96 | 88.89 |
| 4 | 92.09 | 95.32 | 88.54 | 91.80 |
| 5 | 89.53 | 91.32 | 87.35 | 89.29 |
| 6 | 91.50 | 91.67 | 91.30 | 91.49 |
| 7 | 91.30 | 91.97 | 90.51 | 91.24 |
| 8 | 91.50 | 93.03 | 89.72 | 91.35 |
| 9 | 93.87 | 93.02 | 94.86 | 93.93 |
| 10 | 91.50 | 91.67 | 91.30 | 91.49 |
| Average | 91.38 ± 1.39 | 92.69 ± 1.58 | 89.89 ± 2.21 | 91.25 ± 1.45 |
Fig. 4a, b The ROC curves yielded by SKCNN using tenfold cross validation on the Fdataset and Cdataset, respectively
Fig. 5AUC results yielded by different methods using tenfold cross validation
Results yielded by SVM on Fdataset using tenfold cross validation
| Test set | Acc. (%) | Pre. (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| 1 | 86.08 | 83.33 | 90.21 | 86.63 |
| 2 | 83.51 | 83.51 | 83.51 | 83.51 |
| 3 | 84.54 | 81.31 | 89.69 | 85.29 |
| 4 | 81.35 | 82.70 | 79.27 | 80.95 |
| 5 | 82.64 | 82.14 | 83.42 | 82.78 |
| 6 | 84.20 | 84.38 | 83.94 | 84.16 |
| 7 | 82.90 | 83.96 | 81.35 | 82.63 |
| 8 | 82.38 | 78.28 | 89.64 | 83.57 |
| 9 | 83.42 | 81.16 | 87.05 | 84.00 |
| 10 | 86.53 | 85.79 | 87.56 | 86.67 |
| Average | 83.76 ± 1.54 | 82.66 ± 1.98 | 85.56 ± 3.61 | 84.02 ± 1.70 |
| SKCNN | 89.55 ± 1.15 | 91.65 ± 1.77 | 87.07 ± 1.75 | 89.28 ± 1.19 |
Results yielded by SVM on Cdataset using tenfold cross validation
| Test set | Acc. (%) | Pre. (%) | Recall. (%) | F1-score. (%) |
|---|---|---|---|---|
| 1 | 86.61 | 88.43 | 84.25 | 86.29 |
| 2 | 89.17 | 90.95 | 87.01 | 88.93 |
| 3 | 84.39 | 87.50 | 80.24 | 83.71 |
| 4 | 90.51 | 92.18 | 88.54 | 90.32 |
| 5 | 85.77 | 89.52 | 81.03 | 85.06 |
| 6 | 86.17 | 89.96 | 81.42 | 85.48 |
| 7 | 87.15 | 89.50 | 84.19 | 86.76 |
| 8 | 87.35 | 89.54 | 84.58 | 86.99 |
| 9 | 85.97 | 89.57 | 81.42 | 85.30 |
| 10 | 87.35 | 88.57 | 85.77 | 87.15 |
| Average | 87.04 ± 1.66 | 89.57 ± 1.24 | 83.85 ± 2.63 | 86.60 ± 1.83 |
| SKCNN | 91.38 ± 1.39 | 92.69 ± 1.58 | 89.89 ± 2.21 | 91.25 ± 1.45 |
Fig. 6a, b The ROC curves yielded by SVM using tenfold cross validation on the Fdataset and Cdataset, respectively
Top-20 drugs predicted by SKCNN to be associated with obesity based on Fdatabase
| Index | Drug name | Evidence | Index | Drug name | Evidence |
|---|---|---|---|---|---|
| 1 | Vigabatrin | Confirmed | 11 | Fluoxymesterone | NA |
| 2 | Sumatriptan | Confirmed | 12 | Disulfiram | Confirmed |
| 3 | Sulindac | Confirmed | 13 | Carteolol | Confirmed |
| 4 | Paroxetine | Confirmed | 14 | Aspirin | Confirmed |
| 5 | Ofloxacin | Confirmed | 15 | Vincristine | Confirmed |
| 6 | Mesalazine | Confirmed | 16 | Triamcinolone | Confirmed |
| 7 | Mercaptopurine | NA | 17 | Terazosin | NA |
| 8 | Isoproterenol | Confirmed | 18 | Sildenafil | Confirmed |
| 9 | Hyoscyamine | Confirmed | 19 | Sertraline | Confirmed |
| 10 | Formoterol | NA | 20 | Salicyclic acid | NA |
Top-20 drugs predicted by SKCNN to be associated with asthma based on Fdatabase
| Index | Drug name | Evidence | Index | Drug name | Evidence |
|---|---|---|---|---|---|
| 1 | Methimazole | Confirmed | 11 | Quinidine | Confirmed |
| 2 | Famotidine | Confirmed | 12 | Quetiapine | Confirmed |
| 3 | Clonazepam | Confirmed | 13 | Pyridoxine | NA |
| 4 | Trimethoprim | NA | 14 | Propranolol | Confirmed |
| 5 | Triamcinolone | Confirmed | 15 | Propafenone | Confirmed |
| 6 | Timolol | Confirmed | 16 | Promethazine | Confirmed |
| 7 | Theophylline | Confirmed | 17 | Procainamide | Confirmed |
| 8 | Tetrabenazine | NA | 18 | Prednisolone | Confirmed |
| 9 | Tamoxifen | Confirmed | 19 | Praziquantel | Confirmed |
| 10 | Ropinirole | Confirmed | 20 | Pravastatin | Confirmed |