| Literature DB >> 35095495 |
Chu-Qiao Gao1, Yuan-Ke Zhou1, Xiao-Hong Xin1, Hui Min1, Pu-Feng Du1.
Abstract
Drug repositioning provides a promising and efficient strategy to discover potential associations between drugs and diseases. Many systematic computational drug-repositioning methods have been introduced, which are based on various similarities of drugs and diseases. In this work, we proposed a new computational model, DDA-SKF (drug-disease associations prediction using similarity kernels fusion), which can predict novel drug indications by utilizing similarity kernel fusion (SKF) and Laplacian regularized least squares (LapRLS) algorithms. DDA-SKF integrated multiple similarities of drugs and diseases. The prediction performances of DDA-SKF are better, or at least comparable, to all state-of-the-art methods. The DDA-SKF can work without sufficient similarity information between drug indications. This allows us to predict new purpose for orphan drugs. The source code and benchmarking datasets are deposited in a GitHub repository (https://github.com/GCQ2119216031/DDA-SKF).Entities:
Keywords: Laplacian regularized least squares; drug repositioning; drug–disease association; orphan drugs; similarity kernel fusion
Year: 2022 PMID: 35095495 PMCID: PMC8792612 DOI: 10.3389/fphar.2021.784171
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
FIGURE 1Flowchart of DDA-SKF. The model DDA-SKF included three steps: (1) three drug similarity kernels and two disease similarity kernels were calculated; (2) similarity kernel fusion (SKF) algorithm was used to integrate these similarities into two comprehensive similarity kernels; (3) Laplacian regularized least squares (LapRLS) framework was used to build the prediction model.
Performance comparison analysis using both the novel association test and novel drug test.
| Method | AUROC-A | AUPR-A | AUROC-D | AUPR-D |
|---|---|---|---|---|
| BNNR | 0.932 | 0.589 | 0.776 | 0.136 |
| DisDrugPred | 0.890 | 0.070 | 0.835 | 0.243 |
| DRRS | 0.929 | 0.140 | 0.765 | 0.114 |
| SCMFDD | 0.712 | 0.004 | 0.733 | 0.048 |
| MbiRW | 0.911 | 0.129 | 0.798 | 0.156 |
| DRIMC | 0.956 | 0.299 | 0.873 | 0.278 |
| DDA-SKF | 0.937 ± 0.0003 | 0.533 ± 0.0039 | 0.845 ± 0.0007 | 0.270 ± 0.0013 |
AUROC: area under receiver operating characteristics; AUPR: area under precision-recall.
The suffix “-A” indicates that the AUROC and AUPR are obtained using the novel association prediction.
The suffix “-D” indicates that the AUROC and AUPR are obtained using the novel drug prediction.
The performance values are taken from Yang et al. (2019a).
The performance values are taken from Zhang et al. (2020).
The value is represented as “average ± standard deviation” form for 5 times of repeated cross-validations.
Performance comparison analysis with PREDICT, NTSIM, and LRSSL.
| Method | Dataset | AUROC | AUPR | SEN | SPE | PRE | ACC | F1 | MCC |
|---|---|---|---|---|---|---|---|---|---|
| PREDICT | PREDICT | 0.902 | 0.151 | 0.341 | 0.993 | 0.091 | 0.992 | 0.144 | 0.163 |
| NTSIM | PREDICT | 0.921 | 0.338 | 0.368 | 0.999 | 0.462 | 0.998 | 0.402 | 0.407 |
| DDA-SKF | PREDICT | 0.929 ± 0.0015 | 0.497 ± 0.0043 | 0.455 ± 0.0072 | 0.996 ± 0.0002 | 0.565 ± 0.0120 | 0.991 ± 0.0002 | 0.504 ± 0.0051 | 0.503 ± 0.0054 |
| LRSSL | LRSSL | 0.825 | 0.179 | 0.217 | 0.999 | 0.199 | 0.998 | 0.202 | 0.203 |
| NTSIM | LRSSL | 0.902 | 0.269 | 0.308 | 0.999 | 0.376 | 0.999 | 0.338 | 0.337 |
| DDA-SKF | LRSSL | 0.931 ± 0.0011 | 0.382 ± 0.0051 | 0.395 ± 0.0084 | 0.997 ± 0.0002 | 0.467 ± 0.0138 | 0.994 ± 0.0002 | 0.427 ± 0.0037 | 0.426 ± 0.0041 |
AUROC: area under receiver operating characteristics.
AUPR: area under precision-recall.
SEN: sensitivity.
SPE: specificity.
PRE: precision.
ACC: accuracy.
F1: F1-score.
MCC: Matthew’s correlation coefficient.
The performance values are taken from Zhang et al.( 2018a).
The value is represented as “average ± standard deviation” form for 5 times of repeated cross-validations.
Top five candidate diseases for typical drugs.
| Drug | Disease | OMIM ID | Evidence |
|---|---|---|---|
| Vincristine | B-cell chronic lymphocytic leukemia | 151400 | NA |
| Vincristine | Small-cell carcinoma | 182280 | CTD |
| Vincristine | Mycosis fungoides | 254400 | CTD |
| Vincristine | Testicular neoplasms | 273300 | CTD |
| Vincristine | Urinary bladder neoplasms | 109800 | CTD |
| Cisplatin | Alveolar rhabdomyosarcoma | 268220 | NA |
| Cisplatin | Wilms’ tumor | 194070 | CTD |
| Cisplatin | Stomach neoplasms | 137215 | CTD |
| Cisplatin | Acute lymphoblastic leukemia | 247640 | NA |
| Cisplatin | Colorectal neoplasms | 114500 | CTD |
| Fluorouracil | Esophageal neoplasms | 133239 | CTD |
| Fluorouracil | Renal cell carcinoma | 144700 | CTD |
| Fluorouracil | Acute lymphoblastic leukemia | 247640 | NA |
| Fluorouracil | Prostatic neoplasms | 176807 | CTD |
| Fluorouracil | Acute myelocytic leukemia | 246470 | NA |
| Methotrexate | B-cell chronic lymphocytic leukemia | 151400 | CTD |
| Methotrexate | Neuroblastoma | 256700 | NA |
| Methotrexate | Wilms’ tumor | 194070 | NA |
| Methotrexate | Lung neoplasms | 211980 | CTD |
| Methotrexate | Glioma | 137800 | CTD |
| Paclitaxel | Mismatch repair cancer syndrome 1 | 276300 | NA |
| Paclitaxel | Prostatic neoplasms | 176807 | CTD |
| Paclitaxel | Testicular germ cell tumor | 273300 | CTD |
| Paclitaxel | Stomach neoplasms | 137215 | CTD |
| Paclitaxel | Cutaneous malignant melanoma, 1 | 155600 | NA |
NA: not available on the Comparative Toxicogenomics Database.
CTD: Available on the Comparative Toxicogenomics Database.
Top five candidate diseases for typical orphan drugs.
| Orphan drug | Disease | OMIM ID | Evidence |
|---|---|---|---|
| Celecoxib | Osteoarthritis susceptibility 2 | 140600 | CTD |
| Celecoxib | Osteoarthritis susceptibility 1 | 165720 | CTD |
| Celecoxib | Progressive pseudorheumatoid dysplasia | 208230 | NA |
| Celecoxib | Mitochondrial recessive ataxia syndrome | 607459 | NA |
| Celecoxib | Osteoarthritis susceptibility 3 | 607850 | CTD |
| Methotrexate | Mismatch repair cancer syndrome 1 | 276300 | NA |
| Methotrexate | Breast neoplasms | 114480 | CTD |
| Methotrexate | Acute lymphoblastic leukemia | 247640 | NA |
| Methotrexate | Autoimmune diseases | 109100 | CTD |
| Methotrexate | Pyogenic arthritis–pyoderma gangrenosum–acne | 604416 | NA |
| Doxorubicin | Mismatch repair cancer syndrome 1 | 276300 | NA |
| Doxorubicin | Acute lymphoblastic leukemia | 247640 | NA |
| Doxorubicin | Dohle bodies and acute leukemia | 223350 | NA |
| Doxorubicin | Breast neoplasms | 114480 | CTD |
| Doxorubicin | Acute myeloid leukemia | 601626 | CTD |
CTD: available on the Comparative Toxicogenomics Database.
NA: not available on the Comparative Toxicogenomics Database.
Top five candidate drugs for complex diseases.
| Disease | Drug | Evidence |
|---|---|---|
| Alzheimer’s disease | Pyridostigmine |
|
| Alzheimer’s disease | Benzatropine | NA |
| Alzheimer’s disease | Scopolamine |
|
| Alzheimer’s disease | Carbidopa |
|
| Alzheimer’s disease | Pramipexole |
|
| Parkinson’s disease | Biperiden |
|
| Parkinson’s disease | Levodopa |
|
| Parkinson’s disease | Bromocriptine |
|
| Parkinson’s disease | Trihexyphenidyl |
|
| Parkinson’s disease | Rivastigmine |
|
| Amyotrophic lateral sclerosis | Baclofen |
|
| Amyotrophic lateral sclerosis | Mexiletine |
|
| Amyotrophic lateral sclerosis | Colchicine |
|
| Amyotrophic lateral sclerosis | Ranolazine |
|
| Amyotrophic lateral sclerosis | Prilocaine | NA |
Evidence: Evidences from the literature.
NA: Evidences are not available.
FIGURE 2Performance of the single similarity and SKF in the drug subspace of the PREDICT dataset. (A) ROC curve and (B) PR curve.
FIGURE 3Performance of the single similarity and SKF in the disease subspace of the PREDICT dataset. (A) ROC curve and (B) PR curve.
Performance comparison between the multiple similarity fusion and every single similarity.
| Space | Similarity | AUROC | AUPR | SEN | SPE | PRE | ACC | F1 | MCC |
|---|---|---|---|---|---|---|---|---|---|
| Drug | Chemical | 0.778 ± 0.0017 | 0.092 ± 0.0001 | 0.158 ± 0.0178 | 0.991 ± 0.0021 | 0.164 ± 0.0256 | 0.983 ± 0.0019 | 0.159 ± 0.0010 | 0.151 ± 0.0029 |
| Drug | Functional | 0.804 ± 0.0010 | 0.112 ± 0.0024 | 0.221 ± 0.0078 | 0.989 ± 0.0008 | 0.173 ± 0.0049 | 0.981 ± 0.0007 | 0.194 ± 0.0010 | 0.186 ± 0.0011 |
| Drug | Association | 0.811 ± 0.0012 | 0.161 ± 0.0037 | 0.261 ± 0.0145 | 0.989 ± 0.0010 | 0.200 ± 0.0063 | 0.981 ± 0.0009 | 0.226 ± 0.0024 | 0.219 ± 0.0032 |
| Drug | SKF | 0.887 ± 0.0018 | 0.455 ± 0.0035 | 0.458 ± 0.0115 | 0.995 ± 0.0003 | 0.515 ± 0.0112 | 0.990 ± 0.0001 | 0.485 ± 0.0040 | 0.481 ± 0.0038 |
| Disease | Semantic | 0.741 ± 0.0015 | 0.108 ± 0.0021 | 0.185 ± 0.0205 | 0.991 ± 0.0026 | 0.188 ± 0.0297 | 0.983 ± 0.0023 | 0.184 ± 0.0044 | 0.177 ± 0.0059 |
| Disease | Association | 0.707 ± 0.0026 | 0.214 ± 0.0057 | 0.168 ± 0.0049 | 0.999 ± 0.0001 | 0.846 ± 0.0388 | 0.991 ± 0.0001 | 0.280 ± 0.0074 | 0.374 ± 0.0109 |
| Disease | SKF | 0.828 ± 0.0020 | 0.359 ± 0.0033 | 0.345 ± 0.0073 | 0.996 ± 0.0003 | 0.503 ± 0.0145 | 0.990 ± 0.0002 | 0.409 ± 0.0020 | 0.411 ± 0.0027 |
AUROC: area under receiver operating characteristics.
AUPR: area under precision-recall.
SEN: sensitivity.
SPE: specificity.
PRE: precision.
ACC: accuracy.
F1: F1-score.
MCC: Matthew’s correlation coefficient.
The value is represented as “average ± standard deviation” form for 5 times of repeated cross-validations.