| Literature DB >> 29914348 |
Wen Zhang1, Xiang Yue2, Weiran Lin2, Wenjian Wu3, Ruoqi Liu2, Feng Huang2, Feng Liu4.
Abstract
BACKGROUND: Drug-disease associations provide important information for the drug discovery. Wet experiments that identify drug-disease associations are time-consuming and expensive. However, many drug-disease associations are still unobserved or unknown. The development of computational methods for predicting unobserved drug-disease associations is an important and urgent task.Entities:
Keywords: Drug-disease associations; Similarity constrained matrix factorization
Mesh:
Substances:
Year: 2018 PMID: 29914348 PMCID: PMC6006580 DOI: 10.1186/s12859-018-2220-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Two types of drug-disease association prediction methods. a Infer drug-disease associations without known associations; b Infer unobserved drug-disease associations based on known associations
The summary of SCMFDD-S dataset and SCMFDD-L dataset
| Dataset | Drugs | Diseases | Associations | Richness | Drug features | ||||
|---|---|---|---|---|---|---|---|---|---|
| Substructure | Target | Enzyme | Pathway | Drug Interactions | |||||
| SCMFDD-S | 269 | 598 | 18,416 | 11.4% | 881 | 623 | 247 | 465 | 2086 |
| SCMFDD-L | 1323 | 2834 | 49,217 | 1.31% | 881 | N.A. | N.A. | N.A. | N.A. |
Numbers for drug features represent the numbers of descriptors. For example, the PubChem Compound defines 881 types of substructure descriptors for compound substructures, and a drug has some substructures and is thus described by a subset of substructure descriptors. Richness is the ratio of association number vs drug-disease pair number. N.A. indicates that the information is not available
Fig. 2The basic idea of similarity constrained matrix factorization
Fig. 3The bipartite network and the association network
Fig. 4The influence of parameters on SCMFDD models. a the influnce of μ and λ b the influence of k
The performances of SCMFDD models based on different drug features
| AUPR | AUC | SN | SP | ACC | F | |
|---|---|---|---|---|---|---|
| Substructure | 0.2644 | 0.8737 | 0.3329 | 0.9795 | 0.9632 | 0.3130 |
| Target | 0.1947 | 0.8410 | 0.2751 | 0.9751 | 0.9575 | 0.2456 |
| Pathway | 0.2582 | 0.8706 | 0.3435 | 0.9771 | 0.9611 | 0.3079 |
| Enzyme | 0.2496 | 0.8671 | 0.3331 | 0.9768 | 0.9606 | 0.2990 |
| Drug interaction | 0.2638 | 0.8734 | 0.3505 | 0.9769 | 0.9611 | 0.3120 |
Fig. 5The influence of association exclusion criteria on data richness (a) and model performance (b)
Performance of PREDICT and SCMFDD on PREDICT Dataset
| Methods | AUPR | AUC | SN | SP | ACC | F |
|---|---|---|---|---|---|---|
| PREDICT | 0.1507 | 0.9020 | 0.3414 | 0.9929 | 0.9915 | 0.1437 |
| SCMFDD-Che-GS | 0.3141 | 0.9005 | 0.3663 | 0.9988 | 0.9974 | 0.3753 |
| SCMFDD-Che-Phen | 0.3153 | 0.9038 | 0.3678 | 0.9988 | 0.9974 | 0.3769 |
| SCMFDD-SE-GS | 0.3157 | 0.9082 | 0.3663 | 0.9988 | 0.9974 | 0.3753 |
| SCMFDD-SE-Phen | 0.3176 | 0.9109 | 0.3678 | 0.9988 | 0.9974 | 0.3769 |
| SCMFDD-GP-GS | 0.3210 | 0.9129 | 0.3720 | 0.9988 | 0.9975 | 0.3811 |
| SCMFDD-GP-Phen | 0.3224 | 0.9157 | 0.3714 | 0.9988 | 0.9975 | 0.3806 |
| SCMFDD-GO-GS | 0.3147 | 0.9035 | 0.3678 | 0.9988 | 0.9974 | 0.3769 |
| SCMFDD-GO-Phen | 0.3159 | 0.9065 | 0.3678 | 0.9988 | 0.9974 | 0.3769 |
| SCMFDD-GW-GS | 0.3249 | 0.9173 | 0.3389 | 0.9991 | 0.9977 | 0.3843 |
| SCMFDD-GW-Phen | 0.3284 | 0.9203 | 0.3776 | 0.9988 | 0.9975 | 0.3870 |
For drugs, Che Chemical fingerprints Similarity, SE Side Effect Similarity, GP Genes-Perlman Similarity, GO Genes- Ovaska Similarity, GW Genes-Waterman Similarity. For diseases, GS Gene Signature Similarity, Phen Phenotypic Similarity
Performance of TL-HGBI and SCMFDD on TL-HGBI Dataset
| Methods | AUPR | AUC | SN | SP | ACC | F |
|---|---|---|---|---|---|---|
| TL-HGBI | 0.0492 | 0.9584 | 0.1697 | 0.9999 | 0.9998 | 0.0840 |
| SCMFDD | 0.1500 | 0.9752 | 0.2136 | 0.9990 | 0.9990 | 0.0168 |
Performance of LRSSL and SCMFDD on Liang Dataset
| Methods | AUPR | AUC | SN | SP | ACC | F |
|---|---|---|---|---|---|---|
| LRSSL | 0.1789 | 0.8250 | 0.2167 | 0.9989 | 0.9979 | 0.2018 |
| SCMFDD-Che-Sem | 0.2518 | 0.9020 | 0.2799 | 0.9993 | 0.9985 | 0.3030 |
| SCMFDD-Dom-Sem | 0.2673 | 0.9228 | 0.2851 | 0.9993 | 0.9985 | 0.3088 |
| SCMFDD-Go-Sem | 0.2585 | 0.9210 | 0.2897 | 0.9993 | 0.9985 | 0.3137 |
For drugs, Che Chemical Similarity, Dom Protein Domains Similarity, Go Gene ontology Similarity. For diseases, Sem: Semantic Similarity
Fig. 6The number of confirmed associations in top predictions of PREDICT, LRSSL, SCMFDD. (a) For drugs, Che: Chemical Similarity, SE: Chemical Similarity, GP: Genes-Perlman Similarity, GO: Genes- Ovaska Similarity, GW: Genes-Waterman Similarity. For diseases, GS: Gene Signature Similarity, Phen: Phenotypic Similarity (b) For drugs, Che: Chemical Similarity, Dom: Protein Domains Similarity, Go: Gene ontology Similarity. For diseases, Sem: Semantic Similarity
Top 10 predicted diseases associated with Clozapine
| Index | Disease Name | Disease ID | Score | Evidence |
|---|---|---|---|---|
| 1 | Sleep Initiation and Maintenance Disorders | D007319 | 1 |
|
| 2 | Anxiety Disorders | D001008 | 0.9117 | N.A. |
| 3 | Inappropriate ADH Syndrome | D007177 | 0.7434 | A Case report [ |
| 4 | Stress Disorders, Post-Traumatic | D013313 | 0.7267 | Report [ |
| 5 | Parkinson Disease, Secondary | D010302 | 0.7179 | Review [ |
| 6 | Memory Disorders | D008569 | 0.7123 | An animal study [ |
| 7 | Status Epilepticus | D013226 | 0.6312 |
|
| 8 | Headache | D006261 | 0.6166 |
|
| 9 | Torsades de Pointes | D016171 | 0.5953 | N.A. |
| 10 | Attention Deficit Disorder with Hyperactivity | D001289 | 0.5913 | N.A. |
Scores are normalized by using ((score-min)/(max-min))
Top 10 predicted drugs associated with Alzheimer’s disease
| Index | Drug Name | Drug MeSH ID | DrugBank ID | PubChem CID | Score(normalized) | Evidence |
|---|---|---|---|---|---|---|
| 1 | Nitroprusside | D009599 | DB00325 | 11,963,622 | 1 | N.A. |
| 2 | Tamoxifen | D013629 | DB00675 | 2,733,526 | 0.7644 | N.A. |
| 3 | Olanzapine | C076029 | DB00334 | 4585 | 0.7269 | A clinical study [ |
| 4 | Sucralfate | D013392 | DB00364 | 70,789,197 | 0.7223 | N.A. |
| 5 | Levodopa | D007980 | DB01235 | 6047 | 0.6893 | An animal study [ |
| 6 | Malondialdehyde | D008315 | DB03057 | 10,964 | 0.6767 | A clinical study [ |
| 7 | Progesterone | D011374 | DB00396 | 5994 | 0.6695 | An animal study [ |
| 8 | Valproic Acid | D014635 | DB00313 | 3121 | 0.6625 | An animal study [ |
| 9 | Scopolamine Hydrobromide | D012601 | DB00747 | 3,000,322 | 0.6522 | N.A. |
| 10 | Ethanol | D000431 | DB00898 | 702 | 0.6402 | A clinical study [ |
Scores are normalized by using ((score-min)/(max-min))
Fig. 7Web Visualization of predictions for Clozapine a and predictions for Headache b