| Literature DB >> 31443472 |
Ping Xuan1, Yingying Song1, Tiangang Zhang2, Lan Jia1.
Abstract
Identifying new indications for existing drugs may reduce costs and expedites drug development. Drug-related disease predictions typically combined heterogeneous drug-related and disease-related data to derive the associations between drugs and diseases, while recently developed approaches integrate multiple kinds of drug features, but fail to take the diversity implied by these features into account. We developed a method based on non-negative matrix factorization, DivePred, for predicting potential drug-disease associations. DivePred integrated disease similarity, drug-disease associations, and various drug features derived from drug chemical substructures, drug target protein domains, drug target annotations, and drug-related diseases. Diverse drug features reflect the characteristics of drugs from different perspectives, and utilizing the diversity of multiple kinds of features is critical for association prediction. The various drug features had higher dimensions and sparse characteristics, whereas DivePred projected high-dimensional drug features into the low-dimensional feature space to generate dense feature representations of drugs. Furthermore, DivePred's optimization term enhanced diversity and reduced redundancy of multiple kinds of drug features. The neighbor information was exploited to infer the likelihood of drug-disease associations. Experiments indicated that DivePred was superior to several state-of-the-art methods for prediction drug-disease association. During the validation process, DivePred identified more drug-disease associations in the top part of prediction result than other methods, benefitting further biological validation. Case studies of acetaminophen, ciprofloxacin, doxorubicin, hydrocortisone, and ampicillin demonstrated that DivePred has the ability to discover potential candidate disease indications for drugs.Entities:
Keywords: diversity representation; drug–disease association; non-negative matrix factorization; projections of drug features; specific features of different drug views
Mesh:
Substances:
Year: 2019 PMID: 31443472 PMCID: PMC6747548 DOI: 10.3390/ijms20174102
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Two types of curves for evaluating the predicting performance of DivePred and other methods. (a) receiver operating characteristic (ROC) curves; (b) precision–recall (P–R) curves.
Area under ROC curve (AUC) values of 15 drugs using DivePred and other methods.
| Drug Name | AUC DivePred | TL_HGBI | MBiRW | LRSSL | SCMFDD |
|---|---|---|---|---|---|
| ampicillin |
| 0.751 | 0.932 | 0.962 | 0.895 |
| cefepime |
| 0.910 | 0.970 | 0.971 | 0.914 |
| cefotaxime |
| 0.917 | 0.929 | 0.950 | 0.953 |
| cefotetan |
| 0.808 | 0.918 | 0.948 | 0.848 |
| cefoxitin |
| 0.890 | 0.912 | 0.979 | 0.894 |
| ceftazidime |
| 0.845 | 0.931 | 0.936 | 0.922 |
| ceftizoxime | 0.797 | 0.960 | 0.961 | 0.923 |
|
| ceftriaxone | 0.907 | 0.945 | 0.898 |
| 0.811 |
| ciprofloxacin |
| 0.811 | 0.813 | 0.928 | 0.820 |
| doxorubicin |
| 0.487 | 0.921 | 0.727 | 0.460 |
| erythromycin |
| 0.827 | 0.887 | 0.918 | 0.764 |
| itraconazole |
| 0.445 | 0.877 | 0.845 | 0.730 |
| levofloxacin |
| 0.943 | 0.975 | 0.964 | 0.872 |
| moxifloxacin | 0.794 | 0.812 | 0.948 | 0.957 | 0.932 |
| ofloxacin |
| 0.902 | 0.943 | 0.904 | 0.774 |
| Average AUC |
| 0.683 | 0.837 | 0.838 | 0.726 |
The bold values indicate the higher AUCs.
Area under precision–recall curve (AUPR) values of 15 drugs using DivePred and other methods.
| Drug Name | AUPR DivePred | TL_HGBI | MBIRW | LRSSL | SCMFDD |
|---|---|---|---|---|---|
| ampicillin |
| 0.032 | 0.023 | 0.285 | 0.068 |
| cefepime |
| 0.163 | 0.315 | 0.625 | 0.054 |
| cefotaxime |
| 0.071 | 0.292 | 0.283 | 0.105 |
| cefotetan |
| 0.054 | 0.197 | 0.512 | 0.059 |
| cefoxitin |
| 0.151 | 0.394 | 0.286 | 0.065 |
| ceftazidime | 0.675 | 0.032 | 0.201 | 0.488 |
|
| ceftizoxime |
| 0.212 | 0.244 | 0.455 | 0.096 |
| ceftriaxone | 0.409 | 0.056 | 0.223 |
| 0.077 |
| ciprofloxacin |
| 0.082 | 0.118 | 0.280 | 0.064 |
| doxorubicin | 0.164 | 0.005 | 0.051 |
| 0.004 |
| erythromycin |
| 0.023 | 0.038 | 0.144 | 0.022 |
| itraconazole | 0.188 | 0.006 |
| 0.042 | 0.008 |
| levofloxacin | 0.504 | 0.136 | 0.071 |
| 0.098 |
| moxifloxacin |
| 0.049 | 0.065 | 0.384 | 0.088 |
| ofloxacin |
| 0.091 | 0.130 | 0.201 | 0.078 |
| Average AUC |
| 0.013 | 0.043 | 0.117 | 0.014 |
The bold values indicate the higher AUPRs.
Results of Wilcoxon test on DivePred and four other contrast methods for 763 drugs.
| TL_HGBI | MBiRW | LRSSL | SCMFDD | |
|---|---|---|---|---|
| 5.631 × 10−42 | 7.181 × 10−156 | 3.735 × 10−78 | 6.596 × 10−73 | |
| 1.332 × 10−21 | 2.635 × 10−32 | 1.562 × 10−16 | 8.452 × 10−29 |
Figure 2Average recall rates of all drugs at different top .
The top 15 related candidate diseases for acetaminophen, ciprofloxacin, doxorubicin, hydrocortisone, and ampicillin.
| Drug Name | Rank | Disease Name | Description | Rank | Disease Name | Description |
|---|---|---|---|---|---|---|
|
| 1 | Osteoarthritis | CTD | 9 | Arthritis | DrugBank |
| 2 | Arthritis, Rheumatoid | CTD | 10 | Pain, Postoperative | CTD | |
| 3 | Inflammation | CTD | 11 | Rheumatic Fever | PubChem | |
| 4 | Dysmenorrhea | inferred candidate by 1 literature | 12 | Arthritis, Gouty | CTD | |
| 5 | Arthritis, Juvenile Rheumatoid | DrugBank | 13 | Premenstrual Syndrome | DrugBank | |
| 6 | Gout | DrugBank | 14 | Menorrhagia | unconfirmed | |
| 7 | Spondylitis, Ankylosing | Clinicaltrials | 15 | Rheumatic Diseases | Clinicaltrials | |
| 8 | Bursitis | literature [ | ||||
|
| 1 | Salmonella Infections | CTD | 9 | Pyelonephritis | CTD |
| 2 | Streptococcal Infections | DrugBank | 10 | Bacterial Infections | CTD | |
| 3 | Bronchitis | CTD | 11 | Serratia Infections | DrugBank | |
| 4 | Pneumonia, Bacterial | CTD | 12 | Tuberculosis, Pulmonary | CTD | |
| 5 | Chlamydia Infections | CTD | 13 | Plague | CTD | |
| 6 | Gram-Negative Bacterial Infections | CTD | 14 | Brucellosis | PubChem | |
| 7 | Enterobacteriaceae Infections | CTD | 15 | Chlamydiaceae Infections | PubChem | |
| 8 | Soft Tissue Infections | CTD | ||||
|
| 1 | Leukemia, Myeloid, Acute | CTD | 9 | Rhabdomyosarcoma | CTD |
| 2 | Precursor Cell Lymphoblastic Leukemia-Lymphoma | CTD | 10 | Histiocytosis | Clinicaltrials | |
| 3 | Carcinoma, Non-Small-Cell Lung | PubChem | 11 | Trophoblastic Neoplasms | DrugBank | |
| 4 | Mycosis Fungoides | PubChem | 12 | Stomach Neoplasms | CTD | |
| 5 | Leukemia, Lymphocytic, Chronic, B-Cell | inferred candidate by 14 literatures | 13 | Hodgkin Disease | CTD | |
| 6 | Head and Neck Neoplasms | CTD | 14 | Melanoma | CTD | |
| 7 | Sarcoma, Kaposi | CTD | 15 | Leukemia, Myelogenous, Chronic, BCR-ABL Positive | DrugBank | |
| 8 | Leukemia, Lymphoid | CTD | ||||
|
| 1 | Asthma | CTD | 9 | Shock, Septic | CTD |
| 2 | Rhinitis, Allergic, Perennial | DrugBank | 10 | Acne Vulgaris | unconfirmed | |
| 3 | Dermatitis | PubChem | 11 | Rosacea | CTD | |
| 4 | Skin Diseases | CTD | 12 | Addison Disease | CTD | |
| 5 | Pruritus | PubChem | 13 | Hyperhidrosis | literature [ | |
| 6 | Keratosis | inferred candidate by 1 literature | 14 | Hematologic Diseases | inferred candidate by 1 literature | |
| 7 | Hypersensitivity | inferred candidate by 7 literatures | 15 | Pityriasis Rosea | unconfirmed | |
| 8 | Psoriasis | PubChem | ||||
|
| 1 | Proteus Infections | CTD | 9 | Osteomyelitis | Clinicaltrials |
| 2 | Streptococcal Infections | CTD | 10 | Impetigo | unconfirmed | |
| 3 | Septicemia | DrugBank | 11 | Serratia Infections | CTD | |
| 4 | Pneumonia, Bacterial | CTD | 12 | Peritonitis | CTD | |
| 5 | Bone Diseases, Infectious | PubChem | 13 | Bacterial Infections | CTD | |
| 6 | Staphylococcal Skin Infections | DrugBank | 14 | Enterobacteriaceae Infections | DrugBank | |
| 7 | Wound Infection | CTD | 15 | Cellulitis | CTD | |
| 8 | Pseudomonas Infections | PubChem |
Figure 3Representation of data from drugs and diseases from multiple sources and representation of drug–disease predictive association matrix . (a) Drug feature data sets from multiple sources; (b) four low-dimensional representation of drugs; (c) four affinity maps of the drugs were obtained by similarity calculation; (d) extract the similarity of the diseases and obtain the affinity map of the disease.