| Literature DB >> 32373595 |
Feng Huang1, Yang Qiu1, Qiaojun Li1,2, Shichao Liu1,3, Fuchuan Ni1,3.
Abstract
Identifying drug-disease associations is integral to drug development. Computationally prioritizing candidate drug-disease associations has attracted growing attention due to its contribution to reducing the cost of laboratory screening. Drug-disease associations involve different association types, such as drug indications and drug side effects. However, the existing models for predicting drug-disease associations merely concentrate on independent tasks: recommending novel indications to benefit drug repositioning, predicting potential side effects to prevent drug-induced risk, or only determining the existence of drug-disease association. They ignore crucial prior knowledge of the correlations between different association types. Since the Comparative Toxicogenomics Database (CTD) annotates the drug-disease associations as therapeutic or marker/mechanism, we consider predicting the two types of association. To this end, we propose a collective matrix factorization-based multi-task learning method (CMFMTL) in this paper. CMFMTL handles the problem as multi-task learning where each task is to predict one type of association, and two tasks complement and improve each other by capturing the relatedness between them. First, drug-disease associations are represented as a bipartite network with two types of links representing therapeutic effects and non-therapeutic effects. Then, CMFMTL, respectively, approximates the association matrix regarding each link type by matrix tri-factorization, and shares the low-dimensional latent representations for drugs and diseases in the two related tasks for the goal of collective learning. Finally, CMFMTL puts the two tasks into a unified framework and an efficient algorithm is developed to solve our proposed optimization problem. In the computational experiments, CMFMTL outperforms several state-of-the-art methods both in the two tasks. Moreover, case studies show that CMFMTL helps to find out novel drug-disease associations that are not included in CTD, and simultaneously predicts their association types.Entities:
Keywords: collective matrix factorization; drug-disease association; multi-task learning; predicting association type; similarity
Year: 2020 PMID: 32373595 PMCID: PMC7179666 DOI: 10.3389/fbioe.2020.00218
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Figure 1Workflow of collective matrix factorization-based multi-task learning method (CMFMTL): A is the corresponding binary matrix for the therapeutic subnetwork; A is the corresponding binary matrix for non-therapeutic subnetwork; U ∈ ℝ and V ∈ ℝ are, respectively, the low-dimensional representations for drugs and diseases; R and R are coefficient matrices.
Figure 2Influence of parameters on the performance of CMFMTL involving two tasks: (A) shows the influence of α, β, λ on the AUPR score in task 1. (B) indicates the effect of k on the AUPR score in task 1. (C) illustrates the impact of α, β, λ on the AUPR score in task 2. (D) demonstrates the perturbation of k on the AUPR score in task 2.
Performances of Prediction Models in Task 1.
| CMFMTL | 0.2122 | 0.8898 | 0.2888 | 0.9926 | 0.2544 | 0.9866 | 0.2690 |
| CMFMTL-R | 0.1217 | 0.8543 | 0.2135 | 0.9905 | 0.1644 | 0.9839 | 0.1849 |
| TL-HGBI | 0.0444 | 0.7444 | 0.1265 | 0.9827 | 0.0624 | 0.9753 | 0.0808 |
| LRSSL | 0.0420 | 0.7341 | 0.1489 | 0.9745 | 0.0490 | 0.9674 | 0.0731 |
| DRRS | 0.1735 | 0.8893 | 0.2756 | 0.9917 | 0.2292 | 0.9856 | 0.2468 |
Performances of Prediction Models in Task 2.
| CMFMTL | 0.1838 | 0.8661 | 0.3091 | 0.9798 | 0.2091 | 0.9686 | 0.2473 |
| CMFMTL-R | 0.1465 | 0.8449 | 0.2623 | 0.9798 | 0.1812 | 0.9679 | 0.2139 |
| TL-HGBI | 0.0635 | 0.7469 | 0.1839 | 0.9653 | 0.0840 | 0.9523 | 0.1140 |
| LRSSL | 0.0606 | 0.7393 | 0.1812 | 0.9644 | 0.0801 | 0.9514 | 0.1106 |
| DRRS | 0.1150 | 0.8570 | 0.3105 | 0.9690 | 0.1454 | 0.9580 | 0.1979 |
Figure 3Top-N ranked recall and precision of all methods in two tasks: (A) shows the top-N ranked recall in task 1. (B) displays the top-N ranked recall in task 2. (C) demonstrates the top-N ranked precision in task 1. (D) illustrates the top-N ranked precision in task 2.
Top 10 Drug-Disease Associations Predicted by CMFMTL.
| Chloroquine | Bradycardia | −1 | Don Michael and Aiwazzadeh, |
| Chlorpromazine | Coma | −1 | N.A. |
| Risperidone | Anxiety disorders | 1 | Ravindran et al., |
| Clozapine | Headache | −1 | |
| Methotrexate | Neoplasms | 1 | |
| Valproic Acid | Fatigue | −1 | N.A. |
| Amitriptyline | Confusion | −1 | |
| Ibuprofen | Drug hypersensitivity | −1 | Nanau and Neuman, |
| Tamoxifen | Diarrhea | −1 | N.A. |
| Vincristine | Neoplasms | 1 |
N.A. means that the predicted association cannot be confirmed. Type 1 denotes the therapeutic associations and type −1 refers to non-therapeutic associations.
The updated process of CMFMTL.