| Literature DB >> 33868209 |
Yuanyuan Ma1, Lifang Liu2, Qianjun Chen3, Yingjun Ma4.
Abstract
Metabolites are closely related to human disease. The interaction between metabolites and drugs has drawn increasing attention in the field of pharmacomicrobiomics. However, only a small portion of the drug-metabolite interactions were experimentally observed due to the fact that experimental validation is labor-intensive, costly, and time-consuming. Although a few computational approaches have been proposed to predict latent associations for various bipartite networks, such as miRNA-disease, drug-target interaction networks, and so on, to our best knowledge the associations between drugs and metabolites have not been reported on a large scale. In this study, we propose a novel algorithm, namely inductive logistic matrix factorization (ILMF) to predict the latent associations between drugs and metabolites. Specifically, the proposed ILMF integrates drug-drug interaction, metabolite-metabolite interaction, and drug-metabolite interaction into this framework, to model the probability that a drug would interact with a metabolite. Moreover, we exploit inductive matrix completion to guide the learning of projection matrices U and V that depend on the low-dimensional feature representation matrices of drugs and metabolites: Fm and Fd . These two matrices can be obtained by fusing multiple data sources. Thus, Fd U and Fm V can be viewed as drug-specific and metabolite-specific latent representations, different from classical LMF. Furthermore, we utilize the Vicus spectral matrix that reveals the refined local geometrical structure inherent in the original data to encode the relationships between drugs and metabolites. Extensive experiments are conducted on a manually curated "DrugMetaboliteAtlas" dataset. The experimental results show that ILMF can achieve competitive performance compared with other state-of-the-art approaches, which demonstrates its effectiveness in predicting potential drug-metabolite associations.Entities:
Keywords: Vicus matrix; drug-metabolite association; graph regularization; human metabolites; logistic matrix factorization
Year: 2021 PMID: 33868209 PMCID: PMC8047063 DOI: 10.3389/fmicb.2021.650366
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Illustrative example of ILMF for predicting potential drug-metabolite associations. (A) Metabolite–metabolite, metabolite-drug, metabolite-microbe, metabolite-pathway association matrices, or correlation matrices; (B) Drug-metabolite, drug–drug association, or correlation matrices; (C,D) Based on Gaussian interaction profile kernel function, metabolite–metabolite similarity matrices, and drug–drug similarity matrices obtained from four metabolite association data and two drug association data, respectively; (E) The fused metabolite–metabolite similarity matrix by integrating four metabolite-related data with clusDCA; (F) The fused drug–drug similarity matrix by integrating two drug association data with clusDCA. Then, the local spectral matrix of metabolites (G) And the local spectral matrix of drugs (H) Can be obtained based on these two fused similarity matrices with Vicus; (I) The drug-metabolite association matrix; (J) The proposed ILMF model. Finally, ILMF outputs the predicted drug-metabolite interaction probability scores (K). Here, a solid line indicates known associations, a dotted line indicates predicted drug-metabolite associations obtained from ILMF.
The pseudocode of the ILMF algorithm.
| Input: The known association matrix |
| Output: The projection matrices, |
| 1. Compute metabolite–metabolite similarity matrices |
| 2. Compute the low-dimensional feature representational matrices of metabolites and drugs, |
| 3. Initialize |
| 4. For |
| 5. Update |
| 6. Until convergence conditions are satisfied |
| 7. End for |
| 8. Return |
The best performance of all methods on the “DrugMetaboliteAtlas” dataset.
| DTInet | 0.7430 | 0.2176 | 0.2951 |
| IMCMDA | 0.7913 | 0.3655 | 0.4345 |
| GRNMF | 0.9272 | 0.5847 | 0.5767 |
| ILMF– | 0.9223 | 0.5429 | 0.5662 |
| ILMF | 0.9402 | 0.6303 | 0.6052 |
FIGURE 2Performance of ILMF on “DrugMetaboliteAtlas” dataset with different values of λ and ϕ. (A) AUC versus λ and ϕ; (B) AUPR versus λ and ϕ.
FIGURE 3Performance of ILMF on “DrugMetaboliteAtlas” dataset with different values of c and r. (A) AUC versus c and r; (B) AUPR versus c and r.
Top 20 novel associations predicted by ILMF on the “DrugMetaboliteAtlas” dataset.
| 1 | C_HMG CoA reductase inhibitors-hydrophilic statin | TotPG | 0.9915 | C10AA03 (pravastatin) |
| 2 | M_Preparations inhibiting uric acid production | L.VLDL.FC | 0.9891 | M04AA01 (allopurinol) |
| 3 | M_Preparations inhibiting uric acid production | L.VLDL.P | 0.9881 | Unconfirmed |
| 4 | N_Benzodiazepine derivatives | UnsatDeg | 0.9755 | N03AE01 (clonazepam) |
| 5 | C_Angiotensin II antagonists-plain | XS.VLDL.FC | 0.9687 | Unconfirmed |
| 6 | C_Low-ceiling diuretics | XL.HDL.FC | 0.9625 | C03AA04 (chlorothiazide) |
| 7 | C_Low-ceiling diuretics | L.HDL.P | 0.9588 | C03AA03 (hydrochlorothiazide) |
| 8 | C_Low-ceiling diuretics | L.HDL.PL | 0.9553 | Unconfirmed |
| 9 | A_Insulins and analogs-fast-acting | FALen | 0.9525 | A10AB019 (insulin) |
| 10 | C_Low-ceiling diuretics | HDL.C | 0.9493 | Unconfirmed |
| 11 | B_Carbasalate calcium | ApoB | 0.9419 | Unconfirmed |
| 12 | C_Low-ceiling diuretics | HDL2.C | 0.9346 | Unconfirmed |
| 13 | C_Low-ceiling diuretics | UnsatDeg | 0.9334 | C03AA03 (hydrochlorothiazide) |
| 14 | C_Digoxin | S.VLDL.PL | 0.9247 | Unconfirmed |
| 15 | C_ACE inhibitors-plain | M.HDL.C | 0.9240 | C09AA01 (captopril) |
| 16 | C_HMG CoA reductase inhibitors-hydrophilic statin | S.HDL.CE | 0.9219 | C10AA03 (pravastatin) |
| 17 | C_Angiotensin II antagonists-plain | L.HDL.TG | 0.9212 | Unconfirmed |
| 18 | C_Fibrates | VLDL.D | 0.9192 | Unconfirmed |
| 19 | C_Angiotensin II antagonists-plain | PUFA | 0.9188 | C09CA01-08 |
| 20 | M_Preparations inhibiting uric acid production | XL.VLDL.PL | 0.9158 | Unconfirmed |
FIGURE 4Global view of the predicted drug-metabolite associations. Hierarchical clustering of the ILMF scores between 42 drugs and 150 metabolites. The color of each cell represents the ILMF score of a drug (row) and a metabolite (column), where red/blue indicates high/low ILMF scores.
FIGURE 5The sub-network consists of three drugs and six metabolites.
FIGURE 6The sub-network consists of two drugs and seven metabolites.