| Literature DB >> 35207606 |
Xueting Han1, Ruixia Xie2, Xutao Li1, Junyi Li1.
Abstract
Concurrent use of multiple drugs can lead to unexpected adverse drug reactions. The interaction between drugs can be confirmed by routine in vitro and clinical trials. However, it is difficult to test the drug-drug interactions widely and effectively before the drugs enter the market. Therefore, the prediction of drug-drug interactions has become one of the research priorities in the biomedical field. In recent years, researchers have been using deep learning to predict drug-drug interactions by exploiting drug structural features and graph theory, and have achieved a series of achievements. A drug-drug interaction prediction model SmileGNN is proposed in this paper, which can be characterized by aggregating the structural features of drugs constructed by SMILES data and the topological features of drugs in knowledge graphs obtained by graph neural networks. The experimental results show that the model proposed in this paper combines a variety of data sources and has a better prediction performance compared with existing prediction models of drug-drug interactions. Five out of the top ten predicted new drug-drug interactions are verified from the latest database, which proves the credibility of SmileGNN.Entities:
Keywords: drug–drug interaction prediction; graph neural network; knowledge graph; structural features; topological features
Year: 2022 PMID: 35207606 PMCID: PMC8879716 DOI: 10.3390/life12020319
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
Figure 1Two-dimensional graphs of the drug Leucovorin and its corresponding SMILES.
Figure 2Pretreatment methods of SMILES.
Figure 3KG construction.
Comparison of KEGG KG and PDD KG.
| KEGG | PDD | |
|---|---|---|
| Number of drugs | 11,174 | 1495 |
| The proportion of drugs with structural records | 13.96% | 72.37% |
| The density of the graph | 4.300 × 10−5 | 8.571 × 10−4 |
| Number of positive samples | 56,983 | 36,768 |
| Drug-drug interaction subgraph density | 9.128 × 10−4 | 3.292 × 10−2 |
Figure 4Extraction of topological features.
Figure 5SmileGNN model.
Experimental parameters.
| KEGG | PDD | |
|---|---|---|
| Batch size | 2048 | 1024 |
| Learning rate | 2 × 10−2 | 1 × 10−2 |
| GNN embed dimension | 32 | 64 |
Comparative analysis of the new model and several classical models.
| Model | The Data Source | ACC | AUC |
|---|---|---|---|
| DeepDDI | KEGG | 0.8217 | 0.8987 |
| Decagon | STITCH, etc. | -- | 0.8720 |
| KGNN | KEGG | 0.8834 | 0.9422 |
| SmileGNN | KEGG | 0.8936 | 0.9521 |
Comparison of the performance of SmileGNN and KGNN on datasets.
| Dataset | Model | Aggregator Type | Average Accuracy | Average AUC | Average F1-Score |
|---|---|---|---|---|---|
| KEGG | KGNN | sum | 0.8801 | 0.9390 | 0.8851 |
| concat | 0.8834 | 0.9422 | 0.8881 | ||
| neigh | 0.8642 | 0.9267 | 0.8690 | ||
| Average | 0.8759 | 0.9360 | 0.8807 | ||
| SmileGNN | sum | 0.8888 | 0.9467 | 0.8943 | |
| concat | 0.8936 | 0.9521 | 0.8957 | ||
| neigh | 0.8744 | 0.9329 | 0.8788 | ||
| Average | 0.8856 | 0.9439 | 0.8896 | ||
| PDD | KGNN | sum | 0.8920 | 0.9542 | 0.8947 |
| concat | 0.8970 | 0.9576 | 0.8995 | ||
| neigh | 0.8896 | 0.9518 | 0.8919 | ||
| Average | 0.8929 | 0.9545 | 0.8954 | ||
| SmileGNN | sum | 0.9040 | 0.9618 | 0.9056 | |
| concat | 0.9065 | 0.9642 | 0.9084 | ||
| neigh | 0.9000 | 0.9613 | 0.9018 | ||
| Average | 0.9035 | 0.9624 | 0.9053 |
Different aggregation methods on the PDD dataset.
| ACC | AUC | F1-Score | |
|---|---|---|---|
| sum | 0.9095 | 0.9647 | 0.9070 |
| concat | 0.9056 | 0.9618 | 0.9040 |
Figure 6Influence of the drug structure characteristic dimension on model performance. (a) Influence of different dimension on ACC. ACC increases with the increasement of drug structural feature dimension. (b) Influence of different dimension on AUC. AUC reaches the highest when drug structural feature dimension is 64. (c) Influence of different dimension on F1-Score. F1-Score increases with the increasement of drug structural feature dimension.
New DDIs.
| Drug1 | Drug2 | Score | Whether You Can Query DDI in DrugBank |
|---|---|---|---|
| DB00437 | DB09322 | 0.999964 | 0 |
| DB00450 | DB00768 | 0.999917 | 0 |
| DB00437 | DB00959 | 0.999854 | 0 |
| DB00660 | DB01656 | 0.999831 | 1 |
| DB00722 | DB01039 | 0.999817 | 1 |
| DB00437 | DB00633 | 0.999764 | 1 |
| DB00346 | DB01173 | 0.999618 | 1 |
| DB04908 | DB05521 | 0.999571 | 1 |
| DB00475 | DB00820 | 0.999542 | 0 |
| DB00040 | DB00564 | 0.999236 | 0 |
The corresponding drug names of drugs in new DDIs.
| Drug1 | Drug1 Name | Drug2 | Drug2 Name |
|---|---|---|---|
| DB00437 | Allopurinol | DB09322 | Zinc sulfate |
| DB00450 | Droperidol | DB00768 | Olopatadine |
| DB00437 | Allopurinol | DB00959 | Methylprednisolone |
| DB00660 | Metaxalone | DB01656 | Roflumilast |
| DB00722 | Lisinopril | DB01039 | Fenofibrate |
| DB00437 | Allopurinol | DB00633 | Dexmedetomidine |
| DB00346 | Alfuzosin | DB01173 | Orphenadrine |
| DB04908 | Flibanserin | DB05521 | Telaprevir |
| DB00475 | Chlordiazepoxide | DB00820 | Tadalafil |
| DB00040 | Glucagon | DB00564 | Carbamazepine |