| Literature DB >> 29297371 |
Wen Zhang1, Xiang Yue2, Feng Liu2, Yanlin Chen3, Shikui Tu4, Xining Zhang5.
Abstract
BACKGROUND: Drug side effects are one of main concerns in the drug discovery, which gains wide attentions. Investigating drug side effects is of great importance, and the computational prediction can help to guide wet experiments. As far as we known, a great number of computational methods have been proposed for the side effect predictions. The assumption that similar drugs may induce same side effects is usually employed for modeling, and how to calculate the drug-drug similarity is critical in the side effect predictions.Entities:
Keywords: Drug side effects; Linear neighborhood similarity; Missing side effects
Mesh:
Year: 2017 PMID: 29297371 PMCID: PMC5751767 DOI: 10.1186/s12918-017-0477-2
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1Predicting side effects of new drugs (a) and predicting missing side effects of approved drugs (b)
Details about benchmark datasets
| Dataset | #drug | #side effect | #substructure | #target | #transporter | #enzyme | #pathway | # indication |
|---|---|---|---|---|---|---|---|---|
| Pauwels’s dataset | 888 | 1385 | 881 | N.A | N.A | N.A | N.A | N.A |
| Mizutani’s dataset | 658 | 1339 | 881 | 1368 | N.A | N.A | N.A | N.A |
| Liu’s dataset | 832 | 1385 | 881 | 786 | 72 | 111 | 173 | 869 |
| SIDER 4 dataset | 1080 | 2260 | 881 | 1050 | 96 | 160 | 268 | 2537 |
N.A means unavailable information
Fig. 2A unified frame of predicting side effects of drugs by using linear neighborhood similarity
Fig. 3Drug association profiles defined on known side effects
Fig. 4AUPR scores of different similarity-based models based on Liu’s data under different conditions. a~f demonstrate the performances of models based on different drug features in Liu’s data. LN-200: the models based on the LN similarity and 200 neighbors
5-CV performances of prediction models on Liu’s dataset
| Data | Methods | AUC | AUPR | Hamming Loss | Ranking Loss | One Error | Coverage | Average Precision |
|---|---|---|---|---|---|---|---|---|
| Enzyme | LNSM | 0.8898 | 0.4187 | 0.0473 | 0.0821 | 0.1659 | 846.3846 | 0.4696 |
| Pathway | LNSM | 0.8886 | 0.4273 | 0.0470 | 0.0776 | 0.1647 | 814.6298 | 0.4932 |
| Target | LNSM | 0.8991 | 0.4708 | 0.0452 | 0.0690 | 0.1538 | 792.3726 | 0.5216 |
| Transporter | LNSM | 0.8896 | 0.4147 | 0.0477 | 0.0817 | 0.1611 | 849.3161 | 0.4762 |
| Treatment | LNSM | 0.9013 | 0.4836 | 0.0446 | 0.0710 | 0.1262 | 806.8558 | 0.5232 |
| Substructure | LNSM | 0.8944 | 0.4538 | 0.0459 | 0.0714 | 0.1490 | 803.5228 | 0.5184 |
| All data | LNSM-SMI | 0.8986 | 0.5053 | 0.0435 | 0.0670 | 0.1154 | 789.8486 | 0.5476 |
Fig. 5AUPR scores of LNSM-MSE models using different similarities for SEAD task
Fig. 6AUPR scores of models based on different features for SEAD task (neighbor number = 600)
Fig. 7The visualization of parameters and AUPR scores of LNSM-MSE
Performances of our methods and other state-of-the-art methods
| Dataset | Method | AUC | AUPR | Hamming Loss | Ranking Loss | One Error | Coverage | Average Precision |
|---|---|---|---|---|---|---|---|---|
| Pauwels’s dataset | Pauwels’s method | 0.8827 | 0.3883 | 0.0577 | 0.0827 | 0.1779 | 832.7827 | 0.4616 |
| LNSM | 0.8941 | 0.4491 | 0.0444 | 0.0713 | 0.1633 | 790.9471 | 0.5126 | |
| Mizutani’s dataset | Mizutani’s method | 0.8665 | 0.4107 | 0.0557 | 0.0888 | 0.1854 | 862.9757 | 0.4795 |
| LNSM | 0.8946 | 0.4624 | 0.0499 | 0.0746 | 0.1581 | 805.8875 | 0.5170 | |
| Liu’s dataset | Liu’s method | 0.8850 | 0.2514 | 0.0721 | 0.0927 | 0.9291 | 837.4579 | 0.2610 |
| FS-MLKNN | 0.9034 | 0.4802 | 0.0524 | 0.0703 | 0.1202 | 795.9435 | 0.5134 | |
| LNSM-SMI | 0.8986 | 0.5053 | 0.0435 | 0.0670 | 0.1154 | 789.8486 | 0.5476 |
Performances of different methods in the independent test
| Method | AUC | AUPR | Hamming Loss | Ranking Loss | One Error | Coverage | Average Precision |
|---|---|---|---|---|---|---|---|
| Liu’s method | 0.8772 | 0.1766 | 0.0421 | 0.1150 | 0.9870 | 1587.5663 | 0.1816 |
| FS-MLKNN | 0.8722 | 0.3109 | 0.0373 | 0.1038 | 0.1851 | 1535.9223 | 0.3649 |
| LNSM-SMI | 0.8786 | 0.3465 | 0.0291 | 0.0969 | 0.2013 | 1488.2977 | 0.3906 |
Performances of LNSM-MSE and benchmark methods evaluated by 5-CV
| Dataset | Methods | AUPR | AUC | SN | SP | Precision | Accuracy | F |
|---|---|---|---|---|---|---|---|---|
| Pauwels’s dataset | Liu’s method | 0.345 | 0.920 | 0.643 | 0.950 | 0.400 | 0.934 | 0.493 |
| Cheng’s method | 0.588 | 0.922 | 0.587 | 0.975 | 0.547 | 0.955 | 0.566 | |
| RBMBM | 0.612 | 0.941 | 0.605 | 0.977 | 0.579 | 0.958 | 0.592 | |
| INBM | 0.641 | 0.934 | 0.608 | 0.979 | 0.605 | 0.961 | 0.607 | |
| LNSM-MSE | 0.671 | 0.948 | 0.629 | 0.980 | 0.625 | 0.963 | 0.627 | |
| Mizutani’s dataset | Liu’s method | 0.366 | 0.918 | 0.637 | 0.948 | 0.418 | 0.930 | 0.505 |
| Cheng’s method | 0.599 | 0.923 | 0.593 | 0.973 | 0.560 | 0.951 | 0.576 | |
| RBMBM | 0.619 | 0.939 | 0.614 | 0.974 | 0.581 | 0.954 | 0.597 | |
| INBM | 0.646 | 0.932 | 0.616 | 0.976 | 0.605 | 0.956 | 0.611 | |
| LNSM-MSE | 0.676 | 0.944 | 0.627 | 0.979 | 0.635 | 0.959 | 0.631 | |
| Liu’s dataset | Liu’s method | 0.278 | 0.907 | 0.669 | 0.930 | 0.341 | 0.917 | 0.452 |
| Cheng’s method | 0.592 | 0.922 | 0.589 | 0.974 | 0.550 | 0.954 | 0.569 | |
| RBMBM | 0.616 | 0.941 | 0.608 | 0.976 | 0.581 | 0.957 | 0.594 | |
| INBM | 0.641 | 0.934 | 0.607 | 0.979 | 0.606 | 0.959 | 0.606 | |
| LNSM-MSE | 0.673 | 0.948 | 0.631 | 0.979 | 0.624 | 0.962 | 0.628 |