| Literature DB >> 32033537 |
Konstantinos Pliakos1,2, Celine Vens3,4.
Abstract
BACKGROUND: Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework.Entities:
Keywords: Drug-target networks; Interaction prediction; Network reconstruction; Tree-ensembles; multi-output prediction
Mesh:
Substances:
Year: 2020 PMID: 32033537 PMCID: PMC7006075 DOI: 10.1186/s12859-020-3379-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of a (bi-partite) DPI interaction network
The drug-protein networks (DPN) used in the experimental evaluation are presented
| | | | | | | |
|---|---|---|---|
| 54×26 | 54−26 | 90/1404 (6.4 | |
| 223×95 | 223−95 | 635/21185 (3 | |
| 210×204 | 210−204 | 1476/42840 (3.4 | |
| 445×664 | 445−664 | 2926/295480 (1 |
AUROC results for the compared methods
| AUROC | |||||
|---|---|---|---|---|---|
| Data | |||||
| NR | 0.787* | 0.851* | 0.807* | 0.794* | |
| GR | 0.857* | 0.867* | 0.842* | 0.847* | |
| IC | 0.780* | 0.792 | 0.737* | 0.783* | |
| E | 0.827* | 0.777* | 0.815* | 0.794* | |
| Avg | 0.813 | 0.822 | 0.800 | 0.805 | |
| NR | 0.614* | 0.747* | 0.667* | 0.525* | |
| GR | 0.846* | 0.861* | 0.776* | 0.800* | |
| IC | 0.931* | 0.949* | 0.887* | 0.909* | |
| E | 0.924* | 0.940* | 0.904* | 0.906* | |
| Avg | 0.829 | 0.874 | 0.809 | 0.785 | |
| NR | 0.676 | 0.634* | 0.683 | 0.554* | 0.469* |
| GR | 0.792* | 0.800* | 0.475* | 0.630* | |
| IC | 0.719* | 0.731 | 0.466* | 0.649* | |
| E | 0.785* | 0.749* | 0.490* | 0.682* | |
| Avg | 0.733 | 0.741 | 0.496 | 0.608 | |
AUPR results for the compared methods
| AUPR | |||||
|---|---|---|---|---|---|
| Data | |||||
| NR | 0.480 | 0.444* | 0.436 | 0.467 | |
| GR | 0.334 | 0.329 | 0.312 | 0.324 | |
| IC | 0.317* | 0.213* | 0.307 | ||
| E | 0.314 | 0.316 | 0.255* | 0.353 | |
| Avg | 0.364 | 0.352 | 0.304 | 0.363 | |
| NR | 0.424* | 0.485* | 0.338* | 0.384* | |
| GR | 0.504* | 0.406* | 0.324* | 0.365* | |
| IC | 0.791* | 0.798 | 0.724* | 0.779* | |
| E | 0.785 | 0.784 | 0.735* | 0.752* | |
| Avg | 0.626 | 0.621 | 0.530 | 0.570 | |
| NR | 0.160 | 0.151 | 0.100 | 0.080* | |
| GR | 0.156 | 0.156 | 0.042* | 0.079* | |
| IC | 0.229 | 0.227 | 0.041* | 0.198* | |
| E | 0.214 | 0.218 | 0.016* | 0.190* | |
| Avg | 0.191 | 0.189 | 0.050 | 0.137 | |
Fig. 2The prediction setting of a DTI network
Fig. 3Illustration of a bi-clustering tree along with the corresponding interaction matrix that is partitioned by that tree. Let ϕ and ϕ be the features of the row and column instances, respectively