| Literature DB >> 31660848 |
Konstantinos Pliakos1,2, Celine Vens3,4.
Abstract
BACKGROUND: Network inference is crucial for biomedicine and systems biology. Biological entities and their associations are often modeled as interaction networks. Examples include drug protein interaction or gene regulatory networks. Studying and elucidating such networks can lead to the comprehension of complex biological processes. However, usually we have only partial knowledge of those networks and the experimental identification of all the existing associations between biological entities is very time consuming and particularly expensive. Many computational approaches have been proposed over the years for network inference, nonetheless, efficiency and accuracy are still persisting open problems. Here, we propose bi-clustering tree ensembles as a new machine learning method for network inference, extending the traditional tree-ensemble models to the global network setting. The proposed approach addresses the network inference problem as a multi-label classification task. More specifically, the nodes of a network (e.g., drugs or proteins in a drug-protein interaction network) are modelled as samples described by features (e.g., chemical structure similarities or protein sequence similarities). The labels in our setting represent the presence or absence of links connecting the nodes of the interaction network (e.g., drug-protein interactions in a drug-protein interaction network).Entities:
Keywords: Biomedical networks; Interaction prediction; Multi-label classification; Network inference; Tree-ensembles
Year: 2019 PMID: 31660848 PMCID: PMC6819564 DOI: 10.1186/s12859-019-3104-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of a (bi-partite) DPI interaction network
Fig. 2The prediction setting of an interaction network
Fig. 3A description of the two learning approaches. Left the global single output and right the local multiple output approach
Fig. 4Illustration of a bi-clustering tree along with the corresponding interaction matrix that is partitioned by that tree. Let ϕ and ϕ be the features of the row and column instances respectively
Fig. 5Illustration of the labeling strategy that is followed. Prediction of an interaction between a new row instance and a column instance included in learning
The datasets used in the evaluation procedure
|
| | | | | | |
|---|---|---|---|
| ERN | 1164×154 | 445−445 | 3293/179256 (1.8 |
| SRN | 1821×113 | 1685−1685 | 3663/205773 (1.7 |
| DPI-E | 664×445 | 664−445 | 2926/295480 (1 |
| DPI-IC | 204×210 | 204−210 | 1476/42840 (3.4 |
| DPI-GR | 95×223 | 95−223 | 635/21185 (3 |
| DPI-NR | 26×54 | 26−54 | 90/1404 (6.4 |
| CPIv3.1 | 2154×2458 | 2154−2458 | 138513/5294532 (2.6 |
| CPIv4 | 2154×2458 | 2154−2458 | 258618/5294532 (4.9 |
AUPR and AUROC results for the compared methods. The tree-ensemble setting is the ERT
| AUPR | |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| ern | 0.397 | 0.397 | 0.404 | 0.043 | 0.041 | 0.043 | 0.048 | 0.047 | 0.035 |
| dpie | 0.645 | 0.638 | 0.626 | 0.303 | 0.294 | 0.309 | 0.175 | 0.163 | 0.179 |
| dpii | 0.544 | 0.535 | 0.541 | 0.327 | 0.326 | 0.33 | 0.073 | 0.07 | 0.074 |
| dpig | 0.239 | 0.24 | 0.234 | 0.345 | 0.329 | 0.318 | 0.084 | 0.083 | 0.073 |
| dpin | 0.385 | 0.362 | 0.395 | 0.507 | 0.506 | 0.513 | 0.106 | 0.105 | 0.106 |
| srn | 0.157 | 0.158 | 0.17 | 0.028 | 0.03 | 0.028 | 0.022 | 0.024 | 0.018 |
| Avg |
| 0.388 |
|
| 0.254 | 0.257 |
| 0.082 | 0.081 |
| AUROC | |||||||||
| ern | 0.845 | 0.849 | 0.856 | 0.603 | 0.594 | 0.602 | 0.729 | 0.721 | 0.645 |
| dpie | 0.873 | 0.865 | 0.87 | 0.825 | 0.835 | 0.815 | 0.719 | 0.713 | 0.684 |
| dpii | 0.824 | 0.82 | 0.824 | 0.793 | 0.789 | 0.8 | 0.582 | 0.566 | 0.54 |
| dpig | 0.662 | 0.654 | 0.659 | 0.854 | 0.85 | 0.848 | 0.655 | 0.658 | 0.601 |
| dpin | 0.625 | 0.61 | 0.614 | 0.786 | 0.777 | 0.78 | 0.578 | 0.572 | 0.535 |
| srn | 0.794 | 0.796 | 0.807 | 0.544 | 0.54 | 0.532 | 0.551 | 0.568 | 0.497 |
| Avg | 0.771 | 0.766 |
| 0.734 | 0.731 |
|
| 0.633 | 0.584 |
Best values appear in boldface
AUPR and AUROC results for the compared methods. The tree-ensemble setting is the RF
| AUPR | |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| ern | 0.399 | 0.386 | 0.404 | 0.049 | 0.047 | 0.055 | 0.065 | 0.052 | 0.052 |
| dpie | 0.613 | 0.607 | 0.6 | 0.32 | 0.302 | 0.323 | 0.175 | 0.155 | 0.167 |
| dpii | 0.518 | 0.5 | 0.496 | 0.341 | 0.324 | 0.342 | 0.065 | 0.068 | 0.07 |
| dpig | 0.233 | 0.226 | 0.219 | 0.35 | 0.318 | 0.329 | 0.085 | 0.077 | 0.069 |
| dpin | 0.39 | 0.333 | 0.367 | 0.502 | 0.481 | 0.495 | 0.105 | 0.1 | 0.095 |
| srn | 0.149 | 0.133 | 0.168 | 0.028 | 0.032 | 0.025 | 0.023 | 0.023 | 0.018 |
| Avg |
| 0.364 | 0.376 |
| 0.251 | 0.262 |
| 0.079 | 0.079 |
| AUROC | |||||||||
| ern | 0.836 | 0.846 | 0.857 | 0.602 | 0.645 | 0.61 | 0.763 | 0.732 | 0.642 |
| dpie | 0.831 | 0.87 | 0.868 | 0.819 | 0.826 | 0.819 | 0.736 | 0.712 | 0.675 |
| dpii | 0.792 | 0.817 | 0.814 | 0.808 | 0.799 | 0.801 | 0.579 | 0.573 | 0.529 |
| dpig | 0.574 | 0.692 | 0.655 | 0.853 | 0.863 | 0.855 | 0.639 | 0.641 | 0.589 |
| dpin | 0.511 | 0.661 | 0.583 | 0.75 | 0.775 | 0.774 | 0.59 | 0.567 | 0.505 |
| srn | 0.812 | 0.779 | 0.806 | 0.518 | 0.569 | 0.532 | 0.558 | 0.558 | 0.496 |
| Avg | 0.726 |
| 0.764 | 0.725 |
| 0.732 |
| 0.631 | 0.573 |
Best values appear in boldface
AUPR and AUROC results for the compared methods
| AUPR | |||||||||
|
|
|
|
|
|
|
| |||
| ern | 0.397 | 0.401 | 0.378 | 0.043 | 0.03 | 0.032 | 0.048 | 0.029 | 0.029 |
| dpie | 0.645 | 0.489 | 0.635 | 0.303 | 0.217 | 0.233 | 0.175 | 0.047 | 0.122 |
| dpii | 0.544 | 0.338 | 0.542 | 0.327 | 0.245 | 0.294 | 0.073 | 0.075 | 0.054 |
| dpig | 0.239 | 0.168 | 0.197 | 0.345 | 0.277 | 0.294 | 0.084 | 0.033 | 0.06 |
| dpin | 0.385 | 0.373 | 0.351 | 0.507 | 0.476 | 0.48 | 0.106 | 0.079 | 0.06 |
| srn | 0.157 | 0.126 | 0.133 | 0.028 | 0.032 | 0.032 | 0.022 | 0.019 | 0.02 |
| Avg |
| 0.316 | 0.373 |
| 0.213 | 0.228 |
| 0.047 | 0.058 |
| AUROC | |||||||||
| ern | 0.845 | 0.861 | 0.842 | 0.603 | 0.552 | 0.549 | 0.729 | 0.579 | 0.571 |
| dpie | 0.873 | 0.832 | 0.823 | 0.825 | 0.823 | 0.729 | 0.719 | 0.571 | 0.602 |
| dpii | 0.824 | 0.749 | 0.773 | 0.793 | 0.777 | 0.767 | 0.582 | 0.569 | 0.533 |
| dpig | 0.662 | 0.527 | 0.528 | 0.854 | 0.815 | 0.835 | 0.655 | 0.472 | 0.508 |
| dpin | 0.625 | 0.622 | 0.553 | 0.786 | 0.8 | 0.807 | 0.578 | 0.532 | 0.423 |
| srn | 0.794 | 0.832 | 0.8 | 0.544 | 0.532 | 0.505 | 0.551 | 0.493 | 0.518 |
| Avg |
| 0.737 | 0.72 |
| 0.717 | 0.699 |
| 0.536 | 0.526 |
Best values appear in boldface