| Literature DB >> 23566263 |
Junkyu Lee1, Seongsoon Kim, Sunwon Lee, Kyubum Lee, Jaewoo Kang.
Abstract
BACKGROUND: Most previous Protein Protein Interaction (PPI) studies evaluated their algorithms' performance based on "per-instance" precision and recall, in which the instances of an interaction relation were evaluated independently. However, we argue that this standard evaluation method should be revisited. In a large corpus, the same relation can be described in various different forms and, in practice, correctly identifying not all but a small subset of them would often suffice to detect the given interaction.Entities:
Mesh:
Year: 2013 PMID: 23566263 PMCID: PMC3618211 DOI: 10.1186/1472-6947-13-S1-S7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1Dependency relations for the example sentences.
Relation keywords
| Activation | activate, increase, induce, initiate, stimulate, promote, catalyze, up-regulate, coactivate, potentiate, precipitate, reactivate |
|---|---|
| Deactivation | block, decrease, down-regulate, inactivate, inhibit, reduce, repress, suppress, interfere, antagonize, degrade, obstruct |
| Creating bond | associate, link, dimerize, heterodimerize, crystalize, methylate, phosphorylate, assemble, polymerize, bound, bond, oligomerize, glycosylate, bind, complex, form, conjugate, acetylate, couple |
| Breaking bond | cleave, demethylate, dephosphorylate, sever, unbind, depolymerize, dissociate, deacetylate, deglycosilate, disassemble |
| Signaling | mediate, modulate, participate, regulate, control, interact, react, contact, response, encode, recognize, stabilize, destabilize, target |
Per-relation and per-instance performance evaluation results
| Min #Inst. per Relation | #Uniq. Relations | #Instances | Methods | Per-Relation | Per-Instance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pos | Neg | Pos | Neg | TP | FP | Precision | Recall | F-Score | TP | FP | Precision | Recall | F-Score | ||
| 1 | 618 | 2,312 | 1,000 | 4,834 | ours | 132 | 7 | 0.214 | 0.349 | 153 | 10 | 0.153 | 0.263 | ||
| hybrid [ | 477 | 344 | 0.581 | 694 | 584 | 0.5431 | |||||||||
| SST-PT [ | 195 | 102 | 0.657 | 0.316 | 0.427 | 217 | 170 | 0.5612 | 0.2172 | 0.3132 | |||||
| 2 | 197 | 695 | 579 | 2,827 | ours | 80 | 3 | 0.406 | 0.571 | 101 | 5 | 0.174 | 0.294 | ||
| hybrid [ | 183 | 164 | 0.527 | 389 | 327 | 0.543 | |||||||||
| SST-PT [ | 103 | 71 | 0.592 | 0.523 | 0.555 | 131 | 113 | 0.537 | 0.226 | 0.318 | |||||
| 3 | 89 | 314 | 363 | 1,835 | ours | 51 | 2 | 0.573 | 67 | 3 | 0.185 | 0.310 | |||
| hybrid [ | 88 | 98 | 0.473 | 0.640 | 250 | 236 | 0.514 | ||||||||
| SST-PT [ | 56 | 47 | 0.544 | 0.629 | 0.583 | 85 | 79 | 0.518 | 0.234 | 0.322 | |||||
| 4 | 51 | 181 | 249 | 1,349 | ours | 35 | 2 | 0.686 | 50 | 3 | 0.201 | 0.331 | |||
| hybrid [ | 49 | 67 | 0.422 | 0.587 | 160 | 170 | 0.485 | ||||||||
| SST-PT [ | 35 | 30 | 0.538 | 0.686 | 0.603 | 62 | 54 | 0.534 | 0.249 | 0.340 | |||||
| 5 | 28 | 107 | 157 | 984 | ours | 23 | 1 | 0.821 | 37 | 1 | 0.236 | 0.380 | |||
| hybrid [ | 28 | 44 | 0.389 | 0.560 | 105 | 105 | 0.500 | ||||||||
| SST-PT [ | 21 | 15 | 0.583 | 0.75 | 0.656 | 37 | 30 | 0.552 | 0.236 | 0.331 | |||||
Figure 2Per-relation and per-instance evaluation results with varying "min #instance per relation".
Performance improvement of baselines through pipelining our approach
| TP | FP | F | |||
|---|---|---|---|---|---|
| 694 | 584 | 0.543 | 0.694 | 0.609 | |
| 713 | 585 | 0.549 | 0.713 | 0.621 | |
| 217 | 170 | 0.561 | 0.217 | 0.313 | |
| 322 | 178 | 0.644 | 0.322 | 0.429 |
Performance of the two-tier extraction system: our rule-based system in the first tier
| Min #Inst. per Relation | #Uniq. Relation | #Instances | Methods | Per-Relation | Per-Instance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pos | Neg | Pos | Neg | TP | FP | Precision | Recall | F-Score | TP | FP | Precision | Recall | F-Score | ||
| ours | 132 | 7 | 0.214 | 0.349 | 153 | 10 | 0.153 | 0.263 | |||||||
| +hybrid | 477 | 326 | 0.594 | 701 | 555 | 0.558 | |||||||||
| +SST-PT | 231 | 112 | 0.673 | 0.374 | 0.481 | 287 | 167 | 0.632 | 0.287 | 0.395 | |||||
| 1 | 618 | 2,312 | 1,000 | 4,834 | +SVM | 152 | 44 | 0.776 | 0.246 | 0.374 | 172 | 67 | 0.720 | 0.172 | 0.278 |
| +NB | 247 | 254 | 0.493 | 0.400 | 0.442 | 285 | 410 | 0.410 | 0.285 | 0.336 | |||||
| +DT | 155 | 40 | 0.795 | 0.251 | 0.382 | 180 | 67 | 0.729 | 0.180 | 0.289 | |||||
| +kNN | 215 | 226 | 0.488 | 0.348 | 0.406 | 242 | 310 | 0.438 | 0.242 | 0.312 | |||||
| ours | 80 | 3 | 0.406 | 0.571 | 101 | 5 | 0.174 | 0.294 | |||||||
| +hybrid | 183 | 151 | 0.548 | 395 | 310 | 0.560 | |||||||||
| +SST-PT | 127 | 62 | 0.672 | 0.645 | 0.658 | 186 | 102 | 0.646 | 0.321 | 0.429 | |||||
| 2 | 197 | 695 | 579 | 2,827 | +SVM | 94 | 36 | 0.723 | 0.477 | 0.575 | 116 | 63 | 0.648 | 0.200 | 0.306 |
| +NB | 113 | 110 | 0.507 | 0.574 | 0.538 | 151 | 196 | 0.435 | 0.261 | 0.326 | |||||
| +DT | 90 | 21 | 0.811 | 0.457 | 0.585 | 112 | 45 | 0.713 | 0.193 | 0.304 | |||||
| +kNN | 129 | 138 | 0.483 | 0.655 | 0.556 | 163 | 235 | 0.410 | 0.282 | 0.334 | |||||
| ours | 51 | 2 | 0.573 | 0.718 | 67 | 3 | 0.185 | 0.310 | |||||||
| +hybrid | 88 | 94 | 0.484 | 0.650 | 257 | 222 | 0.537 | ||||||||
| +SST-PT | 70 | 38 | 0.648 | 0.787 | 0.711 | 129 | 58 | 0.690 | 0.355 | 0.469 | |||||
| 3 | 89 | 314 | 363 | 1,835 | +SVM | 58 | 22 | 0.725 | 0.652 | 0.687 | 78 | 32 | 0.709 | 0.215 | 0.330 |
| +NB | 59 | 54 | 0.522 | 0.663 | 0.584 | 86 | 89 | 0.491 | 0.237 | 0.320 | |||||
| +DT | 53 | 5 | 0.914 | 0.596 | 70 | 8 | 0.897 | 0.193 | 0.318 | ||||||
| +kNN | 66 | 85 | 0.437 | 0.742 | 0.550 | 97 | 153 | 0.388 | 0.267 | 0.316 | |||||
| ours | 35 | 2 | 0.686 | 50 | 3 | 0.201 | 0.331 | ||||||||
| +hybrid | 51 | 63 | 0.447 | 0.618 | 171 | 150 | 0.533 | ||||||||
| +SST-PT | 45 | 27 | 0.625 | 0.882 | 0.732 | 92 | 43 | 0.681 | 0.369 | 0.479 | |||||
| 4 | 51 | 181 | 249 | 1,349 | +SVM | 38 | 15 | 0.717 | 0.745 | 0.731 | 55 | 25 | 0.688 | 0.221 | 0.335 |
| +NB | 42 | 43 | 0.494 | 0.824 | 0.618 | 67 | 86 | 0.438 | 0.269 | 0.333 | |||||
| +DT | 36 | 5 | 0.878 | 0.706 | 0.783 | 54 | 8 | 0.871 | 0.217 | 0.347 | |||||
| +kNN | 44 | 56 | 0.440 | 0.863 | 0.583 | 74 | 104 | 0.416 | 0.297 | 0.347 | |||||
| ours | 23 | 1 | 0.821 | 37 | 1 | 0.236 | 0.380 | ||||||||
| +hybrid | 28 | 40 | 0.412 | 0.584 | 108 | 88 | 0.551 | ||||||||
| +SST-PT | 25 | 14 | 0.641 | 0.893 | 0.746 | 57 | 22 | 0.722 | 0.363 | 0.483 | |||||
| 5 | 28 | 107 | 157 | 984 | +SVM | 25 | 12 | 0.676 | 0.893 | 0.769 | 40 | 18 | 0.690 | 0.255 | 0.372 |
| +NB | 24 | 21 | 0.533 | 0.857 | 0.657 | 40 | 40 | 0.500 | 0.255 | 0.338 | |||||
| +DT | 23 | 4 | 0.852 | 0.821 | 0.836 | 37 | 8 | 0.822 | 0.236 | 0.367 | |||||
| +kNN | 26 | 31 | 0.456 | 0.929 | 0.612 | 45 | 51 | 0.469 | 0.287 | 0.356 | |||||
Performance of the "ours+SVM" model with incremental feature sets
| Min #Inst. per Relation | #Uniq. Relation | #Instances | Methods | Per-Relation | Per-Instance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pos | Neg | Pos | Neg | TP | FP | Precision | Recall | F-Score | TP | FP | Precision | Recall | F-Score | ||
| 1 | 618 | 2,312 | 1,000 | 4,834 | ours+SVM | 152 | 44 | 0.776 | 172 | 67 | 0.720 | 0.172 | 0.278 | ||
| +dep. len. | 149 | 35 | 0.241 | 0.371 | 170 | 52 | 0.170 | 0.278 | |||||||
| +dist. | 150 | 39 | 0.794 | 0.243 | 0.372 | 171 | 63 | 0.731 | 0.171 | 0.277 | |||||
| +both | 151 | 38 | 0.799 | 0.244 | 173 | 58 | 0.749 | ||||||||
| 2 | 197 | 695 | 579 | 2,827 | ours+SVM | 94 | 36 | 0.723 | 0.575 | 116 | 63 | 0.648 | 0.200 | 0.306 | |
| +dep. len. | 94 | 32 | 117 | 53 | 0.688 | ||||||||||
| +dist. | 92 | 35 | 0.724 | 0.467 | 0.568 | 117 | 54 | 0.684 | |||||||
| +both | 94 | 32 | 116 | 51 | 0.200 | 0.311 | |||||||||
| 3 | 89 | 314 | 363 | 1,835 | ours+SVM | 58 | 22 | 78 | 32 | 0.709 | |||||
| +dep. len. | 57 | 22 | 0.722 | 0.640 | 0.679 | 76 | 31 | 0.209 | 0.323 | ||||||
| +dist. | 57 | 22 | 0.722 | 0.640 | 0.679 | 75 | 31 | 0.708 | 0.207 | 0.320 | |||||
| +both | 57 | 22 | 0.722 | 0.640 | 0.679 | 76 | 31 | 0.209 | 0.323 | ||||||
| 4 | 51 | 181 | 249 | 1,349 | ours+SVM | 38 | 15 | 0.717 | 55 | 25 | 0.688 | ||||
| +dep. len. | 36 | 13 | 0.706 | 0.720 | 52 | 19 | 0.209 | 0.325 | |||||||
| +dist. | 38 | 15 | 0.717 | 55 | 25 | 0.688 | |||||||||
| +both | 36 | 13 | 0.706 | 0.720 | 52 | 19 | 0.209 | 0.325 | |||||||
| 5 | 28 | 107 | 157 | 984 | ours+SVM | 25 | 12 | 0.676 | 0.769 | 40 | 18 | 0.690 | 0.372 | ||
| +dep. len. | 25 | 11 | 40 | 15 | |||||||||||
| +dist. | 25 | 12 | 0.676 | 0.769 | 40 | 18 | 0.690 | 0.372 | |||||||
| +both | 25 | 11 | 40 | 15 | |||||||||||
1Note that the result shown here is different from the ones reported in [6]. It may be due to the differences in SVM optimization parameters used for the experiments. We obtained the codes from the authors' web page at http://staff.science.uva.nl/ui/PPIs.zip and ran as is with the parameters: RBF kernel gamma - 0.0145; C = 9; Weka Cost-SensitiveClassifier optimization.
2In [20], the authors reported the macro-averaged precision, recall, and F-score, which are incomparable to other performance results. Following the general convention in PPI research, we compared the performance using the precision, recall and F-score computed with only positive class prediction results. The original implementation was not available. We implemented it on SVM-LIGHT-TK ver 1.2 obtained from http://disi.unitn.it/moschitti/Tree-Kernel.htm. The optimization parameters used are C = 8 and λ = 0.6 (as reported in [20])
Performance of the two-tier extraction system: our rule-based system in the second tier
| Min #Inst. per Relation | #Uniq. Relation | #Instances | Methods | Per-Relation | Per-Instance | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pos | Neg | Neg | TP | FP | Precision | Recall | F-Score | TP | FP | Precision | Recall | F-Score | |||
| 1 | 618 | 2,312 | 1,000 | 4,834 | ours | 132 | 7 | 0.214 | 0.349 | 153 | 10 | 0.153 | 0.263 | ||
| hybrid+ | 479 | 344 | 0.582 | 705 | 584 | 0.547 | |||||||||
| SST-PT+ | 257 | 108 | 0.704 | 0.416 | 0.523 | 322 | 178 | 0.644 | 0.322 | 0.429 | |||||
| SVM+ | 131 | 57 | 0.697 | 0.212 | 0.325 | 140 | 84 | 0.625 | 0.140 | 0.229 | |||||
| NB+ | 235 | 247 | 0.488 | 0.380 | 0.427 | 261 | 410 | 0.389 | 0.261 | 0.312 | |||||
| DT+ | 157 | 73 | 0.683 | 0.254 | 0.370 | 169 | 120 | 0.585 | 0.169 | 0.262 | |||||
| kNN+ | 219 | 260 | 0.457 | 0.354 | 0.399 | 237 | 366 | 0.393 | 0.237 | 0.296 | |||||
| 2 | 197 | 695 | 579 | 2,827 | ours | 80 | 3 | 0.406 | 0.571 | 101 | 5 | 0.174 | 0.294 | ||
| hybrid+ | 183 | 164 | 0.527 | 399 | 327 | 0.550 | |||||||||
| SST-PT+ | 132 | 73 | 0.644 | 0.670 | 0.657 | 203 | 118 | 0.632 | 0.351 | 0.451 | |||||
| SVM+ | 88 | 48 | 0.647 | 0.447 | 0.529 | 106 | 78 | 0.576 | 0.183 | 0.278 | |||||
| NB+ | 107 | 113 | 0.486 | 0.543 | 0.513 | 140 | 230 | 0.378 | 0.242 | 0.295 | |||||
| DT+ | 88 | 41 | 0.682 | 0.447 | 0.540 | 104 | 85 | 0.550 | 0.180 | 0.271 | |||||
| kNN+ | 125 | 151 | 0.453 | 0.635 | 0.529 | 152 | 250 | 0.378 | 0.263 | 0.310 | |||||
| 3 | 89 | 314 | 363 | 1,835 | ours | 51 | 2 | 0.573 | 67 | 3 | 0.185 | 0.310 | |||
| hybrid+ | 88 | 98 | 0.473 | 0.640 | 262 | 236 | 0.526 | ||||||||
| SST-PT+ | 71 | 48 | 0.597 | 0.798 | 0.683 | 136 | 82 | 0.624 | 0.375 | 0.468 | |||||
| SVM+ | 43 | 28 | 0.606 | 0.483 | 0.538 | 59 | 40 | 0.596 | 0.163 | 0.256 | |||||
| NB+ | 59 | 69 | 0.461 | 0.663 | 0.544 | 82 | 113 | 0.421 | 0.226 | 0.294 | |||||
| DT+ | 49 | 17 | 0.742 | 0.551 | 0.632 | 61 | 26 | 0.701 | 0.168 | 0.271 | |||||
| kNN+ | 59 | 86 | 0.407 | 0.663 | 0.504 | 84 | 151 | 0.357 | 0.231 | 0.281 | |||||
| 4 | 51 | 181 | 249 | 1,349 | ours | 35 | 2 | 0.686 | 50 | 3 | 0.201 | 0.331 | |||
| hybrid+ | 50 | 67 | 0.427 | 0.595 | 171 | 171 | 0.500 | ||||||||
| SST-PT+ | 45 | 32 | 0.584 | 0.882 | 0.703 | 98 | 57 | 0.632 | 0.394 | 0.485 | |||||
| SVM+ | 33 | 19 | 0.635 | 0.647 | 0.641 | 47 | 35 | 0.573 | 0.189 | 0.284 | |||||
| NB+ | 41 | 53 | 0.436 | 0.804 | 0.565 | 64 | 116 | 0.356 | 0.257 | 0.299 | |||||
| DT+ | 32 | 7 | 0.821 | 0.627 | 0.711 | 43 | 11 | 0.796 | 0.173 | 0.284 | |||||
| kNN+ | 39 | 61 | 0.390 | 0.765 | 0.517 | 59 | 118 | 0.333 | 0.237 | 0.277 | |||||
| 5 | 28 | 107 | 157 | 984 | ours | 23 | 1 | 0.821 | 37 | 1 | 0.236 | 0.380 | |||
| hybrid+ | 28 | 44 | 0.389 | 0.560 | 114 | 105 | 0.521 | ||||||||
| SST-PT+ | 25 | 16 | 0.610 | 0.893 | 0.725 | 66 | 31 | 0.680 | 0.420 | 0.519 | |||||
| SVM+ | 21 | 14 | 0.600 | 0.750 | 0.667 | 29 | 22 | 0.569 | 0.185 | 0.279 | |||||
| NB+ | 23 | 41 | 0.359 | 0.821 | 0.500 | 35 | 73 | 0.324 | 0.223 | 0.264 | |||||
| DT+ | 18 | 3 | 0.857 | 0.643 | 0.735 | 26 | 4 | 0.867 | 0.166 | 0.279 | |||||
| kNN+ | 24 | 38 | 0.387 | 0.857 | 0.533 | 35 | 68 | 0.340 | 0.223 | 0.269 | |||||