| Literature DB >> 19635172 |
Renata Kabiljo1, Andrew B Clegg, Adrian J Shepherd.
Abstract
BACKGROUND: The automated extraction of gene and/or protein interactions from the literature is one of the most important targets of biomedical text mining research. In this paper we present a realistic evaluation of gene/protein interaction mining relevant to potential non-specialist users. Hence we have specifically avoided methods that are complex to install or require reimplementation, and we coupled our chosen extraction methods with a state-of-the-art biomedical named entity tagger.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19635172 PMCID: PMC2723093 DOI: 10.1186/1471-2105-10-233
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The F-scores produced by ABNER and BANNER when applied to four corpora using sloppy matching criteria.
| Y | J | P | I | |
| ABNER(B) | 80.4 | 76.0 | 85.3 | 78.3 |
| BANNER | 85.0 | 77.5 | 89.7 | 83.4 |
Abbreviations are as follows: Y = Yapex; J = JNLPBA evaluation corpus; P = ProSpecTome; I = ImmunoTome; B = BioCreAtivE.
The F-scores produced by ABNER and BANNER when applied to four corpora using strict matching criteria.
| Y | J | P | I | |
| ABNER(B) | 54.2 | 60.8 | 62.0 | 54.0 |
| BANNER | 62.0 | 61.0 | 68.7 | 53.9 |
Abbreviations are as follows: Y = Yapex; J = JNLPBA evaluation corpus; P = ProSpecTome; I = ImmunoTome; B = BioCreAtivE.
The (P)recision, (R)ecall and (F)-measure scores for BANNER when applied to five GPI corpora.
| A | B | H | I | L | |
| P | 80.5 | 97.5 | 70.8 | 60.4 | 80.0 |
| R | 85.4 | 85.1 | 83.2 | 69.6 | 88.7 |
| F | 82.9 | 90.8 | 76.5 | 64.6 | 84.1 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
The performance (precision, recall and F-score) of six GPI extraction methods when applied to five GPI corpora using gold-standard named entities.
| A | B | H | I | L | |
| AkanePPI(A) | (57.0) | 29.2 | 61.5 | 60.2 | 69.6 |
| AkanePPI(B) | 29.1 | (56.8) | 52.0 | 66.2 | 76.7 |
| RelEx | 40 | 39 | 76 | 74 | 82 |
| Baseline(K) | 22.8 | 24 | 54 | 44.8 | (53.9) |
| Baseline(C) | 17 | 13 | 38 | 41 | 50 |
| OpenDMAP | 61 | 62.3 | 77.3 | 87.5 | 100 |
| AkanePPI(A) | (74.0) | 31.8 | 44.2 | 32.5 | 23.8 |
| AkanePPI(B) | 52.9 | (85.4) | 55.8 | 51.3 | 40.2 |
| RelEx | 50 | 45 | 64 | 61 | 72 |
| Baseline(K) | 51.5 | 52.2 | 66.9 | 56.4 | (72) |
| Baseline(C) | 95 | 99 | 100 | 100 | 100 |
| OpenDMAP | 9.1 | 5.9 | 10.4 | 2.1 | 2.4 |
| AkanePPI(A) | (64.4) | 30.5 | 51.4 | 42.2 | 35.4 |
| AkanePPI(B) | 37.5 | (68.2) | 53.8 | 57.8 | 52.8 |
| RelEx | 44 | 41 | 69 | 67 | 77 |
| Baseline(K) | 31.6 | 32.9 | 59.7 | 49.9 | (61.6) |
| Baseline(C) | 29 | 23 | 55 | 58 | 66 |
| OpenDMAP | 15.9 | 10.8 | 18.4 | 4.1 | 4.8 |
The figures for RelEx and Baseline(C) are taken from Pyysalo et al. (2008). (Note that we use a simplified version of BioInfer compared to the one used in that paper, so the figures for this corpus are not completely comparable.) Figures are given in brackets where a corpus was used to develop a given method. Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
The effect of using the BANNER entity tagger compared to gold-standard entities on the performance (precision, recall and F-score) of AkanePPI trained on AIMed.
| (A) | B | H | I | L | |
| With gold-standard entities | 57.0 | 29.2 | 61.5 | 60.2 | 69.6 |
| With BANNER | 34.3 | 23.8 | 26.8 | 13.0 | 39.8 |
| Δ precision | 22.7 | 5.4 | 34.7 | 47.2 | 29.8 |
| With gold-standard entities | 74.0 | 31.8 | 44.2 | 32.5 | 23.8 |
| With BANNER | 64.2 | 30.2 | 41.1 | 24.5 | 27.4 |
| Δ recall | 9.8 | 1.7 | 3.1 | 8.0 | -3.7 |
| With gold-standard entities | 64.4 | 30.5 | 51.4 | 42.2 | 35.4 |
| With BANNER | 44.7 | 26.6 | 32.4 | 17.0 | 32.5 |
| Δ F-score | 19.7 | 3.8 | 19.0 | 25.2 | 3.0 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
The effect of using the BANNER entity tagger compared to gold-standard entities on the performance (precision, recall and F-score) of AkanePPI trained on BioInfer.
| A | (B) | H | I | L | |
| With gold-standard entities | 29.1 | 56.8 | 52.0 | 66.2 | 76.7 |
| With BANNER | 32.3 | 49.5 | 35.1 | 17.8 | 50.7 |
| Δ precision | -3.2 | 7.3 | 16.9 | 48.4 | 26.0 |
| With gold-standard entities | 52.9 | 85.4 | 55.8 | 51.3 | 40.2 |
| With BANNER | 38.9 | 42.2 | 37.4 | 30.1 | 23.2 |
| Δ recall | 14.0 | 43.2 | 18.4 | 21.2 | 17.0 |
| With gold-standard entities | 37.5 | 68.2 | 53.8 | 57.8 | 52.8 |
| With BANNER | 35.3 | 45.5 | 36.2 | 22.4 | 31.8 |
| Δ F-score | 2.2 | 22.7 | 17.6 | 35.4 | 21.0 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
The effect of using the BANNER entity tagger compared to gold-standard entities on the performance (precision, recall and F-score) of our simple baseline algorithm, Baseline(K).
| A | B | H | I | (L) | |
| With gold-standard entities | 22.8 | 24 | 54 | 44.8 | 53.9 |
| With BANNER | 18.4 | 23.5 | 32 | 20 | 43.8 |
| Δ precision | 4.4 | 0.5 | 22 | 24.8 | 10.1 |
| With gold-standard entities | 51.5 | 52.2 | 66.9 | 56.4 | 72 |
| With BANNER | 42.1 | 33.5 | 49.7 | 27.5 | 51.2 |
| Δ recall | 9.4 | 18.7 | 17.2 | 28.9 | 20.8 |
| With gold-standard entities | 31.6 | 32.9 | 59.7 | 49.9 | 61.6 |
| With BANNER | 25.6 | 27.6 | 38.9 | 23.1 | 47.2 |
| Δ F-score | 6 | 5.3 | 20.8 | 26.8 | 14.4 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
Performance of the Whatizit protein interaction pipeline, Ppi method.
| A | B | H | I | L | |
| Precision | 73.8 | 58.6 | 83.3 | 25.0 | 1.0 |
| Recall | 4.4 | 1.4 | 3.2 | 0.3 | 1.3 |
| F-score | 8.3 | 2.7 | 6.1 | 0.6 | 2.5 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
Performance of the Whatizit protein interaction pipeline, Co3 method.
| A | B | H | I | L | |
| Precision | 29.3 | 31.3 | 24.5 | 12.4 | 31.8 |
| Recall | 14.5 | 10.7 | 15.9 | 4.6 | 8.8 |
| F-score | 19.4 | 15.9 | 19.3 | 6.7 | 13.8 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.
Performance of the Whatizit protein interaction pipeline, Co method.
| A | B | H | I | L | |
| Precision | 21.8 | 16.2 | 21.4 | 10.2 | 25.3 |
| Recall | 52.1 | 49.8 | 49.7 | 42.1 | 31.4 |
| F-score | 30.7 | 24.4 | 29.9 | 16.4 | 28.0 |
Corpus abbreviations are as follows: A = AIMed; B = BioInfer; H = HPRD50; I = IEPA; L = LLL.