| Literature DB >> 26201408 |
Makoto Miwa, Sophia Ananiadou.
Abstract
BACKGROUND: Biomedical event extraction has been a major focus of biomedical natural language processing (BioNLP) research since the first BioNLP shared task was held in 2009. Accordingly, a large number of event extraction systems have been developed. Most such systems, however, have been developed for specific tasks and/or incorporated task specific settings, making their application to new corpora and tasks problematic without modification of the systems themselves. There is thus a need for event extraction systems that can achieve high levels of accuracy when applied to corpora in new domains, without the need for exhaustive tuning or modification, whilst retaining competitive levels of performance.Entities:
Mesh:
Year: 2015 PMID: 26201408 PMCID: PMC4511382 DOI: 10.1186/1471-2105-16-S10-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Example of Event Structure in the CG task.
Figure 2Example of Configuration for the CG task. Some types are omitted for brevity.
Effect of instance generation generalisations and the stacking method on the PC development data set.
| Generation | Stacking | Recall | Precision | F-score (%) |
|---|---|---|---|---|
| × | × | 42.87 | 47.72 | 45.16 |
| ✓ | × | 43.37 | 46.42 | 44.84 |
| ✓ | ✓ | 43.59 | 48.77 | 46.04 |
Instance generation generalisations (Generation) and the stacking method (Stacking) are applied to the PC development data set.
Official best and second best scores on the CG and PC tasks.
| Task | System | Recall | Precision | F-Score (%) |
|---|---|---|---|---|
| CG | EventMine | 55.82 | 52.09 | |
| TEES-2.1 | 48.76 | |||
| NCBI | 38.28 | 58.84 | 46.38 | |
| RelAgent | 41.73 | 49.58 | 45.32 | |
| PC | EventMine | 53.48 | ||
| TEES-2.1 | 47.15 | 51.10 | ||
Highest scores are shown in bold.
Recall / Precision / F-scores for event categories on the CG and PC tasks
| Task | Category | EventMine | TEES-2.1 | ||||
|---|---|---|---|---|---|---|---|
| CG | ANATOMY | 69.43 | 73.28 | 71.31 | 73.11 | 81.79 | |
| PATHOL | 56.51 | 63.44 | 59.78 | 61.69 | 74.54 | ||
| MOLECUL | 72.03 | 73.53 | 67.33 | 78.76 | 72.60 | ||
| GENERAL | 48.74 | 58.26 | 44.72 | 62.68 | 52.20 | ||
| REGULAT | 37.00 | 43.02 | 39.79 | 37.17 | 51.21 | ||
| PLANNED | 40.05 | 40.98 | 34.78 | 45.51 | 39.43 | ||
| MOD | 22.85 | 43.44 | 29.95 | 24.89 | 57.07 | ||
| PC | SIMPLE | 66.42 | 64.80 | 60.40 | 67.87 | 63.92 | |
| NON-REG | 69.07 | 62.69 | 61.16 | 65.74 | 63.37 | ||
| REGULAT | 37.73 | 42.79 | 35.17 | 44.76 | 39.39 | ||
| MOD | 23.56 | 34.65 | 28.05 | 22.41 | 40.00 | ||
Highest F-scores are shown in bold. We refer the reader to the papers of the tasks [1] for the details of the event categories.
Effect of the weighting and covariate shift methods on the development data sets.
| Task | Weighting | Covariate shift | Recall | Precision | F-score (%) |
|---|---|---|---|---|---|
| CG | × | × | 40.70 | 62.19 | 49.20 |
| ✓ | × | 50.11 | 50.68 | 50.39 | |
| × | ✓ | 44.42 | 59.78 | 50.96 | |
| ✓ | ✓ | 48.83 | 54.10 | 51.33 | |
| PC | × | × | 37.89 | 61.26 | 46.82 |
| ✓ | × | 44.23 | 49.18 | 46.57 | |
| × | ✓ | 40.65 | 55.81 | 47.04 | |
| ✓ | ✓ | 42.73 | 52.13 | 46.97 | |
Effect of the weighting (W) and covariate shift (CS) methods on the test data sets.
| Task | Category | Recall | Precision | -W -CS F-Score | Recall | Precision | +W +CS F-Score (%) |
|---|---|---|---|---|---|---|---|
| CG | ANATOMY | 70.17 | 80.39 | 74.93 | 74.37 | 72.76 | 73.56 |
| PATHOL | 60.54 | 75.96 | 67.38 | 67.05 | 67.31 | 67.18 | |
| MOLECUL | 58.75 | 81.24 | 68.19 | 73.56 | 72.81 | 73.18 | |
| GENERAL | 38.69 | 65.25 | 48.58 | 47.24 | 52.37 | 49.67 | |
| REGULAT | 28.52 | 52.72 | 37.02 | 38.35 | 41.36 | 39.80 | |
| PLANNED | 33.41 | 51.41 | 40.50 | 46.45 | 40.20 | 43.10 | |
| MOD | 9.28 | 67.24 | 16.30 | 26.92 | 45.93 | 33.95 | |
| TOTAL | 41.88 | 67.18 | 51.59 | 51.96 | 54.77 | 53.33 | |
| PC | SIMPLE | 58.52 | 76.78 | 66.42 | 66.53 | 68.05 | 67.28 |
| NON-REG | 60.27 | 72.65 | 65.88 | 67.93 | 62.83 | 65.28 | |
| REGULAT | 25.94 | 53.69 | 34.98 | 36.28 | 43.34 | 39.49 | |
| MOD | 5.75 | 38.46 | 10.00 | 22.41 | 39.53 | 28.61 | |
| TOTAL | 41.62 | 65.44 | 50.88 | 50.93 | 54.10 | 52.47 | |
Stacking is not used for the PC task.