| Literature DB >> 22759459 |
Andreas Vlachos1, Mark Craven.
Abstract
BACKGROUND: Biomedical event extraction has attracted substantial attention as it can assist researchers in understanding the plethora of interactions among genes that are described in publications in molecular biology. While most recent work has focused on abstracts, the BioNLP 2011 shared task evaluated the submitted systems on both abstracts and full papers. In this article, we describe our submission to the shared task which decomposes event extraction into a set of classification tasks that can be learned either independently or jointly using the search-based structured prediction framework. Our intention is to explore how these two learning paradigms compare in the context of the shared task.Entities:
Mesh:
Year: 2012 PMID: 22759459 PMCID: PMC3384253 DOI: 10.1186/1471-2105-13-S11-S5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The stages of our biomedical event extraction system.
Figure 2Prediction (top of each panel) and cost sensitive examples for trigger recognition actions (bottom of each panel) in the first two SEARN iterations.
R(ecall)/P(recision)/F(-score) on the development dataset.
| SEARN (R/P/F) | ||||||
|---|---|---|---|---|---|---|
| trigger | 52.82 | 66.76 | 58.98 | 83.65 | 29.78 | 43.92 |
| 46.23 | 79.03 | 58.34 | 63.63 | 71.82 | 67.48 | |
| 15.16 | 58.49 | 24.08 | 31.79 | 49.06 | 38.57 | |
| Event | 35.68 | 69.39 | 47.12 | 49.15 | 59.60 | 53.87 |
Each row reports the results for each stage of the event extraction decomposition, with the last row containing the overall event extraction performance.
Detailed results on the development data using independently learned classifiers and SEARN
| SEARN | ||||||
|---|---|---|---|---|---|---|
| 67.02 | 85.20 | 75.03 | 73.43 | 78.88 | 76.06 | |
| 34.81 | 90.16 | 50.23 | 48.73 | 70.00 | 57.46 | |
| 69.57 | 94.12 | 80.00 | 69.57 | 76.19 | 72.73 | |
| 71.17 | 86.81 | 78.22 | 81.08 | 90.91 | 85.71 | |
| 62.69 | 80.77 | 70.59 | 71.64 | 77.42 | 74.42 | |
| 62.64 | 85.66 | 72.36 | 70.49 | 78.95 | 74.48 | |
| 28.42 | 63.10 | 39.19 | 40.21 | 62.24 | 48.86 | |
| 54.02 | 81.78 | 65.06 | 62.86 | 75.67 | 68.67 | |
| 14.04 | 44.57 | 21.35 | 32.19 | 37.90 | 34.81 | |
| 22.02 | 55.00 | 31.45 | 40.24 | 46.42 | 43.11 | |
| 20.38 | 48.73 | 28.74 | 35.46 | 50.61 | 41.70 | |
| 20.26 | 51.81 | 29.13 | 37.63 | 45.91 | 41.36 | |
| TOTAL | 35.68 | 69.39 | 47.12 | 49.15 | 59.60 | 53.87 |
Figure 3Recall-precision points resulting from different parameter values for independent and SEARN.
Figure 4Recall-precision points obtained for SEARN using different weights for false positives and false negatives.
Detailed results on the test data using SEARN with domain adaptation
| Domain | abstracts+full | abstracts | full papers | ||
|---|---|---|---|---|---|
| 69.46 | 80.65 | 74.64 | 72.63 | 79.35 | |
| 44.83 | 62.40 | 52.17 | 53.11 | 48.28 | |
| 73.33 | 42.31 | 53.66 | 70.97 | 0.00 | |
| 81.62 | 88.82 | 85.07 | 83.08 | 90.53 | |
| 45.03 | 88.66 | 59.72 | 61.72 | 43.75 | |
| 65.22 | 79.78 | 71.77 | 70.32 | 75.80 | |
| 38.09 | 57.36 | 45.78 | 46.67 | 43.72 | |
| 58.75 | 75.23 | 65.98 | 65.27 | 67.87 | |
| 32.99 | 41.50 | 36.76 | 38.13 | 32.77 | |
| 40.82 | 51.62 | 45.59 | 44.69 | 47.36 | |
| 38.35 | 43.89 | 40.93 | 43.80 | 35.64 | |
| 38.97 | 48.05 | 43.04 | 43.33 | 42.45 | |
| TOTAL | 48.10 | 60.34 | 53.53 | 53.79 | 52.93 |