| Literature DB >> 27637866 |
Seung-Cheol Baek1,2, Jong C Park3.
Abstract
BACKGROUND: Current state-of-the-art approaches to biological event extraction train statistical models in a supervised manner on corpora annotated with event triggers and event-argument relations. Inspecting such corpora, we observe that there is ambiguity in the span of event triggers (e.g., "transcriptional activity" vs. 'transcriptional'), leading to inconsistencies across event trigger annotations. Such inconsistencies make it quite likely that similar phrases are annotated with different spans of event triggers, suggesting the possibility that a statistical learning algorithm misses an opportunity for generalizing from such event triggers.Entities:
Mesh:
Year: 2016 PMID: 27637866 PMCID: PMC5026771 DOI: 10.1186/s13326-016-0094-9
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Dependency Graphs of Example Sentences. The graphs are basic Stanford dependency analyses by the Charniak-Johnson parser with a self-trained biomedical parsing model. In (1) and (4), dashed arrows indicate inferred dependency relations based on conjunctions. In (3), dashed arrows indicate that corresponding dependency relations are naturally expected dependency relations, but are missed in the analysis generated by the parser
Fig. 2Event Graph with a Loop
Fig. 3Baseline algorithm
Fig. 4Informed EM algorithms
Fig. 5Comparison Between Multi-Label and Single-Label Statistical Models. Each point (x,y) indicates that a model trained by taking x rounds has an F-score of y. These models are trained using the baseline algorithm
Performance of multi-label and single-label statistical models. These models are trained using the baseline algorithm
| Single-label (R/P/F) | Multi-label (R/P/F) | |
|---|---|---|
| BEST | 46.8/67.0/55.1 | 47.3/67.7/55.7 |
| AVG. | 46.2/66.6/54.6 | 46.6/67.1/55.0 |
| (STD.) | (0.36/0.41/0.32) | (0.23/0.21/0.30) |
Fig. 6Comparison Between the Baseline and Pure EM Algorithm. Each point (x,y) indicates that a model trained by taking x rounds has an F-score of y
Best performance of informed EM models
|
|
|
|
|---|---|---|
| Without NOC | ||
| 0.1 | 48.0/68.2/56.3 | 47.6/68.3/56.1 |
| 0.2 | 47.6/68.6/56.2 | 47.4/68.5/56.0 |
| 0.3 | 47.7/68.8/56.3 | 47.3/67.5/55.7 |
| 0.4 | 47.1/67.8/55.6 | 47.6/67.7/55.9 |
| With NOC | ||
| 0.1 | 47.3/68.9/56.1 | 47.5/68.1/55.9 |
| 0.2 | 47.3/68.0/55.8 | 47.5/ |
| 0.3 |
| 47.2/68.1/55.8 |
| 0.4 | 46.8/68.9/55.8 | 47.3/67.7/55.7 |
The best figures are set in bold-face
Average performance of informed EM models
|
|
|
|
|---|---|---|
| Without NOC | ||
| 0.1 |
| 47.3/67.7/55.7 |
| (0.27/0.56/0.31) | (0.22/0.30/0.23) | |
| 0.2 | 47.1/68.0/55.7 | 47.1/68.1/55.7 |
| (0.35/0.86/0.42) | (0.22/0.21/0.16) | |
| 0.3 | 47.4/67.9/55.8 | 47.3/66.8/55.4 |
| (0.18/0.39/0.23) | (0.13/0.22/0.13) | |
| 0.4 | 46.7/67.5/55.2 | 47.0/67.7/55.5 |
| (0.38/0.52/0.21) | (0.35/0.23/0.30) | |
| With NOC | ||
| 0.1 | 46.9/68.0/55.5 | 47.1/67.6/55.5 |
| (0.23/0.39/0.26) | (0.15/0.23/0.16) | |
| 0.2 | 47.1/67.6/55.5 | 47.2/68.3/55.8 |
| (0.22/0.29/0.20) | (0.22/0.65/0.35) | |
| 0.3 | 47.6/68.0/ | 47.0/67.1/55.3 |
| (0.38/0.45/0.40) | (0.27/0.36/0.29) | |
| 0.4 | 46.5/ | 47.1/67.6/55.5 |
| (0.33/0.72/0.42) | (0.24/0.39/0.22) | |
The best figures are set in bold-face and the sample standard deviations are bracketed
p-values for informed EM models
|
|
|
|
|---|---|---|
| 0.1 | 3.32E-09/1.86E-04 | 1.03E-06/4.47E-06 |
| 0.2 | 9.98E-07/1.21E-08 | 3.58E-09/1.05E-08 |
| 0.3 |
| 4.38E-06/2.95E-03 |
| 0.4 | 4.37E-02/1.19E-04 | 2.50E-08/6.70E-07 |
The best figures are set in bold-face
Updated graphs for informed EM models
|
|
|
|
|---|---|---|
| 0.1 | 72/47 | 98/50 |
| 0.2 | 34/18 | 46/31 |
| 0.3 | 16/11 | 25/15 |
| 0.4 | 9/8 | 9/5 |
Distribution of types of 16 updates
| Description | Count |
|---|---|
| Adding events similar to existing ones | 7 |
| Adding missing but reasonable events | 4 |
| Shifting the mark of anchor words | 2 |
| Removing duplicated and inferred events | 2 |
| Wrongly adding an incorrect event | 1 |
| Total | 16 |