| Literature DB >> 26551594 |
Haibin Liu, Karin Verspoor, Donald C Comeau, Andrew D MacKinlay, W Wilbur.
Abstract
IN BIONLP-ST 2013: We participated in the BioNLP 2013 shared tasks on event extraction. Our extraction method is based on the search for an approximate subgraph isomorphism between key context dependencies of events and graphs of input sentences. Our system was able to address both the GENIA (GE) task focusing on 13 molecular biology related event types and the Cancer Genetics (CG) task targeting a challenging group of 40 cancer biology related event types with varying arguments concerning 18 kinds of biological entities. In addition to adapting our system to the two tasks, we also attempted to integrate semantics into the graph matching scheme using a distributional similarity model for more events, and evaluated the event extraction impact of using paths of all possible lengths as key context dependencies beyond using only the shortest paths in our system. We achieved a 46.38% F-score in the CG task (ranking 3rd) and a 48.93% F-score in the GE task (ranking 4th). AFTER BIONLP-ST 2013: We explored three ways to further extend our event extraction system in our previously published work: (1) We allow non-essential nodes to be skipped, and incorporated a node skipping penalty into the subgraph distance function of our approximate subgraph matching algorithm. (2) Instead of assigning a unified subgraph distance threshold to all patterns of an event type, we learned a customized threshold for each pattern. (3) We implemented the well-known Empirical Risk Minimization (ERM) principle to optimize the event pattern set by balancing prediction errors on training data against regularization. When evaluated on the official GE task test data, these extensions help to improve the extraction precision from 62% to 65%. However, the overall F-score stays equivalent to the previous performance due to a 1% drop in recall.Entities:
Mesh:
Year: 2015 PMID: 26551594 PMCID: PMC4642081 DOI: 10.1186/1471-2105-16-S16-S2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1ASM-based Event Extraction Framework.
Figure 2Event Pattern Induction Example.
Event pattern representation.
| Pattern ID | Pattern Description | Graph Representation | |||
|---|---|---|---|---|---|
| E1a | Pos. reg. | lead-20/VBP | Phosphorylation: phosphorylation-23/NN | Binding: ligation-6/NN | nsubj(lead-20/VBP, ligation-6/NN) prep_to(lead-20/VBP, phosphorylation-23/NN) |
| E1b | Pos. reg. | lead-20/VBP | Phosphorylation: phosphorylation-23/NN | Binding: ligation-6/NN | rcmod(ligation-6/NN, lead-20/VBP) prep_to(lead-20/VBP, phosphorylation-23/NN) |
| E1c | Pos. reg. | lead-20/VBP | Phosphorylation: phosphorylation-23/NN | prep_to(lead-20/VBP, phosphorylation-23/NN) | |
| E1d | Pos. reg. | lead-20/VBP | Binding: ligation-6/NN | nsubj(lead-20/VBP, ligation-6/NN) | |
| E1e | Pos. reg. | lead-20/VBP | Binding: ligation-6/NN | rcmod(ligation-6/NN, lead-20/VBP) | |
Figure 3ASM-based Event Extraction.
Figure 4Iterative Bottom-up Event Extraction Example.
ASM parameter setting in the 2013 GE task.
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| 8 | 3 | ||
| 7 | 7 | ||
| 10 | 3 | ||
| 8 | 3 | ||
| 8 | 3 | ||
| 3 | 10 | ||
| 3 | 10 | ||
| 3 | 10 |
Statistics of BioNLP-ST 2013 GE dataset.
| Attributes Counted | Training | Development | Test |
|---|---|---|---|
| Full article segments | 222 | 249 | 305 |
| Proteins | 3,571 | 4,138 | 4,359 |
| Annotated events | 2,817 | 3,199 | 3,301(hidden) |
Performance using different parsers on the development set.
| Parser Type | Event pattern | Recall | Precision | F-score |
|---|---|---|---|---|
| Charniak | 2,923 | 47.01% | 66.01% | 54.91% |
| Stanford | 3,305 | 43.66% | 67.67% | 53.08% |
| Ensemble | 4,617 | 47.45% | 65.65% | 55.09% |
Performance of integrated DSM on development set.
| All Tokens | Recall | Precision | F-score |
|---|---|---|---|
| DSM 1 | 47.98% | 52.56% | 50.17% |
| DSM 3 | 48.68% | 35.07% | 40.77% |
| DSM 10 | 53.43% | 19.38% | 28.44% |
| DSM 1 | 48.06% | 54.22% | 50.95% |
| DSM 3 | 48.59% | 37.00% | 42.01% |
| DSM 10 | 53.35% | 24.65% | 33.72% |
Performance of using all-paths on development set.
| Path Type | Event Pattern | Recall | Precision | F-score |
|---|---|---|---|---|
| All-paths | 9,527 | 48.77% | 64.64% | 55.59% |
| Shortest paths | 4,617 | 47.45% | 65.65% | 55.09% |
Performance comparison on development set under various settings.
| System Setting | Event Pattern | Recall | Precision | F-score |
|---|---|---|---|---|
| 1 + 2 + 3 | 4,617 | 47.45% | 65.65% | 55.09% |
| 1 | 4,593 | 49.21% | 64.48% | 55.82% |
| 1 | 4,533 | 48.50% | 67.36% | 56.39% |
| 1 + 2 | 4,787 | 45.60% | 72.14% | 55.88% |
| 1 | 4,806 | 46.83% | 71.89% | 56.72% |
Performance comparison among top 8 systems in 2013 GE task.
| System | SVT | PTM | BIND | REG | TOTAL | ||
|---|---|---|---|---|---|---|---|
| EVEX | 76.59 | 65.37 | 42.88 | 38.41 | 45.44 | 58.03 | 50.97 |
| TEES 2.1 | 76.82 | 66.49 | 43.32 | 38.05 | 46.17 | 56.32 | 50.74 |
| BioSEM | 76.11 | 74.37 | 49.76 | 35.8 | 42.47 | 62.83 | 50.68 |
| 72.55 | 70.45 | 39.56 | 34.25 | 40.53 | 61.72 | 48.93 | |
| DlutNLP | 74.42 | 69.36 | 42.43 | 32.92 | 40.81 | 57 | 47.56 |
| HDS4NLP | 79.07 | 73.17 | 37.32 | 21.64 | 37.11 | 51.19 | 43.03 |
| NICTANLM | 64.66 | 53.64 | 31.61 | 29.63 | 36.99 | 50.68 | 42.77 |
| USheff | 64.86 | 55.68 | 37.7 | 30.18 | 31.69 | 63.28 | 42.23 |
Impact of 2011 data and ensemble pattern set in 2013 GE task.
| System Attribute | Recall | Precision | F-score |
|---|---|---|---|
| Ensemble 2013 + 2011 data | 40.53% | 61.72% | 48.93% |
| Ensemble 2013 data | 35.63% | 63.91% | 45.75% |
| Charniak 2013 data | 35.29% | 65.71% | 45.92% |
Performance comparison on GE test set under different settings.
| System Setting | SVT | PTM | BIND | REG | TOTAL | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 + 2 + 3 | 72.99 | 72.12 | 64.92 | 77.02 | 37.54 | 41.81 | 24.74 | 55.61 | 40.53 | 61.72 | 48.93 |
| 1' + 2 + 3' | 74.07 | 72.85 | 64.92 | 68.50 | 40.54 | 39.24 | 25.26 | 49.45 | 41.41 | 57.80 | 48.25 |
| 1' + 2'+ 3' | 68.55 | 81.57 | 59.69 | 77.03 | 31.23 | 49.76 | 26.18 | 53.69 | 39.32 | 64.74 | 48.93 |
Impact of 2011 and 2013 data on GE test set.
| Data Attribute | Recall | Precision | F-score |
|---|---|---|---|
| 2011 data | 35.60% | 66.09% | 46.27% |
| 2013 data | 35.90% | 66.02% | 46.51% |
Statistics of BioNLP-ST 2013 CG dataset.
| Attributes Counted | Training | Development | Test |
|---|---|---|---|
| Abstracts | 300 | 100 | 200 |
| Entities | 10,935 | 3,634 | 6,955 |
| Annotated events | 8,803 | 2,915 | 5,972 (hidden) |
Performance of all systems in 2013 CG task.
| Team | Recall | Precision | F-score |
|---|---|---|---|
| TEES-2.1 | 48.76% | 64.17% | 55.41% |
| NaCTeM | 48.83% | 55.82% | 52.09% |
| 38.28% | 58.84% | 46.38% | |
| RelAgent | 41.73% | 49.58% | 45.32% |
| UET-NII | 19.66% | 62.73% | 29.94% |
| ISI | 16.44% | 47.83% | 24.47% |