| Literature DB >> 22759461 |
Halil Kilicoglu1, Sabine Bergler.
Abstract
BACKGROUND: In recent years, biological event extraction has emerged as a key natural language processing task, aiming to address the information overload problem in accessing the molecular biology literature. The BioNLP shared task competitions have contributed to this recent interest considerably. The first competition (BioNLP'09) focused on extracting biological events from Medline abstracts from a narrow domain, while the theme of the latest competition (BioNLP-ST'11) was generalization and a wider range of text types, event types, and subject domains were considered. We view event extraction as a building block in larger discourse interpretation and propose a two-phase, linguistically-grounded, rule-based methodology. In the first phase, a general, underspecified semantic interpretation is composed from syntactic dependency relations in a bottom-up manner. The notion of embedding underpins this phase and it is informed by a trigger dictionary and argument identification rules. Coreference resolution is also performed at this step, allowing extraction of inter-sentential relations. The second phase is concerned with constraining the resulting semantic interpretation by shared task specifications. We evaluated our general methodology on core biological event extraction and speculation/negation tasks in three main tracks of BioNLP-ST'11 (GENIA, EPI, and ID).Entities:
Mesh:
Substances:
Year: 2012 PMID: 22759461 PMCID: PMC3384260 DOI: 10.1186/1471-2105-13-S11-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
An overview of BioNLP-ST'11 tracks
| GENIA | EPI | ID | BB | BI | |
|---|---|---|---|---|---|
| Number of core events | 9 | 15 | 10 | 2 | 10 |
| Triggers annotated? | Y | Y | Y | N | N |
| Includes full-text? | Y | N | Y | N | N |
| Speculation/Negation? | Y | Y | Y | N | N |
An overview of BioNLP-ST'11 tracks.
Figure 1The shared task pipeline. The biological event composition pipeline. The cylindrical boxes represent the resources used.
Figure 2Atomic vs. embedding predications. Atomic and embedding predications extracted from the sentence Stimulation of cells leads to a rapid phosphorylation of IκBα, which is presumed to be important for the subsequent degradation. from the Medline abstract 7499266. The middle column in the predication rows shows the predications after the Composition phase and the right column the event and event modification annotations after the Mapping phase. Note that e2 is not mapped to an event annotation, since its argument is semantically empty, whereas em6 is not mapped since its semantic type, TEMPORAL, is not relevant in the shared task context. The relevant syntactic dependency relations as well as the entities are illustrated.
Figure 3Embedding predication categorization. Embedding predication categorization relevant to the shared task.
Embedding trigger dictionary entries
| Predicate | POS | Semantic Type | Polarity | Category Strength | Negative-raising |
|---|---|---|---|---|---|
| VB | 1.0 | false | |||
| JJ | 0.7 | false | |||
| VB | 1.0 | false | |||
| VB | 1.0 | false | |||
| NN | 0.5 | false | |||
| RB | 1.0 | false | |||
| NN | 1.0 | false | |||
Several entries from the embedding trigger dictionary.
Application of intra-sentential transformation rules
| Fragment | Syntactic Dependencies | Embedding Relations |
|---|---|---|
Application of several intra-sentential transformation rules to the sentence fragments in the first column. The syntactic dependencies in the second column are the input to these rules and the embedding relations in the third column are the output.
Figure 4An example embedding graph. A portion of the embedding graph associated with a Medline abstract (10089566). The sentence under consideration is Our previous results show that recombinant gp41 (aa565-647), the extracellular domain of HIV-1 transmembrane glycoprotein, stimulates interleukin-10 (IL-10) production in human monocytes. Yellow circles represent surface elements bound to PROTEIN entities, green circles those bound to atomic predicates, and the orange circles to embedding predicates.
Argument identification rules
| Embedding Relation Type | POS | Inclusions | Exclusions | Argument Type |
|---|---|---|---|---|
| NN | - | Object | ||
| VB | - | - | Subject | |
| VB | - | - | Object | |
| VB | - | Object | ||
| NN | - | Adjunct | ||
Several argument identification rules. For a rule R:Q→A, where Q = 〈T,POS,IN,EX〉, column 1:T, column 2:POS, column 3: IN, column 4:EX, and column 5:A. Note that inclusion and exclusion constraints may apply to predicate categories, as well as to specific lemmas.
Polarity value composition
| Predicate polarity | Embedded polarity value | Composite polarity |
|---|---|---|
| neutral | positive | positive |
| neutral | negative | negative |
| negative | * | negative |
| positive | negative | negative |
| positive | * | positive |
The composition of polarity value of an embedding predication from polarity value of the predicate and embedded polarity value.
Mapping from embedding predications to events
| Track | PredicationType | Polarity | Modality Value | Correspond. Event (Mod.) Type |
|---|---|---|---|---|
| GENIA,ID | neutral | - | ||
| GENIA,ID,EPI | negative | - | ||
| EPI | positive | - | ||
| GENIA,ID,EPI | - | |||
| GENIA,ID,EPI | negative | - | ||
Constraints used in mapping from embedding predication types to event and event modification types.
Mapping logical arguments to semantic roles
| Logical Argument | Constrained To | Exclusions | Semantic Role |
|---|---|---|---|
| Object | - | Theme | |
| Subject | - | Cause | |
| Subject | - | Theme | |
| Object | - | Participant | |
| Object | - | Scope | |
Logical argument to semantic role mappings.
Official GENIA track results
| Event Class | Recall | Precision | F1-score | Rank |
|---|---|---|---|---|
| Localization | 39.27 | 90.36 | 54.74 | 7 |
| Binding | 29.33 | 49.66 | 36.88 | 7 |
| Gene_expression | 65.87 | 86.84 | 74.91 | 5 |
| Transcription | 32.18 | 58.95 | 41.64 | 9 |
| Protein_catabolism | 66.67 | 71.43 | 68.97 | 2 |
| Phosphorylation | 75.14 | 94.56 | 83.73 | 4 |
| EVT-TOTAL | 52.67 | 78.04 | 62.90 | 6 |
| Regulation | 33.77 | 42.48 | 37.63 | 3 |
| Positive_regulation | 35.97 | 47.66 | 41.00 | 7 |
| Negative_regulation | 36.43 | 43.88 | 39.81 | 5 |
| REG-TOTAL | 35.72 | 45.85 | 40.16 | 5 |
| Negation | 18.77 | 44.26 | 26.36 | 2 |
| Speculation | 21.10 | 38.46 | 27.25 | 1 |
| MOD-TOTAL | 19.97 | 40.89 | 26.83 | 2 |
| ALL-TOTAL | 43.55 | 59.58 | 50.32 | 5 |
Official GENIA track results, with the approximate span matching/approximate recursive matching evaluation criteria.
Official EPI and ID track results
| Track-Eval. Type | Recall | Precision | F1-score | Rank |
|---|---|---|---|---|
| 20.83 | 42.14 | 27.88 | 7 | |
| EPI-CORE | 40.28 | 76.71 | 52.83 | 6 |
| 49.00 | 40.27 | 44.21 | 4 | |
| ID-CORE | 50.91 | 43.37 | 46.84 | 4 |
| 45.26 | 53.18 | 48.90 | 4 | |
| ID-CORE-T | 46.75 | 56.94 | 51.34 | 4 |
Official evaluation results for EPI and ID tracks. The primary evaluation criteria underlined. ID-FULL-T and ID-CORE-T refer to the post-shared task scenario where ID triggers are drawn only from ID training data.
Coreference resolution on test sets
| System | Recall | Precision | |
|---|---|---|---|
| GENIA | 43.55 | 59.58 | 50.32 |
| GENIA + COREF | 44.45 | 58.92 | 50.67 |
| - Abstracts | 44.31 | 59.82 | 50.91 |
| - Full-text | 44.78 | 56.82 | 50.09 |
| EPI | 20.83 | 42.14 | 27.88 |
| EPI + COREF | 21.48 | 40.63 | 28.10 |
| ID | 49.00 | 40.27 | 44.21 |
| ID + COREF | 49.97 | 38.81 | 43.69 |
| ID-T | 45.26 | 53.18 | 48.90 |
| ID-T + COREF | 46.37 | 50.95 | 48.55 |
Event extraction performances after coreference resolution with the primary evaluation criteria.
GENIA Task 3 results based on gold event annotations
| Event Modification Type | Recall | Precision | |
|---|---|---|---|
| 49.31 (18.77) | 87.70 (44.26) | 63.13 (26.36) | |
| 65.70 (21.10) | 73.27 (38.46) | 69.28 (27.25) | |
| MOD-TOTAL | 57.95 (19.97) | 78.47 (40.89) | 66.67 (26.83) |
Task 3 results when gold standard event annotations are provided to the system. Official results are duplicated in parentheses for reference.
Coreference resolution on GENIA development set
| System | Recall | Precision | |
|---|---|---|---|
| Base | 46.32 | 56.81 | 51.03 |
| Base + RELAT | 46.57 | 56.52 | 51.06 |
| Base + APPOS | 47.07 | 56.40 | 51.32 |
| Base + PRON | 46.76 | 56.28 | 51.08 |
| Base + DNP | 46.85 | 56.26 | 51.13 |
| Base + ALL | 47.98 | 55.77 | 51.62 |
Effect of different types of coreference resolution on event extraction performance on GENIA development set with the approximate span matching/approximate recursive matching evaluation criteria.