| Literature DB >> 32667950 |
Wenzhi Huang1,2, Junchi Zhang2, Donghong Ji1.
Abstract
Chinese information extraction is traditionally performed in the process of word segmentation, entity recognition, relation extraction and event detection. This pipelined approach suffers from two limitations: 1) It is prone to introduce propagated errors from upstream tasks to subsequent applications; 2) Mutual benefits of cross-task dependencies are hard to be introduced in non-overlapping models. To address these two challenges, we propose a novel transition-based model that jointly performs entity recognition, relation extraction and event detection as a single task. In addition, we incorporate subword-level information into character sequence with the use of a hybrid lattice structure, removing the reliance of external word tokenizers. Results on standard ACE benchmarks show the benefits of the proposed joint model and lattice network, which gives the best result in the literature.Entities:
Mesh:
Year: 2020 PMID: 32667950 PMCID: PMC7363078 DOI: 10.1371/journal.pone.0235796
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Example sentence from ACE05 dataset.
Preconditions of transition actions.
| Transitions | Preconditions of transition actions |
|---|---|
| LEFT-* | (λ ≠ |
| RIGHT-* | (λ ≠ |
| SHIFT | (λ ≠ |
| DUAL-SHIFT | (λ ≠ |
| DELETE | (∃ |
| ELEMENT-SHIFT | (λ = |
| ELEMENT-GEN | (λ = |
| ELEMENT-BACK | (λ = |
Fig 2Our encoder-decoder model for joint entities, relations and events extraction.
The influence of lattice LSTM results on the ACE2005 test set.
| Models | Entity Recognition | Relation Classification | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| Transition | 75.3 | 71.0 | 73.1 | 55.7 | 39.0 | 45.3 |
| Transition—BERT | 74.5 | 68.6 | 71.5 | 44.8 | 39.8 | 42.2 |
| Transition + Bigram | 78.1 | 70.7 | 74.2 | 58.0 | 41.4 | 45.5 |
| Transition + Lattice (Word) | 83.5 | 79.2 | 81.3 | 55.3 | 48.2 | 49.9 |
| Transition + Lattice (Subword) | 84.2 | 79.3 | 56.5 | 48.8 | ||
Joint model and pipeline model results on the ACE2005 test set.
| Models | Entity Recognition | Relation Classification | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| Lattice-transition-pipeline | 81.0 | 78.3 | 79.6 | 44.2 | 51.4 | 47.2 |
| Lattice-transition-joint | 84.2 | 79.3 | 56.5 | 48.8 | ||
Entity and relation results compared to previous systems on the ACE2005 test set.
| Models | Entity Recognition | Relation Classification | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| Word-Tree-Structure | 79.0 | 67.2 | 72.6 | 43.6 | 35.7 | 39.3 |
| Char-BERT-pipeline | 77.6 | 73.1 | 75.3 | 45.0 | 42.3 | 43.6 |
| Lattice-transition-joint | 84.2 | 79.3 | 56.5 | 48.8 | ||
Event trigger and argument role results on ACE2005 test set.
| Models | Event Trigger Detection | Argument Role Classification | ||||
|---|---|---|---|---|---|---|
| P | R | F1 | P | R | F1 | |
| Word-Tree-Structure | 67.7 | 58.3 | 62.6 | 49.2 | 39.8 | 44.5 |
| Char-BERT-pipeline | 62.3 | 68.9 | 65.4 | 48.5 | 51.5 | 50.0 |
| Lattice-transition-pipeline | 62.3 | 72.2 | 66.9 | 52.8 | 52.0 | 52.4 |
| Lattice-transition-joint | 65.5 | 72.5 | 56.2 | 59.2 | ||