| Literature DB >> 36166421 |
Wenzhi Huang1,2, Junchi Zhang2, Donghong Ji1.
Abstract
The task of event extraction consists of three subtasks namely entity recognition, trigger identification and argument role classification. Recent work tackles these subtasks jointly with the method of multi-task learning for better extraction performance. Despite being effective, existing attempts typically treat labels of event subtasks as uninformative and independent one-hot vectors, ignoring the potential loss of useful label information, thereby making it difficult for these models to incorporate interactive features on the label level. In this paper, we propose a joint label space framework to improve Chinese event extraction. Specifically, the model converts labels of all subtasks into a dense matrix, giving each Chinese character a shared label distribution via an incrementally refined attention mechanism. Then the learned label embeddings are also used as the weight of the output layer for each subtask, hence adjusted along with model training. In addition, we incorporate the word lexicon into the character representation in a soft probabilistic manner, hence alleviating the impact of word segmentation errors. Extensive experiments on Chinese and English benchmarks demonstrate that our model outperforms state-of-the-art methods.Entities:
Mesh:
Year: 2022 PMID: 36166421 PMCID: PMC9514653 DOI: 10.1371/journal.pone.0272353
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Illustration of comparison of existing methods and our proposed method for an input Chinese sentence “军警两名士兵丧生(Two military soldiers were killed)”.
Fig 2Illustration of our multi-task framework for Chinese event extraction model.
Comparison results on ACE05-CN.
| Model | ACE05-CN | ||
|---|---|---|---|
| Entity | Trig-C | Arg-C | |
| Word-Tree-Joint | 81.2 | 58.4 | 39.5 |
| Word-NP-pipeline | 78.5 | 59.1 | 42.4 |
| Char-GRU-Joint | 83.4 | 59.6 | 45.6 |
| Char-BERT-pipeline | 87.2 | 61.6 | 45.6 |
| Char-Span-Joint | 87.8 | 62.7 | 46.7 |
| Char-Global-Joint | 88.5 | 65.6 | 52.0 |
| Transition-Joint | 88.0 | 63.4 | 47.3 |
| Lattice | 87.7 | 62.4 | 50.8 |
| SoftLexicon | 88.5 | 63.3 | 51.2 |
| MLAEE | 88.6 | 65.8 | 54.4 |
| MLAEE+REL |
|
|
|
* indicates relation annotations are used.
† and ‡ indicate statistical significance compared to Char-Global-Joint with p < 0.01 and p < 0.05, respectively.
Comparison results on ACE05-EN.
| Model | ACE05-EN | ||
|---|---|---|---|
| Entity | Trig-C | Arg-C | |
| Word-Tree-Joint | - | 69.6 | 50.1 |
| Char-GRU-Joint | 81.2 | 69.8 | 52.1 |
| Char-Span-Joint | 89.7 | 69.7 | 48.8 |
| Char-Global-Joint |
| 74.7 | 56.8 |
| Word-Transition-Joint | 88.1 | 73.8 | 55.3 |
| MLAEE | 89.3 | 74.2 | 55.9 |
| MLAEE+REL | 90.0 |
|
|
* indicates relation annotations are used.
† and ‡ indicate statistical significance compared to Char-Global-Joint with p < 0.01 and p < 0.05, respectively.
Ablation tests on ACE-CN.
| Settings | Trig-C | Arg-C | Entity |
|---|---|---|---|
| MLAEE | 65.8 | 54.4 | 88.6 |
| -SoftLexicon | 62.5 | 50.8 | 86.7 |
| -Label embedding | 63.3 | 51.2 | 87.3 |
| -BiLSTM | 64.3 | 52.6 | 87.8 |
| -BERT embedding | 61.2 | 48.6 | 85.2 |
† and ‡ indicate statistical significance compared to MLAEE with p < 0.01 and p < 0.05, respectively.
Fig 3t-SNE plot of joint label embeddings of entities, triggers and argument roles with varying numbers of training epochs.
(a) 3 epochs, (b) 10 epochs, and (c) 40 epochs.
Event prediction made by different models.
Gold C indicates the standard annotation. Words in bold and italics are correct triggers and arguments, respectively, while the underlined ones are incorrect.
| Gold | 信中明白的指出, 因为被警方通缉需要钱, 否则就要 |
| Char-Global-Joint: | 信中明白的指出, 因为被警方通缉需要钱, 否则就要 |
| MLAEE: | 信中明白的指出, 因为被警方通缉需要钱, 否则就要 |
| Gold | 但是会向 |
| Char-Global-Joint: | 但是会 |
| MLAEE: | 但是会向 |