| Literature DB >> 32028883 |
Lvxing Zhu1, Haoran Zheng2,3,4.
Abstract
BACKGROUND: Biomedical event extraction is a fundamental and in-demand technology that has attracted substantial interest from many researchers. Previous works have heavily relied on manual designed features and external NLP packages in which the feature engineering is large and complex. Additionally, most of the existing works use the pipeline process that breaks down a task into simple sub-tasks but ignores the interaction between them. To overcome these limitations, we propose a novel event combination strategy based on hybrid deep neural networks to settle the task in a joint end-to-end manner.Entities:
Keywords: Biomedical text; Deep learning Neural network; Event extraction
Mesh:
Year: 2020 PMID: 32028883 PMCID: PMC7006190 DOI: 10.1186/s12859-020-3376-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1A sentence with visualized events, provided by BioNLP-ST2013
Statistics of datasets
| Dataset | Entity type | Entity | Event type | Event | Word | Document (training) | Document (development) | Document (test) |
|---|---|---|---|---|---|---|---|---|
| CG | 18 | 21683 | 40 | 17248 | 129878 | 300 | 100 | 200 |
| PC | 4 | 15901 | 23 | 12125 | 108356 | 260 | 90 | 175 |
| MLEE | 14 | 8291 | 28 | 6677 | 56588 | 131 | 44 | 87 |
Fig. 2The sequence of training loss within 100 epochs
Comparison of overall performance on CG, PC and MLEE task (test set)
| CG | PC | MLEE | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Methods | Recall | Precision | F1 Scores | Recall | Precision | F1 Scores | Recall | Precision | F1 Scores |
| RelAgent [ | 41.73 | 49.58 | 45.32 | - | - | - | - | - | - |
| NCBI [ | 38.28 | 58.84 | 46.38 | - | - | - | - | - | - |
| Zhou and Zhong [ | - | - | - | - | - | - | 59.19 | 55.76 | 57.41 |
| TEES [ | 48.76 | 64.17 | 55.41 | 47.15 | 55.78 | 51.10 | - | - | - |
| EventMine [ | 48.83 | 55.82 | 52.09 | 52.23 | 53.48 | 52.84 | 49.56 | 62.28 | 55.20 |
| Wang et al. [ | - | - | - | - | - | - | 56.23 | 60.65 | 58.31 |
| Li et al. [ | - | - | - | - | - | - | 53.61 | 67.23 | 59.65 |
| TEES CNN [ | 50.77 | 66.55 | 57.60 | 50.34 | 62.16 | 55.62 | - | - | - |
| Proposed | 51.91 | 65.81 | 58.04 | 50.65 | 61.95 | 55.73 | 55.02 | 66.08 | 60.05 |
Detailed performance comparison on CG
| Event Class | TEES | EventMine | NCBI | RelAgent | Proposed |
|---|---|---|---|---|---|
| Anatomical | 77.20 | 71.31 | 73.68 | 70.82 | 79.78 |
| Pathological | 67.51 | 59.78 | 54.19 | 48.14 | 68.46 |
| Molecular | 72.60 | 72.77 | 67.33 | 60.72 | 73.16 |
| General | 52.20 | 53.08 | 44.70 | 40.89 | 59.45 |
| Regulation | 43.08 | 39.79 | 29.21 | 35.58 | 44.31 |
| Planned Pro | 39.43 | 40.51 | 34.28 | 28.57 | 48.15 |
| Modification | 34.66 | 29.95 | 0.00 | 30.88 | 39.67 |
Performance comparison across variations of our method on development set
| CG | PC | MLEE | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Methods | Recall | Precision | F1 Scores | Recall | Precision | F1 Scores | Recall | Precision | F1 Scores |
| Proposed | 51.29 | 63.34 | 56.68 | 49.32 | 59.90 | 54.10 | 53.53 | 62.34 | 57.60 |
| Single-model | 48.34 | 55.73 | 51.77 | 48.43 | 52.32 | 50.30 | 49.96 | 55.64 | 52.65 |
| Single-pipeline-model | 47.68 | 54.80 | 51.00 | 47.49 | 52.95 | 50.07 | 48.43 | 53.73 | 50.94 |
| Combination-rule-single | 50.50 | 57.46 | 53.75 | 46.92 | 55.94 | 51.04 | 48.09 | 55.28 | 51.43 |
| Combination-rule-all | 54.51 | 52.05 | 53.25 | 50.21 | 47.69 | 48.92 | 52.51 | 48.77 | 50.57 |
| EE-probability | 49.16 | 64.22 | 55.69 | 46.74 | 60.60 | 52.77 | 48.43 | 63.72 | 55.03 |
| Zero-threshold | 50.74 | 62.24 | 55.90 | 48.90 | 57.06 | 52.66 | 53.02 | 62.30 | 57.29 |
| Without-CharCNN | 50.57 | 62.18 | 55.78 | 46.08 | 55.08 | 50.18 | 51.57 | 63.52 | 56.93 |
Statistics for the extraction errors in CG/PC/MLEE
| Corpus | Wrong T_Label | Wrong T_Span | Wrong Argu | Redundant Argu | Other Error | Total Error |
|---|---|---|---|---|---|---|
| CG | 5.09% | 12.11% | 9.57% | 5.65% | 3.25% | 35.67% |
| PC | 4.75% | 18.28% | 4.47% | 5.72% | 4.22% | 37.44% |
| MLEE | 2.38% | 15.72% | 5.48% | 6.38% | 3.97% | 33.93% |
* The statistics are derived by training method on training set and testing on development set of CG/PC/MLEE.
* The Wrong T_Label represents the event triggers with the wrong assigned label. The Wrong T_Span represents the range of the trigger words that were wrong (including detected triggers that do not exist in the gold standard). The Wrong Argu indicates that the event trigger was correctly detected but the arguments were wrongly assigned. Similarly, the Redundant Argu indicates that redundant arguments were assigned for correctly detected triggers.
Fig. 3The overall networks structure (CharCNN, BiLSTM, TR, RC, EE), some components are omitted for brevity(detailed structures are shown in Additional file 1: Figure S1–S3)
Fig. 4A simple example to illustrate the principle of CS