| Literature DB >> 31197355 |
Meizhi Ju1,2, Nhung T H Nguyen1,2, Makoto Miwa3,2, Sophia Ananiadou1,2.
Abstract
OBJECTIVE: This article describes an ensembling system to automatically extract adverse drug events and drug related entities from clinical narratives, which was developed for the 2018 n2c2 Shared Task Track 2.Entities:
Keywords: adverse drug event; electronic health record; information extraction; natural language processing; nested named entity recognition
Year: 2020 PMID: 31197355 PMCID: PMC6913208 DOI: 10.1093/jamia/ocz075
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.An overview of our work. Intra-ensemble refers to the combination of different versions of the same model with different parameter settings. Inter-ensemble represents the combination of different models or different intra-ensembles.
Figure 2.An example of nested entities.
Figure 3.The architecture of the neural model. (a) is the model architecture while (b) is the composition of the word embeddings.
Statistics of the data set. Rare words are words that occur only once in the data. Unknown words refer to words that are not seen in the training set
| Item | Training | Development |
|---|---|---|
| Document | 242 | 61 |
| Entities | 41 171 | 9776 |
| Nest level 1 entity (flat entities) | 41 109 | 9760 |
| Nest level 2 entity | 61 | 16 |
| Nest level 3 entity | 1 | 0 |
| Polysemous entity | 47 | 13 |
| Textually nested entity | 15 | 3 |
| ADE | 785 | 174 |
| Dosage | 3401 | 820 |
| Drug | 13 109 | 3114 |
| Duration | 499 | 93 |
| Form | 5340 | 1311 |
| Frequency | 5075 | 1205 |
| Reason | 3105 | 750 |
| Route | 4479 | 996 |
| Strength | 5378 | 1313 |
| Unknown words /Unique words | – | 17.00% |
| Rare words /Unique words | 37.19% | 37.69% |
| EUNKs/All entities | – | 2.67% |
| ERAREs /All entities | 1.89% | 3.88% |
Abbreviations: ADE, adverse drug event; EUNKs, entities that contain unknown words, ERAREs, entities that contain rare words.
Performance of CRF and NN models on the development set. For each model, the best lenient metrics of precision, recall, and F-score are shown in bold
| Model | Precision | Recall | F-score |
|---|---|---|---|
|
| |||
| Baseline (Lexical and syntactic features) | 0.9525 | 0.8825 | 0.9162 |
| Baseline + word shape (ws) | 0.9527 | 0.8815 | 0.9157 |
| Baseline + dictionary features (df) | 0.9511 | 0.8829 | 0.9157 |
| Baseline + cluster features (cf) | 0.9504 | 0.8902 |
|
| Baseline + ws + df |
| 0.8821 | 0.9158 |
| Baseline + ws + cf | 0.9491 | 0.8898 | 0.9185 |
| Baseline + df + cf | 0.9494 |
| 0.9189 |
| Baseline + ws + df + cf | 0.9486 | 0.8900 | 0.9184 |
|
| |||
| Baseline (word + characters) | 0.9476 | 0.8995 | 0.9230 |
| Csub (characters + subword) |
| 0.9042 | 0.9266 |
| Wsub (word + subword) | 0.9496 | 0.9044 | 0.9264 |
| Wcsub (word + subword + characters) | 0.9498 |
|
|
|
| |||
| Inter-CRF | 0.9466 | 0.8935 | 0.9193 |
| Intra-csub |
| 0.8981 | 0.9306 |
| Intra-wsub | 0.9638 | 0.9013 | 0.9315 |
| Intra-wcsub | 0.9641 | 0.9010 | 0.9315 |
| Inter-NN | 0.9591 | 0.9084 |
|
| NN-CRF | 0.9401 |
| 0.9304 |
represents significance value at P < .05 with approximate randomization significance test.
Abbreviations: CRF, conditional random fields; NN, neural network.
Lenient performance on the test set with submission and inter-NN settings
| Entity type | Precision | Recall | F-score |
|---|---|---|---|
|
| |||
| Strength | 0.9815 | 0.9804 | 0.9810 |
| Frequency | 0.9788 | 0.9666 | 0.9727 |
| Route | 0.9662 | 0.9445 | 0.9552 |
| Drug | 0.9567 | 0.9533 | 0.9550 |
| Form | 0.9653 | 0.9436 | 0.9543 |
| Dosage | 0.9356 | 0.9433 | 0.9395 |
| Duration | 0.8875 | 0.7513 | 0.8138 |
| Reason | 0.7254 | 0.5470 | 0.6237 |
| ADE | 0.4697 | 0.1984 | 0.2790 |
| Overall (micro) | 0.9444 | 0.9073 | 0.9255 |
|
| |||
| Overall (micro) | 0.9599 | 0.8979 | 0.9278 |
Abbreviation: ADE, adverse drug event.
Figure 4.Statistics of CEs and SEs for our best individual and ensemble models on the development set.
Figure 5.Statistics of CEs and SEs for each category on the development set.
Figure 6.Percentage of category-wise extracted EUNKs (a) and ERAREs (b).
Fine-grained lenient matching statistics of individual and ensemble models on the development set
| Matching type | CRF (%) | Wcsub (%) | Inter-NN (%) | NN-CRF (%) |
|---|---|---|---|---|
| Strict | 95.73 | 95.02 | 95.62 | 95.51 |
| Includes | 2.71 | 3.17 | 2.83 | 2.75 |
| Is included | 1.53 | 1.76 | 1.50 | 1.69 |
| Partial overlap | 0.03 | 0.05 | 0.06 | 0.06 |