| Literature DB >> 35469206 |
Ran Jin1, Tengda Hou1, Tongrui Yu1, Min Luo2, Haoliang Hu1.
Abstract
Over the years, the explosive growth of drug-related text information has resulted in heavy loads of work for manual data processing. However, the domain knowledge hidden is believed to be crucial to biomedical research and applications. In this article, the multi-DTR model that can accurately recognize drug-specific name by joint modeling of DNER and DNEN was proposed. Character features were extracted by CNN out of the input text, and the context-sensitive word vectors were obtained using ELMo. Next, the pretrained biomedical words were embedded into BiLSTM-CRF and the output labels were interacted to update the task parameters until DNER and DNEN would support each other. The proposed method was found with better performance on the DDI2011 and DDI2013 datasets.Entities:
Mesh:
Year: 2022 PMID: 35469206 PMCID: PMC9034928 DOI: 10.1155/2022/3321296
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1An example of DNER and DNEN tasks.
Figure 2CNN used to extract a character-level representation of words.
Figure 3Structure of the ELMo model.
Figure 4The main architecture of the multi-DTR model.
Training and testing set in DDI2011.
| Set | Documents | Sentences | Drugs |
|---|---|---|---|
| Training | 435 | 4267 | 11260 |
| Final test | 144 | 1539 | 3689 |
| Total | 579 | 5806 | 14949 |
Numbers of the annotated entities in DDI2013 set.
| Type | Train | Test | ||||
|---|---|---|---|---|---|---|
| DrugBank | MedLine | Total | DrugBank | MedLine | Total | |
| Drug | 9901 (63%) | 1745 (63%) | 11646 (63%) | 180 (59%) | 171 (44%) | 351 (51%) |
| Brand | 1824 (12%) | 42 (1.5%) | 1866 (10%) | 53 (18%) | 6 (2%) | 59 (8%) |
| Group | 3901 (25%) | 324 (12%) | 4225 (23%) | 65 (21%) | 90 (24%) | 155 (23%) |
| Drug_n | 130 (1%) | 635 (23%) | 765 (4%) | 6 (2%) | 115 (30%) | 121 (18%) |
| Total | 15756 | 2746 | 18502 | 304 | 382 | 686 |
The parameters for our experiments.
| Layer | Hyperparameter | Value |
|---|---|---|
| CNN | Window size | 3 |
| Number of filters | 30 | |
|
| ||
| LSTM | State size | 200 |
| Initial state | 0.0 | |
| Peepholes | No | |
|
| ||
| Dropout | Dropout rate | 0.5 |
| Batch size | 10 | |
| Initial learning rate | 0.015 | |
| Gradient clipping | 5.0 | |
| Decay rate | 0.05 | |
| Labeling schema | BIO | |
| ELMo dim | 1024 | |
Results of experiment in DDI2011 and DDI2013.
| System | DDI2011 | DDI2013 | ||||
|---|---|---|---|---|---|---|
| Precision | Recall |
| Precision | Recall |
| |
| UMCC_DLS | - | - | - | 24.00 | 57.00 | 34.00 |
| Hettne | 66.91 | 71.42 | 69.09 | 59.41 | 56.32 | 57.82 |
| Tsuruoka | 68.42 | 72.39 | 70.34 | 62.24 | 58.17 | 60.12 |
| WBI | 89.53 | 88.42 | 88.97 | 76.70 | 88.42 | 74.80 |
| LASIGE | 87.02 | 82.51 | 84.70 | 78.00 | 56.00 | 65.19 |
| Yang | 81.44 | 81.50 | 81.46 | 76.54 | 74.40 | 75.45 |
| Zeng | 93.26 | 91.11 | 92.17 | 83.60 | 77.81 | 79.26 |
| Liu | - | - | - | 87.46 | 75.22 | 80.88 |
| Multi-DTR | 94.36 | 92.13 | 93.22 | 85.56 | 81.45 | 83.45 |
Experimental results of different entity types in DDI2013.
| Type | Precision | Recall |
|
|---|---|---|---|
| Drug | 86.52 | 81.68 | 84.03 |
| Brand | 89.46 | 78.51 | 83.62 |
| Group | 83.26 | 86.43 | 84.81 |
| Drug_n | 79.74 | 67.36 | 73.02 |
| Mico-average | 85.56 | 81.45 | 83.45 |
Performance comparison of each representations.
| System | DDI2011 | DDI2013 | ||||
|---|---|---|---|---|---|---|
| Precision | Recall | F1 | Precision | Recall | F1 | |
| ELMo | 88.46 | 87.74 | 88.09 | 82.14 | 79.24 | 81.68 |
| Char | 86.32 | 85.12 | 85.71 | 81.21 | 78.53 | 79.84 |
| Glove | 88.12 | 89.34 | 88.72 | 84.74 | 80.57 | 82.60 |
| ELMo + Char | 89.47 | 90.55 | 90.00 | 83.45 | 81.06 | 82.23 |
| Char + Glove | 90.14 | 88.42 | 89.24 | 83.32 | 80.64 | 81.95 |
| ELMo + Glove | 91.73 | 89.51 | 90.60 | 84.24 | 81.32 | 82.75 |
| ELMo + Glove + Char | 94.36 | 92.13 | 93.22 | 85.56 | 81.45 | 83.45 |
Figure 5Performance comparison of different optimization methods optimization.
Performance comparison using Dropout.
| Precision | Recall | F1 | ||
|---|---|---|---|---|
| DDI2011 | No | 92.73 | 91.11 | 91.91 |
| Yes | 94.36 | 92.13 | 93.22 | |
| Δ | +1.63 | +1.02 | +1.31 | |
|
| ||||
| DDI2013 | No | 83.52 | 79.71 | 81.04 |
| Yes | 85.56 | 81.45 | 83.45 | |
| Δ | +2.04 | +1.74 | +2.41 | |
Performance comparison of adopting multitask learning.
| Precision | Recall |
| ||
|---|---|---|---|---|
| DDI2011 | Single-task | 91.13 | 89.51 | 90.30 |
| Multitask | 94.36 | 92.13 | 93.22 | |
| Δ | +3.23 | +2.62 | +2.92 | |
|
| ||||
| DDI2013 | Single-task | 83.42 | 78.01 | 80.62 |
| Multitask | 85.56 | 81.45 | 83.45 | |
| Δ | +2.14 | +3.44 | +2.83 | |