| Literature DB >> 29297301 |
Wei Wang1, Xi Yang1, Canqun Yang1, Xiaowei Guo1, Xiang Zhang1, Chengkun Wu2.
Abstract
BACKGROUND: Drug-drug interaction extraction (DDI) needs assistance from automated methods to address the explosively increasing biomedical texts. In recent years, deep neural network based models have been developed to address such needs and they have made significant progress in relation identification.Entities:
Keywords: Data imbalance; Dependency tree; Long short term memory; Relation extraction
Mesh:
Year: 2017 PMID: 29297301 PMCID: PMC5751524 DOI: 10.1186/s12859-017-1962-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The framework of our model
Fig. 2An example of the typed dependency representation and the corresponding dependency tree
Fig. 3LSTM memory block
An example of drug blinding
| Drug candidate | Sentence with drug blinding |
|---|---|
| ( | The CNS-depressant effect of |
| ( | The CNS-depressant effect of |
| ( | The CNS-depressant effect of |
The statistics of the DDI corpus
| Training set | Training set filtering | Test set | Test set filtering | |
|---|---|---|---|---|
| Negative | 23,371 | 17,297 | 4737 | 3335 |
| Advice | 1319 | 1315 | 302 | 301 |
| Effect | 1687 | 1677 | 360 | 357 |
| Mechanism | 826 | 821 | 221 | 221 |
| Int | 189 | 184 | 96 | 96 |
| Total | 27,792 | 21,294 | 5716 | 4310 |
| Ra. | 1:5.8 | 1:4.3 | 1:4.8 | 1:3.4 |
Note The Ra. denotes the ratio between positive instances and negative instances
The hyper parameters of our model
| Parameter | Description | Value |
|---|---|---|
|
| Dimension of word embedding | 100 |
|
| Dimension of distance embedding | 10 |
|
| The number of hidden units | 300 |
|
| The ratio of dropout | 0.7 |
|
| The L2 regularization | 0.001 |
|
| The learning rate of Adam optimizer | 0.01 |
|
| The ratio of under sampling | 0.5 |
|
| The ratio of oversampling | 0.5 |
|
| The times of oversampling | 6 |
Performance comparison of our models with baseline methods
| Models | DDI corpus | ||||||||
| DrugBank | MEDLINE | Overall | |||||||
| P | R | F | P | R | F | P | R | F | |
| DLSTM1 | 74.74 |
|
| 48.78 | 42.55 | 45.45 | 72.53 |
|
|
| DLSTM2 | 75.29 | 72.64 | 73.95 | 50.67 | 40.43 | 44.97 | 73.29 | 69.54 | 71.37 |
| B-LSTM | – | – | – | – | – | – | 75.97 | 65.57 | 70.39 |
| AB-LSTM | – | – | – | – | – | – | 67.85 | 65.98 | 66.90 |
| Joint AB-LSTM | – | – | – | – | – | – | 73.41 | 69.66 | 71.48 |
| CNN&DCNN | – | – | – | – | – | – |
| 64.66 | 70.81 |
| CNN |
| 66.74 | 71.52 |
|
|
| 75.72 | 64.66 | 69.75 |
| SCNN2 | – | – | – | – | – | – | 72.50 | 65.10 | 68.60 |
| SCNN1 | – | – | – | – | – | – | 69.10 | 65.10 | 67.00 |
| Kim | – | – | 69.80 | – | – | 38.20 | – | – | 67.00 |
| FBK-irst | 66.70 | 68.60 | 67.60 | 41.90 | 37.90 | 39.80 | 64.60 | 65.60 | 65.10 |
| UTurku | 73.80 | 53.50 | 62.00 | 59.30 | 16.80 | 26.20 | 73.20 | 49.90 | 59.40 |
| NIL_UCM | 56.60 | 57.90 | 57.30 | 35.70 | 15.80 | 21.90 | 53.50 | 50.10 | 51.70 |
| Models | PK DDI corpus | ||||||||
| – | – | – | – | – | – | P | R | F | |
| DLSTM1-multi | – | – | – | – | – | – |
|
|
|
| DLSTM1-single | – | – | – | – | – | – | 87.97 | 87.97 | 87.97 |
Class wise performance comparison of our models with baseline methods
| Models | Advice | Effect | Mechanism | Int | MAVG |
|---|---|---|---|---|---|
| DLSTM1 |
| 68.37 |
| 49.00 |
|
| DLSTM2 | 77.00 |
| 74.61 | 51.03 | 68.27 |
| CNN | 77.72 | 69.32 | 70.23 | 46.37 | 65.91 |
| Kim | 72.50 | 66.20 | 69.30 | 48.30 | 64.10 |
| FBK-irst | 69.20 | 62.80 | 67.90 |
| 64.80 |
| UTurku | 63.00 | 60.00 | 58.20 | 50.70 | 58.70 |
| NIL_UCM | 61.30 | 48.90 | 51.50 | 42.70 | 53.50 |
The effect of enhancements on performance
| Enhancement removed or replaced | P | R | F | △ |
|---|---|---|---|---|
| None | 72.53 | 71.49 | 72.00 | – |
| -DFS channel | 70.22 | 68.21 | 69.20 | −2.80 |
| -BFS channel | 73.52 | 65.23 | 69.13 | −2.87 |
| -DFS&BFS channels | 66.57 | 71.69 | 69.04 | −2.96 |
| -Negative instance filtering | 70.59 | 69.15 | 69.87 | −2.13 |
| -Train set sampling | 69.51 | 66.67 | 68.06 | −3.94 |
| #Bi-LSTM outputs concatenating | 70.94 | 66.87 | 68.85 | −3.15 |
Notes. △ denotes the corresponding F-score decrease percentage when an enhancement is removed or replaced
Fig. 4The distribution of DLSTM1’s predicted results for each DDI types. The vertical axis is the targeted type, while the horizontal axis is the predicted type. Point (X, Y) means the ratio, where X is predicted type and Y is targeted type. The sum of each row value equal to 1
Fig. 5The statistic and F-score of instances with different length in test data