| Literature DB >> 30010749 |
Abstract
In this paper, we explore the application of artificial neural network ('deep learning') methods to the problem of detecting chemical-protein interactions in PubMed abstracts. We present here a system using multiple Long Short Term Memory layers to analyse candidate interactions, to determine whether there is a relation and which type. A particular feature of our system is the use of unlabelled data, both to pre-train word embeddings and also pre-train LSTM layers in the neural network. On the BioCreative VI CHEMPROT test corpus, our system achieves an F score of 61.51% (56.10% precision, 67.84% recall).Entities:
Mesh:
Substances:
Year: 2018 PMID: 30010749 PMCID: PMC6044291 DOI: 10.1093/database/bay066
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Layers in pre-training network
| Layer | Type | Input(s) | Number of output neurons | Notes |
|---|---|---|---|---|
| Embedding | 300 | |||
| Embedding | 300 | |||
| Embedding | 300 | |||
| LSTM | 300 | |||
| LSTM | 300 | Reversed | ||
| concatenate | 600 | |||
| concatenate | 600 | |||
| TimeDistributed Dense | 300 | Activation is relu | ||
| TimeDistributed Dense | 1 | Activation is sigmoid. | ||
| TimeDistributed Dense | 300 | Activation is relu | ||
| TimeDistributed Dense | 1 | Activation is sigmoid. |
Figure 1.Pre-training network.
Layers in recognition network
| Layer | Type | Input(s) | Number of output neurons | Notes |
|---|---|---|---|---|
| Embedding | 300 | |||
| LSTM | 300 | |||
| LSTM | 300 | Reversed | ||
| Conv1D | 48 | Width = 3, activation is relu | ||
| Conv1D | 6 | Width = 3, activation is relu | ||
| concatenate | 652 | |||
| Bidirectional LSTM | 128 per direction, total 256 | |||
| GlobalMaxPooling1D | 256 | |||
| Dense | 6 | Activation is softmax |
Figure 2.Recognition network.
Results on development
| Run | Precision (%) | Recall (%) | |
|---|---|---|---|
| Full | 56.52 | 70.42 | 62.71 |
| No Phase 1 | 62.97 | 57.25 | 59.97 |
| PublicEmbeddings | 61.69 | 56.96 | 59.23 |
| Random | 45.05 | 50.66 | 47.70 |
Results on development
| Run | Precision (%) | Recall (%) | Best Epoch | |
|---|---|---|---|---|
| Full | 56.52 | 62.71 | 33 | |
| No Phase I | 62.97 | 57.25 | 59.97 | 15 |
| Unidirectional, 128 outputs, final state | 56.49 | 58.96 | 57.69 | 10 |
| Unidirectional, 256 outputs, final state | 50.71 | 62.33 | 55.93 | 24 |
| Unidirectional, 128 outputs, via Global MaxPooling | 61.70 | 60.88 | 61.28 | 6 |
| Conv1D | 60.04 | 28 | ||
| Conv1D with Phase I | 56.39 | 64.13 | 60.01 | 48 |
Numbers in boldface represent best results.
Results on development
| Sub-Epochs | Precision (%) | Recall (%) | Best Epoch | |
|---|---|---|---|---|
| 0 | 57.25 | 59.97 | 15 | |
| 5 | 61.37 | 64.67 | 62.97 | 19 |
| 10 | 61.69 | 64.88 | 63.24 | 12 |
| 15 | 59.38 | 68.29 | 36 | |
| 20 | 57.06 | 69.92 | 62.83 | 38 |
| 25 | 56.52 | 62.71 | 33 |
Numbers in boldface represent best results.
Results on development
| Run | Precision (%) | Recall (%) | ||
|---|---|---|---|---|
| 25 Sub-Epochs | 71.08 | 59.92 | 65.02 | 26 |
| 58.22 | 68.79 | 63.06 | 24 | |
| 15 Sub-Epochs | 68.87 | 62.58 | 65.58 | 15 |
| 63.05 | 66.25 | 64.61 | 15 | |
| No Phase I | 63.20 | 59.45 | 61.27 | 22 |
| 56.26 | 64.63 | 60.15 | 18 | |
| Conv1D, No Phase I | 65.86 | 59.50 | 62.52 | 19 |
| 55.94 | 68.42 | 61.56 | 19 |
For each run, the upper row is with thresholding, the lower is without.
Results on development and test
| Run | Precision (%) | Recall (%) | Best Epoch | |
|---|---|---|---|---|
| Development, no thresholding | 65.42 | 64.36 | 19 | |
| Test, no thresholding | 62.56 | 62.52 | 62.54 | |
| Development, thresholding | 63.87 | 67.17 | 64.91 | 34 |
| Test, thresholding | 62.97 | 62.20 | 62.58 |
Numbers in boldface represent best results.
Results
| Corpus | Precision (%) | Recall (%) | |
|---|---|---|---|
| Development | 56.52 | 70.42 | 62.71 |
| Test | 56.10 | 67.84 | 61.41 |
Confusion matrix for development data
| Actual | Predicted | |||||
|---|---|---|---|---|---|---|
| 26196 | 214 | 413 | 75 | 76 | 351 | |
| 163 | 287 | 82 | 4 | 4 | 9 | |
| 159 | 24 | 896 | 0 | 5 | 6 | |
| 23 | 0 | 0 | 89 | 4 | 0 | |
| 28 | 1 | 7 | 3 | 160 | 0 | |
| 166 | 4 | 16 | 0 | 2 | 258 | |
Development data results by relationship class
| Class | Precision (%) | Recall (%) | |
|---|---|---|---|
| 54.16 | 52.28 | 53.20 | |
| 63.37 | 82.20 | 71.57 | |
| 52.04 | 76.72 | 62.02 | |
| 63.75 | 80.40 | 71.11 | |
| 41.35 | 57.85 | 48.22 |
Figure 3.F scores for development and test sets from epochs 12 to 50, with thresholding.
Figure 4.F scores for development and test sets from epochs 12 to 50, with thresholding.