| Literature DB >> 30066652 |
Haodi Li1,2, Ming Yang3, Qingcai Chen4,5, Buzhou Tang6,7, Xiaolong Wang1,2, Jun Yan8.
Abstract
BACKGROUND: Extracting relationships between chemicals and diseases from unstructured literature have attracted plenty of attention since the relationships are very useful for a large number of biomedical applications such as drug repositioning and pharmacovigilance. A number of machine learning methods have been proposed for chemical-induced disease (CID) extraction due to some publicly available annotated corpora. Most of them suffer from time-consuming feature engineering except deep learning methods. In this paper, we propose a novel document-level deep learning method, called recurrent piecewise convolutional neural networks (RPCNN), for CID extraction.Entities:
Keywords: Chemical-induced disease; Convolutional neural network; Deep learning; Relation extraction
Mesh:
Year: 2018 PMID: 30066652 PMCID: PMC6069297 DOI: 10.1186/s12911-018-0629-3
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
An example of candidate generation (Literature with chemical and disease mentions and their identifiers)
| Position | Mention | Label | Identifier (MeSH) | |||
| start | end | |||||
| 0 | 28 | Cardiovascular complications | Disease | D002318 | ||
| 45 | 56 | terbutaline | Chemical | D013726 | ||
| 71 | 84 | preterm labor | Disease | D007752 | ||
| 93 | 121 | cardiovascular complications | Disease | D002318 | ||
| 169 | 180 | terbutaline | Chemical | D013726 | ||
| 185 | 198 | preterm labor | Disease | D007752 | ||
| Identifier (MeSH) | Chemical mention | Disease mention | ||||
| position | mention | Position | mention | |||
| start | end | start | end | |||
| <D013726, D002318> | 45 | 56 | terbutaline | 0 | 28 | Cardiovascular complications |
| 45 | 56 | terbutaline | 93 | 121 | Cardiovascular complications | |
| 169 | 180 | terbutaline | 0 | 28 | Cardiovascular complications | |
| 169 | 180 | terbutaline | 93 | 121 | Cardiovascular complications | |
| position | position | mention | ||||
| start | end | Start | End | |||
| <D013726, D007752> | 45 | 56 | 71 | 84 | preterm labor | |
| 45 | 56 | 185 | 198 | preterm labor | ||
| 169 | 180 | 71 | 84 | preterm labor | ||
| 169 | 180 | 185 | 198 | preterm labor | ||
Cardiovascular complications associated with terbutaline treatment for preterm labor
Abstract: Severe cardiovascular complications occurred in eight of 160 patients treated with terbutaline for preterm labor. Associated corticosteroid therapy and twin gestations appear to be predisposing factors. Potential mechanisms of the pathophysiology are briefly discussed
Fig. 1Architecture of recurrent piecewise convolutional neural networks (RPCNN) for multi-instance learning
Example of chemical position and disease position
performance of our cnn-based and rpcnn-based systems for chemical-induced disease extraction
| Methods | Without domain knowledge (%) | With domain knowledge (%) | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| CNN | 50.47 | 55.61 | 52.92 | 63.70 | 74.40 | 68.64 |
| CNN + piecewise | 54.48 | 53.91 | 54.20 | 63.83 | 75.16 | 69.03 |
| CNN + attention | 48.40 | 58.54 | 52.99 | 62.28 | 76.58 | 68.69 |
| CNN + attention+piecewise | 57.80 | 54.20 | 55.94 | 59.97 | 81.49 | 69.09 |
| RPCNN | 55.17 | 63.63 | 59.10 | 65.24 | 77.21 | 70.77 |
Comparison between our systems and other state-of-the-art systems
| Methods | Without domain knowledge (%) | With domain knowledge (%) | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Xu et al. [ | 59.60 | 44.00 | 50.73 | 65.80 | 68.57 | 67.16 |
| Zhou et al. (LSTM) [ | 54.91 | 51.41 | 53.10 | / | / | / |
| Zhou et al. (CNN) [ | 41.13 | 55.25 | 47.16 | / | / | / |
| Gu et al. (CNN) [ | 59.70 | 55.00 | 57.20 | / | / | / |
| Patrick et al. (BRAN) [ | 55.60 | 70.80 | 62.10 | / | / | / |
| Our CNN | 57.80 | 54.20 | 55.94 | 59.97 | 81.49 | 69.09 |
| Our RPCNN | 55.17 | 63.63 | 59.10 | 65.24 | 77.21 | 70.77 |