| Literature DB >> 23133662 |
Yijia Zhang1, Hongfei Lin, Zhihao Yang, Jian Wang, Yanpeng Li.
Abstract
When one drug influences the level or activity of another drug this is known as a drug-drug interaction (DDI). Knowledge of such interactions is crucial for patient safety. However, the volume and content of published biomedical literature on drug interactions is expanding rapidly, making it increasingly difficult for DDIs database curators to detect and collate DDIs information manually. In this paper, we propose a single kernel-based approach to extract DDIs from biomedical literature. This novel kernel-based approach can effectively make full use of syntactic structural information of the dependency graph. In particular, our approach can efficiently represent both single subgraph topological information and the relation of two subgraphs in the dependency graph. Experimental evaluations showed that our single kernel-based approach can achieve state-of-the-art performance on the publicly available DDI corpus without exploiting multiple kernels or additional domain resources.Entities:
Mesh:
Year: 2012 PMID: 23133662 PMCID: PMC3486804 DOI: 10.1371/journal.pone.0048901
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1A sentence on the DDI corpus which underwent parsing with the Charniak and Lease parser.
Figure 2Graph representation of a sentence.
Figure 3Illustration of pruning the dependency subgraph and linear subgraph.
Figure 4An example of how to compute hierarchical hash labels.
Figure 5Illustration of subgraph pairs features mapped from , ,…
.
Statistics of the DDI corpora.
| Training sets | Test sets | Total | |
| Documents | 435 | 144 | 579 |
| Candidate DDI pairs | 23827 | 7026 | 30853 |
| Positive DDI pairs | 2402 | 756 | 3158 |
| Negative DDI pairs | 21425 | 6270 | 27695 |
Effectiveness of parameterβ.
| β |
|
|
| σF | AUC | σAUC |
| β = 0.2 | 55.5 | 64.6 | 59.7 | 2.1 | 88.3 | 1.8 |
| β = 0.4 | 56.4 | 66.5 | 61.0 | 2.8 | 89.5 | 1.7 |
| β = 0.6 | 56.8 |
|
| 2.4 | 90.1 | 2.2 |
| β = 0.8 | 57.1 | 67.4 | 61.7 | 2.7 | 90.6 | 1.5 |
| β = 1.0 | 56.7 | 65.6 | 60.8 | 1.8 |
| 1.4 |
| β = 1.5 | 57.3 | 63.3 | 60.2 | 2.0 | 90.7 | 1.9 |
| β = 2.0 |
| 62.5 | 59.9 | 1.7 | 90.2 | 1.2 |
F: F-score; P: precision; R: recall. σF andσAUC are the standard deviation of the F-score and AUC in cross validation, respectively.
Performance of our approach in comparison with other approaches.
| Approach |
|
|
|
|
|
|
|
|
|
|
| WBI | 543 | 354 | 212 | 5917 | 60.5 | 71.9 |
| 91.9 |
| - |
| Our approach | 508 | 297 | 248 | 5973 |
| 67.2 | 65.1 |
| 60.8 |
|
| LIMSI-FBK | 532 | 376 | 223 | 5895 | 58.6 | 70.5 | 64.0 | 91.5 | 59.5 | - |
| FBK-HLT | 529 | 377 | 226 | 5894 | 58.4 | 70.1 | 63.7 | 91.4 | 59.2 | - |
| UTurku | 520 | 376 | 235 | 5895 | 58.0 | 68.9 | 63.0 | 91.3 | 58.4 | - |
| BNBNLEL | 420 | 266 | 335 | 6005 | 61.2 | 55.6 | 58.3 | 91.5 | 53.6 | - |
| Naive Bayes approach | 603 | 786 | 153 | 5484 | 43.4 |
| 56.2 | 86.6 | 52.3 | 84.1 |
| SVM approach | 382 | 346 | 374 | 5924 | 52.5 | 50.6 | 51.5 | 89.8 | 45.8 | 87.2 |
F: F-score; P: precision; R: recall; Acc: accuracy; MCC: Matthews correlation coefficient.