| Literature DB >> 32351611 |
Li Wang1,2, Wenjie Pan1, QingHua Wang1, Heming Bai2, Wei Liu2, Lei Jiang3, Yuanpeng Zhang1,2.
Abstract
Drug-drug interactions (DDIs) are one of the indispensable factors leading to adverse event reactions. Considering the unique structure of AERS (Food and Drug Administration Adverse Event Reporting System (FDA AERS)) reports, we changed the scope of the window value in the original skip-gram algorithm, then propose a language concept representation model and extract features of drug name and reaction information from large-scale AERS reports. The validation of our scheme was tested and verified by comparing with vectors originated from the cooccurrence matrix in tenfold cross-validation. In the verification of description enrichment of the DrugBank DDI database, accuracy was calculated for measurement. The average area under the receiver operating characteristic curve of logistic regression classifiers based on the proposed language model is 6% higher than that of the cooccurrence matrix. At the same time, the average accuracy in five severe adverse event classes is 88%. These results indicate that our language model can be useful for extracting drug and reaction features from large-scale AERS reports.Entities:
Mesh:
Year: 2020 PMID: 32351611 PMCID: PMC7174925 DOI: 10.1155/2020/1747413
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1The scheme of DDI extraction based on the MSG algorithm.
Figure 2Dynamic scope of the window of the modified skip-gram model.
Figure 3Drug/reaction report cooccurrence matrix based on tfidf.
Figure 4Examples of DDIs in DrugBank and report in DrugBank_Toxicity.
Parameters of the modified skip-gram model.
| Dimensionality of word embeddings | Starting alpha | Min count for drugs or reactions | Gradient calculation | |
|---|---|---|---|---|
| Parameters | 100 | 0.025 | 10 | Hierarchical softmax |
Positive reference samples of five event classes.
| Event class | DrugBank_DDI | DrugBank_Toxicity | SIDER |
|---|---|---|---|
| Renal Impairment (REI) | 117 | 47 | 270 |
| Hepatotoxic (HTT) | 11 | 29 | 265 |
| Abnormal Blood Pressure (ABP) | 757 | 132 | 275 |
| Cardiotoxicity (CDT) | 544 | 51 | 448 |
| Neurotoxic (NET) | 221 | 158 | 298 |
Figure 5ROC of ten logistic regression models based on MSG.
Figure 6ROC of ten logistic regression models based on CM-TF-IDF.
Figure 7AUROC of twenty logistic regression models.
Enrichment of drug pair
| No. | Reaction | Cosine |
|---|---|---|
| 1 | Mediastinal haematoma | 0.739511629 |
| 2 | Pulmonary toxicity | 0.731175786 |
|
|
|
|
|
|
|
|
|
|
|
|
| 6 | Tumour embolism | 0.711951369 |
|
|
|
|
|
|
|
|
| 9 | Metastasis | 0.71071688 |
| 10 | Rhabdomyosarcoma | 0.709203308 |
| 11 | Ewing's sarcoma | 0.706786142 |
| 12 | Aorto-oesophageal fistula | 0.706059463 |
| 13 | Stress ulcer | 0.703392027 |
| 14 | Pneumonia pseudomonal | 0.699428808 |
| 15 | Renal cortical necrosis | 0.699053548 |
| 16 | Emphysematous pyelonephritis | 0.698257906 |
|
|
|
|
| 18 | Hospice care | 0.695568984 |
| 19 | Malignant glioma | 0.695259318 |
| 20 | Disease progression | 0.694726752 |
Details of drug pair DDI enrichment in DrugBank.
| Event class | Number of DDIs | Number of valid enrichments | Accuracy |
|---|---|---|---|
| Renal Impairment (REI) | 117 | 99 | 0.846154 |
| Hepatotoxic (HTT) | 11 | 9 | 0.818182 |
| Abnormal Blood Pressure (ABP) | 757 | 660 | 0.871863 |
| Cardiotoxicity (CDT) | 544 | 494 | 0.908088 |
| Neurotoxic (NET) | 221 | 194 | 0.877828 |
| Total | 1650 | 1456 | 0.882424 |