| Literature DB >> 34946636 |
Myung-Gyun Kang1,2, Nam Sook Kang2.
Abstract
Drug-induced liver injury (DILI) is a major concern for drug developers, regulators, and clinicians. However, there is no adequate model system to assess drug-associated DILI risk in humans. In the big data era, computational models are expected to play a revolutionary role in this field. This study aimed to develop a deep neural network (DNN)-based model using extended connectivity fingerprints of diameter 4 (ECFP4) to predict DILI risk. Each data set for the predictive model was retrieved and curated from DILIrank, LiverTox, and other literature. The best model was constructed through ten iterations of stratified 10-fold cross-validation, and the applicability domain was defined based on integer ECFP4 bits of the training set which represented substructures. For the robustness test, we employed the concept of the endurance level. The best model showed an accuracy of 0.731, a sensitivity of 0.714, and a specificity of 0.750 on the validation data set in the complete applicability domain. The model was further evaluated with four external data sets and attained an accuracy of 0.867 on 15 drugs with DILI cases reported since 2019. Overall, the results suggested that the ECFP4-based DNN model represents a new tool to identify DILI risk for the evaluation of drug safety.Entities:
Keywords: DILI; DNN; ECFP4; applicability domain; deep neural network; drug-induced liver injury; endurance level; in silico model; machine learning; substructure space
Mesh:
Year: 2021 PMID: 34946636 PMCID: PMC8707960 DOI: 10.3390/molecules26247548
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Data sets used to generate the drug-induced liver injury (DILI)-prediction model in this study.
| Data Sets | DILI-Negatives | DILI-Positives | Total | |
|---|---|---|---|---|
| Training | DILIrank data set | 245 | 439 | 684 |
| LiverTox data set | 234 | 22 | 256 | |
| SUM | 479 | 461 | 940 | |
| Validation | Greene data set | 64 | 92 | 156 |
| Xu data set | 10 | 13 | 23 | |
| SUM | 74 | 105 | 179 |
Figure 1Venn diagram of unique integer ECFP4 fingerprint bits in the training data set and validation data set.
Stratified 10-fold cross-validation results with ECFP4 over ten iterations.
| Iteration | ACC | Best Loss |
|---|---|---|
| 1 | 0.873 ± 0.0957 | 0.2271 |
| 2 | 0.864 ± 0.1074 | 0.1713 |
| 3 | 0.945 ± 0.0834 | 0.1264 |
| 4 | 0.918 ± 0.0725 | 0.1066 |
| 5 | 0.939 ± 0.0778 | 0.1276 |
|
|
|
|
| 7 | 0.835 ± 0.1557 | 0.1858 |
| 8 | 0.927 ± 0.0788 | 0.1102 |
| 9 | 0.889 ± 0.0943 | 0.1258 |
| 10 | 0.814 ± 0.1928 | 0.2245 |
ACC: accuracy; DILI+: drug-induced liver injury (DILI)-positive; DILI−: DILI-negative. Bold represents the best model selected due to the lowest loss values.
Comparison of the deep neural network models by molecular fingerprints.
|
|
|
| ||||||
|
|
|
|
|
|
|
|
| |
| 0% | 0.731 | 0.714 | 0.750 | 0.741 | 0.750 | 0.500 | 1.000 | 0.667 |
| 5% | 0.667 | 0.778 | 0.524 | 0.724 | 0.706 | 0.667 | 0.750 | 0.706 |
| 10% | 0.648 | 0.744 | 0.500 | 0.719 | 0.615 | 0.667 | 0.556 | 0.651 |
| 15% | 0.642 | 0.742 | 0.488 | 0.706 | 0.608 | 0.607 | 0.609 | 0.630 |
| 20% | 0.632 | 0.763 | 0.434 | 0.706 | 0.571 | 0.571 | 0.571 | 0.593 |
| 30% | 0.607 | 0.758 | 0.397 | 0.686 | 0.540 | 0.507 | 0.591 | 0.574 |
|
|
|
| ||||||
|
|
|
|
|
|
|
|
| |
| 0% | 0.548 | 0.556 | 0.538 | 0.609 | 0.650 | 0.600 | 0.700 | 0.556 |
| 5% | 0.553 | 0.635 | 0.452 | 0.630 | 0.618 | 0.632 | 0.600 | 0.611 |
| 10% | 0.552 | 0.663 | 0.390 | 0.640 | 0.614 | 0.677 | 0.538 | 0.613 |
| 15% | 0.531 | 0.638 | 0.379 | 0.622 | 0.586 | 0.640 | 0.513 | 0.626 |
| 20% | 0.515 | 0.616 | 0.375 | 0.601 | 0.575 | 0.652 | 0.471 | 0.629 |
| 30% | 0.517 | 0.606 | 0.392 | 0.600 | 0.551 | 0.629 | 0.449 | 0.604 |
ACC: accuracy, SE: sensitivity, SP: specificity, F1: F1 score.
Figure 2Overall workflow for model creation and evaluation. Models were created based on chemical structure-based fingerprints using various algorithms and subsets from validation data sets selected by the applicability domain, which was defined by the endurance level.
Figure 3Histogram of maximum similarities for each DILI class. The graph indicates that the DILI-negatives in the validation data set had a more biased distribution of the maximum similarity, ranging from 0.2 to 0.3, than DILI-positives in the same data set. This indicates that the DILI-negatives structurally varied more than the DILI-positives.
Evaluation results of the naive Bayesian, SVM, and RF models by endurance levels.
| Endurance | Naive Bayesian | SVM | RF | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ACC | SE | SP | F1 | ACC | SE | SP | F1 | ACC | SE | SP | F1 | |
| 0% | 0.538 | 0.643 | 0.417 | 0.600 | 0.577 | 0.571 | 0.583 | 0.593 | 0.577 | 0.500 | 0.667 | 0.560 |
| 5% | 0.604 | 0.778 | 0.381 | 0.689 | 0.583 | 0.630 | 0.524 | 0.630 | 0.604 | 0.667 | 0.524 | 0.655 |
| 10% | 0.606 | 0.767 | 0.357 | 0.702 | 0.592 | 0.651 | 0.500 | 0.659 | 0592 | 0.651 | 0.500 | 0.659 |
| 15% | 0.606 | 0.803 | 0.302 | 0.711 | 0.615 | 0.667 | 0.535 | 0.677 | 0.578 | 0.652 | 0.465 | 0.652 |
| 20% | 0.602 | 0.825 | 0.264 | 0.714 | 0.602 | 0.688 | 0.472 | 0.675 | 0.564 | 0.675 | 0.396 | 0.651 |
| 30% | 0.589 | 0.810 | 0.279 | 0.697 | 0.601 | 0.684 | 0.485 | 0.667 | 0.571 | 0.674 | 0.426 | 0.646 |
ACC: accuracy; SE: sensitivity; SP: specificity; SVM: support vector machine; RF: random forest.
Performance comparison of the DNN-based model with external data sets.
| References | Level * | Training | ACC | SE | SP | AUC |
|---|---|---|---|---|---|---|
| Liew et al. (entire data set) [ | 0% | 114 | 0.789 | 0.838 | 0.717 | 0.853 |
| 100% | 187 | 0.642 | 0.724 | 0.537 | 0.742 | |
| valBLACK | 0% | 38 | 0.974 | 0.955 | 1.000 | 0.955 |
| (22+/16−) | (0.809) | (0.957) | (0.667) | (0.924) | ||
| 100% | 47 | 0.830 | 0.957 | 0.708 | 0.937 | |
| valPAIR | 0% | 14 | 0.500 | 0.857 | 0.143 | 0.551 |
| (7+/7−) | (0.550) | (0.800) | (0.300) | (0.450) | ||
| 100% | 20 | 0.450 | 0.700 | 0.200 | 0.525 | |
| valRANDOM | 0% | 62 | 0.742 | 0.769 | 0.696 | 0.836 |
| (39+/23−) | (0.750) | (0.819) | (0.646) | (0.595) | ||
| 100% | 120 | 0.600 | 0.653 | 0.521 | 0.687 | |
| Zhang et al. [ | 0% | 80 | 0.950 | 1.000 | 0.926 | 0.957 |
| (53+/27−) | (0.750) | (0.932) | (0.379) | (0.667) | ||
| 100% | 85 | 0.941 | 0.982 | 0.857 | 0.952 | |
| Ai et al. [ | 0% | 84 | 0.881 | 0.905 | 0.810 | 0.920 |
| (63+/21−) | (0.843) | (0.869) | (0.754) | (0.904) | ||
| 100% | 121 | 0.893 | 0.904 | 0.852 | 0.911 | |
| Kotsampasakou et al. [ | 0% | 151 | 0.636 | 0.595 | 0.687 | 0.672 |
| (84+/67−) | (0.600) | (0.670) | (0.520) | (0.642) | ||
| 100% | 973 | 0.585 | 0.635 | 0.526 | 0.605 |
* Endurance level. ACC: accuracy; SE: sensitivity; SP: specificity; AUC: area under the receiver–operating characteristic curve. The data in parentheses are validation results from each reference.
Evaluation results of the DNN model using 15 drugs with case reports.
| Drugs | CID | Endurance | Prediction | Prediction |
|---|---|---|---|---|
| Flucloxacillin [ | 21,319 | 6.7% | DILI-positive | 0.999 |
| Aliskiren [ | 5,493,444 | 7.0% | DILI-positive | 0.999 |
| Rilpivirine [ | 6,451,164 | 5.0% | DILI-positive | 0.994 |
| Escitalopram [ | 146,570 | 5.0% | DILI-positive | 0.989 |
| Nilotinib [ | 644,241 | 7.6% | DILI-positive | 0.982 |
| Olmesartan [ | 158,781 | 6.3% | DILI-positive | 0.974 |
| Mesterolone [ | 15,020 | 4.1% | DILI-positive | 0.971 |
| Levothyroxine [ | 5819 | 3.9% | DILI-positive | 0.965 |
| Zanubrutinib [ | 135,565,884 | 6.4% | DILI-positive | 0.922 |
| Phenprobamate [ | 4770 | 2.8% | DILI-positive | 0.896 |
| Apixaban [ | 10,182,969 | 5.5% | DILI-positive | 0.804 |
| Fasiglifam [ | 24,857,286 | 7.0% | DILI-positive | 0.660 |
| Pirfenidone [ | 40,632 | 2.6% | DILI-positive | 0.608 |
| Ligandrol [ | 44,137,686 | 4.6% | DILI-negative | 0.378 |
| Ulipristal acetate [ | 130,904 | 6.5% | DILI-negative | 0.036 |
DILI, drug-induced liver injury.
Figure 4The deep neural network architecture of the DILI prediction model.