| Literature DB >> 35581608 |
Zhao Chen1, Mengzhu Zhao1, Liangzhen You2, Rui Zheng1, Yin Jiang1, Xiaoyu Zhang1, Ruijin Qiu1, Yang Sun1, Haie Pan1, Tianmai He1, Xuxu Wei1, Zhineng Chen3, Chen Zhao4, Hongcai Shang5.
Abstract
BACKGROUNDS: Traditional Chinese medicine and Western medicine combination (TCM-WMC) increased the complexity of compounds ingested.Entities:
Keywords: AI; DILI; Deep learning; Machine learning; Safety assessment; TCM-WMC
Year: 2022 PMID: 35581608 PMCID: PMC9112584 DOI: 10.1186/s13020-022-00617-4
Source DB: PubMed Journal: Chin Med ISSN: 1749-8546 Impact factor: 4.546
Fig. 1Workflow for the study of screening hepatotoxic compounds in TCM-WMC based AI methods. DILI Dataset was collected from public databases and published literatures. PaDEL was used to calculate molecular descriptors/fingerprints of DILI dataset compounds. Nine machine learning models of Stochastic gradient descent (SGD), k-nearest neighbor (kNN), Support vector machine (SVM), Naive bayes (NB), Decision tree (DT), Random forest (RF), Artificial neural network (ANN), Adaboost, Logistic regression (LR) and one deep learning model (DBN) were adopted to develop DILI AI models
Fig. 2DILI dataset screening process
AI models for drug-included liver injury
| Dataset | ML method | AUC | CA | F1 | Precision | Recall |
|---|---|---|---|---|---|---|
| Training set | SGD | 0.647 | 0.709 | 0.715 | 0.723 | 0.709 |
| kNN | 0.654 | 0.722 | 0.698 | 0.690 | 0.722 | |
| SVM | 0.785 | 0.795 | 0.791 | 0.788 | 0.795 | |
| DT | 0.710 | 0.756 | 0.760 | 0.764 | 0.756 | |
| RF | 0.814 | 0.838 | 0.827 | 0.832 | 0.838 | |
| Adaboost | 0.785 | 0.792 | 0.788 | 0.785 | 0.792 | |
| ANN | 0.621 | 0.737 | 0.671 | 0.685 | 0.737 | |
| LR | 0.746 | 0.776 | 0.757 | 0.757 | 0.761 | |
| NB | 0.632 | 0.686 | 0.694 | 0.711 | 0.686 | |
| Test set | SGD | 0.627 | 0.682 | 0.694 | 0.712 | 0.682 |
| kNN | 0.574 | 0.745 | 0.636 | 0.555 | 0.745 | |
| SVM | 0.669 | 0.747 | 0.712 | 0.710 | 0.747 | |
| DT | 0.544 | 0.680 | 0.679 | 0.678 | 0.680 | |
| RF | 0.739 | 0.767 | 0.731 | 0.739 | 0.767 | |
| Adaboost | 0.614 | 0.708 | 0.707 | 0.707 | 0.708 | |
| ANN | 0.647 | 0.694 | 0.696 | 0.697 | 0.694 | |
| LR | 0.656 | 0.733 | 0.694 | 0.688 | 0.733 | |
| NB | 0.598 | 0.675 | 0.648 | 0.705 | 0.675 |
LR Logistic regression, RF Random forest, SVM Support vector machine, kNN k-nearest neighbor, DT Decision tree, NB Naive bayes, ANN Artificial neural network, SGD Stochastic gradient descent
Fig. 3ROC curve of training set of DILI screening model based on AI model. LR: Logistic regression, RF: Random forest, SVM: Support vector machine, kNN: k-nearest neighbor, DT: Decision tree, NB: Naive bayes, AdaBoost, ANN: Artificial neural network, SGD: Stochastic gradient descent
Fig. 4ROC curve of test set of DILI screening model based on AI model. LR: Logistic regression, RF: Random forest, SVM: Support vector machine, kNN: k-nearest neighbor, DT: Decision tree, NB: Naive bayes, AdaBoost, ANN: Artificial neural network, SGD: Stochastic gradient descent