| Literature DB >> 30804783 |
Hao Wang1,2, Ruifeng Liu1,2, Patric Schyman1,2, Anders Wallqvist2.
Abstract
Improving the accuracy of toxicity prediction models for liver injuries is a key element in evaluating the safety of drugs and chemicals. Mechanism-based information derived from expression (transcriptomic) data, in combination with machine-learning methods, promises to improve the accuracy and robustness of current toxicity prediction models. Deep neural networks (DNNs) have the advantage of automatically assembling the relevant features from a large number of input features. This makes them especially suitable for modeling transcriptomic data, which typically contain thousands of features. Here, we gaged gene- and pathway-level feature selection schemes using single- and multi-task DNN approaches in predicting chemically induced liver injuries (biliary hyperplasia, fibrosis, and necrosis) from whole-genome DNA microarray data. The single-task DNN models showed high predictive accuracy and endpoint specificity, with Matthews correlation coefficients for the three endpoints on 10-fold cross validation ranging from 0.56 to 0.89, with an average of 0.74 in the best feature sets. The DNN models outperformed Random Forest models in cross validation and showed better performance than Support Vector Machine models when tested in the external validation datasets. In the cross validation studies, the effect of the feature selection scheme was negligible among the studied feature sets. Further evaluation of the models on their ability to predict the injury phenotype per se for non-chemically induced injuries revealed the robust performance of the DNN models across these additional external testing datasets. Thus, the DNN models learned features specific to the injury phenotype contained in the gene expression data.Entities:
Keywords: artificial neural network; biliary hyperplasia; classification model; liver fibrosis; liver necrosis; machine leaning; toxicity prediction
Year: 2019 PMID: 30804783 PMCID: PMC6370634 DOI: 10.3389/fphar.2019.00042
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Summary of training and testing data used in this study.
| Source | Number of samples | Reference | |||||
|---|---|---|---|---|---|---|---|
| Biliary Hyperplasia | Liver Fibrosis | Liver Necrosis | |||||
| + | - | + | - | + | - | ||
| Open TG-GATEs | 91 | 2,233 | 37 | 2,287 | 275 | 2,049 | |
| DrugMatrix DB | 38 | 661 | 27 | 672 | 179 | 520 | |
| Gene Expression Omnibus | 20 | 52 | 19 | 53 | 30 | 42 | |
| N/A | N/A | N/A | N/A | 2 | 15 | ||
| 0 | 269 | 0 | 269 | 0 | 269 | ||
| N/A | N/A | 3 | 32 | N/A | N/A | ||
Summary of feature sets used in this study.
| Name | Content | Reference |
|---|---|---|
| 1,331 genes | ||
| 199 genes | ||
| 978 genes | ||
| 1,312 genes belonging to 89 gene co-expression modules for chemically induced liver injury | ||
| 154 genes common to | ||
| 200 genes arbitrarily selected from the genome | ||
| 600 genes arbitrarily selected from the genome | ||
| 1,200 genes arbitrarily selected from the genome | ||
| 1,329 pathways in C2 collection | ||
| 50 pathways in Hallmark collection | ||
| 89 gene co-expression modules for chemically induced liver injuries | ||
| 1,220 | ||
FIGURE 1Diagram of single-task DNN and loss function during training. (A) Diagram of single-task DNN model. (B–D) Change of loss function during training single-task DNN model for Biliary hyperplasia, Fibrosis, and Necrosis, respectively.
FIGURE 2Performance of RF, SVM, single-task DNN, and multi-task DNN models based on cross validation studies. (A–C) results for biliary hyperplasia, fibrosis and necrosis, respectively.
Matthews correlation coefficients (MCCs) of single-task DNN models on cross validation data.
| Feature set | Size | Biliary hyperplasia | Fibrosis | Necrosis | |||
|---|---|---|---|---|---|---|---|
| 154 | 0.85 | 0.09 | 0.77 | 0.15 | 0.53 | 0.07 | |
| 199 | 0.82 | 0.11 | 0.73 | 0.12 | 0.51 | 0.06 | |
| 200 | 0.83 | 0.09 | 0.66 | 0.12 | 0.51 | 0.07 | |
| 600 | 0.87 | 0.08 | 0.75 | 0.18 | 0.55 | 0.08 | |
| 978 | 0.87 | 0.06 | 0.72 | 0.18 | 0.10 | ||
| 1,200 | 0.85 | 0.05 | 0.77 | 0.13 | 0.55 | 0.09 | |
| 1,312 | 0.07 | 0.13 | 0.55 | 0.09 | |||
| 1,331 | 0.85 | 0.08 | 0.77 | 0.17 | 0.54 | 0.09 | |
| 50 | 0.75 | 0.13 | 0.67 | 0.14 | 0.49 | 0.09 | |
| 89 | 0.81 | 0.09 | 0.65 | 0.17 | 0.48 | 0.10 | |
| 1,220 | 0.83 | 0.09 | 0.67 | 0.21 | 0.53 | 0.08 | |
| 1,329 | 0.82 | 0.08 | 0.68 | 0.16 | 0.51 | 0.07 | |
MCCs of SVM models on cross validation data.
| Feature set | Size | Biliary hyperplasia | Fibrosis | Necrosis | |||
|---|---|---|---|---|---|---|---|
| 154 | 0.87 | 0.09 | 0.66 | 0.14 | 0.48 | 0.08 | |
| 199 | 0.80 | 0.10 | 0.64 | 0.18 | 0.48 | 0.07 | |
| 200 | 0.79 | 0.07 | 0.70 | 0.12 | 0.55 | 0.09 | |
| 600 | 0.88 | 0.07 | 0.73 | 0.19 | 0.52 | 0.09 | |
| 978 | 0.85 | 0.10 | 0.71 | 0.27 | 0.54 | 0.09 | |
| 1,200 | 0.86 | 0.07 | 0.13 | 0.07 | |||
| 1,312 | 0.07 | 0.75 | 0.18 | 0.51 | 0.07 | ||
| 1,331 | 0.84 | 0.06 | 0.73 | 0.15 | 0.49 | 0.08 | |
| 50 | 0.79 | 0.13 | 0.71 | 0.17 | 0.44 | 0.10 | |
| 89 | 0.83 | 0.10 | 0.71 | 0.18 | 0.46 | 0.05 | |
| 1,220 | 0.87 | 0.09 | 0.73 | 0.17 | 0.47 | 0.06 | |
| 1,329 | 0.84 | 0.06 | 0.75 | 0.15 | 0.46 | 0.06 | |
Performance of single-task DNN and SVM models using Ippolito et al. data.
| Biliary hyperplasia | Fibrosis | Necrosis | ||||
|---|---|---|---|---|---|---|
| Best feature set | MCC | Best feature set | MCC | Best feature set | MCC | |
| Single-task DNN | 0.76 | 0.90 | 0.49 | |||
| SVM | 0.67 | 0.79 | 0.36 | |||
Metrics for three gene-level feature sets on combined cross validation data.
| Biliary hyperplasia | Fibrosis | Necrosis | MCC | TP | TN | FN | FP | Sen | Spc | PPV | NPV | BAc | F1 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0 | 0 | 0.79 | 31 | 2,767 | 10 | 6 | 0.76 | 1.00 | 0.84 | 1.00 | 0.88 | 0.79 |
| 0 | 1 | 0 | 0.00 | 0 | 2,806 | 5 | 3 | 0.00 | 1.00 | 0.00 | 1.00 | 0.50 | 0.00 |
| 0 | 0 | 1 | 0.50 | 154 | 2,390 | 204 | 66 | 0.43 | 0.97 | 0.70 | 0.92 | 0.70 | 0.53 |
| 1 | 1 | 0 | 0.40 | 5 | 2,793 | 12 | 4 | 0.29 | 1.00 | 0.56 | 1.00 | 0.65 | 0.38 |
| 1 | 0 | 1 | 0.61 | 21 | 2,767 | 12 | 14 | 0.64 | 0.99 | 0.60 | 1.00 | 0.82 | 0.62 |
| 0 | 1 | 1 | 0.68 | 5 | 2,804 | 4 | 1 | 0.56 | 1.00 | 0.83 | 1.00 | 0.78 | 0.67 |
| 1 | 1 | 1 | 0.63 | 13 | 2,785 | 3 | 13 | 0.81 | 1.00 | 0.50 | 1.00 | 0.90 | 0.62 |
| 1 | 0 | 0 | 0.90 | 34 | 2,772 | 7 | 1 | 0.83 | 1.00 | 0.97 | 1.00 | 0.91 | 0.89 |
| 0 | 1 | 0 | 0.00 | 0 | 2,805 | 5 | 4 | 0.00 | 1.00 | 0.00 | 1.00 | 0.50 | 0.00 |
| 0 | 0 | 1 | 0.50 | 156 | 2,385 | 202 | 71 | 0.44 | 0.97 | 0.69 | 0.92 | 0.70 | 0.53 |
| 1 | 1 | 0 | 0.39 | 6 | 2,789 | 11 | 8 | 0.35 | 1.00 | 0.43 | 1.00 | 0.68 | 0.39 |
| 1 | 0 | 1 | 0.59 | 20 | 2,767 | 13 | 14 | 0.61 | 0.99 | 0.59 | 1.00 | 0.80 | 0.60 |
| 0 | 1 | 1 | 0.67 | 6 | 2,802 | 3 | 3 | 0.67 | 1.00 | 0.67 | 1.00 | 0.83 | 0.67 |
| 1 | 1 | 1 | 0.57 | 10 | 2,789 | 6 | 9 | 0.63 | 1.00 | 0.53 | 1.00 | 0.81 | 0.57 |
FIGURE 3(A) MCCs for predicting presence of one endpoint and absence of two endpoints for all feature sets. (B) MCCs for predicting presence of two endpoints and absence of one endpoint for all feature sets.
FIGURE 4Performance of the single-task DNN model on four external testing sets. Contingency tables of prediction for data from (A) Stallings et al. and (B) Brown et al. (C) and (D) Correlation between measures of experimentally observed injury and model-predicted injury for data from (C) Sutherland et al., and (D) Slopianka et al. Each data point represents an exposure condition (a combination of treatment duration and dose). The experimental level of injury is given by the mean injury score based on histopathology evaluation, while the predicted level of injury is given by the number of positive samples.
Performance of single-task DNN in predicting Eun et al. data.
| Biliary hyperplasia | Fibrosis | Necrosis | |
|---|---|---|---|
| Total | 269 | 269 | 269 |
| Correct | 265 | 269 | 269 |
| Missed | 4 | 0 | 0 |
| Accuracy (%) | 98.5 | 100.0 | 100.0 |
Performance of Y-shuffled DNN models.
| Biliary hyperplasia | Fibrosis | Necrosis | ||||
|---|---|---|---|---|---|---|
| Cross validation | Ippolito et al. | Cross validation | Ippolito et al. | Cross validation | Ippolito et al. | |
| F1 | 0.03 | 0.01 | 0.02 | 0.00 | 0.14 | 0.16 |
| Standard deviation | 0.03 | 0.03 | 0.04 | 0.00 | 0.03 | 0.13 |
Mean F1 scores of arbitrary gene sets in predicting external testing sets.
| Feature set | Brown et al. | Stallings et al. |
|---|---|---|
| Fibrosis | Necrosis | |
| 0.07 | 0.00 | |
| 0.05 | 0.00 | |
| 0.29 | 0.07 | |