| Literature DB >> 35754715 |
Junko Yamane1, Takumi Wada1, Hironori Otsuki2, Koji Inomata2, Mutsumi Suzuki2, Tomoka Hisaki3, Shuichi Sekine3, Hirokazu Kouzuki3, Kenta Kobayashi1, Hideko Sone4, Jun K Yamashita1, Mitsujiro Osawa1, Megumu K Saito1, Wataru Fujibuchi1.
Abstract
An alternative model that reliably predicts human-specific toxicity is necessary because the translatability of effects on animal models for human disease is limited to context. Previously, we developed a method that accurately predicts developmental toxicity based on the gene networks of undifferentiated human embryonic stem (ES) cells. Here, we advanced this method to predict adult toxicities of 24 chemicals in six categories (neurotoxins, cardiotoxins, hepatotoxins, two types of nephrotoxins, and non-genotoxic carcinogens) and achieved high predictability (AUC = 0.90-1.00) in all categories. Moreover, we screened for an induced pluripotent stem (iPS) cell line to predict the toxicities based on the gene networks of iPS cells using transfer learning of the gene networks of ES cells, and predicted toxicities in four categories (neurotoxins, hepatotoxins, glomerular nephrotoxins, and non-genotoxic carcinogens) with high performance (AUC = 0.82-0.99). This method holds promise for tailor-made safety evaluations using personalized iPS cells.Entities:
Keywords: Bioinformatics; Biological sciences; Cell biology; Computational toxicology; Toxicology
Year: 2022 PMID: 35754715 PMCID: PMC9218511 DOI: 10.1016/j.isci.2022.104538
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Construction of a gene expression database for 24 chemicals
(A) Schema of the chemical assay. hESC, human ES cells; hiPSC, human iPS cells. (B) IC50 for 24 chemicals. Data are represented as estimated value with 95% confidence intervals. (C) Transcription factor genes differentially expressed following exposure to 24 chemicals at ½ dose. (D) PCA of 24 chemicals in six toxicity categories at two time points.
List of 24 chemicals
| Chemical name | CAS RN | M.W. | NT | HT | CT | GT | TT | NGC |
|---|---|---|---|---|---|---|---|---|
| 110-13-4 | 114.14 | NA | NA | NA | NA | NA | ||
| 79-06-1 | 71.08 | NA | NA | NA | ||||
| 1951-25-3 | 645.3 | NA | NA | |||||
| 549-18-8 | 313.9 | NA | NA | NA | ||||
| 134523-00-5 | 558.6 | NA | NA | NA | NA | |||
| 319460-85-0 | 386.5 | NA | LiverTox: Likelihood score: E | NA | NA | |||
| 65002-17-7 | 223.3 | NA | NA | NA | NA | NA | ||
| 25013-16-5 | 180.24 | NA | NA | NA | NA | NA | ||
| 132-22-9 | 274.79 | NA | DILIrank: 0 (No-DILI-Concern) | NA | NA | NA | NA | |
| 50-53-3 | 318.9 | DILIrank: 2 (Less-DILI-Concern) | NA | NA | NA | |||
| 15663-27-1 | 300.05 | NA | NA | |||||
| 4449-51-8 | 411.6 | NA | NA | NA | NA | NA | NA | |
| 59865-13-3 | 1202.6 | NA | ||||||
| 20830-75-5 | 780.9 | DILIrank: 0 (No-DILI-Concern) | NA | NA | ||||
| 23214-92-8 | 543.5 | NA | ||||||
| 1403-66-3 | 477.6 | LiverTox: Likelihood score: E | NA | NA | ||||
| 15687-27-1 | 206.28 | NA | NA | |||||
| 84625-61-6 | 705.6 | NA | NA | NA | ||||
| 434-13-9 | 376.6 | NA | NA | NA | NA | |||
| 135-23-9 | 297.8 | NA | NA | NA | NA | |||
| 557795-19-4 | 398.5 | NA | NA | NA | ||||
| 62-55-5 | 75.14 | NA | NA | NA | NA | |||
| 99-66-1 | 144.21 | NA | NA | |||||
| 152-11-4 | 491.1 | NA | NA |
Bold characters indicate positive toxicity.
NT, Neurotoxin; HT, Hepatotoxin; CT, Cardiotoxin; GT, Glomerular toxin (Nephrotoxin); TT, Tubular toxin (Nephrotoxin); NGC, Non-genotoxic carcinogen.
Chemical names are given by referring to PubChem (as of April 20, 2021).
Figure 2Prediction of six toxicity categories using KhES-3 cells
(A) Gene network representation of GGMs from KhES-3 cells. (B) ROC curves for the prediction of chemicals in two toxicity categories (C) Pathway analysis for hepatotoxins at 24 h and high-dose (1/1, ½ doses) samples.
Summary of prediction performance for KhES-3
| NT (13) | HT (15) | CT (13) | GT (6) | TT (7) | NGC (9) | |||
|---|---|---|---|---|---|---|---|---|
| Random | 24 h | Mean Accuracy (%) | 85.8 | 83.7 | 83.3 | 87.5 | 87.0 | 87.0 |
| SD | 8.13 | 6.92 | 6.81 | 6.21 | 6.65 | 5.38 | ||
| Mean AUC | 0.86 | 0.82 | 0.85 | 0.88 | 0.86 | 0.87 | ||
| SD | 0.09 | 0.11 | 0.09 | 0.07 | 0.11 | 0.06 | ||
| 48 h | Mean Accuracy (%) | 85.8 | 83.7 | 83.3 | 87.5 | 87.0 | 87.0 | |
| SD | 8.13 | 6.92 | 6.81 | 6.21 | 6.65 | 5.38 | ||
| Mean AUC | 0.86 | 0.82 | 0.85 | 0.88 | 0.86 | 0.87 | ||
| SD | 0.09 | 0.11 | 0.09 | 0.07 | 0.11 | 0.06 | ||
| Molecular descriptors | Accuracy (%) | 83.3 | 70.8 | 83.3 | 83.3 | 83.3 | ||
| AUC | 0.83 | 0.76 | 0.90 | 0.82 | 0.86 | 0.60 | ||
| RNA-seq | 24 h | Accuracy (%) | 75.0 | 75.0 | 79.2 | 79.2 | 75.0 | 66.7 |
| AUC | 0.73 | 0.73 | 0.83 | 0.83 | 0.74 | 0.76 | ||
| 48 h | Accuracy (%) | 62.5 | 75.0 | 75.0 | 83.3 | 79.2 | 66.7 | |
| AUC | 0.71 | 0.73 | 0.78 | 0.66 | 0.61 | 0.79 | ||
| GGM network coefficient | 24 h | Accuracy (%) | 83.3 | 87.5 | 79.2 | |||
| AUC | 0.73 | 0.83 | 0.76 | |||||
| 48 h | Accuracy (%) | 87.5 | 83.3 | 83.3 | ||||
| AUC | 0.79 | 0.9 | 0.80 | |||||
p < 0.05 and ∗p < 0.01 in one-sample t-test (one-sided).
NT, Neurotoxin; HT, Hepatotoxin; CT, Cardiotoxin; GT, Glomerular toxin (Nephrotoxin); TT, Tubular toxin (Nephrotoxin); NGC, Non-genotoxic carcinogen.
SD, Sample SD; AUC, Area Under the ROC Curve; GGM, Graphical Gaussian Model.
Prediction for HPS4138 cells
| NT (13) | HT (15) | CT (13) | GT (6) | TT (7) | NGC (9) | |||
|---|---|---|---|---|---|---|---|---|
| Random | 24 h | Mean Accuracy (%) | 81.6 | 82.5 | 80.8 | 89.5 | 85.0 | 81.2 |
| SD | 3.49 | 3.82 | 4.89 | 2.95 | 4.49 | 5.65 | ||
| Mean AUC | 0.81 | 0.77 | 0.78 | 0.79 | 0.80 | 0.74 | ||
| SD | 0.06 | 0.08 | 0.07 | 0.08 | 0.10 | 0.11 | ||
| 48 h | Mean Accuracy (%) | 81.6 | 82.0 | 80.4 | 88.7 | 86.2 | 81.2 | |
| SD | 5.27 | 4.82 | 5.21 | 2.03 | 4.43 | 4.04 | ||
| Mean AUC | 0.77 | 0.81 | 0.8 | 0.84 | 0.77 | 0.77 | ||
| SD | 0.09 | 0.05 | 0.07 | 0.10 | 0.13 | 0.08 | ||
| GGM network coefficient | 24 h | Accuracy (%) | 79.2 | 87.5 | 79.2 | |||
| AUC | 0.81 | 0.71 | 0.82 | |||||
| 48 h | Accuracy (%) | 83.3 | 79.2 | 83.3 | 83.3 | |||
| AUC | 0.75 | 0.81 | 0.77 |
p < 0.05 and ∗p < 0.01 in one-sample t-test (one-sided).
NT, Neurotoxin; HT, Hepatotoxin; CT, Cardiotoxin; GT, Glomerular toxin (Nephrotoxin); TT, Tubular toxin (Nephrotoxin); NGC, Non-genotoxic carcinogen.
SD, Sample SD; AUC, Area Under the ROC Curve; GGM, Graphical Gaussian Model.
Figure 3Summary of toxicity category prediction for 24 chemicals using HPS4138 cells
Red dots indicate predicted SVM values of iPS cell data, and filled and open markers indicate true and false predictions, respectively. Black dots and bars indicate the means ± SEM of SVM values for random data. In the tables, the label columns contain prior knowledge regarding whether the chemical shows toxicity (P: positive) or not (N: negative). The SVM column contains the SVM values of the iPS cell data. The expected accuracy (Exp.Acc.) columns contain the prediction accuracy using random data. The graphs below the tables show the probability distribution of the prediction accuracy using random data. Black lines indicate the probability density estimated using the t distribution (degrees of freedom = 9). Black shaded areas represent the upper 5%. Red dashed lines indicate the prediction accuracy using iPS cell data.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Vitronectin (VTN-N) | Thermo Fisher Scientific | Cat # A14700 |
| CultureSure® Y-27632 | FUJIFILM WAKO | Cat # 036-24023 |
| Dimethyl sulfoxide, for molecular biology | Sigma-Aldrich | Cat # D8418 |
| 2,5-Hexanedione | Sigma-Aldrich | Cat # 00770-50ml |
| Acrylamide(monomer) | NACALAI TESQUE | Cat # 06114-24 |
| Amiodarone Hydrochloride | TCI (Tokyo Chemical Industry) | Cat # A2530 |
| Amitriptyline hydrochloride | Sigma-Aldrich | Cat # A8404 |
| Atorvastatin Calcium Salt Trihydrate | TCI (Tokyo Chemical Industry) | Cat # A2476 |
| Axitinib | Selleck Chemicals | Cat # S1005 |
| Bucillamine | Toronto Research Chemicals | Cat # B689375 |
| Butylated hydroxyanisole | Sigma-Aldrich | Cat # B1253-5G |
| (+)-Chlorpheniramine Maleate | FUJIFILM WAKO | Cat # 030-13271 |
| Chlorpromazine Hydrochloride | FUJIFILM WAKO | Cat # 033-10581 |
| cis-Diammineplatinum(II) dichloride crystalline | Sigma-Aldrich | Cat # P4394-250MG |
| Cyclopamine hydrate | SIGMA | Cat # C4116-1MG |
| Ciclosporin A | FUJIFILM WAKO | Cat # 031-24931 |
| Digoxin | Sigma-Aldrich | Cat # D6003-1G |
| Doxorubicin (hydrochloride) | Cayman Chemical | Cat # 15007 |
| Gentamicin sulfate | Sigma-Aldrich | Cat # G3632-1G |
| Ibuprofen | FUJIFILM WAKO | Cat # 098-02641 |
| Itraconazole | Sigma-Aldrich | Cat # I6657-100MG |
| Lithocholic acid | Sigma-Aldrich | Cat # L6250-10G |
| Methapyrilene HCl | Sigma-Aldrich | Cat # 442641 |
| Sunitinib malate | Tocris Bioscience | Cat # 510-86011 |
| Thioacetamide | Sigma-Aldrich | Cat # 172502-25G |
| Valproic acid | SIGMA | Cat # PHR1061-1G |
| Verapamil Hydrochloride | Sigma-Aldrich | Cat # V4629-1G |
| RNasin® Plus RNase Inhibitor, 2500u | Promega | Cat # N2611 |
| Dimethoate Standard | FUJIFILM WAKO | Cat # 041-15843 |
| Arsenic (III) oxide | Sigma-Aldrich | Cat # 311383-125G |
| Quinidine hydrochloride monohydrate technical grade | Sigma-Aldrich | Cat # Q0750-5G |
| CellTiter-Glo® Luminescent Cell Viability Assay | Promega | Cat # G7571 |
| RNeasy Mini Kit | QIAGEN | Cat # 74106 |
| RNase-Free DNase Set (50) | QIAGEN | Cat # 79254 |
| TruSeq Stranded mRNA Library Prep (48 Samples) | Illumina | Cat # 2020594 |
| TruSeq RNA Single Indexes Set A (12 Indexes, 48 Samples) | Illumina | Cat # 20020492 |
| TruSeq RNA Single Indexes Set B (12 Indexes, 48 Samples) | illumina | Cat # 20020493 |
| HiSeq 3000/4000 SR Cluster Kit | illumina | Cat # GD-410-1001 |
| HiSeq 3000/4000 SBS Kit (50 Cycles) | illumina | Cat # FC-410-1001 |
| PrimeScript™ RT Reagent Kit (Perfect Real Time) | TAKARA BIO | Cat # RR037B |
| KAPA SYBR Fast qPCR Kit (Universal qPCR kit) | KAPA BIOSYSTEMS | Cat # KK4602 |
| RNA-seq | This study | GEO: |
| KhES-3 cell line (XY) | ( | NA |
| HPS4138 cell line (XY) | RIKEN BRC | NA |
| HPS4234 cell line (XY) | RIKEN BRC | NA |
| HPS4046 cell line (XY) | RIKEN BRC | NA |
| Primers for qRT-PCR | This study | |
| alvaDesc | Affinity Science | |
| CORINA Classic | mn-am.com | |
| The source code of R analysis script and the data supporting the findings of this study | This study | |