| Literature DB >> 34036128 |
Eunnara Cho1,2, Andrew Williams1, Carole L Yauk1,3.
Abstract
Transcriptomic biomarkers facilitate mode of action analysis of toxicants by detecting specific patterns of gene expression perturbations. We identified an 81-gene transcriptomic biomarker of histone deacetylase inhibitors (HDACi) using whole transcriptome data sets of TK6 human lymphoblastoid cells generated by Templated Oligo-Sequencing (TempO-Seq) after 4 h of exposure to 20 reference compounds (10 HDACi and 10 non-HDACi) [1]. The biomarker, named TGx-HDACi, was derived using the nearest shrunken centroid (NSC) method and can distinguish HDACi from non-HDACi compounds based on the expression pattern across the 81 genes. The classification capability of TGx-HDACi was evaluated by NSC probability analysis of 11 external validation compounds (4 HDACi and 7 non-HDACi) with a probability cut-off of 90%. Thus far, TGx-HDACi has demonstrated 100% accuracy in classifying the reference and validation compounds as HDACi or non-HDACi. Of the 81 TGx-HDACi genes, 19 genes are part of the S1500+ gene panel containing 2753 genes, developed for toxicological assessments [2]. Herein, we assessed the classification performance of the biomarker with this reduced gene set to determine if TGx-HDACi can be applied to analyze S1500+ gene expression profiles. The 20 reference compounds and 11 validation compounds were correctly classified as HDACi or non-HDACi by the NSC probability analysis, principal component analysis, and hierarchical clustering based on the expression of the 19 genes, demonstrating 100% accuracy. CrownEntities:
Keywords: Epigenetics; Genomics; Histone deacetylase inhibition; Predictive toxicology; TempO-Seq; Toxicogenomics; Transcriptomic biomarker
Year: 2021 PMID: 34036128 PMCID: PMC8138725 DOI: 10.1016/j.dib.2021.107097
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
List of reference and validation compounds with chemical class and abbreviations.
| Class | Chemical | Abbreviation | |
|---|---|---|---|
| Reference Compound | HDACi | Apicidin | Api |
| Mocetinostat | Mo | ||
| Oxamflatin | Oxa | ||
| Panobinostat | Pan | ||
| Scriptaid | Scr | ||
| Sodium butyrate | SodButy | ||
| Suberohydroxamic acid | SBHA | ||
| Tacedinaline | Tac | ||
| Trichostatin A | TSA | ||
| Vorinostat | Vor | ||
| Non-HDACi | 2-Deoxyglucose | 2-DG | |
| 5-Fluorouracil | 5-FU | ||
| Antimycin A | Ant | ||
| Bleomycin | Ble | ||
| Cadmium chloride | CdCl2 | ||
| Cisplatin | Cisp | ||
| Camptothecin | CPT | ||
| Hydroxyurea | HU | ||
| Tunicamycin | Tun | ||
| Vinblastine | Vin | ||
| External Validation Compound | HDACi | Entinostat | Ent |
| HC Toxin | HCT | ||
| Pracinostat | Pra | ||
| Valproic acid | VPA | ||
| Non-HDACi | Arabinofuranosyl cytidine | AraC | |
| Docetaxel | Doc | ||
| Methotrexate | MTX | ||
| Thapsigargin | Thap | ||
| 3-Deazaneplanocin A | DZNep | ||
| Garcinol | Gar | ||
| GSK-J4 | GSK-J4 | ||
The centroids and standard deviations of 19 genes from the TGx-HDACi biomarker.
| Gene | HDACi Class Centroid | Non-HDACi Class Centroid | Standard Deviation |
|---|---|---|---|
| 3.564 | 0.135 | 1.171 | |
| 2.287 | −2.029 | 2.607 | |
| 2.279 | 0.344 | 1.122 | |
| 1.769 | −0.031 | 0.860 | |
| 1.506 | 0.044 | 0.896 | |
| 1.274 | −0.220 | 0.920 | |
| 1.268 | −0.645 | 0.898 | |
| 1.257 | −0.414 | 0.906 | |
| 1.243 | −0.358 | 1.017 | |
| 1.237 | −0.465 | 1.031 | |
| −1.317 | 0.353 | 1.007 | |
| −1.497 | 0.226 | 0.985 | |
| −1.542 | 0.043 | 0.954 | |
| −1.615 | 0.346 | 1.227 | |
| −1.709 | 0.078 | 0.978 | |
| −1.758 | −0.086 | 0.930 | |
| −1.794 | 0.191 | 1.073 | |
| −1.867 | −0.003 | 1.102 | |
| −2.712 | −0.279 | 1.273 | |
Fig. 1Principal component analysis of the 19-gene expression profiles in TK6 cells exposed to the 20 TGx-HDACi reference compounds and 11 external validation compounds for 4 h. The red dots represent the 10 HDACi reference compounds and the blue dots represent the 10 non-HDACi reference compounds. Each dot represents the average of three replicates. The vertical red line at principal component 1 (PC1) separates HDACi from non-HDACi for classification purposes. PC1 describes the contrast between the HDACi and the non-HDACi classes. The four HDACi external validation compounds are in black and the seven non-HDACi validation compounds are in green; all validation compounds clustered within their respective classes (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.).
Fig. 2Hierarchical clustering of the 19-gene expression profiles in TK6 cells exposed to the 20 TGx-HDACi reference compounds and 11 external validation compounds for 4 h. Red and blue indicate HDACi reference compounds and non-HDACi reference compounds, respectively. The four HDACi external validation compounds are in black and the seven non-HDACi validation compounds are in green. The two chemical classes formed two separate branches and all 11 validation compounds branched within their respective classes (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.).
Fig. 3Heatmaps representing the expression level of the 19 genes of TGx-HDACi in TK6 cells exposed to the 20 reference compounds and 11 external validation compounds for 4 h. The chemicals are listed across the x-axis. The 19 genes are on the y-axis. Red indicates up-regulation and green indicates down-regulation in gene expression. The bars above the heatmaps display the classification calls made by the biomarker in the NSC probability analysis (probability cut-off for HDACi class membership > 90%) and the class to which the chemical belongs. Red indicates HDACi and blue indicates non-HDACi (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.).
| Subject | Omics: Toxicogenomics (TGx) |
| Specific subject area | TGx is the study of genomic responses to toxicity. Transcriptomic biomarkers are patterns of gene expression changes that predict specific toxicities. |
| Type of data | Transcriptomic data |
| How data were acquired | Whole transcriptome Templated Oligo-Sequencing (TempO-Seq) (BioSpyder, Carlsbad, CA, USA) libraries were sequenced on an Illumina NextSeq 500 sequencer using 75-cycle flow cells. |
| Data format | All raw data (i.e., FASTQ files) and normalized transcriptomic data (TXT file containing transcript read counts normalized to counts per million) have been deposited to the National Center for Biotechnology Information (NCBI)’s Gene Expression Omnibus (GEO) database, under the accession number GSE164478. The overall centroids of the 81 TGx-HDACi genes derived from the TempO-Seq whole transcriptome profiles of the 20 reference compounds are presented in Supplementary Table 1. The overall centroids of 19 genes used in the analysis herein are presented within this article in Table 1. Analyses of the data are presented as figures. |
| Parameters for data collection | TK6 human lymphoblastoid cells were exposed to 31 compounds (HDACi and non-HDACi) and the vehicle solvents for 4 hr. The concentration of each compound was selected based on cell viability (>60%) at 24 hr and fold changes in three select genes measured in a preliminary qPCR experiment. There were three replicates and solvent matched controls for each chemical. |
| Description of data collection | Using total RNA extracted from TK6 cells, whole transcriptome TempO-Seq libraries were constructed. The libraries were sequenced on an Illumina NextSeq 500 sequencer. The resulting BCL files were converted to FASTQ files, which were processed to align the reads and generate counts for each gene. The read counts of all samples were normalized to counts per million (CPM) and the CPM values of each chemical treatment was then normalized to its solvent control. The nearest shrunken centroid method was applied to the resulting gene expression profiles to derive an 81-gene biomarker of HDACi. |
| Data accessibility | Data are partly hosted with the article and the complete TempO-Seq dataset is available in a public repository. |
| Related research article | E. Cho, A. Rowan-Carroll, A. Williams, J.C. Corton, H.H. Li, A. Fornace Jr., C. Hobb, C.L. Yauk, Development and Validation of the TGx-HDACi Transcriptomic Biomarker to Detect Histone Deacetylase Inhibitors in Human TK6 Cells, Arch Toxicol. (2021) |