| Literature DB >> 34208365 |
Jeremiah Hadwen1,2, Sarah Schock1,2, Faraz Farooq1,2, Alex MacKenzie1,2, Julio Plaza-Diaz2,3,4.
Abstract
The development of DNA microarray and RNA-sequencing technology has led to an explosion in the generation of transcriptomic differential expression data under a wide range of biologic systems including those recapitulating the monogenic muscular dystrophies. Data generation has increased exponentially due in large part to new platforms, improved cost-effectiveness, and processing speed. However, reproducibility and thus reliability of data remain a central issue, particularly when resource constraints limit experiments to single replicates. This was observed firsthand in a recent rare disease drug repurposing project involving RNA-seq-based transcriptomic profiling of primary cerebrocortical cultures incubated with clinic-ready blood-brain penetrant drugs. Given the low validation rates obtained for single differential expression genes, alternative approaches to identify with greater confidence genes that were truly differentially expressed in our dataset were explored. Here we outline a method for differential expression data analysis in the context of drug repurposing for rare diseases that incorporates the statistical rigour of the multigene analysis to bring greater predictive power in assessing individual gene modulation. Ingenuity Pathway Analysis upstream regulator analysis was applied to the differentially expressed genes from the Care4Rare Neuron Drug Screen transcriptomic database to identify three distinct signaling networks each perturbed by a different drug and involving a central upstream modulating protein: levothyroxine (DIO3), hydroxyurea (FOXM1), dexamethasone (PPARD). Differential expression of upstream regulator network related genes was next assessed in in vitro and in vivo systems by qPCR, revealing 5× and 10× increases in validation rates, respectively, when compared with our previous experience with individual genes in the dataset not associated with a network. The Ingenuity Pathway Analysis based gene prioritization may increase the predictive value of drug-gene interactions, especially in the context of assessing single-gene modulation in single-replicate experiments.Entities:
Keywords: differential expression analysis; drug repurposing; rare disease; transcriptomics
Mesh:
Substances:
Year: 2021 PMID: 34208365 PMCID: PMC8231191 DOI: 10.3390/ijms22126295
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of six upstream regulators identified in the Neuron Screen data.
| Drug Name | Upstream Regulator | Expression Fold Change | Predicted Activation State | Activation Z-Score | Number of Genes | |
|---|---|---|---|---|---|---|
| Levothyroxine | DIO3 | 4.385 | Inhibited | −2.975 | 1.79 × 10−7 | 8 |
| Hydroxyurea | FOXM1 | −3.579 | Inhibited | −3.245 | 1.63 × 10−10 | 8 |
| Dexamethasone | PPARD | 2.522 | Activated | 3.126 | 4.58 × 10−2 | 10 |
| Dexamethasone | STAT4 | NA | Activated | 2.933 | 7.36 × 10−4 | 12 |
| Vigabatrin | MKNK1 | 1.187 | Inhibited | −3 | 1.40 × 10−4 | 5 |
| Pregabalin | PGR | 1.013 | Activated | 3.376 | 7.77 × 10−9 | 13 |
The expression fold change indicated is derived from the original study reporting the RNA-seq results from the neuronal screen. The level of mRNA (i.e., RNA-seq read numbers) for a given gene in the presence of a drug is compared against the average mRNA level of that gene for all conditions while the upstream regulator is the central molecule in the network. The activation z-score serves as a statistical measure as well as a directional tool (negative/downregulated, positive/upregulated) while the p-value of overlap serves as an additional measure of statistical certainty pertaining to the interrelated nature of the molecules forming each network. NA, data not available as gene expression was not included in the Neuron Screen data [25].
Figure 1Robust URA network induced by levothyroxine treatment of mouse cerebrocortical cultures. Dashed orange lines signify activation (loss of inhibition) and red symbols signify upregulated genes with the Neuron Screen Z-score and adjusted p-value (p-adj) (<0.05) appearing directly below each symbol. The upstream regulator DIO3 is blue, indicating downregulation based on the C4R drug screen data.
Figure 2Validation of downregulated gene expression prioritized by IPA analysis. (A) The URA network shows downregulation (green symbols) of the putative upstream regulator (FOXM1) and the downstream molecules (with associated Z-score and p-value given under each gene, for p values, E-n symbolizes E-n)) (Care4Rare Neuron Screen). Blue arrows indicate inhibition of the physiological activating relationship between FOXM1 and related genes. Only genes for which p-adj < 0.05 are included. (B–E) U87 glioblastoma cells were treated with 250 μM hydroxyurea for 0, 4, and 8 h. qRT-PCR (n = 3) was employed to determine target gene expression with geometric normalization against GAPDH and HPRT1. (B) Fork-head box M1 (FOXM1). (C) Polo-like kinase 1 (PLK1). (D) Cyclin B1 (CCNB1). (E) Centromere protein E (CENPE). Statistical significance was measured by one-way ANOVA (nonparametric) with Tukey post hoc analysis (** p < 0.01, *** p < 0.001, ns = nonsignificant).
Figure 3Validation efficiency for upregulated gene expression prioritized by IPA analysis. (A) The PPARD upstream regulator identified by URA analysis is upregulated in dexamethasone-treated mouse cerebrocortical cultures (Care4Rare Neuron Screen). Orange arrows signify an activating relationship with downstream genes and red symbols signify upregulated genes with the Neuron Screen Z-score and p-adj appearing directly below each symbol. Only genes for which p-adj < 0.05 are included. (B–E) Adult male C57BL6 mice were treated p.o. with vehicle or DEX (1 mg/kg) for 5 days, and qRT-PCR was employed to determine gene expression of 7 of the 10 targets of PPARD (n = 3). The four genes with the most robust upregulation are shown. (B) Bcl-2-like protein 1 (Bcldl1). (C) Integrin-linked kinase (Ilk). (D) Major facilitator superfamily domain containing 2A (Mfsd2a). (E) Pyruvate dehydrogenase kinase 4 (Pdk4). For all hits, statistical significance was measured by Student’s paired two-tailed t-test (** p < 0.01).