| Literature DB >> 24586062 |
Sten Ilmjärv1, Christian Ansgar Hundahl, Riin Reimets, Margus Niitsoo, Raivo Kolde, Jaak Vilo, Eero Vasar, Hendrik Luuk.
Abstract
Regardless of the advent of high-throughput sequencing, microarrays remain central in current biomedical research. Conventional microarray analysis pipelines apply data reduction before the estimation of differential expression, which is likely to render the estimates susceptible to noise from signal summarization and reduce statistical power. We present a probe-level framework, which capitalizes on the high number of concurrent measurements to provide more robust differential expression estimates. The framework naturally extends to various experimental designs and target categories (e.g. transcripts, genes, genomic regions) as well as small sample sizes. Benchmarking in relation to popular microarray and RNA-sequencing data-analysis pipelines indicated high and stable performance on the Microarray Quality Control dataset and in a cell-culture model of hypoxia. Experimental-data-exhibiting long-range epigenetic silencing of gene expression was used to demonstrate the efficacy of detecting differential expression of genomic regions, a level of analysis not embraced by conventional workflows. Finally, we designed and conducted an experiment to identify hypothermia-responsive genes in terms of monotonic time-response. As a novel insight, hypothermia-dependent up-regulation of multiple genes of two major antioxidant pathways was identified and verified by quantitative real-time PCR.Entities:
Mesh:
Year: 2014 PMID: 24586062 PMCID: PMC4005682 DOI: 10.1093/nar/gku158
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Power analysis of Fisher’s exact test. For each combination of input parameters, the average of 1000 simulations is plotted. (B) Distribution of gene-specific probe counts. (C) Performance analysis based on DE estimates from simulated gene-expression data. For each combination of analysis parameters the average of 100 simulations is plotted.
Microarray and Taqman® datasets representing MAQC reference samples in this article
| Platform | Accession | Reference |
|---|---|---|
| Human Genome U133 Plus 2.0 | GEO:GSE9819 | ( |
| Human Gene 1.0 ST | GEO:GSE9819 | ( |
| Human Exon 1.0 ST | GEO:GSE13069 | ( |
| Taqman® | GEO:GSE5350 | ( |
Number of spots per sample for RNA-seq datasets representing MAQC reference samples in this article
| Dataset | HBR | UHR |
|---|---|---|
| GEO:GSE12946 | 17246957 | 8069964 |
| GEO:GSE24283 | 53238798 | 59461348 |
Figure 2.Performance analysis of DE analysis pipelines on the MAQC data. Performance of each pipeline is presented as a radial plot, which includes results from three microarrays (A) or two distinct RNA-seq datasets (B) based on six complementary performance indicators. Abbreviations: AUC, MCC, TPR, FPR, TNR and FNR.
Figure 3.Performance analysis of DE-analysis pipelines on data from mouse embryonic fibroblasts exposed to 1% O2 for 24 h. The plot depicts the combined detection rate of GO categories ‘cellular response to hypoxia’ (GO:0071456) and ‘glycolysis’ (GO:0006096) as indicators of hypoxia response. The data is plotted as mean ± standard error of all possible comparisons between subsets of size N of the hypoxic and normoxic groups (original N = 4).
Enrichment of mouse orthologs of HIF-1 targets among significantly up-regulated genes in mouse embryonic fibroblasts exposed to 1% O2 for 24 h
| Normalization | DE | ||||||
|---|---|---|---|---|---|---|---|
| Mean | SEM | Mean | SEM | Mean | SEM | ||
| Relative ranking | DEMI | 3.0 | NA | 4.3 | 2.4 | 6.0 | 5.4 |
| DFW | Limma | 1.4 | NA | 3.5 | 2.2 | 0.617 | 0.081 |
| DFW | RankProd | 1.4 | NA | 2.2 | 1.5 | 8.2 | 4.8 |
| FARMS | Limma | 9.5 | NA | 2.5 | 1.9 | 0.457 | 0.082 |
| FARMS | RankProd | 6.8 | NA | 3.1 | 2.1 | 0.037 | 0.011 |
| PLIER | Limma | 9.4 | NA | 0.9 | 0.1 | 1 | 0 |
| PLIER | RankProd | 3.5 | NA | 1 | 0 | 1 | 0 |
| RMA | Limma | 6.0 | NA | 1.7 | 1.5 | 0.481 | 0.083 |
| RMA | RankProd | 1.4 | NA | 4.3 | 2.1 | 0.001 | 4.6 |
Differential gene expression was estimated by nine pipelines including various normalization and DE-analysis methods. The data is presented as mean and standard error of hypergeometric P-values from all possible comparisons between subsets of size N of the hypoxic and normoxic groups (original N = 4). N, sample size; SEM, standard error of mean; NA, not available.
Enrichment of regions exhibiting differential epigenetic modification among differentially expressed genomic regions between LNCaP and PrEC cell lines
| Higher modification level in LNCaP | Lower modification level in LNCaP | Modification | |
|---|---|---|---|
| Higher expression level in LNCaP | 0.998 | 0.266 | H3K27me3 |
| 7.990 | 1 | H3K9ac | |
| Lower expression level in LNCaP | 0.102 | 1 | H3K27me3 |
| 1 | 5.400 | H3K9ac |
Significant P-values (hypergeometric probability distribution) indicate that down-regulation of H3K9ac is associated with silenced genomic regions while up-regulation of H3K9ac is prevalent among up-regulated regions in LNCaP cells when compared to PrEC. No significant association was found between DE and H3K27me3.
Figure 4.Large-scale analysis of temporal dynamics of transcription in mouse embryonic fibroblasts exposed to mild hypothermia. (A and B) Temporal profiles of probe expression levels of selected genes during mild hypothermia (A) and normothermia (B). The solid blue line indicates a linear fit to the data points and the gray shadowing represents standard error of the fit.
Genes responding to increasing durations of hypothermia with significantly monotonic increase in expression and relating to the antioxidant system
| Gene ID | Symbol | FDR | System | |
|---|---|---|---|---|
| ENSMUSG00000003849 | Nqo1 | 9.04 | 4.72 | Quinone detoxification |
| ENSMUSG00000032802 | Srxn1 | 1.23 | 2.14 | Glutathione |
| ENSMUSG00000027610 | Gss | 1.08 | 4.74 | Glutathione |
| ENSMUSG00000020250 | Txnrd1 | 4.29 | 9.83 | Thioredoxin |
| ENSMUSG00000032350 | Gclc | 4.29 | 9.83 | Glutathione |
| ENSMUSG00000000811 | Txnrd3 | 9.85 | 1.19 | Thioredoxin |
Figure 5.Temporal expression profiles of selected genes during mild hypothermia (32°C) and normothermia (37°C) as reported by quantitative real-time PCR. Expression level of Ywhaz was used as reference. The mean of three replicates and standard error has been plotted.