| Literature DB >> 22723856 |
Stephan Artmann1, Klaus Jung, Annalen Bleckmann, Tim Beissbarth.
Abstract
Expression levels of mRNAs are among other factors regulated by microRNAs. A particular microRNA can bind specifically to several target mRNAs and lead to their degradation. Expression levels of both, mRNAs and microRNAs, can be obtained by microarray experiments. In order to increase the power of detecting microRNAs that are differentially expressed between two different groups of samples, we incorporate expression levels of their related target gene sets. Group effects are determined individually for each microRNA, and by enrichment tests and global tests for target gene sets. The resulting lists of p-values from individual and set-wise testing are combined by means of meta analysis. We propose a new approach to connect microRNA-wise and gene set-wise information by means of p-value combination as often used in meta-analysis. In this context, we evaluate the usefulness of different approaches of gene set tests. In a simulation study we reveal that our combination approach is more powerful than microRNA-wise testing alone. Furthermore, we show that combining microRNA-wise results with 'competitive' gene set tests maintains a pre-specified false discovery rate. In contrast, a combination with 'self-contained' gene set tests can harm the false discovery rate, particularly when gene sets are not disjunct.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22723856 PMCID: PMC3378551 DOI: 10.1371/journal.pone.0038365
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow chart of combining expression levels of miRNAs and their related target mRNAs.
The links between microRNAs and their target mRNAs are taken from public databases. MicroRNAs and target sets are first analysed separately and obtained p-values are combined as final result.
Simulated for microRNA-selection based on target set testing.
| Gene set test | Covariance matrix | Non-overlap. target sets | Overlap. target sets |
| GlobalTest | Autoregressive |
|
|
| Block |
| 0.035– | |
| Unstructured |
|
| |
| GlobalAncova | Autoregressive | 0.000–0.000– | 0.04–0.046– |
| Block | 0.002–0.005– | 0.028–0.031–0.032 | |
| Unstructured | 0.000–0.000–0.003 | 0.000–0.036–0.039 | |
| RepeatedHighDim | Autoregressive | 0.019– |
|
| Block | 0.000–0.002–0.011 | 0.008–0.040–0.049 | |
| Unstructured | 0.030–0.034–0.037 | 0.031– | |
| ROAST | Autoregressive |
|
|
| Block |
|
| |
| Unstructured | 0.000–0.002–0.041 | 0.043–0.046–0.049 | |
| KS | Autoregressive | 0.002–0.005–0.036 | 0.029–0.031–0.036 |
| Block |
|
| |
| Unstructured | 0.029–0.031–0.033 | 0.030– | |
| Wilcoxon | Autoregressive | 0.001–0.002–0.009 | 0.010–0.030–0.039 |
| Block | 0.026–0.029–0.035 | 0.038– | |
| Unstructured |
|
| |
| Fisher | Autoregressive |
|
|
| Block | 0.000–0.001– | 0.042–0.047–0.050 | |
| Unstructured | 0.002–0.004–0.049 | 0.029–0.030–0.033 | |
| Romer | Autoregressive | 0.000–0.000–0.003 | 0.003–0.038–0.040 |
| Block | 0.023–0.025–0.031 | 0.033– | |
| Unstructured | 0.000–0.002–0.007 | 0.005–0.027–0.033 |
Simulated with respect to the type of gene set test and covariance matrix using the approach of selecting microRNAs by testing their target gene sets. Results are presented for the simulation setting of overlapping and disjunct target sets. Presented numbers are the minimum, median and maximum simulated across the range of the log fold change (between 0 and 6). Rates larger than the pre-specified level of 0.05 are printed in bold.
Simulated for microRNA-selection based on combined target set and microRNA-wise testing.
| Gene set test | Covariance matrix | Non-overlap. target sets | Overlap. target sets |
| GlobalTest | Autoregressive |
|
|
| Block |
|
| |
| Unstructured |
|
| |
| GlobalAncova | Autoregressive | 0.029–0.033– | 0.044–0.046– |
| Block | 0.043–0.045– | 0.034–0.036–0.045 | |
| Unstructured | 0.005–0.006–0.006 | 0.007–0.037–0.042 | |
| RepeatedHighDim | Autoregressive | 0.044–0.047– |
|
| Block | 0.031–0.034–0.047 | 0.029–0.037–0.044 | |
| Unstructured | 0.033–0.038–0.044 | 0.027– | |
| ROAST | Autoregressive | 0.045–0.048– | 0.035– |
| Block |
|
| |
| Unstructured | 0.033–0.036– | 0.049– | |
| KS | Autoregressive |
| 0.036–0.044–0.045 |
| Block |
|
| |
| Unstructured | 0.037–0.040–0.044 | 0.025–0.036–0.046 | |
| Wilcoxon | Autoregressive | 0.039–0.044– | 0.037–0.039–0.046 |
| Block | 0.026–0.029–0.036 | 0.030– | |
| Unstructured | 0.042–0.050– |
| |
| Fisher | Autoregressive |
|
|
| Block | 0.031–0.034– | 0.044–0.049– | |
| Unstructured | 0.045–0.050– | 0.039–0.043– | |
| Romer | Autoregressive | 0.007–0.008–0.012 | 0.012–0.044–0.047 |
| Block | 0.032–0.034–0.043 | 0.028–0.039– | |
| Unstructured | 0.038–0.041–0.046 | 0.033–0.036– |
Simulated with respect to the type of gene set test and covariance matrix using the approach of combined target set and microRNA-wise testing. Results are presented for the simulation setting of overlapping and disjunct target sets. Presented numbers are the minimum, median and maximum simulated across the range of the log fold change (between 0 and 6). Rates larger than the pre-specified level of 0.05 are printed in bold.
Figure 2Effect of overlapping target gene sets on the simulated .
Effects are presented for component-wise testing (dotted line), target set-wise testing (dashed line) and the combination approach (solid line). While the competitive approaches such as the ‘Wilcoxon’-based gene set test (top) still maintained the pre-specified -level of 5 when target gene sets overlapped, the increased dramatically when employing the self-contained approaches such as the ‘globaltest’ procedure (bottom).
Relation between simulated power rate curves.
| Gene set test | Covariance matrix | Non-overlap. target sets | Overlap. target sets |
| Globaltest | Autoregr. | miRNA < set ≈ combined | miRNA < set ≈ combined |
| Block & unstr. | miRNA < set < combined | miRNA < combined < set | |
| GlobalAncova | Autoregr. | miRNA < set < combined | miRNA < set ≈ combined |
| Block & unstr. | miRNA < set < combined | miRNA < combined < set | |
| Rep’HighDim | Autoregr. | miRNA < set ≈ combined | miRNA < set ≈ combined |
| Block & unstr. | miRNA < set ≈ combined | miRNA < set < combined | |
| ROAST | Autoregr. | *miRNA < combined < set | miRNA < combined < set |
| Block | miRNA ≈ set < combined | miRNA < combined < set | |
| Unstr. | miRNA ≈ set < combined | miRNA < set ≈ combined | |
| KS | Autoregr. | miRNA < set < combined | miRNA < set < combined |
| Block & unstr. | miRNA < set ≤ combined | miRNA < set < combined | |
| Wilcoxon | Autoregr. | miRNA < set < combined | miRNA < set < combined |
| Block & unstr. | miRNA < set ≤ combined | miRNA < set < combined | |
| Fisher | Autoregr. | miRNA < set < combined | miRNA < set < combined |
| Block & unstr. | miRNA < set ≤ combined | miRNA < set < combined | |
| Romer | Autoregr. | miRNA < combined < set | miRNA < combined < set |
| Block & unstr. | set < miRNA < combined | set < miRNA < combined |
Relation between simulated power rate curves of microRNA-wise, target set-wise and combined testing. Although, power curves sometimes intersected, this table gives the general tendency of the relations between the three approaches. *Compare Figure 3 left.
Figure 3Simulated average power rates with respect to the log fold change .
Left: component-wise testing (dotted line), target testing with ‘ROAST’ (dashed line) and the combination approach (solid line), each in the case of an autoregressive covariance structure and non-overlapping target sets. Right: combination approach based on ‘globaltest’ (dotted line), ‘Wilcoxon’ (dashed line) and ‘Romer’ (solid line), each in the case of an unstructured covariance matrix and overlapping target sets.
Rating orders of the in simulations of combined testing.
| Gene sets | Covariance matrix | GT | GA | RHD | KS | W | F | Ro | R |
| Non-Overlap. | Autoregressive | 4 | 5 | 3 | 1 | 1 | 8 | 6 | 7 |
| Block | 7 | 8 | 4 | 1 | 1 | 5 | 6 | 3 | |
| Unstructured | 7 | 8 | 4 | 1 | 1 | 5 | 6 | 3 | |
| Overlapping | Autoregressive | 1 | 1 | 1 | 1 | 1 | 6 | 7 | 8 |
| Block | 6 | 6 | 4 | 1 | 1 | 3 | 8 | 5 | |
| Unstructured | 7 | 6 | 4 | 1 | 1 | 3 | 8 | 4 |
Rating orders are based on the following gene set tests: GlobalTest (GT), GlobalAncova (GA), RepeatedHighDim (RHD), Kolm. Smirnov (KS), Wilcoxon (W), Fisher (F), ROAST (Ro) and Romer (R). The rank 1 denotes the largest power rate while the rank 8 denotes the worst power rate (comparable -curves were given the same position in the ranking). Note that W and KS always have the largest power rates.
Original results and re-analysis of microRNAs in rat brains.
| Original analysis | Re-analysis | |||||
| miRNA | miRNA | Targ.up | Targ. down | Glob. tests | Enrich. tests | Rot. tests |
|
| up | n.s. | sig. | sig. | sig. | sig. |
|
| up | n.s. | sig. | sig. | sig. | sig. |
|
| up | n.s. | sig. | sig. | sig. | sig. |
|
| up | n.s. | sig. | sig. | sig. | sig. |
|
| up | n.s | n.s | sig. | sig. | sig. |
|
| down | sig. | n.s. | sig. | sig. | sig. |
|
| down | sig. | n.s. | sig. | sig. | sig. |
|
| down | sig. | n.s. | sig. | sig. | sig. |
|
| down | sig. | n.s. | sig. | sig. | sig. |
|
| down | sig. | n.s. | sig. | sig. | sig. |
|
| down | sig. | sig. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| down | n.s. | n.s. | sig. | sig. | sig. |
|
| up | sig. | n.s. | sig. | sig. | n.s. |
|
| up | n.s. | n.s. | sig. | sig. | n.s. |
|
| up | n.s. | sig. | sig. | sig. | n.s. |
|
| down | n.s. | n.s. | sig. | sig. | n.s. |
|
| down | sig. | sig. | n.s. | ||
|
| down | sig. | n.s. | sig. | sig. | sig. (R) |
|
| down | n.s. | n.s. | sig. | sig. | sig. (R) |
|
| down | n.s. | n.s. | sig. | sig. | sig. (r) |
|
| down | n.s. | sig. | sig. | sig. | n.s. |
MicroRNAs originally detected in expression data from rat brains by Nielsen et al. [28]. Columns from left to right are the name of the microRNAs, the originally reported results (columns 2-4) and the results of our re-analysis (columns 5-7). Nielsen et al. performed their gene set tests separately for subsets of up- and down-regulated mRNAs yielding two results per gene set. Our re-analysis was performed by use of the combination approach based on either global tests, enrichment tests or rotation tests (‘ROAST’ (R), ‘Romer’ (r)). Significance (sig.) was declared when the FDR-adjusted -value was and not significant (n. s.) otherwise.
GO-Terms for Data Examples.
|
| GO-Term |
|
| |
| 7.42e-20 | sequence-specific DNA binding transcription factor activity |
| 1.71e-17 | regulation of transcription, DNA-dependent |
| 6.90e-17 | positive regulation of transcription from RNA polymerase II promoter |
| 8.57e-13 | sequence-specific DNA binding |
| 2.92e-12 | transcription, DNA-dependent |
| 2.32e-10 | negative regulation of transcription from RNA polymerase II promoter |
| 3.00e-10 | zinc ion binding |
| 1.64e-09 | axon guidance |
| 1.64e-09 | DNA binding |
| 6.79e-09 | SMAD binding |
| 1.02e-08 | growth factor binding |
| 7.56e-08 | negative regulation of transcription, DNA-dependent |
| 2.92e-07 | RNA polymerase II core promoter proximal |
| region sequence-specific DNA binding transcription factor | |
| activity involved in positive regulation of transcription | |
| 2.63e-06 | positive regulation of transcription, DNA-dependent |
| 4.72e-06 | anterior/posterior pattern specification |
| 5.36e-06 | in utero embryonic development |
| 7.85e-06 | regulation of actin cytoskeleton organization |
| 1.05e-05 | ephrin receptor binding |
| 1.37e-05 | gastrulation with mouth forming second |
| 1.42e-05 | regulation of translation |
| 1.43e-05 | regulation of cell-matrix adhesion |
| 1.85e-05 | palate development |
| 2.51e-05 | transmembrane receptor protein serine/threonine kinase activity |
| 2.99e-05 | axonogenesis |
|
| |
| 3.23e-15 | S phase |
| 4.21e-14 | regulation of gene silencing |
| 1.36e-13 | nucleosome |
| 8.53e-12 | nucleoplasm |
| 5.86e-11 | nucleosome assembly |
| 4.70e-10 | negative regulation of megakaryocyte differentiation |
| 3.24e-09 | chromosome |
| 1.53e-05 | transcription initiation, DNA-dependent |
| 0.0002 | telomere maintenance |
| 0.0002 | chromatin organization |
| 0.0009 | phosphatidylinositol-mediated signaling |
| 0.0025 | CenH3-containing nucleosome assembly at centromere |
| 0.0119 | gene expression |
| 0.0279 | ribonucleoprotein complex |
| 0.0296 | blood coagulation |
| 0.0420 | protein refolding |
| 0.0487 | signalosome |
The top-scoring Gene Ontology (GO) terms with lowest p-values (according to one-sided Fisher’s exact test) of miRNAs’ target sets from Neurogenesis (above) and HIV (below) data example.