| Literature DB >> 22044602 |
Jie Yang1, George Casella, Lauren M McIntyre.
Abstract
BACKGROUND: Many analyses of gene expression data involve hypothesis tests of an interaction term between two fixed effects, typically tested using a residual variance. In expression studies, the issue of variance heteroscedasticity has received much attention, and previous work has focused on either between-gene or within-gene heteroscedasticity. However, in a single experiment, heteroscedasticity may exist both within and between genes. Here we develop flexible shrinkage error estimators considering both between-gene and within-gene heteroscedasticity and use them to construct F-like test statistics for testing interactions, with cutoff values obtained by permutation. These permutation tests are complicated, and several permutation tests are investigated here.Entities:
Mesh:
Year: 2011 PMID: 22044602 PMCID: PMC3221690 DOI: 10.1186/1471-2105-12-427
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Results from raw data permutation
| Restricted? | Data set | |||||||
|---|---|---|---|---|---|---|---|---|
| YES | null-ce | 5.05(0.07) | 5.06(0.08) | 5.12(0.17) | 5.09(0.10) | 5.09(0.10) | 5.05(0.08) | 5.11(0.10) |
| null-gh | 5.02(0.07) | 5.13(0.16) | 5.26(0.20) | 5.03(0.07) | 5.07(0.12) | 5.03(0.07) | 5.11(0.16) | |
| null-wgh | 4.97(0.07) | 4.96(0.09) | 4.93(0.18) | 4.99(0.08) | 4.99(0.12) | 4.96(0.09) | 5.01(0.16) | |
| null-bgh | 5.02(0.07) | 4.99(0.17) | 5.03(0.21) | 5.02(0.07) | 5.02(0.15) | 5.01(0.09) | 5.03(0.18) | |
| NO | null-ce | 5.10(0.07) | 5.06(0.08) | 5.06(0.08) | 7.4(0.12) | 5.15(0.09) | 5.12(0.08) | 5.08(0.08) |
| null-gh | 5.08(0.07) | 5.12(0.16) | 5.12(0.12) | 7.4(0.09) | 5.10(0.11) | 5.07(0.10) | 5.12(0.09) | |
| null-wgh | 12.31(0.10) | 7.56(0.10) | 4.61(0.10) | 17.37(0.14) | 5.32(0.11) | 5.07(0.09) | 5.87(0.11) | |
| null-bgh | 12.31(0.11) | 6.63(0.17) | 5.55(0.19) | 15.68(0.12) | 6.30(0.12) | 6.10(0.11) | 6.30(0.11) | |
CWER obtained from 1,000 permutations with the nominal significance level setting at 0.05, with standard errors in parentheses. Nine hundred simulation runs were performed to get empirical average CWER of all types of F-like test statistics.
Figure 1The comparison of power curves of all . The x-axis is the average power from analyzing 900 simulated data sets using F1 with tabled p-values. The y-axis is the estimated powers using empirical gene-specific null distributions from 1,000 residual permutations. The upper four plots show the results with restricted residual permutation, while the lower four plots show the results from unrestricted residual permutation. The solid line indicates the empirical average CWER of a statistic is at the prespecified level, and the dashed line shows an inflated empirical average CWER."ce," all genes have common error; "gh," only between-gene heteroscedasticity exists; "wgh," only within-gene heteroscedasticity exists; "bgh," both between-gene and within-gene heteroscedasticity exist.
Figure 2The comparison of power curves of from unrestricted residual permutation versus other . Only results from permutation combinations that can control prespecified CWER are used in this figure. The x-axis is the average power after analyzing 900 simulated data sets using Fand 1,000 unrestricted residual permutations. The y-axis is the estimated power from other F-like test statistics and empirical gene-specific null distributions based on the appropriate permutation. The solid black line corresponds to Fwith unrestricted permutation, and this test always controls prespecified CWER."ce," all genes have common error; "gh," only between-gene heteroscedasticity exists; "wgh," only within-gene heteroscedasticity exists; "bgh," both between-gene and within gene heteroscedasticity exist; "res," restricted permutation; "unres," unrestricted permuation.
Results from residual permutation
| Restricted? | Data set | |||||||
|---|---|---|---|---|---|---|---|---|
| YES | null-ce | 4.59(0.07) | 4.15(0.08) | 3.63(0.14) | 4.09(0.09) | 4.55(0.07) | 3.23(0.06) | 4.44(0.08) |
| null-gh | 4.57(0.07) | 4.1(0.13) | 3.95(0.16) | 4.49(0.07) | 4.6(0.07) | 4.61(0.07) | 4.38(0.07) | |
| null-wgh | 6.74(0.08) | 4.33(0.08) | 3.51(0.14) | 6.49(0.09) | 4.38(0.07) | 4.2(0.07) | 2.78(0.09) | |
| null-bgh | 6.74(0.08) | 4.35(0.16) | 4.07(0.19) | 6.58(0.08) | 4.36(0.07) | 4.16(0.07) | 3.64(0.07) | |
| NO | null-ce | 5.1(0.07) | 4.99(0.08) | 4.5(0.08) | 4.59(0.08) | 4.99(0.07) | 4.1(0.07) | 4.68(0.07) |
| null-gh | 5.1(0.07) | 4.83(0.1) | 4.59(0.11) | 5.08(0.07) | 4.99(0.07) | 5.01(0.07) | 4.95(0.07) | |
| null-wgh | 10.75(0.09) | 8.46(0.09) | 7.6(0.09) | 12.37(0.11) | 5.03(0.08) | 6.43(0.08) | 4.93(0.08) | |
| null-bgh | 10.75(0.1) | 8.38(0.17) | 8.07(0.19) | 10.79(0.1) | 5.02(0.08) | 6.38(0.08) | 6.73(0.08) | |
CWER obtained from 1,000 permutations with the nominal significance level setting at 0.05, with standard errors in parentheses. Nine hundred simulation runs were performed to get empirical average CWER of all types of F-like test statistics.
Probe sets with significant line*probe terms found by F-like test statistics and appropriate residual permutation procedures and Smyth's moderated F-test statistic
| Test statistic | Restricted permutation? | Number of probe sets found | True false discovery rate | Power |
|---|---|---|---|---|
| Yes | 124 | 22.6% | 12.4% | |
| Yes | 187 | 29.4% | 17.1% | |
| No | 453 | 29.5% | 41.1% | |
| No | 455 | 28.8% | 41.7% | |
| No | 474 | 28.9% | 43.4% | |
| Yes | 136 | 24.3% | 13.3% | |
| Yes | 122 | 22.1% | 12.2% | |
| Yes | 116 | 21.5% | 11.7% | |
| N/A | 535 | 34.1% | 75.5% | |
| N/A | 813 | 34.4% | 68.8% |
The CWER was set to 0.05. Gene-specific cutoff values were obtained from 1,000 permutations. "moderated F-1" and "moderated F-2" represent results from using moderated F statistic without any multiple testing adjustment and setting FDR to 5%.