| Literature DB >> 28637422 |
J F Mudge1, C J Martyniuk2, J E Houlahan3.
Abstract
BACKGROUND: Transcriptomic approaches (microarray and RNA-seq) have been a tremendous advance for molecular science in all disciplines, but they have made interpretation of hypothesis testing more difficult because of the large number of comparisons that are done within an experiment. The result has been a proliferation of techniques aimed at solving the multiple comparisons problem, techniques that have focused primarily on minimizing Type I error with little or no concern about concomitant increases in Type II errors. We have previously proposed a novel approach for setting statistical thresholds with applications for high throughput omics-data, optimal α, which minimizes the probability of making either error (i.e. Type I or II) and eliminates the need for post-hoc adjustments.Entities:
Keywords: High throughput analysis; Microarrays; Multiple comparisons; Optimal α; Post-hoc corrections; RNA-seq; Type I and II error rates
Mesh:
Substances:
Year: 2017 PMID: 28637422 PMCID: PMC5480162 DOI: 10.1186/s12859-017-1728-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Distribution of the number of biological replicates per treatment group over 203 fish microarray papers published between 2002 and 2011
Type I and II error rates: Median, 1st and 3rd quartiles, minimum and maximum α, β, average of α and β, and implied costs of Type I/II errors, evaluated for the standard α = 0.05 and for the optimal α approach, at 3 critical effect sizes (1, 2, and 4 SD), for 203 fish microarray papers with tests that have at least 3 replicates, published between 2002 and 2011 (assuming two-tailed, two-sample t-tests)
| Critical effect size | Decision threshold | Statistical parameter | Minimum | 1st quartile | Median | 3rd quartile | Maximum |
|---|---|---|---|---|---|---|---|
| 1 standard deviation | standard α | α | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 |
| β | 0.088 | 0.71 | 0.78 | 0.84 | 0.84 | ||
| (α + β)/2 | 0.069 | 0.38 | 0.41 | 0.45 | 0.45 | ||
| Implied Type I/II error cost ratio | 1.5 | 2.6 | 3.1 | 3.4 | 3.9 | ||
| optimal α | α | 0.064 | 0.26 | 0.29 | 0.32 | 0.32 | |
| β | 0.070 | 0.34 | 0.38 | 0.42 | 0.42 | ||
| (α + β)/2 | 0.067 | 0.30 | 0.33 | 0.37 | 0.37 | ||
| Implied Type I/II error cost ratio | 1 | 1 | 1 | 1 | 1 | ||
| 2 standard deviations | standard α | α | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 |
| β | 0.0000015 | 0.21 | 0.34 | 0.54 | 0.54 | ||
| (α + β)/2 | 0.025 | 0.13 | 0.20 | 0.29 | 0.29 | ||
| Implied Type I/II error cost ratio | 0.00011 | 3.4 | 4.5 | 5.1 | 5.1 | ||
| optimal α | α | 0.0011 | 0.11 | 0.15 | 0.21 | 0.21 | |
| β | 0.00094 | 0.10 | 0.13 | 0.18 | 0.18 | ||
| (α + β)/2 | 0.0010 | 0.10 | 0.14 | 0.19 | 0.19 | ||
| Implied Type I/II error cost ratio | 1 | 1 | 1 | 1 | 1 | ||
| 4 standard deviations | standard α | α | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 |
| β | 0 | 0.00023 | 0.0038 | 0.052 | 0.052 | ||
| (α + β)/2 | 0.025 | 0.025 | 0.027 | 0.051 | 0.051 | ||
| Implied Type I/II error cost ratio | 0.00011 | 0.014 | 0.19 | 1.9 | 1.9 | ||
| optimal α | α | 0.000000031 | 0.013 | 0.028 | 0.065 | 0.065 | |
| β | 0.000000017 | 0.0065 | 0.014 | 0.031 | 0.031 | ||
| (α + β)/2 | 0.000000024 | 0.0096 | 0.021 | 0.048 | 0.048 | ||
| Implied Type I/II error cost ratio | 1 | 1 | 1 | 1 | 1 |
Replicate estimates: Number of replicates per treatment needed to achieve maximum acceptable averages of α and β of 0.00001, 0.0001, 0.001, 0.01, 0.05, 0.1, and 0.125, at critical effects sizes of 1, 2, and 4 SD, for an independent two-tailed, two sample t-test
| Maximum acceptable average of α and β | Number of samples required | ||
|---|---|---|---|
| CES = 1SD | CES = 2SD | CES = 4SD | |
| 0.00001 | 156 | 43 | 15 |
| 0.0001 | 120 | 33 | 12 |
| 0.001 | 85 | 24 | 9 |
| 0.01 | 50 | 14 | 5 |
| 0.05 | 27 | 8 | 3 |
| 0.1 | 18 | 6 | 3 |
| 0.125 | 16 | 5 | 3 |
A and B. Required number of replicates: A) Number of times a study would have to be repeated with the same conclusion to achieve an α of 0.00001, 0.0001, 0.001, 0.01, 0.05, 0.1, and 0.2, at critical effects sizes of 1, 2, and 4 SD, for an independent two-tailed, two sample t-test. (B) Number of times a study would have to be repeated with the same conclusion to achieve a β of 0.00001, 0.0001, 0.001, 0.01, 0.05, 0.1, and 0.2, at critical effects sizes of 1, 2, and 4 SD, for an independent two-tailed, two sample t-test
| A. | ||||||||
| Critical effect size | Within-study replication | Replication of the experiment needed to achieve | ||||||
| α = 0.00001 | α = 0.0001 | α = 0.001 | α = 0.01 | α = 0.05 | α = 0.1 | α = 0.2 | ||
| 1 SD | 4 | 10 | 8 | 6 | 4 | 3 | 2 | 2 |
| 6 | 9 | 7 | 5 | 4 | 3 | 2 | 2 | |
| 8 | 8 | 6 | 5 | 3 | 3 | 2 | 2 | |
| 10 | 7 | 6 | 4 | 3 | 2 | 2 | 1 | |
| 2 SD | 4 | 7 | 5 | 4 | 3 | 2 | 2 | 1 |
| 6 | 5 | 4 | 3 | 2 | 2 | 1 | 1 | |
| 8 | 4 | 4 | 3 | 2 | 1 | 1 | 1 | |
| 10 | 4 | 3 | 2 | 2 | 1 | 1 | 1 | |
| 4 SD | 4 | 4 | 3 | 2 | 2 | 1 | 1 | 1 |
| 6 | 3 | 2 | 2 | 1 | 1 | 1 | 1 | |
| 8 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | |
| 10 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |
| B. | ||||||||
| Critical effect size | Within-study replication | Replication of the experiment needed to achieve | ||||||
| β = 0.00001 | β = 0.0001 | β = 0.001 | β = 0.01 | β = 0.05 | β = 0.1 | β = 0.2 | ||
| 1 SD | 4 | 12 | 10 | 8 | 5 | 4 | 3 | 2 |
| 6 | 10 | 8 | 6 | 4 | 3 | 2 | 2 | |
| 8 | 9 | 7 | 6 | 4 | 3 | 2 | 2 | |
| 10 | 8 | 6 | 5 | 3 | 2 | 2 | 2 | |
| 2 SD | 4 | 6 | 5 | 4 | 3 | 2 | 2 | 1 |
| 6 | 5 | 4 | 3 | 2 | 2 | 1 | 1 | |
| 8 | 4 | 3 | 3 | 2 | 1 | 1 | 1 | |
| 10 | 4 | 3 | 2 | 2 | 1 | 1 | 1 | |
| 4 SD | 4 | 3 | 3 | 2 | 2 | 1 | 1 | 1 |
| 6 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | |
| 8 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |
| 10 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | |
A-C. A comparison of the mean number of significant results among four different procedures for evaluating significance of multiple comparisons: Type I errors, and Type II errors for 100 iterations of 15,000 simulated differential gene expression test using (1) α = 0.05 for all tests, (2) a Bonferroni correction to adjust the family-wise error rate (FWER) to 0.05, (3) the Benjamini-Hochberg procedure to adjust the false-discovery rate (FDR) to 0.05, and (4) optimal α
| Critical effect size (CES) | Average of 100 iterations of 15,000 tests | α = 0.05 | Bonferroni FWER = 0.05 | Benjamini-Hochberg FDR = 0.05 | Optimal α |
|---|---|---|---|---|---|
| A. | |||||
| CES = 1SD | # of significant results | 2046 | 0 | 1 | 6776 |
| # of Type I errors | 376 | 0 | 0 | 2143 | |
| # of Type II errors ≥ CES | 5829 | 7500 | 7499 | 2867 | |
| # of Type I and II errors | 6205 | 7500 | 7499 | 5010 | |
| % error reduction by using optimal α | 19.3% | 33% | 33% | - | |
| CES = 2SD | # of significant results | 5298 | 3 | 1709 | 7659 |
| # of Type I errors | 379 | 0 | 43 | 1130 | |
| # of Type II errors ≥ CES | 2581 | 7497 | 5834 | 970 | |
| # of Type I and II errors | 2960 | 7497 | 5876 | 2100 | |
| % error reduction by using optimal α | 29% | 72% | 64% | - | |
| CES = 4SD | # of significant results | 7848 | 61 | 7560 | 7608 |
| # of Type I errors | 378 | 0 | 190 | 212 | |
| # of Type II errors ≥ CES | 30 | 7439 | 130 | 105 | |
| # of Type I and II errors | 408 | 7439 | 320 | 317 | |
| % error reduction by using optimal α | 22% | 96% | 1% | - | |
| B. | |||||
| CES = 1SD | # of significant results | 1400 | 0 | 0 | 1456 |
| # of Type I errors | 562 | 0 | 0 | 590 | |
| # of Type II errors ≥ CES | 2912 | 3750 | 3750 | 2883 | |
| # of Type I and II errors | 3474 | 3750 | 3750 | 3473 | |
| % error reduction by using optimal α | 0.02% | 7% | 7% | - | |
| CES = 2SD | # of significant results | 3032 | 1 | 119 | 3537 |
| # of Type I errors | 562 | 0 | 5 | 791 | |
| # of Type II errors ≥ CES | 1280 | 3749 | 3636 | 1004 | |
| # of Type I and II errors | 1842 | 3749 | 3641 | 1795 | |
| % error reduction by using optimal α | 3% | 52% | 51% | - | |
| CES = 4SD | # of significant results | 4295 | 31 | 3665 | 3826 |
| # of Type I errors | 560 | 0 | 136 | 200 | |
| # of Type II errors ≥ CES | 15 | 3719 | 221 | 124 | |
| # of Type I and II errors | 575 | 3719 | 358 | 324 | |
| % error reduction by using optimal α | 44% | 91% | 9% | - | |
| C. | |||||
| CES = 1SD | # of significant results | 1012 | 0 | 0 | 10 |
| # of Type I errors | 680 | 0 | 0 | 5 | |
| # of Type II errors ≥ CES | 1167 | 1500 | 1500 | 1495 | |
| # of Type I and II errors | 1847 | 1500 | 1500 | 1500 | |
| % error reduction by using optimal α | 19% | 0% | 0% | - | |
| CES = 2SD | # of significant results | 1662 | 1 | 3 | 1083 |
| # of Type I errors | 677 | 0 | 0 | 334 | |
| # of Type II errors ≥ CES | 515 | 1499 | 1497 | 752 | |
| # of Type I and II errors | 1192 | 1499 | 1498 | 1086 | |
| % error reduction by using optimal α | 9% | 28% | 27% | - | |
| CES = 4SD | # of significant results | 2169 | 12 | 1261 | 1539 |
| # of Type I errors | 675 | 0 | 56 | 143 | |
| # of Type II errors ≥ CES | 6 | 1488 | 295 | 105 | |
| # of Type I and II errors | 681 | 1488 | 350 | 248 | |
| % error reduction by using optimal α | 64% | 83% | 29% | - | |
Type II error rates and optimal α levels were evaluated using three different critical effect sizes (CES), representing effects as large as 1, 2, and 4 standard deviations (SD) of the data. The 15,000 simulated tests had 4 replicates in the experimental and control groups, and were constructed such that (A) H prior probability = 0.50, H prior probability = 0.50; (B) H prior probability = 0.25, H prior probability = 0.75; (C) H prior probability = 0.10, H prior probability = 0.90