| Literature DB >> 24376639 |
Chia-Ling Kuo1, Dmitri Zaykin2.
Abstract
In experiments with many statistical tests there is need to balance type I and type II error rates while taking multiplicity into account. In the traditional approach, the nominal [Formula: see text]-level such as 0.05 is adjusted by the number of tests, [Formula: see text], i.e., as 0.05/[Formula: see text]. Assuming that some proportion of tests represent "true signals", that is, originate from a scenario where the null hypothesis is false, power depends on the number of true signals and the respective distribution of effect sizes. One way to define power is for it to be the probability of making at least one correct rejection at the assumed [Formula: see text]-level. We advocate an alternative way of establishing how "well-powered" a study is. In our approach, useful for studies with multiple tests, the ranking probability [Formula: see text] is controlled, defined as the probability of making at least [Formula: see text] correct rejections while rejecting hypotheses with [Formula: see text] smallest P-values. The two approaches are statistically related. Probability that the smallest P-value is a true signal (i.e., [Formula: see text]) is equal to the power at the level [Formula: see text], to an very good excellent approximation. Ranking probabilities are also related to the false discovery rate and to the Bayesian posterior probability of the null hypothesis. We study properties of our approach when the effect size distribution is replaced for convenience by a single "typical" value taken to be the mean of the underlying distribution. We conclude that its performance is often satisfactory under this simplification; however, substantial imprecision is to be expected when [Formula: see text] is very large and [Formula: see text] is small. Precision is largely restored when three values with the respective abundances are used instead of a single typical effect size value.Entities:
Mesh:
Year: 2013 PMID: 24376639 PMCID: PMC3869737 DOI: 10.1371/journal.pone.0083079
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Simulated plot of −log10 (P-value).
red circles– true signals; crosses– false signals.
Number of true signals in a set of top P-values.
|
|
|
|
|
| 50 | 5.5 | 7200 | 34.1 |
| 600 | 15.4 | 7750 | 34.8 |
| 1150 | 19.3 | 8300 | 35.4 |
| 1700 | 22.1 | 8850 | 36.0 |
| 2250 | 24.1 | 9400 | 36.6 |
| 2800 | 25.9 | 9950 | 37.1 |
| 3350 | 27.3 | 10500 | 37.7 |
| 3900 | 28.6 | 11050 | 38.2 |
| 4450 | 29.7 | 11600 | 38.6 |
| 5000 | 30.8 | 12150 | 39.1 |
| 5550 | 31.7 | 12700 | 39.5 |
| 6100 | 32.6 | 13250 | 40.0 |
| 6650 | 33.3 | 13800 | 40.4 |
E(T): expected number of true signals among top u P-values.
The total number of tests: ; the total number of true signals: 75.
Figure 2Number of true signals in a set of top P-values.
The number of true signals (T) expected to be encountered among a specific number of the smallest P-values.
1-rFDR(u) by ranking probabilities for u = 100 top P-values using the distribution mean, or distribution of noncentralities for true effects where noncentralities of M = 100 true effects are distributed as Gamma(1,b).
|
|
| true | rankdist | rankbin | rank |
| 1e4 | 5 | 0.298 | 0.300 | 0.309 | 0.320 |
| 1e4 | 10 | 0.478 | 0.477 | 0.486 | 0.610 |
| 1e4 | 14 | 0.562 | 0.564 | 0.574 | 0.761 |
| 1e4 | 17 | 0.609 | 0.611 | 0.621 | 0.837 |
| 1e4 | 25 | 0.696 | 0.696 | 0.705 | 0.943 |
| 1e4 | 30 | 0.733 | 0.731 | 0.740 | 0.971 |
| 1e4 | 40 | 0.780 | 0.781 | 0.795 | 0.994 |
| 1e4 | 50 | 0.814 | 0.815 | 0.840 | 0.999 |
| 1e4 | 75 | 0.865 | 0.864 | 0.919 | 1.000 |
| 1e4 | 100 | 0.892 | 0.892 | 0.961 | 1.000 |
| 1e5 | 5 | 0.172 | 0.171 | 0.177 | 0.136 |
| 1e5 | 10 | 0.345 | 0.346 | 0.356 | 0.394 |
| 1e5 | 14 | 0.441 | 0.444 | 0.452 | 0.584 |
| 1e5 | 17 | 0.500 | 0.500 | 0.509 | 0.696 |
| 1e5 | 25 | 0.603 | 0.604 | 0.614 | 0.878 |
| 1e5 | 30 | 0.652 | 0.650 | 0.655 | 0.934 |
| 1e5 | 40 | 0.714 | 0.714 | 0.712 | 0.982 |
| 1e5 | 50 | 0.759 | 0.757 | 0.759 | 0.996 |
| 1e5 | 75 | 0.822 | 0.823 | 0.857 | 1.000 |
| 1e5 | 100 | 0.860 | 0.859 | 0.922 | 1.000 |
| 1e6 | 5 | 0.094 | 0.094 | 0.094 | 0.048 |
| 1e6 | 10 | 0.248 | 0.247 | 0.259 | 0.215 |
| 1e6 | 14 | 0.348 | 0.345 | 0.353 | 0.394 |
| 1e6 | 17 | 0.405 | 0.405 | 0.411 | 0.522 |
| 1e6 | 25 | 0.524 | 0.522 | 0.534 | 0.777 |
| 1e6 | 30 | 0.576 | 0.574 | 0.586 | 0.869 |
| 1e6 | 40 | 0.649 | 0.649 | 0.649 | 0.959 |
| 1e6 | 50 | 0.702 | 0.701 | 0.690 | 0.989 |
| 1e6 | 75 | 0.783 | 0.781 | 0.787 | 1.000 |
| 1e6 | 100 | 0.827 | 0.826 | 0.869 | 1.000 |
K: number of tests.
b: scale parameter of Gamma to model noncentralities for true effects (shape parameter is fixed at 1).
true: simulation-based 1-rFDR(100).
rank: 1-rFDR(100) by ranking probabilities using mean.
rankbin: 1-rFDR(100) by ranking probabilities using the 3-bins approximation.
rankdist: 1-rFDR(100) by ranking probabilities using the noncentrality distribution.
1-rFDR(u) by ranking probabilities for u = 200 top P-values using the distribution mean, or distribution of noncentralities for true effects where noncentralities of M = 100 true effects are distributed as Gamma(1,b).
|
|
| true | rankdist | rankbin | rank |
| 1e4 | 5 | 0.186 | 0.186 | 0.191 | 0.215 |
| 1e4 | 10 | 0.278 | 0.278 | 0.284 | 0.374 |
| 1e4 | 14 | 0.320 | 0.320 | 0.327 | 0.443 |
| 1e4 | 17 | 0.344 | 0.343 | 0.350 | 0.471 |
| 1e4 | 25 | 0.383 | 0.382 | 0.391 | 0.496 |
| 1e4 | 30 | 0.397 | 0.398 | 0.410 | 0.499 |
| 1e4 | 40 | 0.419 | 0.420 | 0.439 | 0.500 |
| 1e4 | 50 | 0.435 | 0.434 | 0.461 | 0.500 |
| 1e4 | 75 | 0.454 | 0.454 | 0.489 | 0.500 |
| 1e4 | 100 | 0.464 | 0.465 | 0.497 | 0.500 |
| 1e5 | 5 | 0.105 | 0.105 | 0.109 | 0.094 |
| 1e5 | 10 | 0.199 | 0.199 | 0.204 | 0.248 |
| 1e5 | 14 | 0.248 | 0.249 | 0.254 | 0.350 |
| 1e5 | 17 | 0.276 | 0.277 | 0.283 | 0.406 |
| 1e5 | 25 | 0.330 | 0.329 | 0.333 | 0.479 |
| 1e5 | 30 | 0.350 | 0.350 | 0.354 | 0.493 |
| 1e5 | 40 | 0.382 | 0.381 | 0.385 | 0.499 |
| 1e5 | 50 | 0.401 | 0.401 | 0.413 | 0.500 |
| 1e5 | 75 | 0.430 | 0.430 | 0.463 | 0.500 |
| 1e5 | 100 | 0.447 | 0.446 | 0.487 | 0.500 |
| 1e6 | 5 | 0.057 | 0.058 | 0.058 | 0.034 |
| 1e6 | 10 | 0.140 | 0.141 | 0.147 | 0.138 |
| 1e6 | 14 | 0.193 | 0.192 | 0.196 | 0.241 |
| 1e6 | 17 | 0.223 | 0.223 | 0.227 | 0.311 |
| 1e6 | 25 | 0.281 | 0.282 | 0.288 | 0.436 |
| 1e6 | 30 | 0.306 | 0.307 | 0.312 | 0.473 |
| 1e6 | 40 | 0.345 | 0.344 | 0.343 | 0.496 |
| 1e6 | 50 | 0.370 | 0.370 | 0.367 | 0.500 |
| 1e6 | 75 | 0.407 | 0.407 | 0.424 | 0.500 |
| 1e6 | 100 | 0.428 | 0.428 | 0.465 | 0.500 |
K: number of tests.
b: scale parameter of Gamma to model noncentralities for true effects (shape parameter is fixed at 1).
true: simulation-based 1-rFDR(200).
rank: 1-rFDR(200) by ranking probabilities using mean.
rankbin: 1-rFDR(200) by ranking probabilities using the 3-bins approximation.
rankdist: 1-rFDR(200) by ranking probabilities using the noncentrality distribution.
1-rFDR(u) by ranking probabilities for u = 10 top P-values using the distribution mean, or distribution of noncentralities for true effects where noncentralities of M = 100 true effects are distributed as Gamma(a,b).
|
|
|
| true | rankdist | rankbin | rank |
| 1e4 | 0.7 | 3 | 0.447 | 0.433 | 0.406 | 0.259 |
| 1e4 | 0.9 | 3 | 0.564 | 0.553 | 0.531 | 0.365 |
| 1e4 | 0.7 | 5 | 0.760 | 0.754 | 0.749 | 0.512 |
| 1e4 | 0.9 | 5 | 0.855 | 0.880 | 0.881 | 0.683 |
| 1e4 | 0.7 | 10 | 0.988 | 0.998 | 0.999 | 0.952 |
| 1e4 | 0.9 | 10 | 0.999 | 1.000 | 1.000 | 0.999 |
| 1e5 | 0.7 | 3 | 0.203 | 0.195 | 0.157 | 0.066 |
| 1e5 | 0.9 | 3 | 0.285 | 0.273 | 0.234 | 0.108 |
| 1e5 | 0.7 | 5 | 0.472 | 0.473 | 0.434 | 0.183 |
| 1e5 | 0.9 | 5 | 0.628 | 0.620 | 0.595 | 0.303 |
| 1e5 | 0.7 | 10 | 0.935 | 0.950 | 0.968 | 0.655 |
| 1e5 | 0.9 | 10 | 0.982 | 0.995 | 0.998 | 0.877 |
| 1e6 | 0.7 | 3 | 0.086 | 0.081 | 0.051 | 0.014 |
| 1e6 | 0.9 | 3 | 0.121 | 0.120 | 0.085 | 0.026 |
| 1e6 | 0.7 | 5 | 0.265 | 0.265 | 0.203 | 0.050 |
| 1e6 | 0.9 | 5 | 0.388 | 0.375 | 0.320 | 0.097 |
| 1e6 | 0.7 | 10 | 0.801 | 0.801 | 0.816 | 0.308 |
| 1e6 | 0.9 | 10 | 0.922 | 0.937 | 0.954 | 0.546 |
K: number of tests.
a: shape parameter of Gamma to model noncentralities for true effects.
b: scale parameter of Gamma to model noncentralities for true effects
true: simulation-based 1-rFDR(10).
rank: 1-rFDR(10) by ranking probabilities using mean.
rankbin: 1-rFDR(10) by ranking probabilities using the 3-bins approximation.
rankdist: 1-rFDR(10) by ranking probabilities using the noncentrality distribution.
Figure 3Expected number of true discoveries among the first 200 top results.
•: 5000 cases and 5000 controls. ×: 10000 cases and 10000 controls. +: 15000 cases and 15000 controls.