| Literature DB >> 24350232 |
Qing Pan1.
Abstract
We review and compare multiple hypothesis testing procedures used in clinical trials and those in genomic studies. Clinical trials often employ global tests, which draw an overall conclusion for all the hypotheses, such as SUM test, Two-Step test, Approximate Likelihood Ratio test (ALRT), Intersection-Union Test (IUT), and MAX test. The SUM and Two-Step tests are most powerful under homogeneous treatment effects, while the ALRT and MAX test are robust in cases with non-homogeneous treatment effects. Furthermore, the ALRT is robust to unequal sample sizes in testing different hypotheses. In genomic studies, stepwise procedures are used to draw marker-specific conclusions and control family wise error rate (FWER) or false discovery rate (FDR). FDR refers to the percent of false positives among all significant results and is preferred over FWER in screening high-dimensional genomic markers due to its interpretability. In cases where correlations between test statistics cannot be ignored, Westfall-Young resampling method generates the joint distribution of P-values under the null and maintains their correlation structure. Finally, the GWAS data from a clinical trial searching for SNPs associated with nephropathy among Type 1 diabetic patients are used to illustrate various procedures.Entities:
Keywords: false discovery rate; family wise error rate; global test; multiple hypotheses testing; resampling method; stepwise procedure
Year: 2013 PMID: 24350232 PMCID: PMC3859974 DOI: 10.3389/fpubh.2013.00063
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
FWER versus number of tests and the size of individual tests.
| α | ||
|---|---|---|
| 0.01 | 2 | 0.020 |
| 0.01 | 5 | 0.049 |
| 0.01 | 10 | 0.096 |
| 0.01 | 100 | 0.634 |
| 0.01 | 1000 | 1.000 |
| 0.05 | 2 | 0.098 |
| 0.05 | 5 | 0.226 |
| 0.05 | 10 | 0.401 |
| 0.05 | 100 | 0.994 |
| 0.05 | 1000 | 1.000 |
Comparison. of five global test statistics.
| Test | Test statistic |
|---|---|
| SUM | |
| Two-step: step one | |
| Two-step: step two | |
| ALRT | |
| IUT | |
| MAX |
L0 and L represent the maximum log likelihood under H and H respectively. And is the coefficient estimates from the pooled data.
Figure 1Comparison of rejection regions of five global tests.
Simulation results: size and power (%) with different true value positions.
| True values | ρ | Testing procedure | |||||
|---|---|---|---|---|---|---|---|
| β1 | β2 | SUM | Two-step | IUT | ALRT | MAX | |
| 0 | 0 | 0.4 | 4.9 | 4.7 | 0.7 | 4.4 | 4.5 |
| 0.3 | 0.3 | 0.4 | 21 | 20 | 5 | 18 | 18 |
| 0.6 | 0.6 | 0.4 | 52 | 50 | 23 | 48 | 46 |
| 0.6 | 0.3 | 0.4 | 35 | 33 | 11 | 35 | 36 |
| 0.6 | 0 | 0.4 | 21 | 17 | 3 | 32 | 33 |
| 0.6 | −0.3 | 0.4 | 11 | 7 | 1 | 33 | 29 |
The Type I error 4.7% for Two-Step refers to cases rejected in Step Two. The Type I error for the three tests in IUT are 4.3%, 5.1%, and 4.4%, respectively. The intersection of the three rejection regions gives the overall Type I error rate.
Simulation results: power (%) under different correlation between outcomes.
| True values | ρ | Testing procedure | |||||
|---|---|---|---|---|---|---|---|
| β1 | β2 | SUM | Two-step | IUT | ALRT | MAX | |
| 0.6 | 0.6 | 0 | 63 | 61 | 18 | 58 | 52 |
| 0.6 | 0.6 | 0.4 | 52 | 50 | 23 | 48 | 46 |
| 0.6 | 0.6 | 0.8 | 42 | 40 | 27 | 37 | 41 |
| 0.6 | 0.3 | 0 | 44 | 42 | 7 | 40 | 38 |
| 0.6 | 0.3 | 0.4 | 35 | 33 | 11 | 35 | 36 |
| 0.6 | 0.3 | 0.8 | 26 | 24 | 12 | 28 | 30 |
ρ Represents the correlation coefficient between the outcomes, not the correlation between the estimated regression coefficient.
Simulation results: power (%) with different sample sizes.
| True values | Sample size | ρ | Testing procedure | ||||||
|---|---|---|---|---|---|---|---|---|---|
| β1 | β2 | SUM | Two-step | IUT | ALRT | MAX | |||
| 0.6 | 0.6 | 100 | 100 | 0.4 | 52 | 50 | 23 | 48 | 46 |
| 0.6 | 0.6 | 50 | 150 | 0.4 | 41 | 37 | 15 | 49 | 30 |
| 0.6 | 0.6 | 25 | 175 | 0.4 | 30 | 26 | 5 | 51 | 16 |
| 0.6 | 0.3 | 100 | 100 | 0.4 | 35 | 33 | 11 | 35 | 36 |
| 0.6 | 0.3 | 50 | 150 | 0.4 | 27 | 26 | 7 | 23 | 21 |
| 0.6 | 0.3 | 25 | 175 | 0.4 | 18 | 18 | 3 | 23 | 9 |
| 0.6 | 0.6 | 100 | 100 | 0 | 63 | 61 | 18 | 58 | 52 |
| 0.6 | 0.6 | 50 | 150 | 0 | 55 | 50 | 14 | 55 | 37 |
| 0.6 | 0.6 | 25 | 175 | 0 | 36 | 34 | 9 | 54 | 17 |
| 0.6 | 0.3 | 100 | 100 | 0 | 44 | 42 | 7 | 40 | 38 |
| 0.6 | 0.3 | 150 | 50 | 0 | 37 | 33 | 7 | 47 | 24 |
| 0.6 | 0.3 | 175 | 25 | 0 | 24 | 22 | 5 | 48 | 10 |
Real data analysis: association between log(GFR) and 14 SNPs.
| SNP | Minor | Raw | Bonferroni | Sidak | Hochberg | FDR |
|---|---|---|---|---|---|---|
| rs307806 | A | 0.01071 | 0.1499 | 0.1399 | 0.1499 | 0.1499 |
| rs2279622 | T | 0.03383 | 0.4736 | 0.3823 | 0.4398 | 0.1939 |
| rs4693614 | G | 0.06319 | 0.8846 | 0.5990 | 0.6924 | 0.1939 |
| rs11715496 | A | 0.06702 | 0.9382 | 0.6214 | 0.6924 | 0.1939 |
| rs8042694 | G | 0.06924 | 0.9693 | 0.6338 | 0.6924 | 0.1939 |
| rs2259458 | T | 0.22639 | 1.0000 | 0.9725 | 0.9791 | 0.4245 |
| rs3824935 | T | 0.23555 | 1.0000 | 0.9767 | 0.9791 | 0.4245 |
| rs2027440 | C | 0.24256 | 1.0000 | 0.9795 | 0.9791 | 0.4245 |
| rs2276768 | T | 0.30994 | 1.0000 | 0.9944 | 0.9791 | 0.4821 |
| rs10497435 | C | 0.44626 | 1.0000 | 0.9997 | 0.9791 | 0.6248 |
| rs3814995 | T | 0.52058 | 1.0000 | 1.0000 | 0.9791 | 0.6626 |
| rs2705897 | T | 0.61445 | 1.0000 | 1.0000 | 0.9791 | 0.7169 |
| rs7844961 | T | 0.73593 | 1.0000 | 1.0000 | 0.9791 | 0.7925 |
| rs4900312 | A | 0.97914 | 1.0000 | 1.0000 | 0.9791 | 0.9791 |