| Literature DB >> 27034909 |
William D Johnson1, Jacob E Romer1.
Abstract
Testing the equality of percentiles (quantiles) between populations is an effective method for robust, nonparametric comparison, especially when the distributions are asymmetric or irregularly shaped. Unlike global nonparametric tests for homogeneity such as the Kolmogorv-Smirnov test, testing the equality of a set of percentiles (i.e., a percentile profile) yields an estimate of the location and extent of the differences between the populations along the entire domain. The Wald test using bootstrap estimates of variance of the order statistics provides a unified method for hypothesis testing of functions of the population percentiles. Simulation studies are conducted to show performance of the method under various scenarios and to give suggestions on its use. Several examples are given to illustrate some useful applications to real data.Entities:
Keywords: Bootstrap; Hypothesis Testing; Nonparametric Methods; Percentile Profiles; Wald Test
Year: 2016 PMID: 27034909 PMCID: PMC4811631 DOI: 10.4236/ojs.2016.61003
Source DB: PubMed Journal: Open J Stat ISSN: 2161-718X
Figure 1Kernel density estimates of log(CRP) of normal, prediabetes, and diabetes groups for males and females.
Error relative to empirical expected value of bootstrap variance-covariance estimates from N(0,1).
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.05 | 0.25 | 0.5 | (0.05, 0.25) | (0.05, 0.5) | (0.05, 0.75) | (0.05, 0.95) | (0.25, 0.5) | (0.25, 0.75) | |
| 20 | 0.172 | 0.217 | 0.210 | −0.110 | −0.110 | −0.109 | −0.427 | −0.032 | −0.053 |
| 40 | 0.310 | 0.168 | 0.143 | −0.044 | −0.047 | −0.035 | −0.124 | −0.026 | −0.018 |
| 60 | 0.281 | 0.137 | 0.127 | −0.031 | −0.017 | −0.020 | −0.038 | −0.019 | −0.019 |
| 80 | 0.242 | 0.130 | 0.108 | −0.023 | −0.022 | −0.027 | −0.016 | −0.015 | −0.011 |
| 100 | 0.226 | 0.108 | 0.104 | −0.023 | −0.009 | −0.007 | −0.026 | −0.014 | −0.004 |
| 150 | 0.193 | 0.098 | 0.081 | −0.016 | −0.011 | −0.001 | −0.001 | −0.010 | 0.000 |
| 200 | 0.157 | 0.085 | 0.079 | −0.015 | −0.008 | −0.013 | −0.016 | −0.004 | −0.005 |
| 250 | 0.151 | 0.081 | 0.078 | −0.001 | 0.005 | 0.008 | −0.002 | 0.002 | 0.004 |
Error relative to empirical expected value of bootstrap variance-covariance estimates from gamma (2,1).
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0.05 | 0.25 | 0.5 | 0.95 | (0.05, 0.5) | (0.05, 0.95) | (0.25, 0.5) | (0.25, 0.75) | (0.25, 0.95) | |
| 20 | 0.210 | 0.231 | 0.243 | −0.583 | −0.079 | −0.456 | −0.013 | −0.026 | −0.441 |
| 40 | 0.227 | 0.170 | 0.171 | 0.242 | −0.036 | −0.086 | −0.015 | −0.012 | −0.081 |
| 60 | 0.202 | 0.144 | 0.144 | 0.476 | −0.030 | −0.040 | −0.012 | −0.021 | −0.020 |
| 80 | 0.187 | 0.123 | 0.125 | 0.391 | −0.025 | −0.014 | −0.009 | −0.010 | −0.012 |
| 100 | 0.185 | 0.119 | 0.098 | 0.310 | −0.016 | −0.028 | −0.013 | 0.000 | −0.005 |
| 150 | 0.147 | 0.100 | 0.097 | 0.229 | −0.018 | −0.012 | −0.004 | −0.002 | −0.010 |
| 200 | 0.143 | 0.085 | 0.081 | 0.208 | −0.010 | −0.020 | −0.002 | −0.001 | −0.004 |
| 250 | 0.137 | 0.078 | 0.067 | 0.185 | −0.006 | 0.038 | −0.006 | −0.001 | 0.007 |
Empirical Type I error (10,000 replicates) of Wald Test from N(0,1).
| Equally-spaced Percentiles | Unequally-spaced Percentiles | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| 1 | 2 | 3 | 4 | 7 | 9 | ||||
| 20 | 0.0484 | 0.0453 | 0.0332 | 0.0286 | 0.0182 | 0.0142 | 0.1919 | 0.1175 | 0.0786 |
| 40 | 0.0488 | 0.0455 | 0.0378 | 0.0324 | 0.0194 | 0.015 | 0.0652 | 0.0471 | 0.0334 |
| 60 | 0.0445 | 0.0432 | 0.0402 | 0.0349 | 0.0236 | 0.017 | 0.0456 | 0.0425 | 0.0313 |
| 80 | 0.0475 | 0.0461 | 0.0473 | 0.0403 | 0.0284 | 0.0204 | 0.0448 | 0.0407 | 0.031 |
| 100 | 0.0486 | 0.0499 | 0.0464 | 0.0439 | 0.029 | 0.0216 | 0.0483 | 0.0437 | 0.0325 |
| 150 | 0.0512 | 0.0477 | 0.0479 | 0.0461 | 0.0331 | 0.0261 | 0.048 | 0.0454 | 0.0394 |
| 200 | 0.0523 | 0.0513 | 0.049 | 0.0452 | 0.0371 | 0.0314 | 0.0475 | 0.0453 | 0.0338 |
| 250 | 0.0499 | 0.0492 | 0.0497 | 0.0447 | 0.0377 | 0.0322 | 0.0525 | 0.051 | 0.0415 |
Empirical Type I error (10,000 replicates) of Wald Test with from gamma (shape = 2, scale = 1).
| Equally-spaced Percentiles | Unequally-spaced Percentiles | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| 1 | 2 | 3 | 4 | 7 | 9 | ||||
| 20 | 0.0441 | 0.0405 | 0.034 | 0.0305 | 0.0187 | 0.0181 | 0.2111 | 0.1292 | 0.0844 |
| 40 | 0.0495 | 0.0457 | 0.0387 | 0.0321 | 0.0203 | 0.0147 | 0.0718 | 0.0512 | 0.0343 |
| 60 | 0.0473 | 0.0393 | 0.0381 | 0.0348 | 0.0233 | 0.0184 | 0.0483 | 0.0408 | 0.0288 |
| 80 | 0.051 | 0.0458 | 0.042 | 0.0358 | 0.0265 | 0.0173 | 0.0441 | 0.0401 | 0.0307 |
| 100 | 0.0494 | 0.0477 | 0.0435 | 0.0371 | 0.0279 | 0.02 | 0.0475 | 0.0388 | 0.0293 |
| 150 | 0.0482 | 0.0496 | 0.0443 | 0.044 | 0.0307 | 0.0248 | 0.0483 | 0.0435 | 0.0337 |
| 200 | 0.0507 | 0.0486 | 0.0453 | 0.0445 | 0.0352 | 0.0283 | 0.0501 | 0.048 | 0.0382 |
| 250 | 0.0517 | 0.0506 | 0.0471 | 0.0462 | 0.0377 | 0.0314 | 0.0508 | 0.0487 | 0.0382 |
Simulation results compared to Kolmogorov-Smirnov test at 80% power with equally-spaced percentiles.
| Equally-spaced Percentiles | ||||||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Distributions | Power | 1 | 2 | 3 | 4 | 7 | 9 | |
| Gam (2, 1) & Gam (2.4, 1) | 241 | 0.7999 | 0.7112 | 0.7248 | 0.7159 | 0.6959 | 0.6466 | 0.6203 |
| Gam (2, 1) & Gam (2.2, 1.2) | 138 | 0.8024 | 0.718 | 0.7416 | 0.7293 | 0.712 | 0.6675 | 0.6335 |
| Gam (2, 1) & Gam (2.3, 1.3) | 70 | 0.7997 | 0.7425 | 0.7717 | 0.7665 | 0.7547 | 0.7069 | 0.6629 |
| Gam (2, 1) & Gam (2.4, 1.4) | 41 | 0.7936 | 0.7213 | 0.7568 | 0.7546 | 0.7357 | 0.6743 | 0.6148 |
| Gam (4, 1) & Gam (4.2, 1.2) | 104 | 0.8033 | 0.7249 | 0.7428 | 0.7351 | 0.7231 | 0.6701 | 0.6339 |
| Gam (4, 1) & Gam (4.3, 1.3) | 50 | 0.7945 | 0.7117 | 0.7442 | 0.7326 | 0.7178 | 0.653 | 0.6099 |
| Gam (2, 1) & N (2.2, 1) | 101 | 0.7971 | 0.7016 | 0.7763 | 0.8026 | 0.8091 | 0.7999 | 0.7671 |
| Gam (2, 1) & N (2.3, 1.2) | 74 | 0.8051 | 0.6943 | 0.7754 | 0.7897 | 0.796 | 0.7588 | 0.7264 |
| Gam (2, 1) & N (2.3, 1.2) | 102 | 0.7973 | 0.7697 | 0.7512 | 0.738 | 0.7124 | 0.6834 | 0.6454 |
| Average Power | 0.7992 | 0.7217 | 0.7539 | 0.7516 | 0.7396 | 0.6956 | 0.6571 | |
Simulation results compared to Kolmogorov-Smirnov test at 80% power with unequally-spaced percentiles.
| Distributions | Power | Unequal Percentiles
| |||
|---|---|---|---|---|---|
| Gam (2, 1) & Gam (2.4, 1) | 241 | 0.7999 | 0.4766 | 0.6911 | 0.6422 |
| Gam (2, 1) & Gam (2.2, 1.2) | 138 | 0.8024 | 0.464 | 0.7015 | 0.6682 |
| Gam (2, 1) & Gam (2.3, 1.3) | 70 | 0.7997 | 0.4929 | 0.745 | 0.6916 |
| Gam (2, 1) & Gam (2.4, 1.4) | 41 | 0.7936 | 0.5265 | 0.7404 | 0.6814 |
| Gam (4, 1) & Gam (4.2, 1.2) | 104 | 0.8033 | 0.4776 | 0.7133 | 0.6653 |
| Gam (4, 1) & Gam (4.3, 1.3) | 50 | 0.7945 | 0.4712 | 0.7174 | 0.6605 |
| Gam (2, 1) & N (2.2, 1) | 101 | 0.7971 | 0.3522 | 0.862 | 0.8335 |
| Gam (2, 1) & N (2.3, 1.2) | 74 | 0.8051 | 0.2832 | 0.8141 | 0.7796 |
| Gam (2, 1) & N (2.3, 1.2) | 102 | 0.7973 | 0.0809 | 0.774 | 0.7064 |
| Average Power | 0.7992 | 0.4028 | 0.7510 | 0.7032 | |
Estimates of C-reactive protein differences in percentiles between diabetes and normal groups for males and females.
| Percentile | Males | Females | ||||
|---|---|---|---|---|---|---|
|
| ||||||
| Std. Error | 95% Confidence Interval | Std. Error | 95% Confidence Interval | |||
| 5th | 0.03 | 0.0098 | 0.0035, 0.0565 | 0.02 | 0.0102 | −0.0075, 0.0475 |
| 10th | 0.03 | 0.0065 | 0.0124, 0.0476 | 0.03 | 0.0110 | 0.0004, 0.0596 |
| 25th | 0.05 | 0.0132 | 0.0145, 0.0855 | 0.04 | 0.0213 | −0.0175, 0.0975 |
| 50th | 0.13 | 0.0310 | 0.0466, 0.2134 | 0.11 | 0.0438 | −0.0080, 0.2280 |
| 75th | 0.27 | 0.0803 | 0.0535, 0.4865 | 0.26 | 0.1066 | −0.0274, 0.5474 |
| 90th | 0.46 | 0.1311 | 0.1067, 0.8133 | 0.36 | 0.1732 | −0.1069, 0.8269 |
| 95th | 0.07 | 0.2713 | −0.6613, 0.8013 | 0.29 | 0.5538 | −1.2028, 1.7828 |