| Literature DB >> 30298671 |
Martin Happ1, Arne C Bathke1,2, Edgar Brunner1,3.
Abstract
There are many different proposed procedures for sample size planning for the Wilcoxon-Mann-Whitney test at given type-I and type-II error rates α and β, respectively. Most methods assume very specific models or types of data to simplify calculations (eg, ordered categorical or metric data, location shift alternatives, etc). We present a unified approach that covers metric data with and without ties, count data, ordered categorical data, and even dichotomous data. For that, we calculate the unknown theoretical quantities such as the variances under the null and relevant alternative hypothesis by considering the following "synthetic data" approach. We evaluate data whose empirical distribution functions match the theoretical distribution functions involved in the computations of the unknown theoretical quantities. Then, well-known relations for the ranks of the data are used for the calculations. In addition to computing the necessary sample size N for a fixed allocation proportion t = n1 /N, where n1 is the sample size in the first group and N = n1 + n2 is the total sample size, we provide an interval for the optimal allocation rate t, which minimizes the total sample size N. It turns out that, for certain distributions, a balanced design is optimal. We give a characterization of such distributions. Furthermore, we show that the optimal choice of t depends on the ratio of the two variances, which determine the variance of the Wilcoxon-Mann-Whitney statistic under the alternative. This is different from an optimal sample size allocation in case of the normal distribution model.Entities:
Keywords: Wilcoxon-Mann-Whitney test; nonparametric relative effect; nonparametric statistics; optimal design; rank-based inference; sample size planning
Mesh:
Substances:
Year: 2018 PMID: 30298671 PMCID: PMC6491996 DOI: 10.1002/sim.7983
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Number of seizures for 28 subjects from the advance information X 1, ∼F 1(x), k = 1,…,28, and for the relevant effect F 2(x) = F 1(x/q), where q = 0.5 denotes the percentage of the relevant reduction of seizures to be detected. This means X 2, = [q·X 1, ]∼F 2(x), where [u] denotes the largest integer ≤u
| Number of counts | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Advance Information | |||||||||||||||
|
| 3, | 3, | 5, | 4, | 21, | 7, | 2, | 12, | 5, | 0, | 22, | 4, | 2, | 12 | |
| 9, | 5, | 3, | 29, | 5, | 7, | 4, | 4, | 5, | 8, | 25, | 1, | 2, | 12 | ||
| Relevant Alternative | |||||||||||||||
|
| 1, | 1, | 2, | 2, | 10, | 3, | 1, | 6, | 2, | 0, | 11, | 2, | 1, | 6 | |
| 4, | 2, | 1, | 14, | 2, | 3, | 2, | 2, | 2, | 4, | 12, | 0, | 1, | 6 | ||
Power simulation for the number of seizures
| Method | Sample Sizes | Total Sample Size | Power |
|---|---|---|---|
| Balanced | 24/24 | 48 | 0.802 |
| Unbalanced | 23/24 | 47 | 0.7956 |
| Noether | 26/26 | 52 | 0.8417 |
Number of rats with defect score 0, 1, 2, and 3
| Defect Score | ||||
|---|---|---|---|---|
| 0 | 1 | 2 | 3 | |
| Substance 1 | 64 | 12 | 4 | 0 |
| Substance 2 | 48 | 25 | 6 | 1 |
Power simulation for the nasal mucosa data
| Method | Sample Sizes | Total Sample Size | Power |
|---|---|---|---|
| Balanced | 85/85 | 170 | 0.8027 |
| Unbalanced | 83/87 | 170 | 0.7999 |
| Noether | 134/134 | 268 | 0.9417 |
| Tang | 86/86 | 172 | 0.8045 |
Relative kidney weights [‰] for 16 male Wistar rats
| Relative Kidney Weight [‰] | ||||||||
|---|---|---|---|---|---|---|---|---|
| Placebo | 6.62 | 6.65 | 5.78 | 5.63 | 6.05 | 6.48 | 5.50 | 5.37 |
| Treatment | 6.92 | 6.95 | 6.08 | 5.93 | 6.35 | 6.78 | 5.80 | 5.67 |
Power simulation for the relative kidney weights
| Method | Sample Sizes | Total Sample Size | Power |
|---|---|---|---|
| Balanced | 30/30 | 60 | 0.7976 |
| Unbalanced | 31/30 | 61 | 0.8123 |
| Noether | 32/32 | 64 | 0.8320 |
Relative frequencies for the albumin data from the work of Lachin6
| Normal | Micro | Macro | |
|---|---|---|---|
| Control | 0.85 | 0.10 | 0.05 |
| Experimental | 0.90 | 0.075 | 0.025 |
Power simulation for the albumin in urine data
| Method | Sample Sizes | Total Sample Size | Power |
|---|---|---|---|
| Balanced | 877/877 | 1754 | 0.9054 |
| Unbalanced | 909/842 | 1751 | 0.9033 |
| Lachin | 879/879 | 1758 | 0.9029 |
| Noether | 2667/2667 | 5334 | ≈1 |
Figure 1The graphic shows the values of the optimal allocation rate t 0 for different values of type‐I error rates α where the goal is to detect a relevant effect with at least 80% power. For the reference group, we used Beta(5,5) distributions, and for the treatment group, we assumed Beta(3,i), where i = 1,2,3. The red line represents i = 3 (relative effect p≈0.5); for the green curve, we have used i = 2 ( p≈0.65), and for the red line, i = 1 ( p≈0.84) [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 2The graphic shows the values of the optimal allocation rate t 0 for different values of the power for α = 0.05. For the reference group, we used Beta(5,5) distributions, and for the treatment group, we assumed Beta(3,i), where i = 1,2,3. The red line represents i = 3 (relative effect p≈0.5); for the green curve, we have used i = 2 ( p≈0.65), and for the red line, i = 1 ( p≈0.84) [Colour figure can be viewed at wileyonlinelibrary.com]