| Literature DB >> 32790026 |
James M S Wason1,2, David S Robertson2.
Abstract
Multi-arm trials are an efficient way of simultaneously testing several experimental treatments against a shared control group. As well as reducing the sample size required compared to running each trial separately, they have important administrative and logistical advantages. There has been debate over whether multi-arm trials should correct for the fact that multiple null hypotheses are tested within the same experiment. Previous opinions have ranged from no correction is required, to a stringent correction (controlling the probability of making at least one type I error) being needed, with regulators arguing the latter for confirmatory settings. In this article, we propose that controlling the false-discovery rate (FDR) is a suitable compromise, with an appealing interpretation in multi-arm clinical trials. We investigate the properties of the different correction methods in terms of the positive and negative predictive value (respectively how confident we are that a recommended treatment is effective and that a non-recommended treatment is ineffective). The number of arms and proportion of treatments that are truly effective is varied. Controlling the FDR provides good properties. It retains the high positive predictive value of FWER correction in situations where a low proportion of treatments is effective. It also has a good negative predictive value in situations where a high proportion of treatments is effective. In a multi-arm trial testing distinct treatment arms, we recommend that sponsors and trialists consider use of the FDR.Entities:
Keywords: error rate; false discovery rate; family-wise error rate; multi-arm trials
Mesh:
Year: 2020 PMID: 32790026 PMCID: PMC7612170 DOI: 10.1002/pst.2059
Source DB: PubMed Journal: Pharm Stat ISSN: 1539-1604 Impact factor: 1.894
Summary of error rate and predictive values in terms of number of treatments within each category (recommended/non-recommended vs truly effective/truly ineffective)
| Recommended treatments | Non-recommended treatments | |
|---|---|---|
| Non-effective treatments | A | B |
| Effective treatments | C | D |
| Family-wise error rate | P(A > 0) | |
| FDR | E(A/[A + C])[ | |
| PPV | C/(A + C) | |
| NPV | B/(B + D) |
Abbreviations: FDR, false-discovery rate; NPV, negative predictive value; PPV, positive predictive value.
The FDR is defined as 0 when the denominator A + C is 0.
Figure 1Positive and negative predictive values estimated from the simulation study with three experimental arms as the proportion of treatments which are truly effective changes.
A, Positive predictive value. B, Negative predictive value
Figure 2Positive and negative predictive values estimated from the simulation study with five experimental arms as the proportion of treatments which are truly effective changes.
A, Positive predictive value. B, Negative predictive value
Statistical properties of different correction techniques for a selected number of values for the probability of each treatment being effective
| PPV | NPV | FWER | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Uncorrected | FWER control | FDR control (Corr) | Uncorrected | FWER control | FDR control (Corr) | Uncorrected | FWER control | FDR control (Corr) |
| 3 | .05 | 0.687 | 0.834 | 0.833 | 0.995 | 0.991 | 0.990 | 0.060 | 0.025 | 0.025 |
| 3 | .20 | 0.912 | 0.956 | 0.949 | 0.976 | 0.961 | 0.962 | 0.052 | 0.024 | 0.031 |
| 3 | .50 | 0.977 | 0.986 | 0.981 | 0.918 | 0.886 | 0.897 | 0.034 | 0.022 | 0.029 |
| 3 | .80 | 0.994 | 0.995 | 0.994 | 0.747 | 0.705 | 0.728 | 0.015 | 0.013 | 0.015 |
| 3 | .95 | 0.999 | 0.999 | 0.999 | 0.392 | 0.361 | 0.380 | 0.004 | 0.004 | 0.004 |
| 5 | .05 | 0.709 | 0.884 | 0.883 | 0.995 | 0.988 | 0.988 | 0.089 | 0.026 | 0.027 |
| 5 | .20 | 0.921 | 0.973 | 0.961 | 0.976 | 0.951 | 0.955 | 0.076 | 0.024 | 0.038 |
| 5 | .50 | 0.977 | 0.990 | 0.982 | 0.922 | 0.864 | 0.896 | 0.052 | 0.023 | 0.044 |
| 5 | .80 | 0.994 | 0.996 | 0.994 | 0.768 | 0.695 | 0.753 | 0.024 | 0.017 | 0.024 |
| 5 | .95 | 0.999 | 0.999 | 0.999 | 0.423 | 0.371 | 0.416 | 0.006 | 0.005 | 0.006 |
| 10 | .05 | 0.747 | 0.935 | 0.932 | 0.995 | 0.985 | 0.984 | 0.140 | 0.025 | 0.027 |
| 10 | .20 | 0.927 | 0.984 | 0.967 | 0.977 | 0.937 | 0.947 | 0.126 | 0.026 | 0.055 |
| 10 | .50 | 0.978 | 0.994 | 0.981 | 0.922 | 0.823 | 0.890 | 0.089 | 0.025 | 0.076 |
| 10 | .80 | 0.994 | 0.997 | 0.994 | 0.788 | 0.659 | 0.776 | 0.043 | 0.022 | 0.043 |
| 10 | .95 | 0.999 | 0.999 | 0.999 | 0.474 | 0.378 | 0.470 | 0.012 | 0.010 | 0.012 |
Abbreviations: FDR, false-discovery rate; FWER, family-wise error rate; K, number of experimental arms; NPV, negative predictive value; P, probability of each experimental arm being effective; PPV, positive predictive value.