| Literature DB >> 31526178 |
Robin Ristl1, Ludwig Hothorn2, Christian Ritz3, Martin Posch1.
Abstract
Motivated by small-sample studies in ophthalmology and dermatology, we study the problem of simultaneous inference for multiple endpoints in the presence of repeated observations. We propose a framework in which a generalized estimating equation model is fit for each endpoint marginally, taking into account dependencies within the same subject. The asymptotic joint normality of the stacked vector of marginal estimating equations is used to derive Wald-type simultaneous confidence intervals and hypothesis tests for multiple linear contrasts of regression coefficients of the multiple marginal models. The small sample performance of this approach is improved by a bias adjustment to the estimate of the joint covariance matrix of the regression coefficients from multiple models. As a further small sample improvement a multivariate t-distribution with appropriate degrees of freedom is specified as reference distribution. In addition, a generalized score test based on the stacked estimating equations is derived. Simulation results show strong control of the family-wise type I error rate for these methods even with small sample sizes and increased power compared to a Bonferroni-Holm multiplicity adjustment. Thus, the proposed methods are suitable to efficiently use the information from repeated observations of multiple endpoints in small-sample studies.Entities:
Keywords: Generalized estimating equations; dependent observations; multiple endpoints; multiple testing; small samples
Mesh:
Year: 2019 PMID: 31526178 PMCID: PMC7270726 DOI: 10.1177/0962280219873005
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Unadjusted and adjusted p-values for maximum-type Wald tests in the retina disease example.
| Hypothesis | Unadjusted p | Holm | mmmGEE |
|---|---|---|---|
|
| <0.0001 | <0.0001 | <0.0001 |
|
| <0.0001 | <0.0001 | <0.0001 |
|
| 0.0001 | 0.0002 | 0.0002 |
|
| 0.0773 | 0.0828 | 0.0819 |
|
| 0.0414 | 0.0828 | 0.0819 |
|
| 0.0237 | 0.0710 | 0.0363 |
Adjusted p-values are calculated using the Bonferroni-Holm method (Holm) and the closed testing procedure applied to contrasts across multiple marginal generalized estimating equation (GEE) models (mmmGEE).
Type I error rate in the simulations with M = 3 endpoints and M = 12 endpoints with intermediate correlations when testing using the methods described in Section 3 or a Bonferroni test.
| Statistic | Type | Approximation | Bias adj. | Type I error rate (%) | |||
|---|---|---|---|---|---|---|---|
| Wald | Quadratic | Chi-squared | no | 9.8 | 6.8 | 41.0 | 15.1 |
| yes | 6.2 | 5.5 | 28.1 | 11.6 | |||
| F | no | 7.6 | 6.1 | 30.0 | 12.1 | ||
| yes | 4.7 | 4.9 | 19.1 | 9.0 | |||
| Scaled F | no | 6.4 | 5.6 | 8.6 | 6.7 | ||
| yes | 3.8 | 4.5 | 4.3 | 4.7 | |||
| Maximum | MVN | no | 8.1 | 6.1 | 10.4 | 6.8 | |
| yes | 5.2 | 5.1 | 6.2 | 5.5 | |||
| MVT | no | 6.5 | 5.6 | 7.6 | 6.0 | ||
| yes | 4.0 | 4.6 | 4.3 | 4.7 | |||
| Normal-Bonferroni | no | 7.3 | 5.5 | 8.4 | 5.3 | ||
| yes | 4.7 | 4.5 | 4.9 | 4.2 | |||
| t-Bonferroni | no | 5.8 | 5.0 | 5.6 | 4.5 | ||
| yes | 3.5 | 4.0 | 3.0 | 3.5 | |||
| Score | Quadratic | Chi-squared | no | 4.6 | 4.9 | 2.2 | 4.0 |
| Maximum | MVN | no | 5.0 | 5.0 | 4.6 | 4.9 | |
The tests are based on quadratic form or maximum-type Wald or score statistics. The reference distributions are chi-squared, F, scaled F, MVN or MVT. All Wald tests were performed with and without bias adjustment (column 'Bias adj.'). The considered sample sizes were K = 40 and K = 100. The results are based on 105 simulation runs. MVN: multivariate normal; MVT: multivariate t.
Power to reject with selected testing approaches that control the type I error rate (see Table 2 for details).
| Statistic | Type | Approx. | Bias adj. | Power (%) | |||||
|---|---|---|---|---|---|---|---|---|---|
| K = 40 | K = 100 | ||||||||
| a | b | c | a | b | c | ||||
| Wald | Quadratic | Scaled F | yes | 67.5 | 83.7 | 63.1 | 70.5 | 86.8 | 70.5 |
| Wald | Maximum | MVT | yes | 77.7 | 71.2 | 53.7 | 78.8 | 72.9 | 57.8 |
| Wald | Maximum | t-Bonferroni | yes | 75.5 | 68.6 | 50.8 | 76.9 | 70.9 | 55.5 |
| Score | Quadratic | Chi-squared | no | 72.1 | 86.7 | 74.5 | 71.9 | 87.7 | 74.2 |
| Score | Maximum | MVN | no | 80.7 | 74.3 | 64.7 | 79.7 | 73.9 | 61.6 |
The power was calculated for sample sizes K = 40 and K = 100 in three different scenarios. In scenario (a), there was an effect in all three endpoints (), in scenario (b) there was an effect in endpoints 2 and 3 (), and in scenario (c) there was an effect in endpoint 3 only (). See text for the exact values of non-zero coefficients. The results are based on 105 simulation runs. MVN: multivariate normal; MVT: multivariate t.
Power of closed testing procedures to reject , at least one elementary hypothesis (any H), or all elementary hypotheses.
| K | Statistic | Type | Approx. | Bias adj. |
|
|
| any | all |
|---|---|---|---|---|---|---|---|---|---|
| 40 | Wald | Quadratic | scaled F | yes | 53.1 | 54.0 | 54.3 | 67.1 | 41.4 |
| 40 | Wald | Maximum | MVT | yes | 59.0 | 59.4 | 60.6 | 77.7 | 42.2 |
| 40 | Wald | Maximum | t-Bonferroni | yes | 57.4 | 57.9 | 58.9 | 75.5 | 41.5 |
| 40 | Score | Quadratic | Chi-squared | no | 57.7 | 53.8 | 64.0 | 71.7 | 46.0 |
| 40 | Score | Maximum | MVN | no | 63.0 | 58.1 | 69.2 | 80.7 | 46.6 |
| 100 | Wald | Quadratic | Scaled F | yes | 54.3 | 56.3 | 57.8 | 70.1 | 43.1 |
| 100 | Wald | Maximum | MVT | yes | 58.8 | 60.8 | 62.8 | 78.8 | 43.2 |
| 100 | Wald | Maximum | t-Bonferroni | yes | 57.5 | 59.4 | 61.4 | 76.9 | 42.7 |
| 100 | Score | Quadratic | Chi-squared | no | 56.3 | 56.1 | 61.5 | 71.5 | 45.0 |
| 100 | Score | Maximum | MVN | no | 60.6 | 60.2 | 66.1 | 79.7 | 45.1 |
Results are shown for the simulation scenario (a) with an effect in all endpoints. The results are based on 105 simulation runs. MVN: multivariate normal; MVT: multivariate t.
Simultaneous coverage probability of nominal 95% simultaneous confidence intervals for for scenario (a) with sample sizes K = 40 and K = 100.
| Approximation | Bias adj. | ||
|---|---|---|---|
| MVN | no | 91.9 | 93.7 |
| MVN | yes | 94.7 | 94.8 |
| MVT | no | 93.4 | 94.3 |
| MVT | yes | 95.8 | 95.2 |
The intervals were calculated based on an approximating MVN or MVT distribution, with and without bias adjustment for the covariance matrix estimate. The results are based on 105 simulation runs. MVN: multivariate normal; MVT: multivariate t.
Figure 1.Power to reject the global null hypothesis under an alternative with an effect in all M endpoints (solid lines) and type I error rate under H0 (dotted lines) for scenarios with K = 40 subjects and increasing number of endpoints. The correlation between marginal Wald or score statistics is approximately 0.25, 0.5 and 0.75 in the scenarios with low correlation, intermediate correlation and high correlation. The nominal level of 0.05 is indicated by a horizontal line. The studied tests are listed in the legend. The information in parentheses shows that the bias adjustment for the covariance matrix was applied to all tests using Wald statistics; furthermore, the reference distributions with abbreviations as in Table 2 are indicated. The results are based on 2 × 104 simulation runs.
|
|
|
|