| Literature DB >> 22712852 |
Justine Rochon1, Matthias Gondan, Meinhard Kieser.
Abstract
BACKGROUND: Student's two-sample t test is generally used for comparing the means of two independent samples, for example, two treatment arms. Under the null hypothesis, the t test assumes that the two samples arise from the same normally distributed population with unknown variance. Adequate control of the Type I error requires that the normality assumption holds, which is often examined by means of a preliminary Shapiro-Wilk test. The following two-stage procedure is widely accepted: If the preliminary test for normality is not significant, the t test is used; if the preliminary test rejects the null hypothesis of normality, a nonparametric test is applied in the main analysis.Entities:
Mesh:
Year: 2012 PMID: 22712852 PMCID: PMC3444333 DOI: 10.1186/1471-2288-12-81
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Left: Estimated Type I error probability of the two-sample test at = .05 after both samples had passed the Shapiro-Wilk test for normality (Strategy I with = .100, .050, .010, .005), and without pretest.—Right: Estimated Type I error of the test for samples that failed testing for normality
| | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | ||
| | .100 | .091 | .150 | .170 | .190 | .210 | .050 | .048 | .051 | .050 | .051 |
| | .050 | .079 | .127 | .153 | .168 | .188 | .052 | .052 | .050 | .045 | .053 |
| | .010 | .061 | .099 | .112 | .140 | .154 | .055 | .051 | .049 | .049 | .047 |
| | .005 | .060 | .085 | .108 | .127 | .144 | .060 | .047 | .052 | .049 | .048 |
| | Without pretest | .045 | .047 | .047 | .048 | .047 | .053 | .050 | .048 | .050 | .051 |
| | | | | | | | | | | ||
| | .100 | .043 | .044 | .039 | .039 | .036 | .075 | .055 | .052 | .051 | .049 |
| | .050 | .043 | .037 | .040 | .040 | .037 | .093 | .059 | .058 | .051 | .051 |
| | .010 | .049 | .050 | .046 | .045 | .041 | .168 | .111 | .074 | .060 | .057 |
| | .005 | .052 | .050 | .048 | .044 | .043 | .233 | .133 | .087 | .069 | .059 |
| | Without pretest | .058 | .047 | .052 | .047 | .050 | .050 | .050 | .052 | .048 | .049 |
| | | | | | | | | | | ||
| | .100 | .049 | .053 | .050 | .049 | .050 | .069 | .058 | .055 | .061 | .056 |
| | .050 | .049 | .050 | .050 | .053 | .046 | .069 | .063 | .062 | .064 | .059 |
| | .010 | .050 | .050 | .047 | .048 | .051 | .090 | .081 | .073 | .072 | .074 |
| | .005 | .047 | .047 | .050 | .054 | .050 | .093 | .085 | .084 | .081 | .073 |
| Without pretest | .051 | .053 | .049 | .053 | .050 | .054 | .047 | .047 | .049 | .049 | |
Figure 1 Estimated Type I error probability of the two-sample test at = .05 after both samples had passed the Shapiro-Wilk test for normality at = .100, .050, .010, .005 (conditional), and without pretest (unconditional). Samples of equal size from the (A) exponential, (B) uniform, and (C) normal distribution.
Left: Estimated Type I error probability of the two-sample test at = .05 for samples that passed testing for normality of the residuals (Strategy II with = .100, .050, .010, .005), and without pretest.—Right: Estimated Type I error for the test for samples that failed testing for normality
| | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | ||
| | .100 | .122 | .398 | .709 | N/A | N/A | .037 | .049 | .047 | .047 | .050 |
| | .050 | .096 | .317 | .611 | .839 | N/A | .034 | .046 | .048 | .050 | .050 |
| | .010 | .072 | .196 | .421 | .669 | .859 | .034 | .046 | .045 | .051 | .050 |
| | .005 | .064 | .162 | .347 | .583 | .792 | .036 | .040 | .047 | .044 | .048 |
| | Without pretest | .044 | .048 | .047 | .049 | .051 | .041 | .054 | .046 | .044 | .050 |
| | | | | | | | | | | ||
| | .100 | .065 | .108 | .196 | .333 | .529 | .027 | .024 | .029 | .041 | .045 |
| | .050 | .052 | .081 | .133 | .233 | .377 | .025 | .018 | .022 | .035 | .044 |
| | .010 | .051 | .057 | .076 | .116 | .184 | .036 | .011 | .013 | .022 | .031 |
| | .005 | .047 | .050 | .066 | .090 | .138 | .046 | .012 | .012 | .016 | .029 |
| | Without pretest | .051 | .048 | .049 | .050 | .050 | .043 | .053 | .047 | .048 | .050 |
| | | | | | | | | | | ||
| | .100 | .049 | .053 | .048 | .053 | .048 | .071 | .074 | .063 | .062 | .061 |
| | .050 | .051 | .052 | .051 | .052 | .053 | .085 | .079 | .071 | .064 | .067 |
| | .010 | .049 | .046 | .048 | .051 | .049 | .120 | .107 | .087 | .073 | .073 |
| | .005 | .045 | .051 | .050 | .049 | .051 | .153 | .107 | .090 | .083 | .079 |
| Without pretest | .052 | .051 | .046 | .051 | .048 | .044 | .045 | .051 | .044 | .050 | |
Note: N/A, not available because nearly all samples were detected to deviate significantly from the normal distribution.
Figure 2 Estimated Type I error probability of the two-sample test at = .05 after the residuals had passed the Shapiro-Wilk test for normality at = .100, .050, .010, .005 (conditional), and without pretest (unconditional). Samples of equal size from the (A) exponential, (B) uniform, and (C) normal distribution.
Estimated Type I error probability of the two-stage procedure (Student’s test or Mann-Whitney’s test depending on preliminary Shapiro-Wilk test for normality) for different sample sizes and
| | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | | | | | | ||
| | .100 | .050 | .050 | .048 | .049 | .048 | .053 | .050 | .048 | .049 | .048 |
| | .050 | .053 | .050 | .048 | .049 | .050 | .055 | .052 | .048 | .049 | .050 |
| | .010 | .054 | .054 | .048 | .049 | .050 | .054 | .054 | .048 | .049 | .050 |
| | .005 | .050 | .056 | .050 | .048 | .049 | .050 | .055 | .049 | .048 | .049 |
| | | | | | | | | | | ||
| | .100 | .049 | .050 | .047 | .049 | .049 | .052 | .051 | .048 | .049 | .049 |
| | .050 | .051 | .050 | .050 | .049 | .048 | .053 | .051 | .051 | .050 | .048 |
| | .010 | .051 | .050 | .051 | .050 | .051 | .051 | .051 | .052 | .051 | .051 |
| | .005 | .052 | .049 | .049 | .051 | .050 | .052 | .050 | .050 | .052 | .050 |
| | | | | | | | | | | ||
| | .100 | .050 | .052 | .052 | .051 | .052 | .051 | .052 | .053 | .051 | .051 |
| | .050 | .051 | .051 | .051 | .051 | .051 | .051 | .051 | .051 | .051 | .050 |
| | .010 | .049 | .051 | .051 | .051 | .051 | .050 | .051 | .051 | .051 | .051 |
| .005 | .051 | .050 | .049 | .050 | .050 | .051 | .050 | .049 | .050 | .050 | |
Note: Type I error of the unconditional application of Student’s t test and Mann-Whitney’s U test is shown in Table 1 and Table 2.
Estimated power of the two-stage procedure for different sample sizes and
| | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | | | |||||||
| | .224 | .443 | .612 | .743 | .835 | | | | | | |
| | .248 | .446 | .609 | .743 | .835 | .259 | .449 | .609 | .743 | .835 | |
| | .254 | .451 | .610 | .744 | .835 | .261 | .454 | .610 | .744 | .835 | |
| | .270 | .467 | .615 | .744 | .835 | .271 | .466 | .615 | .744 | .835 | |
| | .264 | .482 | .615 | .743 | .837 | .265 | .475 | .614 | .743 | .837 | |
| | .240 | .518 | .721 | .847 | .919 | | | | | | |
| | | | | | |||||||
| | .256 | .512 | .686 | .813 | .892 | | | | | | |
| | .287 | .537 | .703 | .817 | .894 | .290 | .529 | .697 | .816 | .895 | |
| | .292 | .550 | .714 | .821 | .894 | .294 | .542 | .702 | .817 | .894 | |
| | .295 | .558 | .740 | .848 | .908 | .295 | .559 | .729 | .835 | .900 | |
| | .294 | .561 | .749 | .855 | .915 | .295 | .563 | .742 | .842 | .906 | |
| | .292 | .561 | .748 | .867 | .930 | | | | | | |
| | | | | | |||||||
| | .215 | .434 | .600 | .731 | .824 | | | | | | |
| | .244 | .455 | .626 | .750 | .840 | .249 | .459 | .631 | .754 | .843 | |
| | .245 | .456 | .625 | .753 | .842 | .248 | .459 | .628 | .755 | .844 | |
| | .244 | .455 | .629 | .756 | .842 | .245 | .455 | .630 | .756 | .842 | |
| | .245 | .458 | .627 | .751 | .845 | .246 | .458 | .628 | .752 | .845 | |
| .247 | .456 | .627 | .754 | .844 | |||||||