| Literature DB >> 28039681 |
Dereje W Gudicha1, Verena D Schmittmann1, Jeroen K Vermunt2.
Abstract
This paper discusses power and sample-size computation for likelihood ratio and Wald testing of the significance of covariate effects in latent class models. For both tests, asymptotic distributions can be used; that is, the test statistic can be assumed to follow a central Chi-square under the null hypothesis and a non-central Chi-square under the alternative hypothesis. Power or sample-size computation using these asymptotic distributions requires specification of the non-centrality parameter, which in practice is rarely known. We show how to calculate this non-centrality parameter using a large simulated data set from the model under the alternative hypothesis. A simulation study is conducted evaluating the adequacy of the proposed power analysis methods, determining the key study design factor affecting the power level, and comparing the performance of the likelihood ratio and Wald test. The proposed power analysis methods turn out to perform very well for a broad range of conditions. Moreover, apart from effect size and sample size, an important factor affecting the power is the class separation, implying that when class separation is low, rather large sample sizes are needed to achieve a reasonable power level.Entities:
Keywords: Asymptotic distributions; Large simulated data set; Latent class; Likelihood ratio; Non-centrality parameter; Power analysis; Wald test
Mesh:
Year: 2017 PMID: 28039681 PMCID: PMC5628195 DOI: 10.3758/s13428-016-0825-y
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
The computed entropy R-square for different design cells
| Equal class proportions | Unequal class proportions | ||||||
|---|---|---|---|---|---|---|---|
| Class-indicator | Class-indicator | ||||||
| associations | associations | ||||||
| Weak | Medium | Strong | Weak | Medium | Strong | ||
|
|
| .574 | .855 | .981 | .534 | .838 | .978 |
|
|
| .732 | .935 | .997 | .704 | .944 | .998 |
|
|
| .354 | .650 | .900 | .314 | .618 | .878 |
|
|
| .502 | .805 | .969 | .462 | .782 | .963 |
C= the number of classes; P= number of indicator variables. The entropy R-square values reported in this table pertain to the model with small effect sizes for the covariate effects, and these entropy R-square values slightly increase for the case when we have larger effect sizes
The power of the Wald and the likelihood ratio test to reject the null hypothesis that covariate has no effect on class membership in the two-class latent class model; the case of equal class proportions
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Effect | Class-indicator | Class-indicator | Class-indicator | |||||||
| size | associations | associations | associations | |||||||
| Weak | Medium | Strong | Weak | Medium | Strong | Weak | Medium | Strong | ||
| Six indicator variables | ||||||||||
| Small | Wald | .125 | .164 | .181 | .242 | .338 | .379 | .429 | .587 | .645 |
| LR | .126 | .166 | .180 | .245 | .343 | .377 | .434 | .594 | .645 | |
| Medium | Wald | .269 | .363 | .408 | .546 | .721 | .779 | .835 | .945 | .971 |
| LR | .260 | .369 | .411 | .548 | .729 | .784 | .836 | .953 | .973 | |
| Large | Wald | .702 | .868 | .913 | .976 | .998 | 1 | 1 | 1 | 1 |
| LR | .743 | .885 | .923 | .985 | .998 | 1 | 1 | 1 | 1 | |
| Ten indicator variables | ||||||||||
| Small | Wald | .147 | .177 | .184 | .297 | .369 | .385 | .523 | .633 | .655 |
| LR | .151 | .176 | .181 | .307 | .367 | .380 | .539 | .63 | .647 | |
| Medium | Wald | .319 | .397 | .412 | .653 | .766 | .786 | .914 | .967 | .974 |
| LR | .315 | .402 | .422 | .647 | .773 | .796 | .91 | .969 | .976 | |
| Large | Wald | .812 | .903 | .917 | .994 | .999 | .999 | 1 | 1 | 1 |
| LR | .837 | .918 | .9309 | .996 | .999 | .999 | 1 | 1 | 1 | |
The power values reported in this table are obtained by assuming theoretical Chi-square distributions for both the Wald and the likelihood ratio test statistics, for which the non-centrality parameter of the non-central Chi-square is approximated using a large simulated data set
The power of the Wald and the likelihood ratio test to reject the null hypothesis that the covariate has no effect on class membership in the three-class latent class model; the case of equal class proportions
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Effect size | Class-indicator associations | Class-indicator associations | Class-indicator associations | |||||||
| Weak | Medium | Strong | Weak | Medium | Strong | Weak | Medium | Strong | ||
| Six indicator variables | ||||||||||
| Small | Wald | .081 | .106 | .125 | .131 | .200 | .252 | .222 | .365 | .464 |
| LR | .080 | .108 | .126 | .130 | .206 | .255 | .221 | .377 | .471 | |
| Medium | Wald | .135 | .214 | .272 | .281 | .478 | .599 | .517 | .789 | .894 |
| LR | .140 | .215 | .272 | .295 | .48 | .600 | .540 | .792 | .894 | |
| Large | Wald | .365 | .642 | .779 | .752 | .967 | .994 | .968 | 1 | 1 |
| LR | .436 | .686 | .810 | .837 | .978 | .996 | .989 | 1 | 1 | |
| Ten indicator variables | ||||||||||
| Small | Wald | .089 | .118 | .130 | .155 | .233 | .265 | .272 | .430 | .49 |
| LR | .092 | .119 | .133 | .163 | .236 | .274 | .289 | .436 | .504 | |
| Medium | Wald | .163 | .252 | .287 | .353 | .559 | .628 | .632 | .864 | .913 |
| LR | .178 | .263 | .290 | .391 | .583 | .632 | .686 | .882 | .915 | |
| Large | Wald | .471 | .738 | .807 | .871 | .989 | .996 | .994 | 1 | 1 |
| LR | .571 | .772 | .823 | .938 | .993 | .997 | .999 | 1 | 1 | |
The power values reported in this table are obtained by assuming theoretical Chi-square distributions for both the Wald and the likelihood ratio test statistics, for which the non-centrality parameter of the non-central Chi-square is approximated using a large simulated data set
The power of the Wald and the likelihood ratio test to reject the null hypothesis that the covariate has no effect on class membership; the case of unequal class proportions, and six indicator variables
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Effect size | Class-indicator associations | Class-indicator associations | Class-indicator associations | |||||||
| Weak | Medium | Strong | Weak | Medium | Strong | Weak | Medium | Strong | ||
| Two-class model | ||||||||||
| Small | Wald | .102 | .133 | .148 | .183 | .263 | .299 | .319 | .465 | .525 |
| LR | .103 | .136 | .153 | .185 | .268 | .312 | .322 | .475 | .547 | |
| Medium | Wald | .195 | .283 | .322 | .411 | .590 | .658 | .688 | .872 | .918 |
| LR | .197 | .282 | .331 | .414 | .590 | .674 | .693 | .871 | .926 | |
| Large | Wald | .549 | .761 | .826 | .909 | .988 | .996 | .995 | 1 | 1 |
| LR | .590 | .783 | .844 | .933 | .991 | .997 | .998 | 1 | 1 | |
| Three-class model | ||||||||||
| Small | Wald | .077 | .100 | .120 | .120 | .185 | .238 | .198 | .334 | .439 |
| LR | .076 | .101 | .121 | .119 | .188 | .242 | .197 | 0.34 | .447 | |
| Medium | Wald | .125 | .197 | .257 | .253 | .439 | .570 | .467 | .746 | .873 |
| LR | .127 | .208 | .267 | .257 | .465 | .593 | .474 | .775 | .889 | |
| Large | Wald | .337 | .600 | .751 | .712 | .951 | .990 | .945 | .999 | 1 |
| LR | .387 | .641 | .785 | .782 | .966 | .994 | .977 | 1 | 1 | |
The power values reported in this table are obtained by assuming theoretical chi-square distributions for both the Wald and the likelihood ratio test statistics, for which the non-centrality parameter of the non-central Chi-square is approximated using a large simulated data set
Sample-size requirements for Wald statistic in testing the covariate effect on class membership given specified power levels, class-indicator associations, number of indicator variables, number of classes, class proportions, and effect sizes
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|
| Effect size | Class-indicator | Class-indicator | Class-indicator | ||||||
| associations | associations | associations | |||||||
| Weak | Medium | Strong | Weak | Medium | Strong | Weak | Medium | Strong | |
| Two-class model with equal class proportions and six indicator variables | |||||||||
| Small | 2473 | 1652 | 1434 | 3312 | 2210 | 1925 | 4097 | 2734 | 2380 |
| Medium | 911 | 606 | 527 | 1210 | 811 | 705 | 1509 | 1003 | 872 |
| Large | 253 | 165 | 143 | 338 | 221 | 191 | 418 | 273 | 236 |
| Two-class model with equal class proportions and ten indicator variables | |||||||||
| Small | 1929 | 1485 | 1412 | 2582 | 1988 | 1891 | 3193 | 2458 | 2338 |
| Medium | 709 | 544 | 518 | 949 | 729 | 693 | 1173 | 901 | 857 |
| Large | 194 | 148 | 140 | 260 | 198 | 188 | 321 | 245 | 232 |
| Two-class model with unequal class proportions and six indicator variables | |||||||||
| Small | 3544 | 2241 | 1916 | 4745 | 3000 | 2566 | 5868 | 3710 | 3173 |
| Medium | 1306 | 811 | 700 | 1749 | 1098 | 937 | 2163 | 1357 | 1159 |
| Large | 362 | 221 | 187 | 484 | 295 | 250 | 599 | 365 | 310 |
| Three-class model with equal class proportions and six indicator variables | |||||||||
| Small | 4922 | 2785 | 2120 | 6464 | 3657 | 2786 | 7888 | 4463 | 3400 |
| Medium | 1869 | 1025 | 777 | 2454 | 1347 | 1020 | 2995 | 1644 | 1245 |
| Large | 558 | 283 | 210 | 733 | 372 | 276 | 895 | 454 | 337 |
Theoretical versus empirical (H 1-simulated) power values of the likelihood ratio test of the covariate effect on class membership in design conditions of interest
|
|
| |||||
|---|---|---|---|---|---|---|
| Class-indicator | Class-indicator | |||||
| associations | associations | |||||
| Weak | Medium | Strong | Weak | Medium | Strong | |
| Two-class model with six indicator variables | ||||||
| Wald theoretical | .125 | .164 | .181 | .429 | .587 | .645 |
| Wald empirical | .131 | .156 | .176 | .429 | .584 | .648 |
| LR theoretical | .126 | .166 | .180 | .434 | .594 | .645 |
| LR empirical | .138 | .177 | .182 | .432 | .58 | .648 |
| Two-class model with ten indicator variables | ||||||
| Wald theoretical | .147 | .177 | .184 | .523 | .633 | .655 |
| Wald empirical | .138 | .175 | .196 | .513 | .632 | .652 |
| LR theoretical | .151 | .176 | .181 | .539 | .63 | .647 |
| LR empirical | .150 | .179 | .189 | .537 | .638 | .665 |
| Three-class model with six indicator variables | ||||||
| Wald theoretical | .081 | .106 | .125 | .222 | .365 | .464 |
| Wald empirical | .187 | .134 | .123 | .223 | .368 | .454 |
| LR theoretical | .08 | .108 | .126 | .221 | .377 | .471 |
| LR empirical | .238 | .146 | .134 | .267 | .374 | .456 |
| Three-class model with ten indicator variables | ||||||
| Wald theoretical | .089 | .118 | .130 | .272 | .430 | .490 |
| Wald empirical | .169 | .118 | .127 | .283 | .426 | .508 |
| LR theoretical | .092 | .119 | .133 | .289 | .436 | .504 |
| LR empirical | .161 | .133 | .134 | .286 | .443 | .493 |
The power values reported in this table are for the study design conditions with small effect size and equal class proportions
Type I error rates for the Wald and LR tests
| Sample | Test | Class-indicator associations | ||
|---|---|---|---|---|
| Size | Statistic | Weak | Medium | Strong |
| 200 | Wald | .106 | .077 | .063 |
| LR | .204 | .079 | .062 | |
| 500 | Wald | .094 | .072 | .063 |
| LR | .118 | .064 | .056 | |
| 1000 | Wald | .08 | .069 | .061 |
| LR | .088 | .068 | .052 | |
The type I error rates reported in this table pertain to the three-class model with six indicator variables and equal class size