| Literature DB >> 30034027 |
Huub Hoofs1,2, Rens van de Schoot3,4, Nicole W H Jansen1, IJmert Kant1.
Abstract
Bayesian confirmatory factor analysis (CFA) offers an alternative to frequentist CFA based on, for example, maximum likelihood estimation for the assessment of reliability and validity of educational and psychological measures. For increasing sample sizes, however, the applicability of current fit statistics evaluating model fit within Bayesian CFA is limited. We propose, therefore, a Bayesian variant of the root mean square error of approximation (RMSEA), the BRMSEA. A simulation study was performed with variations in model misspecification, factor loading magnitude, number of indicators, number of factors, and sample size. This showed that the 90% posterior probability interval of the BRMSEA is valid for evaluating model fit in large samples (N≥ 1,000), using cutoff values for the lower (<.05) and upper limit (<.08) as guideline. An empirical illustration further shows the advantage of the BRMSEA in large sample Bayesian CFA models. In conclusion, it can be stated that the BRMSEA is well suited to evaluate model fit in large sample Bayesian CFA models by taking sample size and model complexity into account.Entities:
Keywords: Bayesian procedures; factor analysis; model fit; simulation; validity
Year: 2017 PMID: 30034027 PMCID: PMC6041765 DOI: 10.1177/0013164417709314
Source DB: PubMed Journal: Educ Psychol Meas ISSN: 0013-1644 Impact factor: 2.821
Figure 1.The different specifications for the population factor models used to generate the population covariance matrices for each condition.
Note. Factor loadings (λ = .3, .5, or .7; λ = .07, .35) and number of indicators (6 or 12) varied between the conditions. Residuals (θ) were computed on the basis of the factor loadings (1 −λ2). Intercepts and factor means are not displayed as they were estimated to be zero in all models. Models A through C were only used in the first section and Models F1 and F2 only in the second section. Model A was the reference model in the first section and Model D was the reference model in the second section.
Population Parameters (Root Mean Square Error of Approximation) of Each Condition for the Two Different Reference Models, on the Basis of the Number of Indicators, Magnitude of Factor Loadings (Rows), and Specification (Columns).
| Indicators = 6 | Indicators = 12 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | B | C | D | E | A | B | C | D | E | F1 | F2 | |
| Reference model: One-factor (Specification A) | ||||||||||||
| .5 | .000 | .034 | .070 | .089 | .106 | .000 | .017 | .042 | .070 | .091 | — | — |
| .7 | .000 | .052 | .103 | .204 | .234 | .000 | .025 | .063 | .149 | .188 | — | — |
| Reference model: Two-factor (Specification D) | ||||||||||||
| .7 | — | — | — | — | — | — | — | — | .000 | .141 | .013 | .061 |
Note. Specification A is a common one-factor model. Specification B is identical to Specification A except for the inclusion of a small error covariance (.1) between the first and second indicators. Specification C is a one-factor model with small error covariances (.1) between each subsequent pair of indicators. Specification D is a two-factor model with a covariance of .5 between the two factors. Specification E is a three-factor model with a covariance of .25 between the factors. Model F1 was similar to Model D except the inclusion of a small (.07) cross-loadings between the sixth indicator and the second factor and the seventh indicator and the first factor. Model F2 was similar to Model F1 except that the two cross-loadings were moderate (.35). For Specifications D through F2 the number of indicators is equally distributed across the factors. Residuals were computed by subtracting the squared factor loading from 1. Intercepts and factor means were estimated to be zero in all models.
Figure 2.Mean values of the 500 replications for the 90% posterior probability interval (PPI) of the Bayesian root mean square error of approximation (BRMSEA) and the posterior predictive p value (ppp) for the Bayesian confirmatory factor analysis (CFA) models, both with informative and diffuse priors, and for the 90% confidence interval (CI) RMSEA and p value for the frequentist CFA models of the first section, with the one-factor model as reference (ref) model, for each sample size (as ordinal variable) and specification condition in which the magnitude of the factor loadings was .5. Cutoff values for the BRMSEA and RMSEA (.05 for the lower limit and .08 for the upper limit) and for the posterior predictive p value and p value (.05) are indicated with the dashed lines. Values within these cutoff points have (blue) circles, those outside (red) squares.
Figure 4.Mean values of the 500 replications for the 90% posterior probability interval (PPI) of the Bayesian root mean square error of approximation (BRMSEA) and the posterior predictive p value (ppp) for the Bayesian structural equation modeling (BSEM) models, both with informative, diffuse, and wrong priors, and for the 90% confidence interval (CI) RMSEA and p value for the frequentist CFA models of the second section, with the two-factor model as reference (ref) model, for each sample size (as ordinal variable) and specification condition. Cutoff values for the BRMSEA and RMSEA (.05 for the lower limit and .08 for the upper limit) and for the posterior predictive p value and p value (.05) are indicated with the dashed lines. Values within these cutoff points have (blue) circles, those outside (red) squares.
Proportion of Rejected Models With Six Indicators of the First Section, With the One-Factor Model as Reference Model, Using a Cutoff Point for the 90% Confidence Interval and 90% Posterior Probability Intervals of the Root Mean Square Error of Approximation (RMSEA) and BRMSEA for the Upper Limit of .08 and for the Lower Limit of .05 and of .05 for the Posterior Predictive p Value and p Value for the Bayesian Confirmatory Factor Analysis (CFA), With Diffuse Priors, and Frequentist (CFA).
| Factor loadings =
.5 | Factor loadings =
.7 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
| Model | BRMSEA |
| RMSEA |
| BRMSEA |
| RMSEA |
|
| 50 | A (ref) | .92 | .01 | .90 | .09 | .90 | .01 | .90 | .10 |
| B | .93 | .01 | .91 | .11 | .92 | .01 | .91 | .13 | |
| C | .96 | .01 | .95 | .16 | .98 | .07 | .98 | .33 | |
| D | .98 | .02 | .96 | .20 | 1 | .51 | 1 | .84 | |
| E | .95 | .02 | .96 | .12 | 1 | .61 | 1 | .91 | |
| 100 | A (ref) | .72 | .01 | .80 | .07 | .67 | .01 | .80 | .07 |
| B | .78 | .01 | .83 | .11 | .79 | .03 | .87 | .17 | |
| C | .91 | .05 | .94 | .23 | .97 | .20 | .99 | .52 | |
| D | .96 | .11 | .97 | .39 | 1 | .95 | 1 | 1.00 | |
| E | .94 | .1 | .98 | .37 | 1 | .99 | 1 | 1 | |
| 250 | A (ref) | .08 | .00 | .36 | .06 | .07 | .00 | .34 | .07 |
| B | .20 | .02 | .57 | .18 | .38 | .08 | .75 | .35 | |
| C | .66 | .23 | .90 | .62 | .96 | .77 | 1.00 | .95 | |
| D | .84 | .51 | .97 | .81 | 1 | 1 | 1 | 1 | |
| E | .93 | .68 | 1.00 | .93 | 1 | 1 | 1 | 1 | |
| 500 | A (ref) | 0 | .01 | .03 | .08 | 0 | .01 | .50 | .08 |
| B | .01 | .08 | .18 | .31 | .13 | .31 | 1 | .65 | |
| C | .46 | .69 | .81 | .90 | .98 | 1.00 | 1 | 1 | |
| D | .81 | .93 | .97 | .99 | 1 | 1 | 1 | 1 | |
| E | .97 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 1,000 | A (ref) | 0 | .00 | 0 | .07 | 0 | .00 | 0 | .06 |
| B | 0 | .22 | .00 | .58 | .02 | .74 | .20 | .93 | |
| C | .33 | .99 | .74 | 1.00 | 1.00 | 1 | 1 | 1 | |
| D | .89 | 1 | .99 | 1 | 1 | 1 | 1 | 1 | |
| E | 1.00 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 5,000 | A (ref) | 0 | 0 | 0 | .06 | 0 | 0 | 0 | .06 |
| B | 0 | 1.00 | 0 | 1 | .20 | 1 | .10 | 1 | |
| C | 1 | 1 | 1.00 | 1 | 1 | 1 | 1 | 1 | |
| D | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 10,000 | A (ref) | 0 | .00 | 0 | .04 | 0 | 0 | 0 | .05 |
| B | 0 | 1 | 0 | 1 | .33 | 1 | .16 | 1 | |
| C | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| D | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Note. RMSEA = root mean square error of approximation; BRMSEA = Bayesian root mean square error of approximation; ppp = posterior predictive p value; p = p value; ref = reference model.
Proportion of Rejected Models With 12 Indicators of the First Section, With the One-Factor Model as Reference Model, Using a Cutoff Point for the 90% Confidence Interval and 90% Posterior Probability Intervals of the Root Mean Square Error of Approximation (RMSEA) and BRMSEA for the Upper Limit of .08 and for the Lower Limit of .05 and of .05 for the Posterior Predictive p Value and p Value for the Bayesian Confirmatory Factor Analysis (CFA), With Diffuse Priors, and Frequentist CFA.
| Factor loadings =
.5 | Factor loadings =
.7 | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
| Model | BRMSEA |
| RMSEA |
| BRMSEA |
| RMSEA |
|
| 50 | A (ref) | .00 | .04 | .82 | .21 | 0 | .03 | .82 | .22 |
| B | .00 | .04 | .84 | .22 | .00 | .05 | .85 | .23 | |
| C | .02 | .09 | .90 | .31 | .03 | .17 | .96 | .45 | |
| D | .04 | .19 | .96 | .51 | .74 | .94 | 1 | .99 | |
| E | .09 | .28 | .98 | .64 | .99 | 1.00 | 1 | 1 | |
| 100 | A (ref) | 0 | .02 | .25 | .10 | 0 | .02 | .26 | .10 |
| B | 0 | .02 | .30 | .13 | 0 | .04 | .35 | .17 | |
| C | 0 | .10 | .56 | .31 | .01 | .33 | .86 | .65 | |
| D | .02 | .42 | .87 | .71 | .97 | 1 | 1 | 1 | |
| E | .15 | .76 | .98 | .93 | 1 | 1 | 1 | 1 | |
| 250 | A (ref) | 0 | .01 | 0 | .08 | 0 | .01 | 0 | .08 |
| B | 0 | .03 | 0 | .13 | 0 | .07 | .00 | .23 | |
| C | 0 | .38 | .02 | .66 | 0 | .93 | .51 | .98 | |
| D | .01 | .97 | .73 | 1.00 | 1 | 1 | 1 | 1 | |
| E | .37 | 1 | 1.00 | 1 | 1 | 1 | 1 | 1 | |
| 500 | A (ref) | 0 | .01 | 0 | .06 | 0 | .00 | 0 | .05 |
| B | 0 | .07 | 0 | .18 | 0 | .23 | .00 | .42 | |
| C | 0 | .92 | .01 | .97 | .05 | 1 | .61 | 1 | |
| D | .27 | 1 | .88 | 1 | 1 | 1 | 1 | 1 | |
| E | .98 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 1,000 | A (ref) | 0 | .02 | 0 | .06 | 0 | .02 | 0 | .06 |
| B | 0 | .20 | 0 | .41 | 0 | .63 | 0 | .84 | |
| C | 0 | 1 | .00 | 1 | .47 | 1 | .92 | 1 | |
| D | .88 | 1 | .99 | 1 | 1 | 1 | 1 | 1 | |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 5,000 | A (ref) | 0 | .01 | 0 | .05 | 0 | .01 | 0 | .05 |
| B | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | |
| C | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | |
| D | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 10,000 | A (ref) | 0 | .00 | 0 | .05 | 0 | .00 | 0 | .05 |
| B | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | |
| C | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | |
| D | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Note. RMSEA = root mean square error of approximation; BRMSEA = Bayesian root mean square error of approximation; ppp = posterior predictive p value; p = p value; ref = reference model.
Proportion of Rejected Models of the Second Section, With the Two-Factor Model as Reference Model, Using a Cutoff Point for the 90% Confidence Interval and 90% Posterior Probability Intervals of the Root Mean Square Error of Approximation (RMSEA) and BRMSEA for the Upper Limit of .08 and for the Lower Limit of .05 and of .05 for the Posterior Predictive p Value and p Value for the Bayesian Confirmatory Factor Analysis (CFA), With Diffuse, Informative, and Wrong Priors, and Frequentist CFA.
| Bayesian CFA
(priors) | Frequentist CFA | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Diffuse | Conservative | Wrong | — | ||||||
|
| Model | BRMSEA |
| BRMSEA |
| BRMSEA |
| RMSEA |
|
| 50 | D (ref) | .00 | .03 | 0 | .01 | 0 | .11 | .82 | .21 |
| E | .67 | .90 | .45 | .90 | .93 | 1 | 1 | .98 | |
| F1 | .00 | .03 | 0 | .01 | 0 | .11 | .84 | .21 | |
| F2 | .03 | .12 | .00 | .10 | .02 | .40 | .94 | .45 | |
| 100 | D (ref) | 0 | .01 | 0 | .01 | 0 | .14 | .26 | .09 |
| E | .96 | 1 | .94 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | .01 | 0 | .01 | 0 | .15 | .30 | .11 | |
| F2 | .00 | .26 | .00 | .24 | .02 | .80 | .83 | .62 | |
| 250 | D (ref) | 0 | .02 | 0 | .01 | 0 | .38 | 0 | .09 |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | .03 | 0 | .01 | 0 | .46 | 0 | .12 | |
| F2 | 0 | .91 | 0 | .90 | .01 | 1 | .51 | .97 | |
| 500 | D (ref) | 0 | .01 | 0 | .01 | 0 | .50 | .00 | .05 |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | .03 | 0 | .03 | 0 | .64 | 0 | .10 | |
| F2 | .02 | 1 | .02 | 1 | .13 | 1 | .53 | 1 | |
| 1,000 | D (ref) | 0 | .02 | 0 | .02 | 0 | .30 | 0 | .06 |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | .08 | 0 | .08 | 0 | .61 | 0 | .23 | |
| F2 | .32 | 1 | .29 | 1 | .48 | 1 | .79 | 1 | |
| 5,000 | D (ref) | 0 | .02 | 0 | .02 | 0 | .04 | 0 | .04 |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | .87 | 0 | .88 | 0 | .95 | 0 | .96 | |
| F2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 10,000 | D (ref) | 0 | .01 | 0 | .00 | 0 | .01 | 0 | .05 |
| E | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| F1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | |
| F2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
Note. RMSEA = root mean square error of approximation; BRMSEA = Bayesian root mean square error of approximation; ppp = posterior predictive p value; p = p value; ref = reference model.
Figure 3.Mean values of the 500 replications for the 90% posterior probability interval (PPI) of the Bayesian root mean square error of approximation (BRMSEA) and the posterior predictive p value (ppp) for the Bayesian confirmatory factor analysis (CFA) models, both with informative and diffuse priors, and for the 90% confidence interval (CI) RMSEA and p value for the frequentist CFA models of the first section, with the one-factor model as reference (ref) model, for each sample size (as ordinal variable) and specification condition in which the magnitude of the factor loadings was .7. Cutoff values for the BRMSEA and RMSEA (.05 for the lower limit and .08 for the upper limit) and for the posterior predictive p value and p value (.05) are indicated with the dashed lines. Values within these cutoff points have (blue) circles, those outside (red) squares.
Summary of Model Acceptance, for Each Model Specification and Magnitude of the Factor Loadings, Indicating If Models Were Commonly Accepted (+), rejected (−) or a Mixed Pattern Emerged (0) for the Conditions With a Large Sample Size (N≥ 1,000) With a Cutoff Point for the Lower Limit of .05 and for the Upper Limit of .08 for the 90% Posterior Probability Intervals of the BRMSEA and 90% Confidence Interval of the RMSEA and of .05 for the Posterior Predictive p Value and p Value.
| Bayesian confirmatory factor
analysis (CFA) | Frequentist confirmatory
factor analysis (CFA) | |||||||
|---|---|---|---|---|---|---|---|---|
| BRMSEA |
| RMSEA | ||||||
| .5 | .7 | .5 | .7 | .5 | .7 | .5 | .7 | |
| Reference model: One-factor (Specification A) | ||||||||
| A (ref) | + | + | + | + | + | + | + | + |
| B | + | + | 0 | − | + | 0/+[ | 0 | − |
| C | −/+[ | − | − | − | −/+[ | − | − | − |
| D | − | − | − | − | − | − | − | − |
| E | − | − | − | − | − | − | − | − |
| Reference model: Two-factor (Specification D) | ||||||||
| D (ref) | + | +/0b | + | + | ||||
| E | − | − | − | − | ||||
| F1 | + | 0 | + | 0 | ||||
| F2 | − | − | − | − | ||||
| Conclusion ( | Using the | With increasing sample sizes
the | Using the | With increasing sample sizes
the | ||||
Note. RMSEA = root mean square error of approximation; BRMSEA = Bayesian root mean square error of approximation; ppp = posterior predictive p value; ref = reference model. If no superscripts are given, no noteworthy differences were found for the different prior variations or the number of indicators and a joined summary was given.
Model acceptance differed for the number of indicators: The result for the six indicators was provided first, followed by the result for the 12 indicators. bModel acceptance differed for the different prior variations: The result for the diffuse and informative prior variations was provided first, followed by the result for the wrong prior variation.
Results of the Empirical Illustration for the Different Sample Sizes With the 90% Confidence Interval of the Root Mean Square Error of Approximation (RMSEA) and p Values for the Frequentist Confirmatory Factor Analysis (CFA) Models and 90% Posterior Probability Intervals of the Bayesian RMSEA (BRMSEA) and Posterior Predictive p Value for the Bayesian CFA Models With Diffuse and Informative Priors.
| Bayesian CFA (diffuse
priors) | Bayesian CFA (informative
priors) | Frequentist CFA | ||||
|---|---|---|---|---|---|---|
|
| BRMSEA90 |
| BRMSEA90 |
| RMSEA90 | |
| 50 | [.000, .206] |
| [.000, .189] |
| [.053, .254] | .02 |
| 100 | [.000, .114] |
| [.000, .109] |
| [.000, .136] |
|
| 250 |
|
|
|
|
|
|
| 500 |
|
|
|
|
|
|
| 1,000 |
|
|
|
|
| .02 |
| 5,000 |
| .00 |
| .00 |
| < .01 |
| 10,000 |
| .00 |
| .00 |
| < .01 |
Note. RMSEA = root mean square error of approximation; BRMSEA = Bayesian root mean square error of approximation; ppp = posterior predictive p value. Boldfaced BRMSEA and RMSEA intervals have a lower limit below .05 and an upper limit below .08; Boldfaced p values and posterior predictive p values are greater than .05.