| Literature DB >> 25400603 |
Christian Geiser1, G Leonard Burns2, Mateu Servera3.
Abstract
Models of confirmatory factor analysis (CFA) are frequently applied to examine the convergent validity of scores obtained from multiple raters or methods in so-called multitrait-multimethod (MTMM) investigations. We show that interesting incremental information about method effects can be gained from including mean structures and tests of MI across methods in MTMM models. We present a modeling framework for testing MI in the first step of a CFA-MTMM analysis. We also discuss the relevance of MI in the context of four more complex CFA-MTMM models with method factors. We focus on three recently developed multiple-indicator CFA-MTMM models for structurally different methods [the correlated traits-correlated (methods - 1), latent difference, and latent means models; Geiser et al., 2014a; Pohl and Steyer, 2010; Pohl et al., 2008] and one model for interchangeable methods (Eid et al., 2008). We demonstrate that some of these models require or imply MI by definition for a proper interpretation of trait or method factors, whereas others do not, and explain why MI may or may not be required in each model. We show that in the model for interchangeable methods, testing for MI is critical for determining whether methods can truly be seen as interchangeable. We illustrate the theoretical issues in an empirical application to an MTMM study of attention deficit and hyperactivity disorder (ADHD) with mother, father, and teacher ratings as methods.Entities:
Keywords: mean and covariance structures; mean differences across raters; measurement equivalence; measurement invariance; multitrait-multimethod (MTMM) analysis; random vs. fixed methods; rater agreement
Year: 2014 PMID: 25400603 PMCID: PMC4214357 DOI: 10.3389/fpsyg.2014.01216
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1CFA measurement model for multiple-indicator MTMM data. Each latent factor T represents the error-free (true) scores of a specific TMU. The picture shows an example in which three indicators Y (i = 1, 2, 3) are used to measure one construct or trait (j = 1) by two methods (k = 1, 2).
Figure 2Extended CFA measurement model for multiple-indicator MTMM data. In contrast to Figure 1, the extended model contains I – 1 indicator-specific factors IS to reflect shared indicator-specific effects across raters. The latent factors T1 are now specific to the reference indicator Y1 and therefore carry an additional index for the reference indicator.
Figure 3Three different ways to examine method effects with structurally different methods. (A) Latent regression [CT-C(M – 1)] approach. (B) Latent difference approach. (C) Latent means approach. All three approaches imply the same covariance and mean structure at the latent level, but differ in terms of which level of MI they require (see discussion in the text). The measurement part of the models is the same as in Figures 1, 2 and therefore not shown in this figure.
Figure 4CFA-MTMM models for interchangeable methods. (A) Version with a general trait factor for homogeneous indicators. (B) Version with indicator-specific traits for heterogeneous indicators. In the pictures, we show the recommended specification, in which loadings and intercepts are set equal across methods for identical indicators. In model version B, all trait factor loadings are fixed to 1 and all intercepts are fixed to zero.
Goodness of fit statistics for different models fit to the HI and IN multirater data set.
| Figure | 269.91 | 24 | <0.001 | 0.12 | 0.963 | 0.944 | 0.02 | 9773 | |||
| Figure | 17.70 | 17 | 0.41 | 0.01 | 1.000 | 1.000 | 0.01 | — | — | — | 9535 |
| Figure | 21.40 | 21 | 0.44 | 0.01 | 1.000 | 1.000 | 0.02 | 3.7 | 4 | 0.45 | 9531 |
| Figure | 29.85 | 25 | 0.23 | 0.02 | 0.999 | 0.999 | 0.02 | 8.45 | 4 | 0.08 | 9531 |
| Figure | 52.60 | 31 | 0.009 | 0.03 | 0.997 | 0.996 | 0.02 | 22.75 | 6 | <0.001 | 9542 |
| Figure | 104.27 | 27 | <0.001 | 0.06 | 0.988 | 0.984 | 0.08 | 74.42 | 2 | <0.001 | 9602 |
| Figure | |||||||||||
| Figure | 196.64 | 24 | <0.001 | 0.10 | 0.975 | 0.962 | 0.02 | 9386 | |||
| Figure | 16.09 | 17 | 0.52 | 0.00 | 1.000 | 1.000 | 0.01 | — | — | — | 9219 |
| Figure | 28.18 | 21 | 0.14 | 0.02 | 0.999 | 0.998 | 0.02 | 12.09 | 4 | 0.02 | 9223 |
| Figure | 17.66 | 19 | 0.55 | 0.00 | 1.000 | 1.000 | 0.01 | 1.57 | 2 | 0.46 | 9217 |
| Figure | 193.63 | 25 | <0.001 | 0.10 | 0.976 | 0.965 | 0.04 | 165.45 | 4 | <0.001 | 9381 |
| Figure | 29.20 | 23 | 0.17 | 0.02 | 0.999 | 0.999 | 0.02 | 11.54 | 4 | 0.02 | 9220 |
| Figure | 230.73 | 31 | <0.001 | 0.10 | 0.971 | 0.966 | 0.04 | 9406 | |||
| Figure | |||||||||||
| Figure | 34.81 | 27 | 0.14 | 0.02 | 0.999 | 0.998 | 0.02 | 4.42 | 1 | 0.04 | 9218 |
Note: RMSEA, root mean square error of approximation; CFI, comparative fit index; TLI, Tucker-Lewis index; SRMR, standardized root mean square residual; AIC, Akaike's information criterion. All chi-square difference tests refer to the previous model in the preceding row unless otherwise indicated. Bold-face indicates best-fitting models for which detailed results are presented.
No chi-square difference test reported, because model nesting involves boundary constraints in this case.
Relative to the strong invariance model with unequal means.
Relative to the configural invariance model.
Relative to the model of weak invariance for mother and father reports only.
Relative to the model of strong invariance for mother and father reports only.
Parameter estimates of the measurement models fit to the HI and IN multirater data set.
| λ1 | 1.00 | — | 0.96 | 1.00 | — | 0.96 |
| λ2 | 0.93 | 0.01 | 0.90 | 0.13 | 0.01 | 0.91 |
| λ3 | 0.92 | 0.01 | 0.90 | 0.95 | 0.01 | 0.88 |
| γ2 | 1.00 | — | 0.37 | 1.00 | — | 0.33 |
| γ2 | 0.88 | 0.21 | 0.34 | 0.96 | 0.08 | 0.31 |
| γ2 | 0.17 | 0.06 | 0.07 | 0.31 | 0.07 | 0.09 |
| γ3 | 1.00 | — | 0.24 | 1.00 | — | 0.35 |
| γ3 | 1.57 | 0.56 | 0.40 | 0.94 | 0.08 | 0.32 |
| γ3 | 0.13 | 0.08 | 0.03 | 0.06 | 0.08 | 0.02 |
| α1 | 0.00 | — | 0.00 | — | ||
| α2 | −0.09 | 0.02 | 0.06 | 0.02 | ||
| α3 | 0.08 | 0.02 | 0.16 | 0.02 | ||
| α1 | 0.00 | — | 0.00 | — | ||
| α2 | −0.09 | 0.02 | 0.06 | 0.02 | ||
| α3 | 0.08 | 0.02 | 0.16 | 0.02 | ||
| α1 | 0.00 | — | 0.00 | — | ||
| α2 | −0.09 | 0.02 | 0.01 | 0.02 | ||
| α3 | 0.08 | 0.02 | −0.15 | 0.02 | ||
| 0.09 | 0.01 | 0.08 | 0.07 | 0.01 | 0.09 | |
| 0.07 | 0.04 | 0.06 | 0.08 | 0.01 | 0.07 | |
| 0.15 | 0.02 | 0.13 | 0.09 | 0.01 | 0.11 | |
| 0.10 | 0.01 | 0.09 | 0.07 | 0.01 | 0.08 | |
| 0.09 | 0.03 | 0.08 | 0.08 | 0.01 | 0.07 | |
| 0.02 | 0.06 | 0.02 | 0.09 | 0.01 | 0.10 | |
| 0.07 | 0.01 | 0.06 | 0.06 | 0.01 | 0.05 | |
| 0.11 | 0.01 | 0.11 | 0.10 | 0.01 | 0.06 | |
| 0.13 | 0.01 | 0.12 | 0.15 | 0.01 | 0.13 | |
Note: For hyperactivity/impulsivity, a model of strong invariance for all raters and equal means across mother and father ratings was chosen. For inattention, a model of strict invariance for mother and father ratings was chosen. λ.
Parameter fixed for identification.
Standardized loadings differed between raters for the same variable, because error variances and latent factor variances were allowed to differ in the final models. The standardized loadings are therefore given separately for each rater type in the following order: (1) mothers, (2) fathers, (3) teachers.
Standardized residual variances indicate 1 – R.
Dashes indicate fixed parameters for which no standard errors are computed.
Estimated latent covariances, correlations, means, and variances in the final models.
| 1. | — | 0.85 (0.06) | 0.45 (0.05) | — | — |
| 2. | 0.81 (0.02) | — | 0.43 (0.05) | — | — |
| 3. | 0.42 (0.04) | 0.42 (0.04) | — | — | — |
| 4. | — | — | — | — | 0.03 (0.01) |
| 5. | — | — | — | 0.29 (0.07) | — |
| Means | 1.10 | 1.10 | 0.71 (0.04) | — | — |
| Variances | 1.11 (0.07) | 0.99 (0.06) | 1.05 (0.06) | 0.16 (0.04) | 0.07 (0.03) |
| 6. | — | 0.60 (0.04) | 0.41 (0.04) | — | — |
| 7. | 0.78 (0.02) | — | 0.44 (0.05) | — | — |
| 8. | 0.45 (0.04) | 0.44 (0.04) | — | — | — |
| 9. | — | — | — | — | 0.05 (0.01) |
| 10. | — | — | — | 0.44 (0.07) | — |
| Means | 0.97 (0.04) | 1.03 (0.04) | 0.88 (0.04) | — | — |
| Variances | 0.74 (0.05) | 0.80 (0.05) | 1.17 (0.07) | 0.12 (0.02) | 0.10 (0.02) |
Note: Covariances are shown above the diagonal, correlations below the diagonal.
Covariances, correlations, or means that are set to zero by definition of the model. Standard errors are given in parentheses.
Latent means for hyperactivity/inattention were set equal across mother and father reports.
Goodness of fit statistics for different versions of the CFA-MTMM model for interchangeable methods fit to the HI and IN multirater data set.
| All three raters; equal loadings and intercepts | 378.01 | 31 | <0.001 | 0.13 | 0.948 | 0.939 | 0.16 | 9867 | |||
| Mothers and fathers only; equal loadings and intercepts | 7.34 | 8 | 0.50 | 0.00 | 1.000 | 1.000 | 0.01 | 6102 | |||
| Mothers and fathers only; equal loadings, intercepts, and residual variances | |||||||||||
| Mothers and fathers only; equal loadings, intercepts, residual variances, and method factor variances | 16.81 | 12 | 0.16 | 0.03 | 0.999 | 0.999 | 0.03 | 4.11 | 1 | 0.04 | 6104 |
| All three raters; equal loadings and intercepts | 381.16 | 31 | <0.001 | 0.13 | 0.949 | 0.941 | 0.09 | 9556 | |||
| Mothers and fathers only; equal loadings and intercepts | 9.42 | 8 | 0.31 | 0.02 | 1.000 | 0.999 | 0.01 | 5648 | |||
| Mothers and fathers only; equal loadings, intercepts, and residual variances | 10.10 | 11 | 0.52 | 0.00 | 1.00 | 1.00 | 0.01 | 0.68 | 3 | 0.88 | 5643 |
| Mothers and fathers only; equal loadings, intercepts, residual variances, and method factor variances | |||||||||||
Note: In order to save space, we only present results for the model version with indicator-specific traits (Figure 4B), given that the model version with a single trait (Figure 4A) did not fit well for any rater combination. RMSEA, root mean square error of approximation; CFI, comparative fit index; TLI, Tucker-Lewis index; SRMR, standardized root mean square residual; AIC, Akaike's information criterion. For both constructs, the initial model included all three rater types. Subsequent models included only mother and father ratings (dropping teacher ratings from the analysis). Bold-face indicates best-fitting models.
Parameter estimates in the CFA-MTMM models for interchangeable methods fit to mother and father ratings of HI and IN.
| λ1 | 1.00 | — | 0.84; 0.89 | 1.00 | — | 0.84 |
| λ2 | 1.00 | — | 0.88; 0.91 | 1.00 | — | 0.87 |
| λ3 | 1.00 | — | 0.84; 0.88 | 1.00 | — | 0.85 |
| γ1 | 1.00 | — | 0.37 | 1.00 | — | 0.47 |
| γ2 | 0.88 | 0.05 | 0.34 | 1.05 | 0.05 | 0.41 |
| γ3 | 0.99 | 0.05 | 0.07 | 0.98 | 0.05 | 0.43 |
| 0.09 | 0.01 | 0.08; 0.08[ | 0.07 | 0.01 | 0.08 | |
| 0.09 | 0.01 | 0.07; 0.08[ | 0.09 | 0.01 | 0.08 | |
| 0.09 | 0.01 | 0.08; 0.09[ | 0.09 | 0.01 | 0.10 | |
| 1.09 | 0.04 | 0.98 | 0.03 | |||
| 0.95 | 0.04 | 1.17 | 0.04 | |||
| 1.08 | 0.04 | 1.09 | 0.04 | |||
| 0.86 | 0.06 | 0.58 | 0.04 | |||
| 0.92 | 0.06 | 0.88 | 0.06 | |||
| 0.78 | 0.06 | 0.66 | 0.05 | |||
| 0.26 | 0.04 | 0.18 | 0.02 | |||
| 0.14 | 0.03 | 0.18 | 0.02 | |||
For hyperactivity/impulsivity, a model of strong invariance for all raters and equal means across mother and father ratings was chosen. For inattention, a model of strict invariance for mother and father ratings was chosen. λ.
Parameter fixed for identification.
Standardized loadings and standardized error variances for HI differed between raters for the same variable, because the method factor variances were allowed to differ in the final model. The standardized loadings and error variances are therefore given separately for each rater type in the following order: (1) mothers, (2) fathers.
Standardized residual variances indicate 1 – R.
Method factor variances were set equal across mother and father reports in this model.
Dashes indicate fixed parameters for which no standard errors are computed.