| Literature DB >> 28588523 |
Victoria Savalei1, Mijke Rhemtulla2.
Abstract
Structural equation models (SEMs) can be estimated using a variety of methods. For complete normally distributed data, two asymptotically efficient estimation methods exist: maximum likelihood (ML) and generalized least squares (GLS). With incomplete normally distributed data, an extension of ML called "full information" ML (FIML), is often the estimation method of choice. An extension of GLS to incomplete normally distributed data has never been developed or studied. In this article we define the "full information" GLS estimator for incomplete normally distributed data (FIGLS). We also identify and study an important application of the new GLS approach. In many modeling contexts, the variables in the SEM are linear composites (e.g., sums or averages) of the raw items. For instance, SEMs often use parcels (sums of raw items) as indicators of latent factors. If data are missing at the item level, but the model is at the composite level, FIML is not possible. In this situation, FIGLS may be the only asymptotically efficient estimator available. Results of a simulation study comparing the new FIGLS estimator to the best available analytic alternative, two-stage ML, with item-level missing data are presented.Entities:
Keywords: generalized least squares estimation; item-level missing data; missing data; parcels; structural equation modeling
Year: 2017 PMID: 28588523 PMCID: PMC5439014 DOI: 10.3389/fpsyg.2017.00767
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
True parameter values for the composite model.
| Unstandardized factor loadings (for each factor) | 0.72, 0.84, 0.96 | 1.26, 1.47, 1.68 |
| Standardized factor loadings (for each factor) | 0.36, 0.42, 0.48 | 0.52, 0.60, 0.69 |
| Residual variances of indicators (for each factor) | 3.42, 3.23, 3.02 | 4.33, 3.76, 3.10 |
| Factor regression coefficients (F1->F2, F2->F3) | 0.6, 0.6 | 0.6, 0.6 |
| Factor residual variances (F2, F3) | 0.64, 0.64 | 0.64, 0.64 |
Figure 1Model 1 used to generate complete data. In model 2, first-order factor loadings were {0.6, 0.7, 0.8} instead. Variances of all observed and latent variables are 1.
Figure 2Composite model, shown with true parameter values for Model 1. Standardized factor loadings for Model 1 are {0.363, 0.423, 0.484} for each factor. The corresponding true parameter values for Model 2 are given in Table 1. Standardized factor loading values for Model 2 are {0.52, 0.60, 0.69}. These values were derived algebraically from the corresponding values for the components; the derivations were verified empirically by fitting the analysis model to the population covariance matrices of the composites. The analysis model was fit with (residual) factor variances fixed to their true values, and all loadings, latent regression coefficients, and indicator residual variances freely estimated.
Number of useable replications across study conditions.
| 200 | MCAR | 5 | 942 | 943 | 1,000 | 1,000 |
| 15 | 921 | 916 | 1,000 | 1,000 | ||
| 30 | 873 | 874 | 1,000 | 1,000 | ||
| MAR.lin | 5 | 949 | 947 | 1,000 | 1,000 | |
| 15 | 926 | 927 | 1,000 | 1,000 | ||
| 30 | 851 | 858 | 1,000 | 1,000 | ||
| MAR.nl | 5 | 939 | 948 | 1,000 | 1,000 | |
| 15 | 920 | 931 | 1,000 | 1,000 | ||
| 30 | 851 | 861 | 1,000 | 1,000 | ||
| 400 | MCAR | 5 | 998 | 999 | 1,000 | 1,000 |
| 15 | 998 | 999 | 1,000 | 1,000 | ||
| 30 | 992 | 988 | 1,000 | 1,000 | ||
| MAR.lin | 5 | 999 | 1,000 | 1,000 | 1,000 | |
| 15 | 997 | 997 | 1,000 | 1,000 | ||
| 30 | 989 | 990 | 1,000 | 1,000 | ||
| MAR.nl | 5 | 998 | 998 | 1,000 | 1,000 | |
| 15 | 994 | 996 | 1,000 | 1,000 | ||
| 30 | 992 | 991 | 1,000 | 1,000 | ||
| 600 | MCAR | 5 | 1,000 | 1,000 | 1,000 | 1,000 |
| 15 | 1,000 | 1,000 | 1,000 | 1,000 | ||
| 30 | 999 | 999 | 1,000 | 1,000 | ||
| MAR.lin | 5 | 1,000 | 1,000 | 1,000 | 1,000 | |
| 15 | 1,000 | 1,000 | 1,000 | 1,000 | ||
| 30 | 998 | 998 | 1,000 | 1,000 | ||
| MAR.nl | 5 | 1,000 | 1,000 | 1,000 | 1,000 | |
| 15 | 1,000 | 1,000 | 1,000 | 1,000 | ||
| 30 | 999 | 999 | 1,000 | 1,000 | ||
“MAR.lin” and “MAR.nl” stand for MAR-linear and MAR-nonlinear missing mechanisms, respectively.
Average bias in factor loadings and latent regression coefficients across all study conditions.
| 200 | MCAR | 5 | −0.006 | −0.004 | −0.037 | −0.015 | 0.093 | 0.071 | 0.035 | 0.017 |
| 15 | −0.003 | 0.001 | −0.038 | −0.016 | 0.114 | 0.085 | 0.038 | 0.019 | ||
| 30 | 0.004 | 0.009 | −0.041 | −0.020 | 0.132 | 0.104 | 0.046 | 0.025 | ||
| MAR.lin | 5 | −0.010 | −0.007 | −0.037 | −0.015 | 0.105 | 0.082 | 0.034 | 0.017 | |
| 15 | −0.005 | −0.003 | −0.038 | −0.016 | 0.111 | 0.092 | 0.035 | 0.017 | ||
| 30 | 0.009 | 0.006 | −0.037 | −0.015 | 0.099 | 0.112 | 0.037 | 0.018 | ||
| MAR.nl | 5 | −0.008 | −0.009 | −0.037 | −0.015 | 0.096 | 0.090 | 0.035 | 0.017 | |
| 15 | −0.003 | −0.001 | −0.037 | −0.015 | 0.111 | 0.084 | 0.037 | 0.018 | ||
| 30 | 0.005 | 0.010 | −0.039 | −0.013 | 0.118 | 0.092 | 0.040 | 0.019 | ||
| 400 | MCAR | 5 | −0.008 | −0.009 | −0.019 | −0.008 | 0.040 | 0.037 | 0.015 | 0.006 |
| 15 | −0.011 | −0.012 | −0.019 | −0.007 | 0.050 | 0.042 | 0.015 | 0.006 | ||
| 30 | −0.010 | −0.010 | −0.017 | −0.006 | 0.065 | 0.048 | 0.015 | 0.006 | ||
| MAR.lin | 5 | −0.009 | −0.009 | −0.019 | −0.008 | 0.037 | 0.032 | 0.015 | 0.006 | |
| 15 | −0.009 | −0.009 | −0.018 | −0.007 | 0.046 | 0.039 | 0.015 | 0.006 | ||
| 30 | −0.005 | −0.006 | −0.019 | −0.007 | 0.055 | 0.049 | 0.016 | 0.006 | ||
| MAR.nl | 5 | −0.008 | −0.008 | −0.019 | −0.008 | 0.037 | 0.030 | 0.015 | 0.006 | |
| 15 | −0.008 | −0.008 | −0.019 | −0.007 | 0.038 | 0.033 | 0.015 | 0.006 | ||
| 30 | −0.012 | −0.009 | −0.020 | −0.008 | 0.062 | 0.051 | 0.017 | 0.007 | ||
| 600 | MCAR | 5 | −0.007 | −0.006 | −0.010 | −0.003 | 0.030 | 0.023 | 0.009 | 0.003 |
| 15 | −0.007 | −0.006 | −0.010 | −0.003 | 0.032 | 0.025 | 0.009 | 0.003 | ||
| 30 | −0.008 | −0.007 | −0.010 | −0.003 | 0.047 | 0.036 | 0.010 | 0.004 | ||
| MAR.lin | 5 | −0.007 | −0.006 | −0.010 | −0.002 | 0.030 | 0.023 | 0.009 | 0.003 | |
| 15 | −0.007 | −0.007 | −0.009 | −0.002 | 0.032 | 0.025 | 0.008 | 0.002 | ||
| 30 | −0.008 | −0.007 | −0.009 | −0.002 | 0.041 | 0.036 | 0.009 | 0.003 | ||
| MAR.nl | 5 | −0.008 | −0.007 | −0.011 | −0.003 | 0.030 | 0.023 | 0.009 | 0.003 | |
| 15 | −0.006 | −0.006 | −0.010 | −0.002 | 0.033 | 0.026 | 0.009 | 0.003 | ||
| 30 | −0.008 | −0.006 | −0.010 | −0.002 | 0.044 | 0.036 | 0.009 | 0.002 | ||
Shaded cells contain bias values that are >0.05 in absolute value.
Average efficiency estimates (empirical standard errors) for factor loadings and latent regression coefficients across all study conditions.
| 200 | MCAR | 5 | 0.295 | 0.228 | 0.499 | 0.167 | ||||
| 15 | 0.315 | 0.315 | 0.237 | 0.615 | 0.181 | |||||
| 30 | 0.383 | 0.255 | 0.651 | 0.217 | ||||||
| MAR.lin | 5 | 0.287 | 0.229 | 0.550 | 0.165 | |||||
| 15 | 0.307 | 0.237 | 0.588 | 0.171 | ||||||
| 30 | 0.365 | 0.257 | 0.659 | 0.183 | ||||||
| MAR.nl | 5 | 0.288 | 0.288 | 0.229 | 0.531 | 0.163 | ||||
| 15 | 0.319 | 0.240 | 0.614 | 0.177 | ||||||
| 30 | 0.382 | 0.262 | 0.643 | 0.196 | ||||||
| 400 | MCAR | 5 | 0.198 | 0.158 | 0.158 | 0.318 | 0.103 | |||
| 15 | 0.201 | 0.163 | 0.163 | 0.273 | 0.108 | |||||
| 30 | 0.230 | 0.173 | 0.173 | 0.394 | 0.110 | |||||
| MAR.lin | 5 | 0.189 | 0.158 | 0.158 | 0.260 | 0.104 | ||||
| 15 | 0.199 | 0.199 | 0.163 | 0.163 | 0.337 | 0.108 | ||||
| 30 | 0.228 | 0.175 | 0.175 | 0.368 | 0.113 | |||||
| MAR.nl | 5 | 0.189 | 0.189 | 0.159 | 0.159 | 0.256 | 0.104 | |||
| 15 | 0.203 | 0.165 | 0.165 | 0.270 | 0.107 | |||||
| 30 | 0.227 | 0.178 | 0.178 | 0.372 | 0.115 | |||||
| 600 | MCAR | 5 | 0.152 | 0.152 | 0.130 | 0.130 | 0.193 | 0.085 | ||
| 15 | 0.159 | 0.133 | 0.133 | 0.202 | 0.086 | |||||
| 30 | 0.178 | 0.178 | 0.143 | 0.143 | 0.302 | 0.094 | ||||
| MAR.lin | 5 | 0.152 | 0.152 | 0.130 | 0.130 | 0.191 | 0.085 | |||
| 15 | 0.160 | 0.134 | 0.134 | 0.199 | 0.088 | |||||
| 30 | 0.179 | 0.144 | 0.262 | 0.093 | ||||||
| MAR.nl | 5 | 0.152 | 0.152 | 0.130 | 0.130 | 0.193 | 0.085 | |||
| 15 | 0.161 | 0.161 | 0.135 | 0.135 | 0.206 | 0.088 | ||||
| 30 | 0.183 | 0.146 | 0.146 | 0.283 | 0.095 | |||||
The winning method is bolded in each condition.
Average root mean square error estimates for factor loadings and latent regression coefficients across all study conditions.
| 200 | MCAR | 5 | 0.011 | 0.037 | 0.093 | 0.035 | ||||
| 15 | 0.010 | 0.010 | 0.038 | 0.114 | 0.038 | |||||
| 30 | 0.016 | 0.041 | 0.132 | 0.046 | ||||||
| MAR.lin | 5 | 0.013 | 0.037 | 0.105 | 0.034 | |||||
| 15 | 0.011 | 0.038 | 0.111 | 0.035 | ||||||
| 30 | 0.012 | 0.037 | 0.112 | 0.037 | ||||||
| MAR.nl | 5 | 0.012 | 0.012 | 0.037 | 0.096 | 0.035 | ||||
| 15 | 0.011 | 0.037 | 0.111 | 0.037 | ||||||
| 30 | 0.020 | 0.039 | 0.118 | 0.040 | ||||||
| 400 | MCAR | 5 | 0.011 | 0.011 | 0.019 | 0.040 | 0.015 | |||
| 15 | 0.014 | 0.019 | 0.050 | 0.015 | ||||||
| 30 | 0.015 | 0.017 | 0.065 | 0.015 | ||||||
| MAR.lin | 5 | 0.011 | 0.019 | 0.037 | 0.015 | |||||
| 15 | 0.010 | 0.010 | 0.018 | 0.046 | 0.015 | |||||
| 30 | 0.009 | 0.009 | 0.019 | 0.055 | 0.016 | |||||
| MAR.nl | 5 | 0.010 | 0.018 | 0.037 | 0.015 | |||||
| 15 | 0.010 | 0.010 | 0.019 | 0.038 | 0.015 | |||||
| 30 | 0.013 | 0.020 | 0.062 | 0.017 | ||||||
| 600 | MCAR | 5 | 0.009 | 0.010 | 0.030 | 0.009 | ||||
| 15 | 0.009 | 0.010 | 0.032 | 0.009 | ||||||
| 30 | 0.010 | 0.010 | 0.047 | 0.010 | ||||||
| MAR.lin | 5 | 0.008 | 0.010 | 0.030 | 0.009 | |||||
| 15 | 0.009 | 0.009 | 0.032 | 0.008 | ||||||
| 30 | 0.010 | 0.009 | 0.041 | 0.009 | ||||||
| MAR.nl | 5 | 0.009 | 0.011 | 0.030 | 0.009 | |||||
| 15 | 0.008 | 0.010 | 0.033 | 0.009 | ||||||
| 30 | 0.009 | 0.010 | 0.044 | 0.009 | ||||||
The winning method is bolded in each condition.
Average coverage for factor loadings and latent regression coefficients across all study conditions.
| 200 | MCAR | 5 | 95.9 | 94.4 | 94.6 | 94.0 | 94.4 | 95.8 | ||
| 15 | 96.1 | 94.3 | 94.6 | 94.0 | 93.2 | 97.0 | 95.2 | |||
| 30 | 95.8 | 93.8 | 94.2 | 93.6 | 96.9 | 95.4 | ||||
| MAR.lin | 5 | 95.7 | 94.4 | 94.7 | 94.2 | 94.5 | 96.3 | |||
| 15 | 95.6 | 94.0 | 94.3 | 93.9 | 93.0 | 95.6 | ||||
| 30 | 95.3 | 93.2 | 94.3 | 93.4 | 93.1 | 96.8 | 95.3 | |||
| MAR.nl | 5 | 95.8 | 94.3 | 94.6 | 94.0 | 93.9 | 96.2 | |||
| 15 | 95.8 | 94.4 | 94.2 | 93.5 | 94.0 | 97.0 | 95.3 | |||
| 30 | 95.9 | 93.7 | 94.1 | 93.5 | 93.4 | 96.4 | 94.8 | |||
| 400 | MCAR | 5 | 95.3 | 94.3 | 94.8 | 94.7 | 94.7 | 96.6 | 95.9 | |
| 15 | 94.9 | 93.8 | 94.5 | 94.6 | 94.5 | 96.4 | 95.7 | |||
| 30 | 95.0 | 93.8 | 94.8 | 94.7 | 94.9 | 96.4 | 95.8 | |||
| MAR.lin | 5 | 95.0 | 94.1 | 94.6 | 94.6 | 95.4 | 93.8 | 96.6 | 95.8 | |
| 15 | 95.2 | 94.2 | 94.7 | 94.6 | 94.9 | 93.1 | 96.5 | 95.4 | ||
| 30 | 95.2 | 94.2 | 94.4 | 94.2 | 94.0 | 96.1 | 95.2 | |||
| MAR.nl | 5 | 95.2 | 94.3 | 94.5 | 94.4 | 95.2 | 93.5 | 96.2 | 95.6 | |
| 15 | 95.3 | 94.1 | 94.7 | 94.7 | 94.6 | 96.7 | 95.6 | |||
| 30 | 95.3 | 94.2 | 94.4 | 94.1 | 94.8 | 96.2 | 95.4 | |||
| 600 | MCAR | 5 | 94.9 | 94.4 | 94.7 | 94.5 | 94.4 | 93.1 | 95.5 | 95.0 |
| 15 | 95.0 | 94.5 | 94.9 | 94.7 | 94.2 | 95.0 | 94.3 | |||
| 30 | 94.6 | 94.1 | 94.5 | 93.9 | 93.7 | 94.7 | 93.8 | |||
| MAR.lin | 5 | 94.7 | 94.2 | 94.7 | 94.4 | 94.5 | 93.4 | 95.3 | 94.5 | |
| 15 | 94.9 | 94.4 | 94.7 | 94.4 | 94.2 | 93.5 | 95.2 | 94.3 | ||
| 30 | 94.8 | 94.3 | 94.5 | 94.3 | 94.0 | 95.3 | 94.6 | |||
| MAR.nl | 5 | 94.6 | 94.2 | 94.8 | 94.5 | 94.3 | 93.1 | 95.2 | 94.8 | |
| 15 | 94.7 | 94.2 | 94.6 | 94.4 | 93.8 | 93.1 | 95.4 | 94.5 | ||
| 30 | 94.8 | 94.0 | 94.5 | 94.0 | 94.0 | 94.2 | 93.8 | |||
Coverage rates <93% are bolded. Coverage rates >97% are italicized.
Type I error rates across all study conditions.
| 200 | MCAR | 5 | 5.7 | 5.7 | ||
| 15 | 4.4 | 4.8 | 5.0 | |||
| 30 | 5.1 | 5.2 | ||||
| MAR.lin | 5 | 4.6 | 5.0 | |||
| 15 | ||||||
| 30 | 5.4 | |||||
| MAR.nl | 5 | 4.7 | 4.6 | |||
| 15 | 4.7 | 4.7 | ||||
| 30 | 4.2 | 4.3 | 5.2 | 5.3 | ||
| 400 | MCAR | 5 | 4.8 | 4.8 | ||
| 15 | 4.8 | 5.0 | 4.9 | 4.9 | ||
| 30 | 4.3 | 4.3 | ||||
| MAR.lin | 5 | 4.1 | 4.0 | |||
| 15 | 4.9 | 5.0 | 4.3 | 4.3 | ||
| 30 | ||||||
| MAR.nl | 5 | 4.7 | 4.6 | |||
| 15 | 5.4 | 5.4 | 4.8 | 4.9 | ||
| 30 | 5.5 | 5.5 | 5.0 | 5.0 | ||
| 600 | MCAR | 5 | 5.5 | 5.5 | 4.2 | 4.1 |
| 15 | 5.6 | 5.6 | 4.6 | 4.6 | ||
| 30 | 5.0 | 5.0 | 4.9 | 4.9 | ||
| MAR.lin | 5 | 5.9 | 6.0 | |||
| 15 | 5.4 | 5.4 | 4.4 | 4.3 | ||
| 30 | 5.1 | 5.2 | 4.7 | 4.7 | ||
| MAR.nl | 5 | 5.4 | 5.5 | 4.3 | 4.3 | |
| 15 | 4.8 | 4.8 | ||||
| 30 | 6.0 | 4.9 | 4.9 | |||
Rejection rates below 4% and above 6% are bolded.