| Literature DB >> 32451743 |
Haoran Zhang1, Yunxiao Chen2, Xiaoou Li3.
Abstract
We revisit a singular value decomposition (SVD) algorithm given in Chen et al. (Psychometrika 84:124-146, 2019b) for exploratory item factor analysis (IFA). This algorithm estimates a multidimensional IFA model by SVD and was used to obtain a starting point for joint maximum likelihood estimation in Chen et al. (2019b). Thanks to the analytic and computational properties of SVD, this algorithm guarantees a unique solution and has computational advantage over other exploratory IFA methods. Its computational advantage becomes significant when the numbers of respondents, items, and factors are all large. This algorithm can be viewed as a generalization of principal component analysis to binary data. In this note, we provide the statistical underpinning of the algorithm. In particular, we show its statistical consistency under the same double asymptotic setting as in Chen et al. (2019b). We also demonstrate how this algorithm provides a scree plot for investigating the number of factors and provide its asymptotic theory. Further extensions of the algorithm are discussed. Finally, simulation studies suggest that the algorithm has good finite sample performance.Entities:
Keywords: IFA; double asymptotics; exploratory item factor analysis; generalized PCA for binary data; singular value decomposition
Mesh:
Year: 2020 PMID: 32451743 PMCID: PMC7385012 DOI: 10.1007/s11336-020-09704-7
Source DB: PubMed Journal: Psychometrika ISSN: 0033-3123 Impact factor: 2.500
Fig. 1A scree plot for choosing the number of factors. The y-axis shows the standardized singular values , where s are obtained from Step 7 of Algorithm 1. The data are simulated from an IFA model with , , and . The input dimension is set to be 10 in Algorithm 1. A singular value gap can be found between the 5th and 6th singular values
Fig. 2Simulation results when and the true factors are independent. Panel a shows the number of items J in x-axis versus the loss (2) in y-axis, and Panel b shows the number of items J in x-axis versus the computation time (in seconds) in y-axis. For each metric and each method, we show the median, 25% quantile, and 75% quantile based on the 100 independent replications
Fig. 3Simulation results when and the true factors are correlated. The two panels show the same metrics as in Fig. 2
Fig. 4Simulation results when and the true factors are independent. The two panels show the same metrics as in Fig. 2
Fig. 5Simulation results when and the true factors are correlated. The two panels show the same metrics as in Fig. 2