| Literature DB >> 26845032 |
Rudolf Debelak1,2, Ulrich S Tran3.
Abstract
The analysis of polychoric correlations via principal component analysis and exploratory factor analysis are well-known approaches to determine the dimensionality of ordered categorical items. However, the application of these approaches has been considered as critical due to the possible indefiniteness of the polychoric correlation matrix. A possible solution to this problem is the application of smoothing algorithms. This study compared the effects of three smoothing algorithms, based on the Frobenius norm, the adaption of the eigenvalues and eigenvectors, and on minimum-trace factor analysis, on the accuracy of various variations of parallel analysis by the means of a simulation study. We simulated different datasets which varied with respect to the size of the respondent sample, the size of the item set, the underlying factor model, the skewness of the response distributions and the number of response categories in each item. We found that a parallel analysis and principal component analysis of smoothed polychoric and Pearson correlations led to the most accurate results in detecting the number of major factors in simulated datasets when compared to the other methods we investigated. Of the methods used for smoothing polychoric correlation matrices, we recommend the algorithm based on minimum trace factor analysis.Entities:
Mesh:
Year: 2016 PMID: 26845032 PMCID: PMC4742070 DOI: 10.1371/journal.pone.0148143
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Rate (in percent) of correctly detected number of major factors under different variations of PA when minor factors were not present under a symmetrical response distribution.
| Method | Major factors | Var. per factor | Number of respondents | Number of categories | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3 | 5 | 15 | 200 | 500 | 1000 | 2 | 3 | 4 | |
| PCA95KB | 100 | 99.95 | 99.95 | 100 | 99.92 | 100 | 100 | 99.92 | 100 | 100 |
| PCAmKB | 99.89 | 99.92 | 99.86 | 99.95 | 99.71 | 100 | 100 | 99.71 | 100 | 100 |
| PAFA95KB | 97.33 | 99.86 | 97.61 | 99.59 | 98.25 | 98.71 | 98.84 | 98.3 | 98.21 | 99.29 |
| PAFAmKB | 81.17 | 98.81 | 83.92 | 96.06 | 88.88 | 90.63 | 90.46 | 86.46 | 90.59 | 92.92 |
| PCA95BY | 100 | 99.95 | 99.95 | 100 | 99.92 | 100 | 100 | 99.92 | 100 | 100 |
| PCAmBY | 99.89 | 99.92 | 99.86 | 99.95 | 99.71 | 100 | 100 | 99.71 | 100 | 100 |
| PAFA95BY | 97.33 | 99.86 | 97.61 | 99.59 | 98.25 | 98.71 | 98.84 | 98.3 | 98.21 | 99.29 |
| PAFAmBY | 81.17 | 98.84 | 83.92 | 96.09 | 88.92 | 90.63 | 90.46 | 86.5 | 90.59 | 92.92 |
| PCA95Pear | 100 | 99.97 | 99.97 | 100 | 99.96 | 100 | 100 | 99.96 | 100 | 100 |
| PCAmPear | 99.86 | 99.89 | 99.86 | 99.89 | 99.63 | 100 | 100 | 99.67 | 99.96 | 100 |
| PAFA95Pear | 88.73 | 99.81 | 89.17 | 99.36 | 86.5 | 98.21 | 98.09 | 93.04 | 94.46 | 95.3 |
| PAFAmPear | 74.45 | 98.53 | 79.31 | 93.67 | 85.13 | 86.92 | 87.42 | 80.09 | 88.13 | 91.25 |
| % Ind. Matrices | 0 | 8.12 | 0 | 8.12 | 11.33 | 0.84 | 0 | 9.17 | 3 | 0 |
Note. PCA = Principal Component Analysis, PAFA = Principal Axes Factor Analysis; 95 = 95% eigenvalue, m = mean eigenvalue; KB = polychoric correlation with Knol-Berger smoothing algorithm, BY = polychoric correlation Bentler-Yuan smoothing algorithm, Pear = Pearson; Var. per factor = Variables per major factor; % Ind. Matrices = Percentage of indefinite correlation matrices
Rate (in percent) of correctly detected number of major factors under different variations of PA when major and minor factors were present under a symmetrical response distribution.
| Method | Major factors | Var. per factor | Number of respondents | Number of categories | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3 | 5 | 15 | 200 | 500 | 1000 | 2 | 3 | 4 | |
| PCA95KB | 99.97 | 99.97 | 99.95 | 100 | 99.92 | 100 | 100 | 99.96 | 100 | 99.96 |
| PCAmKB | 99.95 | 99.89 | 99.84 | 100 | 99.75 | 100 | 100 | 99.79 | 100 | 99.96 |
| PAFA95KB | 96.67 | 99.86 | 97.2 | 99.33 | 97.63 | 99.08 | 98.09 | 97.67 | 98.55 | 98.58 |
| PAFAmKB | 74.45 | 98.89 | 78.75 | 94.59 | 85.92 | 88.5 | 85.59 | 85.09 | 86.75 | 88.17 |
| PCA95BY | 99.97 | 99.97 | 99.95 | 100 | 99.92 | 100 | 100 | 99.96 | 100 | 99.96 |
| PCAmBY | 99.95 | 99.89 | 99.84 | 100 | 99.75 | 100 | 100 | 99.79 | 100 | 99.96 |
| PAFA95BY | 96.67 | 99.86 | 97.2 | 99.33 | 97.63 | 99.08 | 98.09 | 97.67 | 98.55 | 98.58 |
| PAFAmBY | 74.45 | 98.89 | 78.75 | 94.59 | 85.92 | 88.5 | 85.59 | 85.09 | 86.75 | 88.17 |
| PCA95Pear | 99.97 | 100 | 99.97 | 100 | 99.96 | 100 | 100 | 100 | 100 | 99.96 |
| PCAmPear | 99.95 | 99.89 | 99.84 | 100 | 99.75 | 100 | 100 | 99.79 | 100 | 99.96 |
| PAFA95Pear | 88.31 | 99.86 | 89.39 | 98.78 | 88.55 | 97.63 | 96.08 | 91.09 | 95.25 | 95.92 |
| PAFAmPear | 66.5 | 98.47 | 74.81 | 90.17 | 82.71 | 85.13 | 79.63 | 77.96 | 84.17 | 85.33 |
| % Ind. Matrices | 0 | 8.84 | 0.06 | 8.78 | 11.58 | 1.67 | 0 | 10.08 | 3.17 | 0 |
Note. PCA = Principal Component Analysis, PAFA = Principal Axes Factor Analysis; 95 = 95% eigenvalue, m = mean eigenvalue; KB = polychoric correlation with Knol-Berger smoothing algorithm, BY = polychoric correlation Bentler-Yuan smoothing algorithm, Pear = Pearson; Var. per factor = Variables per major factor; % Ind. Matrices = Percentage of indefinite correlation matrices
Rate (in percent) of correctly detected number of major factors under different variations of PA when only major factors were present under a skewed response distribution.
| Method | Major factors | Var. per factor | Number of respondents | Number of categories | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3 | 5 | 15 | 200 | 500 | 1000 | 2 | 3 | 4 | |
| PCA95KB | 99.72 | 96.45 | 98.56 | 97.61 | 95.04 | 99.21 | 100 | 95.21 | 99.04 | 100 |
| PCAmKB | 99.03 | 93 | 96.78 | 95.25 | 90.38 | 97.79 | 99.88 | 91.29 | 96.75 | 100 |
| PAFA95KB | 85.11 | 91 | 88.67 | 87.45 | 84 | 88.25 | 91.92 | 77.8 | 89.34 | 97.04 |
| PAFAmKB | 54.75 | 83.42 | 65.97 | 72.2 | 62.67 | 69.46 | 75.13 | 53.83 | 68.67 | 84.75 |
| PCA95BY | 99.72 | 98.14 | 98.58 | 99.28 | 97.25 | 99.54 | 100 | 97.34 | 99.46 | 100 |
| PCAmBY | 99.03 | 95.72 | 96.86 | 97.89 | 93.88 | 98.38 | 99.88 | 94.5 | 97.63 | 100 |
| PAFA95BY | 85.11 | 93.7 | 88.7 | 90.11 | 87.13 | 89.17 | 91.92 | 81.13 | 90.04 | 97.04 |
| PAFAmBY | 54.81 | 86.78 | 66.03 | 75.56 | 67.08 | 70.17 | 75.13 | 57.88 | 69.67 | 84.84 |
| PCA95Pear | 99.92 | 99.25 | 99.53 | 99.64 | 98.84 | 99.92 | 100 | 98.75 | 100 | 100 |
| PCAmPear | 99.28 | 97.92 | 98.47 | 98.72 | 96.09 | 99.71 | 100 | 96.46 | 99.38 | 99.96 |
| PAFA95Pear | 76.5 | 98.19 | 81 | 93.7 | 74.71 | 92.46 | 94.88 | 81.29 | 87.83 | 92.92 |
| PAFAmPear | 51.39 | 93.2 | 68.31 | 76.28 | 67.09 | 73.92 | 75.88 | 60.09 | 73.17 | 83.63 |
| % Ind. Matrices | 1.11 | 23.14 | 3.06 | 21.2 | 27.04 | 8.58 | 0.75 | 23.67 | 8.42 | 4.29 |
Note. PCA = Principal Component Analysis, PAFA = Principal Axes Factor Analysis; 95 = 95% eigenvalue, m = mean eigenvalue; KB = polychoric correlation with Knol-Berger smoothing algorithm, BY = polychoric correlation Bentler-Yuan smoothing algorithm, Pear = Pearson; Var. per factor = Variables per major factor; % Ind. Matrices = Percentage of indefinite correlation matrices
Rate (in percent) of correctly detected number of major factors under different variations of PA when major and minor factors were present under a skewed response distribution.
| Method | Major factors | Var. per factor | Number of respondents | Number of categories | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 3 | 5 | 15 | 200 | 500 | 1000 | 2 | 3 | 4 | |
| PCA95KB | 99.64 | 95.06 | 98.22 | 96.48 | 93.25 | 98.79 | 100 | 95.67 | 98.71 | 97.67 |
| PCAmKB | 98.75 | 89.34 | 95.78 | 92.31 | 85.46 | 96.71 | 99.96 | 90.46 | 96.75 | 94.92 |
| PAFA95KB | 76.89 | 85.19 | 83.95 | 78.14 | 75.42 | 81.63 | 86.08 | 78.59 | 85.54 | 79 |
| PAFAmKB | 37.86 | 72.56 | 56.95 | 53.47 | 48.3 | 56.75 | 60.59 | 50.29 | 61.92 | 53.42 |
| PCA95BY | 99.67 | 97.36 | 98.28 | 98.75 | 96.25 | 99.29 | 100 | 97.67 | 99.38 | 98.5 |
| PCAmBY | 98.75 | 93.14 | 95.78 | 96.11 | 90.13 | 97.75 | 99.96 | 94 | 97.79 | 96.04 |
| PAFA95BY | 76.98 | 88.95 | 84 | 81.92 | 80.13 | 82.67 | 86.08 | 81.92 | 86.71 | 80.25 |
| PAFAmBY | 37.95 | 76.09 | 57 | 57.03 | 53.17 | 57.29 | 60.59 | 54 | 62.92 | 54.13 |
| PCA95Pear | 99.86 | 99.14 | 99.5 | 99.5 | 98.54 | 99.96 | 100 | 98.88 | 99.88 | 99.75 |
| PCAmPear | 99.14 | 97.42 | 98.28 | 98.28 | 95 | 99.84 | 100 | 96.5 | 99.42 | 98.92 |
| PAFA95Pear | 72.03 | 97.5 | 81.14 | 88.39 | 76.79 | 89.5 | 88 | 80.09 | 86.84 | 87.38 |
| PAFAmPear | 37.14 | 91.28 | 63.5 | 64.92 | 60.54 | 67 | 65.09 | 58.54 | 67.04 | 67.04 |
| % Ind. Matrices | 0.92 | 26.06 | 2.39 | 24.58 | 29.96 | 8.92 | 1.59 | 23.21 | 8.75 | 8.5 |
Note. PCA = Principal Component Analysis, PAFA = Principal Axes Factor Analysis; 95 = 95% eigenvalue, m = mean eigenvalue; KB = polychoric correlation with Knol-Berger smoothing algorithm, BY = polychoric correlation Bentler-Yuan smoothing algorithm, Pear = Pearson; Var. per factor = Variables per major factor; % Ind. Matrices = Percentage of indefinite correlation matrices