| Literature DB >> 31162418 |
Abstract
The factor structure of mental abilities has most often been depicted using a higher-order model. Under this model, general mental ability (g) is placed at the top of a pyramid, with "loading" arrows going from it to the other factors of intelligence, which in turn go to subtest scores. In contrast, under the bifactor model (also known as the nested factors/direct hierarchical model), each subtest score has its own direct loading on g; the non-g factors (e.g., the broad abilities) do not mediate the relationships of the subtest scores with g. Here we summarized past research that compared the fit of higher-order and bifactor models using confirmatory factor analysis (CFA). We also analyzed additional archival datasets to compare the fit of the two models. Using a total database consisting of 31 test batteries, 58 datasets, and 1,712,509 test takers, we found stronger support for a bifactor model of g than for the traditional higher-order model. Across 166 comparisons, the bifactor model had median increases of 0.076 for the Comparative Fit Index (CFI), 0.083 for the Tucker-Lewis Index (TLI), and 0.078 for the Normed Fit Index (NFI) and decreases of 0.028 for the root mean square error of approximation (RMSEA) and 1343 for the Akaike Information Criterion (AIC). Consequently, researchers should consider using bifactor models when conducting CFAs. The bifactor model also makes the unique contributions of g and the broad abilities to subtest scores more salient to test users.Entities:
Keywords: bifactor; factor analysis; higher-order; intelligence; mental-abilities
Year: 2017 PMID: 31162418 PMCID: PMC6526460 DOI: 10.3390/jintelligence5030027
Source DB: PubMed Journal: J Intell ISSN: 2079-3200
Figure 1(a) This figure shows the higher-order model for the Thurstone and Thurstone [8] test battery. For the sake of clarity, only primary loadings are depicted. The model that only included primary loadings had an RMSEA of 0.063, an AIC of 6696, and a χ2 of 6438 (df = 1,701, p < 0.001). Due to a Heywood case, the error variance for Test 57 was fixed to the value implied by its reliability; (b) This figure presents the corresponding bifactor model for the Thurstone and Thurstone [8] test battery. This model, which only had primary loadings, had an RMSEA of 0.059, an AIC of 6093, and a χ2 of 5737 (df = 1652, p < 0.001). A Δχ2 of 701 (df = 49, p < 0.001) indicated that it had statistically significant incremental fit over the higher-order model. Due to a Heywood case, the error variance for Test 57 was fixed to the value implied by its reliability.
Figure 2(a) This figure shows the bifactor model for the DAT-ASVAB test battery, with the corresponding higher-order model shown in the inset; (b) This figure shows the bifactor model for the AFQOT battery, with the corresponding higher-order model shown in the inset. For the sake of clarity, only primary loadings are depicted. The bifactor model that only included primary loadings had better fit statistics (Comparative Fit Index (CFI) = 0.924; Tucker-Lewis Index (TLI) = 0.897; Normed Fit Index (NFI) = 0.921; Akaike Information Criterion (AIC) = 2333) than the higher-order model (CFI = 0.900; TLI = 0.879; NFI = 0.897; AIC = 3000), and the Δχ2 of 690 (df = 11, p < 0.001) was significant.
Summary of Fit Statistics from Past Bifactor Research.
| Citation | Battery/Notes | Higher-Order | Comparison | Bifactor | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CFI | TLI | NFI | RMSEA | AIC | CFI | TLI | NFI | RMSEA | AIC |
| ||||||||||
| Beaujean et al. [ | WISC-IV | 0.981 | 0.976 | 0.960 | 0.039 | 226.74 | 150.74 | 82 | <0.001 | 127.60 | 73 | <0.001 | ||||||||
| Benson et al. [ | WISC-IV | 0.956 | 0.950 | 0.925 | 0.042 | 1108.74 | 934.74 | 409 | <0.001 | 812.60 | 385 | <0.001 | ||||||||
| Gignac & Watkins [ | WAIS-IV a | |||||||||||||||||||
| Age 16-19 | 0.945 | 0.933 | 0.918 | 0.068 | 314.75 | 246.75 | 86 | <0.001 | 147.28 | 75 | <0.001 | |||||||||
| Age 20-34 | 0.959 | 0.950 | 0.944 | 0.064 | 366.51 | 298.51 | 86 | <0.001 | 197.21 | 75 | <0.001 | |||||||||
| Age 35-54 | 0.943 | 0.930 | 0.920 | 0.075 | 347.28 | 279.28 | 86 | <0.001 | 16.43 | 75 | <0.001 | |||||||||
| Age 55-69 | 0.948 | 0.937 | 0.927 | 0.074 | 341.93 | 273.93 | 86 | <0.001 | 194.95 | 75 | <0.001 | |||||||||
| Gignac [ | WAIS-R b | 0.970 | 0.959 | 0.967 | 0.068 | 443.97 | 391.97 | 40 | <0.001 | 162.28 | 33 | <0.001 | ||||||||
| Gignac [ | WAIS-III c | 0.968 | 0.959 | 0.965 | 0.064 | 723.38 | 663.38 | 61 | <0.001 | 448.25 | 51 | <0.001 | ||||||||
| Gignac [ | MAB d | 0.955 | 0.941 | 0.953 | 0.077 | 664.9 | 622.9 | 34 | <0.001 | 385.34 | 25 | <0.001 | ||||||||
| Gignac [ | Colom S1 e,f | 0.893 | 0.815 | 101.8 | 50 | <0.001 | 0.854 | 0.074 | 159.15 | 87.15 | 42 | <0.001 | ||||||||
| Colom S2 e | 0.913 | 0.893 | 0.783 | 0.049 | 196.48 | 126.48 | 85 | 0.002 | 97.84 | 75 | .039 | |||||||||
| Colom S3 e | 0.878 | 0.850 | 0.767 | 0.064 | 221.1 | 151.1 | 85 | <0.001 | 129.26 | 75 | <0.001 | |||||||||
| G1984 g | 0.958 | 0.951 | 0.943 | 0.051 | 673.15 | 575.15 | 161 | <0.001 | 382.37 | 144 | <0.001 | |||||||||
| HS1939/G2001 h | 0.901 | 0.890 | 0.827 | 0.060 | 617.27 | 511.27 | 247 | <0.001 | 434.29 | 228 | <0.001 | |||||||||
| Golay & Lecerf [ | French WAIS-III i | 0.965 | 0.956 | 0.957 | 0.059 | 359.5 | 301.5 | 62 | <0.001 | 123 | 53 | <0.001 | ||||||||
| Niileksela et al. [ | WAIS-IV | 0.964 | 0.942 | 0.067 | 193.62 | 179.62 | 71 | <0.001 | 0.966 | 168.86 | 66 | <0.001 | ||||||||
| Von Stumm et al. [ | Lab Study j | 0.934 | 0.905 | 0.897 | 0.079 | 103.19 | 63.19 | 25 | <0.001 | 33.99 | 21 | 0.036 | ||||||||
Notes: Tests for which the bifactor model had significant incremental fit are in blue, and those for which the higher-order model had better fit are in red. For each fit index, we highlight the better fitting model using bold font. CFI: Comparative Fit Index; TLI: Tucker-Lewis Index; NFI: Normed Fit Index; RMSEA: Root Mean Square Error of Approximation; AIC: Akaike Information Criterion. Values are reprinted from the original studies. When the original studies did not provide a fit statistic, the formulas provided in Brown [27] and Mueller [28] were used. For example, Beaujean et al. [21] did not provide TLI and NFI in their original paper; however, CFI, χ2, and df were provided for the hypothesized model. We estimated the values of TLI and NFI using a three-step process. First, we computed df using the number of observed variables (i.e., subtest scores). Next, we located the formula for CFI given in Mueller [28] and solved this formula for the value of χnull2. We assumed that χnull2 – df > χhypothesized2 – df > 0 when solving. Next we computed TLI and NFI using the formulas provided by Mueller [28]. We computed all AICs using the formula used by AMOS and LISREL (i.e., χ2+2 (number of free parameters)), which Brown [27] describes on p. 175. a This is the fourth revision of the Wechsler Adult Intelligence Scale (WAIS; Wechsler [29]), a popular clinical intelligence test. b This is the second revision of the WAIS (Wechsler [30]). c This is the third revision of the WAIS (Wechsler [31]). d The Multidimensional Aptitude Battery (MAB; Jackson [32,33]) is a group-administered mental abilities test. The MAB is a general-purpose test battery that is used in employment, research, and clinical settings. e Gignac [14] conducted these analyses on correlation matrices from three samples in studies by Colom et al. [34]. f Gignac [14] reported a df of 50 for both the null and higher-order models; we assumed a value of 66 for the df of the null model based on the number of tests/measured variables (note df = k(k – 1)/2) and the fact that the higher-order model requires the estimation of 16 more parameters than the null model does. g Gignac [14] also analyzed a correlation matrix from Gustafson [35]. h Gignac [14] also analyzed the correlation matrix from Holzinger and Swineford’s [4] study; this matrix is published in Gustafson [36]. i This is the French-language version of the fourth revision of the WAIS. j Von Stumm et al. [26] administered nine tests: the Ravens Advanced Progressive Matrices (Raven [37]), Yela’s [38] rotation of solid figures test, three of Thurstone’s [39] Primary Mental Abilities tests, and four tests from the Differential Aptitude Tests [40,41].
χ2 Tests Comparing Observed Counts of Fit Statistic Results Across all 166 Analyses to those Expected from Morgan et al.’s [43] Monte Carlo Simulation.
| True Model = Bifactor; | True Model = Higher-Order; | ||||||
|---|---|---|---|---|---|---|---|
| Factor Structure | CFI | TLI | RMSEA | CFI | TLI | RMSEA | |
| 3:1 & 2:1 | 200 | 156 | 151 | 148 | 120 | 111 | 111 |
| 3:1 & 2:1 | 800 | 166 | 166 | 166 | 133 | 118 | 108 |
| 3:1 | 200 | 151 | 143 | 141 | 120 | 110 | 106 |
| 3:1 | 800 | 166 | 166 | 166 | 138 | 121 | 110 |
| 3:1 & 2:1 | 200 | 165 | 156 | 156 | 165 | 156 | 156 |
| 3:1 & 2:1 | 800 | 165 | 156 | 156 | 165 | 156 | 156 |
| 3:1 | 200 | 165 | 156 | 156 | 165 | 156 | 156 |
| 3:1 | 800 | 165 | 156 | 156 | 165 | 156 | 156 |
| 3:1 & 2:1 | 200 | 0.5 | 0.2 | 0.4 | 16.9 | 18.2 | 18.2 |
| 3:1 & 2:1 | 800 | 0.0 | 0.6 | 0.6 | 7.7 | 12.2 | 21.3 |
| 3:1 | 200 | 1.3 | 1.2 | 1.6 | 16.9 | 19.2 | 23.6 |
| 3:1 | 800 | 0.0 | 0.6 | 0.6 | 5.3 | 10.1 | 19.2 |
| 3:1 & 2:1 | 200 | 0.471 | 0.684 | 0.511 | <0.001 | <0.001 | <0.001 |
| 3:1 & 2:1 | 800 | 0.938 | 0.438 | 0.438 | 0.006 | <0.001 | <0.001 |
| 3:1 | 200 | 0.255 | 0.277 | 0.207 | <0.001 | <0.001 | <0.001 |
| 3:1 | 800 | 0.938 | 0.438 | 0.438 | 0.022 | 0.001 | <0.001 |
This set of columns pertains to Morgan et al.’s [43] simulations that used a bifactor model to analyze data that were generated to be truly bifactor. This set of columns pertains to Morgan et al.’s [43] simulations that used a bifactor model to analyze data that were generated to be truly higher-order. Indicators per factor in Morgan et al.’s [43] study included two conditions; (1) a condition in which two factors had three indicators each and two other factors had two indicators each and (2) a condition in which four factors had three indicators each. We included the results for both conditions in our analyses. Morgan et al.’s [43] study included two sample sizes (n = 200 and 800); we included the results for both in our analyses. The expected results were computed by multiplying the percentages reported in Morgan et al.’s [43] study by the 166 comparisons used here. The observed results were the number of times each fit statistic identified the bifactor model as having superior fit. Instances in which the fit statistics were equal were treated as support for the higher-order model. = 1.
Figure 3These figures shows the historical evolution of the theoretical models of the factor structure of mental abilities. The evolution begins with modeling of the general factor of mental abilities (g) in (a), followed by contentions that mental abilities were represented by multiple factors (b), and then a recognition that the multiple factors are correlated (c), followed by a realization that both g and broad factors existed (d). This is followed by the emergence of the bifactor model (e).