| Literature DB >> 33794710 |
Silvia Grieder1, Anette Bünger1, Salome D Odermatt1, Florine Schweizer1, Alexander Grob1.
Abstract
Research on comparability of general intelligence composites (GICs) is scarce and has focused exclusively on comparing GICs from different test batteries, revealing limited individual-level comparability. We add to these findings, investigating the group- and individual-level comparability of different GICs within test batteries (i.e., internal score comparability), thereby minimizing transient error and ruling out between-battery variance completely. We (a) determined the magnitude of intraindividual IQ differences, (b) investigated their impact on external validity, (c) explored possible predictors for these differences, and (d) examined ways to deal with incomparability. Results are based on the standardization samples of three intelligence test batteries, spanning from early childhood to late adulthood. Despite high group-level comparability, individual-level comparability was often unsatisfactory, especially toward the tails of the IQ distribution. This limited comparability has consequences for external validity, as GICs were differentially related to and often less predictive for school grades for individuals with high IQ differences. Of several predictors, only IQ level and age were systematically related to comparability. Consequently, findings challenge the use of overall internal consistencies for confidence intervals and suggest using confidence intervals based on test-retest reliabilities or age- and IQ-specific internal consistencies for clinical interpretation. Implications for test construction and application are discussed.Entities:
Keywords: IQ; general intelligence; individual level; reliability; screening; validity
Mesh:
Year: 2021 PMID: 33794710 PMCID: PMC9301611 DOI: 10.1177/10731911211005171
Source DB: PubMed Journal: Assessment ISSN: 1073-1911
Number, Position, and Content of Subtests, and Reliabilities and Widths of 95% CIs for each GIC.
| GIC | # of Subt. | Pos. in Test Seq. | Content Overlap (%) | CHC Broad Abilities Tapped | Internal consistency | rtt
| Width of 95% CI | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Overall
| Age-specific
| Age- and IQ-specific
| 95% CI | 95% CIage | 95% CIageIQ | 95% CIrtt | ||||||
| IDS-2EBIQ(14) | 14 | 1-14 | 100 | Gf, Gc, Gsm, Gv, Glr, Gs | .98 | .95-.97 | .67-.98 | .85 | 8 | 10-13 | 8-28 | 21 |
| IDS-2FSIQ(7) | 7 | 1-7 | 44 | Gf, Gc, Gsm, Gv, Glr, Gs | .97 | .92-.95 | .54-.97 | .89 | 10 | 12-16 | 10-30 | 19 |
| IDS-2ABIQ(2) | 2 | 6, 7 | 38 | Gf, Gc | .95 | .83-.90 | .53-.92 | .86 | 13 | 17-23 | 15-30 | 21 |
| SB5FSIQ(10) | 10 | 1-10 | 50 | Gf, Gc, Gsm, Gv, Gq | .99 | .93-.98 | .45-.96 | .94 | 6 | 8-16 | 10-30 | 14 |
| SB5ABIQ(2) | 2 | 1, 2 | Gf, Gc | .97 | .76-.93 | .30-.93 | .86 | 10 | 15-26 | 15-30 | 21 | |
| RIASFSIQ(4) | 4 | 1-4 | 100 | Gf, Gc | .95 | .93-.97 | .55-.96 | .88 | 13 | 10-16 | 11-30 | 19 |
| RIASABIQ(2) | 2 | 1, 2 | Gf, Gc | .93 | .90-.94 | .51-.96 | .87 | 15 | 13-18 | 12-30 | 20 | |
Note. Content overlap was calculated by dividing the number of subtests tapping the same broad abilities for both GICs by the total number of subtests over both GICs and multiplying this decimal by 100. Each content overlap percentage concerns the respective GIC and the one in the row below (for the IDS-2ABIQ(2): the IDS-2EBIQ(14)). CHC broad ability assignments are based on information in the test manuals and descriptions in McGrew (2009, Table 1). Mean test–retest intervals were 24 days (IDS-2), 22 days (SB5), and 19 days (RIAS). Age- and IQ-specific internal consistencies: IQ groups: <85, 85-115, >115; age groups: IDS-2: 5-6, 7-8, 9-12, 13-15, 16-20 years, SB5: <7, 7-8, 9-12, 13-15, 16-–20, 21-29, 30-59, ≥60 years, RIAS: 3-4, 5-6, 7-8, 9-12, 13-15, 16-20, 21-59, ≥60 years. GIC = general intelligence composite; CI = confidence interval; # of Subt. = number of subtests; Pos. in Test Seq. = position in test sequence; CHC = Cattell–Horn–Carroll; rtt = test–retest reliability; 95% CI = 95% CI with overall internal consistencies; 95% CIage = 95% CI with age-specific internal consistencies; 95% CIageIQ = 95% CI with age- and IQ-specific internal consistencies; 95% CIrtt = 95% CI with test–retest reliability; EBIQ = Extended Battery IQ; FSIQ = Full-Scale IQ; ABIQ = Abbreviated Battery IQ; IDS-2 = Intelligence and Development Scales–2; SB5 = Stanford–Binet Intelligence Scales–Fifth edition, German adaptation; RIAS = Reynolds Intellectual Assessment Scales, German adaptation; Gf = fluid reasoning; Gc = comprehension knowledge; Gsm = short-term memory; Gv = visual processing; Glr = long-term memory and retrieval; Gs = cognitive processing speed; Gq = quantitative knowledge.
Derived from manuals. Based on Cronbach’s alphas (IDS-2 and RIAS) or split-half reliabilities (SB5).
Computed according to the manuals with a formula provided by Lienert and Raatz (1998, p. 330) based on Cronbach’s alphas (IDS-2 and RIAS) or split-half reliabilities (SB5).
Descriptive Statistics, Paired-Samples t Tests and Pearson Correlations, and Intraindividual Absolute Differences in IQs.
| GIC |
|
| Range | Skewness | Kurtosis |
| Cohen’s |
|
|
|
| Rangediff |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IDS-2EBIQ(14) | 100.04 | 14.70 | 55-145 | −0.49 | 0.65 | −0.61 | −0.01 | .95 | .98 | 3.68 | 3 | 0-20 |
| IDS-2FSIQ(7) | 100.11 | 14.79 | 55-142 | −0.44 | 0.48 | −0.12 | −0.00 | .82 | .86 | 7.00 | 6 | 0-39 |
| IDS-2ABIQ(2) | 100.08 | 15.11 | 55-144 | −0.30 | 0.10 | −0.12 | −0.00 | .77 | .80 | 7.94 | 7 | 0-37 |
| SB5FSIQ(10) | 99.96 | 14.80 | 55-145 | −0.02 | 0.22 | −0.18 | −0.00 | .76 | .77 | 8.12 | 7 | 0-38 |
| SB5ABIQ(2) | 99.92 | 14.95 | 55-145 | −0.06 | 0.00 | |||||||
| RIASFSIQ(4) | 99.53 | 14.77 | 45-158 | −0.49 | 0.91 | 3.73 | 0.03 | .93 | .99 | 4.37 | 4 | 0-20 |
| RIASABIQ(2) | 99.98 | 14.49 | 40-160 | −0.79 | 1.63 |
Note. IDS-2: n = 1,622; SB5: n = 1,829; RIAS: n = 2,109. The last six columns refer to the comparison between the respective GIC and the one in the row below it (for the IDS-2ABIQ(2): with the IDS-2EBIQ(14)). Cohen’s d was calculated using the formula from Dunlap et al. (1996) for paired samples. GIC = general intelligence composite; EBIQ = Extended Battery IQ; FSIQ = Full-Scale IQ; ABIQ = Abbreviated Battery IQ; IDS-2 = Intelligence and Development Scales–2; SB5 = Stanford–Binet Intelligence Scales–Fifth Edition, German adaptation; RIAS = Reynolds Intellectual Assessment Scales, German adaptation; r = Pearson correlation corrected for unreliability of both GICs; M/Md/Rangediff = mean/median/range of intraindividual absolute IQ difference.
p < .001.
Figure 1.Comparison of 95% confidence intervals for standardized beta coefficients for GICs predicting school grades in German and mathematics and grade point average (GPA), split into subsamples with participants with intraindividual absolute differences in IQs below (IDS-2: n = 203, SB5: n = 122, RIAS: n = 29) and above (IDS-2: n = 211, SB5: n = 127, RIAS: n = 35) the median.
Note. A difference in betas was deemed significant if confidence intervals overlapped to a maximum of 50% (indicated in red). Significant betas are in black, nonsignificant betas in gray. Data for the IDS-2 and SB5 are cross-sectional; data for the RIAS are longitudinal. IDS-2 = Intelligence and Development Scales–2; SB5 = Stanford–Binet Intelligence Scales–Fifth edition, German adaptation; RIAS = Reynolds Intellectual Assessment Scales, German adaptation; EBIQ = Extended Battery IQ; FSIQ = Full-Scale IQ; ABIQ = Abbreviated Battery IQ.
Gamma Generalized Linear Models With Possible Predictors of Absolute Differences in IQs.
| Predictor | IDS-2 | SB5 | RIAS |
|---|---|---|---|
| EBIQ vs. ABIQ | FSIQ vs. ABIQ | FSIQ vs. ABIQ | |
| Age | −0.00 | −0.15 | −0.08 |
| Sex | 0.00 | 0.03 | 0.03 |
| AD(H)D | 0.12 | 0.14 | |
|
| |||
| Bilingual | −0.14 | 0.00 | −0.01 |
| Other language | −0.04 | 0.07 | −0.01 |
| Education | 0.06 | 0.05 | 0.05 |
| IQ level | |||
| Below-average IQ | 0.01 | 0.07 | 0.27 |
| Above-average IQ | 0.06 | 0.26 | 0.17 |
| Participation | 0.00 | ||
| Cooperation | 0.05 | ||
| Understanding | 0.05 | ||
| Age * Below-average IQ | −0.59 | 0.12 | −0.07 |
| Age * Above-average IQ | −0.10 | 0.31 | −0.16 |
| Likelihood | 24.04 | 27.45 | 29.52 |
Note. IDS-2: n = 1,566, SB5: n = 1,775, RIAS: n = 1,979. Displayed are regression coefficients standardized by dividing by two standard deviations (Gelman, 2008). Sex: 0 = male, 1 = female; AD(H)D: 0 = no, 1 = yes; Bilingual: 0 = German, 1 = bilingual; Other language: 0 = German, 1 = other native language; Education (of participants or their mothers): 0 = no university degree; 1 = university degree; Below average IQ: 0 = 85 ≤ IQ ≤ 115, 1 = IQ < 85; Above average IQ: 0 = 85 ≤ IQ ≤ 115, 1 = IQ >115; Cooperation (in the test situation) and understanding (of instructions): 0 = yes, 1 = partly/no. AD(H)D = attention deficit/hyperactivity disorder or attention deficit disorder; Participation = participation in the test situation; IDS-2 = Intelligence and Development Scales–2; SB5 = Stanford–Binet Intelligence Scales–Fifth edition, German adaptation; RIAS = Reynolds Intellectual Assessment Scales, German adaptation; EBIQ = Extended Battery IQ; FSIQ = Full-Scale IQ; ABIQ = Abbreviated Battery IQ.
p <.05. **p <.01. ***p <.001.
Figure 2.Percentage of participants with comparable IQs (i.e., overlapping intervals) determined by the following six criteria: NomIQ, 95% CI, 95% CIage, NomCI, 95% CIrtt, and 95% CIageIQ.
Note. NomIQ = IQ in nominal categories (e.g., “average” for IQ 85-115), 95% CI = 95% CI with overall internal consistencies, 95% CIage = 95% CI with age-specific internal consistencies, NomCI = 95% CIs with overall internal consistencies in nominal categories, 95% CIrtt = 95% CI with test–retest reliabilities, and 95% CIageIQ = 95% CI with age- and IQ-specific internal consistencies. The percentage of participants with comparable IQs both overall (black dots) and across IQ and age groups (color palette) are displayed. Numbers are displayed for the IQ and age group with the lowest percentage of participants with comparable IQs for each comparison. Ages given in years. IDS-2 = Intelligence and Development Scales–2; SB5 = Stanford–Binet Intelligence Scales–Fifth edition, German adaptation; RIAS = Reynolds Intellectual Assessment Scales, German adaptation; EBIQ = Extended Battery IQ; FSIQ = Full-Scale IQ; ABIQ = Abbreviated Battery IQ; CI = confidence interval.