Literature DB >> 29892253

Short Forms of Wechsler Scales Assessing the Intellectually Gifted Children Using Simulation Data.

Alexandre Aubry1, Béatrice Bourdin1.   

Abstract

Intellectual giftedness is usually defined in terms of having a very high Intellectual Quotient (IQ). The intellectual capacity is assessed by a standardized test such as the Wechsler Intelligence Scale for Children (WISC). However, the identification of intellectually gifted children (IGC) often remains time-consuming. A short-form WISC can be used as a screening instrument. The practitioners and researchers in this field can then make a more in-depth evaluation of the IGC's cognitive and socioemotional characteristics if needed. The aim of our study is thus to determine the best short tests, in terms of their psychometric qualities, for the identification of IGC. The current study is composed of three-step analyses. Firstly, we created nine IQs short forms (IQSF) with 2-subtests, and nine IQSF with 4-subtests from the WISC-IV (Wechsler, 2005). Secondly, we estimated psychometric parameters (i.e., reliability and validity) from empirical and simulated dataset with WISC-IV. The difference in the estimation of psychometric qualities of each IQSF from the simulated data is very close to those derived from empirical data. We thus selected the three best IQSF based on these psychometrics parameters estimated from simulated datasets. For each selected short form of the WISC-IV, we estimated the screening quality in our sample of IGC. Thirdly, we created IQSF with 2- and 4-subtests from the WISC-V (Wechsler, 2016) with simulated dataset. We then highlighted the three best short forms of WISC-V based on the estimated psychometric parameters. The results are interpreted in terms of validity, reliability and screening quality of IGC. In spite of the important changes in the WISC-V, our findings show that the 2-subtest form, Similitaries + Matrix Reasoning, and 4-subtest form, Similitaries + Vocabulary + Matrix Reasoning + Block Design, are the most efficient to identify the IGC at the two recent versions of Wechsler scales. Finally, we discuss the advantages and drawbacks of a brief assessment of intellectual aptitudes for the identification of the IGC.

Entities:  

Keywords:  brief assessment; gifted children; intelligence; screening tools; short-form

Year:  2018        PMID: 29892253      PMCID: PMC5985308          DOI: 10.3389/fpsyg.2018.00830

Source DB:  PubMed          Journal:  Front Psychol        ISSN: 1664-1078


Introduction

In the sphere of education, the rapid and reliable evaluation of a child's global intellectual capacity is important for an efficient identification of intellectually gifted children (IGC). Indeed, such evaluation contributes in proposing specific educational programs (e.g., accelerated or enrichment programs). In this context, a short evaluation can initially be used to assess a child's intellectual giftedness. It can then serve to determine whether a more in-depth evaluation of the child's cognitive and socioemotional characteristics is needed. The importance and usefulness of cognitive ability assessments for the identification of IGC have long been recognized by the scientific community as a means of facilitating the integration of IGC in specialized school programs (Simpson et al., 2002; Pierson et al., 2012). Furthermore, special education such as enrichment education has proved to impact positively on their cognitive skills (Shi et al., 2013). Consequently, an early identification of intellectual giftedness appears to be a predictor of not only their psychological well-being (Neihart, 1999; Litster and Roberts, 2011; Kroesbergen et al., 2016), but also their professional success as adults (Rinn and Bishop, 2015). Intellectual giftedness is usually defined in terms of a high Intellectual Quotient (IQ) resulting from a standardized and validated intelligence test (Winner, 2000; see Lovett and Lewandowski, 2006). The Wechsler Intelligence Scale for Children (WISC-IV; Wechsler, 2005; WISC-V; Wechsler, 2016) is one of the most common tools used by both researchers and clinicians for the evaluation of intellectual capacity (Evers et al., 2012). It is currently used to identify intellectually gifted children in school (McClain and Pfeiffer, 2012). It allows a person's global intellectual potential to be estimated on the basis of a Full Scale Intellectual Quotient (FSIQ). The most commonly considered threshold value of the FSIQ is a score greater than or equal to 130, i.e., 2 standard deviations above the average (Carman, 2013). However, this cutoff is an ideal target value. Very high cutoffs also require a very high level of reliability in the measurement, in order to ensure satisfactory identification performance. Indeed, the measurement error has an influence on the identification of IGC (McIntosh et al., 2005). It perturbs the accuracy of decisions related to their identification (McBee et al., 2013). According to McIntosh et al. (2005), a threshold FSIQ value of 125, i.e., the 95th percentile, appears to be a reasonable choice for the identification of IGC. In the current study, we retained this cutoff of 125 in Wechsler's scale to identify IGC. The recent versions of Wechsler scales are explicitly developed on the basis of the theoretical model of Carroll-Horn-Cattel (CHC; Keith et al., 2006; Lecerf et al., 2010a; Golay et al., 2013; Weiss et al., 2013). The literature is consensual with regard to the number of global cognitive abilities evaluated by the WISC-IV (Reverte et al., 2015) or WISC-V (Reynolds and Keith, 2017). Subtests are posited to estimate global cognitive aptitudes: Similarities, Vocabulary, and Comprehension are considered to evaluate the Comprehension-Knowledge (Gc) ability; Pictures Concept, Figure Weights (only WISC-V) and Reasoning Matrix are considered to estimate Fluid reasoning (Gf) ability; Block Design and Visual Puzzle (only WISC-V) are posited to evaluate mainly Visual processing (Gv) ability; Digit Span, Picture Span (only WISC-V) and Letter-Number Sequencing are considered to evaluate Short-memory ability (Gsm), and Coding and Symbol target processing Speed (Gs) ability. In practice, IGC have higher performance for Gc, Gf and Gv than for Gsm and Gs (Volker et al., 2006; Rowe et al., 2010). It thus appears that IGC performed more efficiently in terms of high-level abilities than in terms of low-level abilities. The consequence of this is the presence of strong scatter scores in Wechsler scale with IGC. According to Flanagan and Kaufman (2009), an important and uncommon variability can affect the interpretation of the FSIQ. Indeed, the aim of the FSIQ is to summarize all cognitive aptitudes assessed by the scale. The FSIQ score is designed to estimate Spearman's g factor 1904. If the tests are too heterogeneous, the common variance between tests decreases and the mean value of the performance does not provide a satisfactory representation of the overall cognitive aptitude. This abnormal variability between scores in Wechsler scale could have an impact on the estimation of a child's overall intellectual functioning (Lecerf et al., 2015). To mitigate this problem, Weiss et al. (2008) proposed a new indicator: the Global Aptitude Index (GAI). This is estimated from subtests used mainly to evaluate Gc, Gf, and Gv abilities. These are the most highly g-saturated core subtests (Keith et al., 2006; Lecerf et al., 2010b). The GAI would thus provide a better estimation of overall cognitive functioning when low g-loaded cognitive aptitudes are lower than the high g-loaded cognitive aptitudes (Watkins et al., 2002; Sattler and Ryan, 2009; Lecerf et al., 2016). Considering the discrepancies in performance for the low- and high-level cognitive abilities in IGC, the use of GAI appears to be more judicious than the FSIQ in the context of the identification of IGC (Newman et al., 2008). The GAI also has a satisfactory reliability with regard to both short- and long-term stability (Kieng et al., 2013; Watkins and Smith, 2013). It is often used to include the gifted children in special education such as enrichment or accelerated programs (Saklofske et al., 2005; Pierson et al., 2012). The GAI is often considered as an abbreviated form of the Wechsler scale for the identification of IGC. However, the short form of a test should reduce the examination time by at least 50% (Levy, 1968). The GAI reduces the administrative time by approximately 23% (Ryan et al., 2007). As a consequence, the identification of IGC often remains time-consuming, even with the GAI. In order to reduce the examination time of a cognitive ability test, the most commonly used solution described in the literature is to reduce the number of subtests retained to estimate the global cognitive functioning (Silverstein, 1990). This time should not be gained at the cost of predictive accuracy (Doppelt, 1956). The use of short forms comprising more than 4 subtests only weakly increases the reliability of the measurement with respect to the cost in terms of examination time (Karnes and Brown, 1981). The usefulness of brief intellectual assessment is controversial, because it prevents to analyse the cognitive profile which can be essential in the learning disabilities context (Fiorello et al., 2001; Hale et al., 2007). Nevertheless, the brief intellectual assessment may be useful to estimate the intellectual potential. Indeed, several short forms of the WISC-IV have been validated in various languages (e.g., Crawford et al., 2010; in English; Dasi et al., 2014, in Spanish). These short forms have been used to obtain a rapid and reliable evaluation of intellectual ability in children with an intellectual disability (McKenzie et al., 2013; Murray et al., 2016), children having a high-functioning autism spectrum (Thomeer et al., 2012), children affected by epilepsy (Hrabok et al., 2012) or children affected by traumatic brain injury (Donders et al., 2013). Recent studies have used shortened versions of the Wechsler scale with 2 or 4 subtests, for the detection of IGC (Shaw et al., 2006; Alloway and Elsworth, 2012; van Viersen et al., 2014, 2015). Most of these short forms have been constructed on the basis of subtests evaluating high-level cognitive abilities such as verbal comprehension and perceptive reasoning. Although there are short forms of the previous versions of the Wechsler Intelligence Scale for Children (WISC-R and WISC-III) as well as other sets of cognitive ability evaluations for the identification of IGC (Killan and Hughes, 1978; Dirks et al., 1980; Karnes and Brown, 1981; Ortiz and Gonzalez, 1989; Mark et al., 1998; for a review Simpson et al., 2002; Reiter, 2004; Pierson et al., 2012), to our knowledge, no short form of the WISC-IV or WISC-V has been tested with respect to its psychometric qualities in this atypically developing population. With respect to the use of shortened tests of cognitive ability as a decisional aid for the identification of IGC, Prewett (1995) suggests that shortened tests should generate scores that are comparable with those obtained with a battery of global assessment tests. It is thus necessary to compare the mean values of the short scale with that of the full scale. If a discrepancy between the IQ short form (IQSF) and the FSIQ is within 2 standard errors of measurement (SEM), the IQSF is considered as stable (Kieng et al., 2013; Meyers et al., 2013). In the literature, some authors use 2 or 4 subtests to estimate IQSF score from Wechsler scale to identify IGC (Shaw et al., 2006; Alloway and Elsworth, 2012; van Viersen et al., 2014, 2015). To our knowledge, the psychometric qualities of these short forms have never been tested for the identification of IGC. In the present study, we make nine IQSF from all possible combinations of 2 or 4 subtests in Verbal Comprehension Index (VCI) and Performance Reasoning Index (PRI). We compare reliability, validity, and screening qualities with each other. We excluded the subtests involving the working memory, because these subtests are also known to be affected negatively by specific learning disabilities (Maehler and Schuchardt, 2009; Cornoldi et al., 2014; Toffalini et al., 2017a). While IGC are known to elicit high performances in working memory tasks (Calero et al., 2007; Hoard et al., 2008; Ruthsatz and Urbach, 2012), these subtests can underestimate the overall cognitive performance in the context of specific learning disability. All short forms based on subtests from VCI and PRI make it possible to obtain an estimation of the FSIQ and GAI scores in less than 30 min. The aim of our study is thus to determine the best short scales, in terms of their psychometric qualities, for the identification of IGC.

Methods

Participants

The data was collected from the WISC-IV produced by 117 French IGC (mean age: 10.39, SD: 1.03, 74% boys) and 52 French intellectually typical children (mean age: 10.57, SD: 2.66, 79% boys). Clinician psychologists gave us the data completely anonymous and in the respect of French deontological code. To preserve the anonymity of the children, we chose not to contact the parents and their children. The groups did not significantly differ in terms of age, t(57.902) = −0.485, p = 0.629, d = −0.109, or gender, = 0.550, p = 0.458. The children were contacted through teachers, school psychologists or licensed psychologists. In order to allow for measurement errors, the inclusion criterion for intellectually gifted group was set to FSIQ or GAI score greater than or equal to 125 in accordance with the current recommendation (McIntosh et al., 2005; Assouline et al., 2010; Brasseur and Grégoire, 2010).

Procedure

All of the statistical analyses were made using version 3.4.2 of R software (R Core Team, 2017). We estimated the interclass correlation of each IQSF and the full-scale using the irr library (Gamer et al., 2012) and the hierarchical omega (ω) of each IQSF using the MBESS library (Kelley, 2018). We also computed the 95% confidence intervals for the correlation comparisons (Zou, 2007) using the cocor library (Diedenhofen and Musch, 2015). The data relevant to the preparation of short forms was extracted from fully administered WISC-IV protocols. The FSIQ scores were computed from 10 core subtests from the WISC-IV. The GAI scores were obtained from the VCI and PRI (Lecerf et al., 2010a). The current study is composed of three steps in our statistical analyses. Firstly, the scores from different IQSF were computed using the linear equation method (Tellegen and Briggs, 1967). They were divided by the sum of the standard scores given by the VCI (Vocabulary [Vo], Similarities [Si] and Comprehension [Co]) and PRI (Block Design [Bd], Picture Concepts [Pc], and Matrix Reasoning [Mr]). Secondly, we realized a simulation of 1,000,000 data with the Choleski decomposition from correlation matrix of WISC-IV (Table 4.1; Wechsler, 2005) using the mvtnorm library (Genz et al., 2018) (see Giofrè et al., 2017 for a similar approach). We realized a comparison between psychometrics parameters (i.e., reliability and validity) estimated from empirical and simulated datasets. Then, we selected the three best IQSF based on these psychometrics parameters estimated from simulated dataset. For each selected short form WISC-IV, we computed the indicators of a receiver operating characteristic (ROC) such as the sensitivity, the sensibility, the false positive rate (FPR), the false negative rate (FNR), and the Area Under Curve (AUC) from our empirical sample using the pROC library (Robin et al., 2011). The AUC is used as a general measure to estimate the performance of the IGC classification (Fawcett, 2006). Thirdly, we realized another simulated dataset from correlation matrix of WISC-V (Table 4.1; Wechsler, 2016). From this simulated dataset, we created the different short forms of WISC-V. We then highlighted the three best short forms of WISC-V based on the estimated psychometrics parameters from simulated dataset. The data and R script used for the simulations and the statistical analyses are available on the Open Science Framework (OSF) at http://osf.io/dax8p.

Statistical analyses

Nine IQSF derived from 2-subtest forms, and 9 IQSF derived from 4-subtest forms were computed from 3 PRI and 3 VCI subtests. For each IQSF, the threshold of intellectual giftedness identification was superior to 125, i.e., the 95th percentile. It is essential to evaluate reliability and validity when selecting the best short form (Cyr and Brooker, 1984). For each form that we prepared, reliability and validity indices were thus computed.

Index of reliability

Alpha

The reliability of each form was determined by a composite reliability coefficient (rcc) according to Equation (1) from Tellegen and Briggs (1967), based on a table of internal consistency and inter-correlations of the applied subtests derived from the WISC-IV manual (Table 4.1 and Table 5.1 from Wechsler, 2005) and the WISC-V manual (Table 4.1 and Table 5.1 from Wechsler, 2016). This index allows the standard error of each short form measurement to be determined. The composite reliability coefficient is frequently used to estimate the reliability of the abriged scale (Ryan and Ward, 1999; Girard et al., 2010, 2015; Donders et al., 2013; Denney et al., 2015). where r is the reliability coefficient of the jth subtest in IQSF, n is the number of subtests in IQSF, r is the correlation coefficient between the jth subtest and the kth subtest used for IQSF.

Hierarchical omega

In contrast to the coefficient alpha reliability, the coefficient omega (ω) takes into account the unequal factor loadings (Watkins, 2017). In particular, the hierarchical omega (ω) has also the advantage to be unaffected by the fitting factorial analysis model (Kelley and Pornprasertmanit, 2016). So, ω can be a better index of the reliability of a composite score from Wechsler scale than alpha coefficient (Gignac and Watkins, 2013). This index estimates the variation portion which is involved by the general factor. High value of ω indicates that a general factor explains a large part of variation in the composite score. ω coefficient is considered as reliable if it exceeds 0.50 at minimum, but ω superior or equal to 0.75 is considered better (Reise et al., 2013).

Interclass correlation coefficient

In addition, we used the Interclass Correlation Coefficient (ICC; model A.1 in McGraw and Wong, 1996) to examine the reliability of both the IQSF and the full-scale scores.

Index of validity

Three validity indices were estimated: the convergent validity determined by the corrected correlation between the short form and the full form of the scale (r′), the degree of discrepancy computed as the difference between the IQSF score and the full form score (FSIQ or GAI), and the accuracy of the estimation of the full form scores (Cacc).

Convergent validity

The convergent validity of each form was determined by computing its correlation (Pearson's r) with the FSIQ and GAI scores. These correlations were then corrected (r′; Equation 2) by taking the redundancy of the variance error into account, using the modified version (Girard and Christensen, 2008) of the Levy formula (Levy, 1967). The forms were indeed prepared from 2 or 4 tests taken from the entirely administered scale. The correlation between the forms and the full forms (FSIQ or GAI) is artificially increased by the measurement error shared between the two forms: where r′ is the corrected correlation coefficient; rsf is the uncorrected correlation coefficient between the score of the short form (QISF) and the full scale (FSIQ or GAI), r is the composite reliability coefficient of QISF, r is the correlation coefficient between the jth subtest and the kth subtest used for IQSF, r is the correlation coefficient between subtest l and subtest m, used for the FSIQ or the GAI, p is the number of subtests used for the IQSF, SDSF is the standard deviation of the QISF.

Degree of discrepancy

The paired Student t-tests were computed in order to determine whether the average of the QISF scores was significantly different at the average of the FSIQ or GAI scores. This index allows us to define whether the measurement provided by the scale is significantly different at that of the full scale. The extent of this effect (Cohen's d for correlated samples comparison; Lakens, 2013) was estimated, in order to determine the magnitude of the difference in mean computed value.

Accuracy

An indicator of the accuracy (Cacc) of the estimation of FSIQ or GAI from the QISF score was prepared, in order to identify the percentage of individuals in our sample having a QISF score greater than or equal to the threshold value of 125. The indicator Cacc also takes into account the measurement stability between the QISF score and the FSIQ and GAI scores. Usually, this stability is determined by the difference between two scores in the range between −2 and +2 standard errors of measurement (Meyers et al., 2013). Cacc is a coefficient lying in the range between 0 and 1. It can be interpreted as the accuracy of IGC identification. The closer Cacc is to 1, the more accurate the QISF score-based identification. In order to simplify the interpretation of multiple psychometric reliability and validity scores, we construct a composite indicator (Rc) based on the approach proposed by Cyr and Brooker (1984), and adapted by Girard and Christensen (2008). Rc corresponds to the unweighted average of 4 computed reliability and validity indicators, i.e., the composite reliability coefficient (r), the hierarchical omega (ω), the interclass correlation coefficient (ICC), the corrected correlation (r′) between the IQSF score and the FSIQ and GAI scores, the identification accuracy (Cacc). We thus reproduced and adapted the psychometric agreement indicator of Girard and Christensen (2008), with our forms and the full form of the Wechsler scale. This makes it possible to interpret the strength of agreement, which ranges from 0 (absence) to 1 (perfect), between the IQSF score and the FSIQ and GAI scores (Girard et al., 2015).

Results

Description of the sample

All of the participants obtained a GAI score equal to, or greater than 125. The descriptive data from our sample, as well as the mean and standard deviation of the FSIQ, the GAI, and the substests from WISC-IV, are shown in Table 1.
Table 1

Descriptive statistics of the sample.

Intellectually gifted children (n = 117)Typical children (n = 52)dSa
VariablesM (SD)Skew.Kurt.M (SD)Skew.Kurt.
Age (in months)124.65 (12.34)0.31−0.99126.87 (31.88)0.43−0.89−0.109
Gender (%)
   Male73.5078.85
   Female26.5021.15
WISC-IV
   Block design13.46 (2.62)0.43−0.279.85 (2.67)0.460.461.372**
   Similarities17.17 (1.99)−0.65−0.9612.48 (2.32)0.31−0.192.239**
   Digit span12.97 (2.94)0.19−0.1910.08 (2.68)0.11−0.441.013**
   Picture concepts14.30 (2.21)0.820.209.44 (2.34)0.00−1.062.159**
   Vocabulary16.44 (2.02)−0.35−0.8312.25 (2.22)0.11−0.712.014**
   LNS13.13 (2.28)0.060.1810.15 (2.73)−0.471.011.228**
   Matrix reasoning13.72 (2.38)−0.250.2010.75 (1.79)−0.17−0.771.338**
   Comprehension15.63 (2.44)−0.29−0.7111.58 (2.84)−0.632.061.579**
   Coding11.26 (2.96)−0.06−0.059.06 (3.08)0.450.150.734**
   Symbol11.72 (3.00)0.070.149.69 (3.46)−0.520.000.643**
   FSIQ134.92 (9.43)0.03−0.82104.37 (8.35)−0.33−0.683.353**
   GAI137.09 (7.66)0.20−0.29107.58 (5.84)−0.88−0.154.127**

N = 169; LNS, Letter-Number Sequencing; VCI, Verbal Comprehension Index; PRI, Performance Reasoning Index; WMI, Working Memory Index; PSI, Processing Speed Index; FSIQ, Full Scale Intellectual Quotient; GAI, General Aptitude Index;

p < 0.01 with Bonferroni correction.

Cohen's d for two independent samples comparison (Equation 1 from Lakens, .

Descriptive statistics of the sample. N = 169; LNS, Letter-Number Sequencing; VCI, Verbal Comprehension Index; PRI, Performance Reasoning Index; WMI, Working Memory Index; PSI, Processing Speed Index; FSIQ, Full Scale Intellectual Quotient; GAI, General Aptitude Index; p < 0.01 with Bonferroni correction. Cohen's d for two independent samples comparison (Equation 1 from Lakens, . The deviations and equations for each IQSF are presented in Table 2. All of the scores from each 2-subtest and 4-subtest IQSF have skewness and kurtosis coefficient lying between −1 and 1. All of the 2- and 4-subtests forms have an average value greater than 125 in intellectually gifted children and an average value of 107 in typical children. The raw data is available in the Supplementary Material (Data Sheet 1).
Table 2

Mean (SD) of WISC-IV Short-Form Scores and formulas.

Intellectually gifted children (n = 117)Typical children (n = 52)dSaStatistical formulasa
Short FormM (SD)SkewKurtM (SD)SkewKurt
2 Subtests
   SiMr132.77 (9.48)−0.04−0.56109.72 (8.16)−0.320.152.533**3.01 × (Si+Mr) + 39.81
   SiPc135.03 (8.83)−0.34−0.06105.87 (7.95)−0.01−0.773.402**3.05 × (Si+Pc) + 38.92
   SiBd132.12 (9.45)0.16−0.02107.03 (10.16)−0.24−0.212.594**3.02 × (Si+Bd) + 39.59
   VoMr130.47 (9.56)−0.280.06109.00 (7.39)−0.580.302.400**3.00 × (Vo+Mr) + 40.02
   VoPc132.69 (8.89)−0.23−0.40105.15 (9.05)−0.150.163.081**3.04 × (Vo+Pc) + 39.14
   VoBd130.48 (9.93)−0.040.31106.45 (9.27)0.13−0.362.469**3.08 × (Vo+Bd) + 38.45
   CoMr129.22 (10.27)0.20−0.22107.27 (8.81)−0.620.532.229**3.13 × (Co+Mr) + 37.50
   CoPc130.92 (10.93)−0.18−0.04103.17 (8.71)−0.54−0.272.693**3.11 × (Co+Pc) + 37.74
   CoBd129.23 (11.97)0.29−0.47104.57 (9.01)−0.11−0.252.212**3.21 × (Co+Bd) + 35.72
4 Subtests
   SiVoMrPc136.05 (7.83)−0.16−0.10108.21 (7.21)−0.23−0.823.643**1.67 × (Si+Vo+Mr+Pc) + 33.33
   SiVoMrBd134.54 (8.08)0.09−0.35108.85 (7.87)−0.46−0.533.206**1.66 × (Si+Vo+Mr+Bd) + 33.55
   SiVoPcBd136.40 (7.40)0.17−0.06106.84 (7.74)−0.34−0.883.939**1.70 × (Si+Vo+Pc+Bd) + 31.88
   SiCoMrPc135.92 (8.42)0.12−0.10107.33 (6.36)−0.52−0.503.641**1.73 × (Si+Co+Mr+Pc) + 30.99
   SiCoMrBd134.56 (8.67)0.22−0.45108.05 (7.15)−0.48−0.693.220**1.73 × (Si+Co+Mr+Bd) + 30.83
   SiCoPcBd136.31 (8.55)−0.02−0.29105.91 (6.22)−0.75−0.083.841**1.77 × (SI+Co+Pc+Bd) + 29.38
   VoCoMrPc134.50 (8.73)0.09−0.32106.90 (6.57)−0.620.233.394**1.72 × (Vo+Co+Mr+Pc) + 31.32
   VoCoMrBd133.38 (9.10)0.40−0.42107.67 (6.44)−0.58−0.663.068**1.73 × (Vo+Co+Mr+Bd) + 30.66
   VoCoPcBd135.11 (9.01)0.12−0.39105.51 (6.26)−0.720.723.581**1.77 × (Vo+Co+Pc+Bd) + 29.20

n = 169; Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Pc, Pictures Concept; Bd, Block Design;

p < 0.01 with Bonferroni correction.

Cohen's d for two independent sample (Equation 1 from Lakens, .

Equation (4) from Tellegen and Briggs (.

Mean (SD) of WISC-IV Short-Form Scores and formulas. n = 169; Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Pc, Pictures Concept; Bd, Block Design; p < 0.01 with Bonferroni correction. Cohen's d for two independent sample (Equation 1 from Lakens, . Equation (4) from Tellegen and Briggs (.

Short form WISC-IV estimation with simulated dataset

We realized a comparison between the reliability and validity indicators from real and simulated data in WISC-IV. The composite indicator Rc calculated with our sample and the simulated dataset are highly correlated [r(36) = 0.988]. In addition, there are few differences between each indicator of reliability and validity estimated with our sample and these with the simulated dataset have few differences (Table 3). The near perfect duplication of psychometric quality indicators shows that their estimation with simulated data is very close to those derived from real data. We thus selected the short forms from the simulated dataset.
Table 3

Comparison the principal indexes of reliability and validity estimated with our sample and the simulated dataset.

FSIQGAIRc
rCaccICCrCaccICC
Difference−0.013−0.0170.003−0.0100.0150.014−0.001
r0.906**0.831**0.976**0.981**0.909**0.974**0.988**

N = 36.

;p < 0.01.

Comparison the principal indexes of reliability and validity estimated with our sample and the simulated dataset. N = 36. ;p < 0.01. The 2-subtest and 4-subtest forms are ranked by decreasing value of the composite score Rc, determined to use the reliability index and each validity index (see Table 4). All of the IQSF scores produced by the 2 subtests are significantly correlated with the FSIQ, [0.680; 0.786]; p < 0.01. All of the IQSF scores computed from 4 subtests are also correlated with the FSIQ, r′(1e6) ∈ [0.823; 0.851]; p < 0.01. However, the forms with 4 subtests are significantly more strongly correlated with the FSIQ, respectively r(9e6) = 0.791; r(9e6) = 0.885; 95% CI [−0.094, −0.094], and the GAI, respectively r(9e6) = 0.856; r(9e6) = 0.957; 95% CI [−0.102, −0.102], than with the forms with 2 subtests
Table 4

Reliability and Validity of WISC-IV Short Forms from simulated data.

Short formrccωHFSIQGAIRc
r′ICCCaccr′ICCCacc
2 Subtests
   SiMr0.8770.5500.7860.8200.7970.8330.8820.9090.807
   VoMr0.8710.5610.7830.8180.8040.8260.8770.9070.806
   VoBd0.8450.4840.7710.8130.8280.8010.8620.9070.789
   SiBd0.8580.5400.7590.7970.7910.7910.8480.8720.782
   SiPc0.8250.5070.7410.7880.8130.7950.8640.9190.781
   VoPc0.8190.5180.7370.7860.8180.7880.8600.9180.781
   CoMr0.8360.4370.7410.7840.7950.7900.8530.8970.767
   CoBd0.8060.3470.7290.7790.8200.7650.8370.8980.748
   CoPc0.7870.4490.6800.7360.7860.7370.8190.8820.734
4 Subtests
   SiVoMrBd0.9170.7330.8510.8910.8560.8920.9520.9870.885
   SiVoPcBd0.8980.7230.8400.8890.8740.8870.9590.9960.883
   SiVoMrPc0.9080.7330.8390.8840.8510.8930.9590.9940.883
   VoCoMrBd0.9000.6840.8450.8930.8790.8890.9580.9950.881
   SiCoMrBd0.9030.6920.8440.8900.8710.8900.9570.9940.880
   VoCoPcBd0.8800.6720.8300.8860.8880.8790.9610.9980.874
   SiCoMrPc0.8940.6940.8270.8780.8550.8850.9590.9960.874
   SiCoPcBd0.8830.6770.8290.8830.8810.8800.9590.9970.873
   VoCoMrPc0.8930.7000.8230.8740.8520.8790.9540.9930.871

N = 1,000,000. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Pc, Pictures Concept; Bd, Block Design. Boldfaced values highlight the top three short-forms according to each psychometric measure. All corrected correlation coefficient (r′) are significant at p < 0.01.

Reliability and Validity of WISC-IV Short Forms from simulated data. N = 1,000,000. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Pc, Pictures Concept; Bd, Block Design. Boldfaced values highlight the top three short-forms according to each psychometric measure. All corrected correlation coefficient (r′) are significant at p < 0.01. The short form with the Similarities + Matrix Reasoning (SiMr) subtests has the highest agreement score of all 2 subtest forms, Rc = 0.807. It is most strongly correlated with the FSIQ, r′(1e6) = 0.786; p < 0.01, and GAI score, r′(1e6) = 0.833; p < 0.01. In 80% of cases, the IQSF score for the SiMr form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the FSIQ. In 91% of cases, the IQSF score for the SiMr form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the GAI. Among the 4-subtest forms, the IQSF scores of Similarities + Vocabulary + Matrix Reasoning + Block Design [SiVoMrBd] form has a higher accession score, Rc = 0.885. It is strongly correlated with the FSIQ, r′(1e6) = 0.851; p < 0.01, and GAI score, r′(1e6) = 0.892; p < 0.01. In 86% of cases, the IQSF score for the SiVoMrBd form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the FSIQ. In 99% of cases, the IQSF score for the SiVoMrBd form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the GAI.

Identification efficiency of short-form WISC-IV from empirical data

After selecting the three best short forms with 2- and 4-subtests from simulated dataset, we realized the AUC, sensitivity and sensibility of the three best IQSF from empirical dataset. In Table 5, the short forms with 2 subtests and 4 subtests are ranked in descending order of the AUC indicating the predictive performance in identifying the IGC. All short form scales have a high AUCs suggesting a highly predictive performance with 2- and 4-subtests models. The difference among all models is low (ΔAUCs < 0.05).
Table 5

Performance analysis of three best 2- and 4-subtests WISC-IV short-forms.

ShortFormSkewKurtFSIQGAIIQSF125
tdrmatdrmaAUCSensitivitySpecificityFPRFNR
2 Subtests
 SiMr−0.268−0.5850,2150.010−3.903**−0.1560.9700.7440.9810.0190.256
 VoMr−0.230−0.609−2,253*−0.103−7.102**−0.2780.9620.7180.9810.0190.282
 VoBd−0.289−0.524−3,216**−0.151−8.305**−0.3250.9580.6750.9620.0380.325
4 Subtests
 SiVoPcBd−0.335−0.6853.053*0.109−2.367−0.0450.9990.9571.0000.0000.043
 SiVoMrPc−0.180−0.6843.240**0.120−1.486−0.0350.9960.9061.0000.0000.094
 SiVoMrBd−0.466−0.5401.9290.068−3.726**−0.0900.9940.8801.0000.0000.120

N = 169. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Bd, Block Design; AUC, Area Under Curve; FPR, False Positive Rate; FNR; False Negative Rate; Boldfaced values highlight the top of short-forms according to each psychometric measure.

p < 0.05;

p < 0.01.

Cohen's d for correlated samples comparison (Equation 9 from Lakens, .

Performance analysis of three best 2- and 4-subtests WISC-IV short-forms. N = 169. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Bd, Block Design; AUC, Area Under Curve; FPR, False Positive Rate; FNR; False Negative Rate; Boldfaced values highlight the top of short-forms according to each psychometric measure. p < 0.05; p < 0.01. Cohen's d for correlated samples comparison (Equation 9 from Lakens, . Among the forms with 2 subtests, the form with the best performance is still Similitaries + Matrix Reasoning [SiMr]. They allow more than 74% of IGC in our sample to be correctly identified. The SiMr form has the probability of around 1.1% typical children being incorrectly identified as gifted. It has only significant difference with GAI, t(168) = −3.903, p < 0.01, d = −0.156, but it has the smallest effect size among the three best 2-subtests short forms. Among the forms comprising 4 subtests, the IQSF with the best performance consisted of the Similarities + Vocabulary + Picture Concept + Block Design [SiVoPcBd] subtests. It correctly identifies more than 96% of IGC in our sample. No typical children were identified as being gifted, but 4.3% IGC from our sample have not been correctly identified. It has only a significant difference with FSIQ, t(168) = −3.053, p < 0.05, d = −0.109.

Short-form WISC-V estimation from simulated dataset

The 2-subtest and 4-subtest forms are ranked by decreasing value of the composite score Rc (see Table 6). All of the IQSF scores produced by the 2 subtests are significantly correlated with the FSIQ, r′(1e6) ∈ [0.741; 0.833]; p < 0.01. All of the IQSF scores computed from 4 subtests are also correlated with the FSIQ, r′(1e6) ∈ [0.921; 0.940]; p < 0.01.
Table 6

Reliability and Validity of WISC-V Short Forms from simulated data.

Short FormStatistical formulasarccωHFSIQGAI
r′ICCCaccr′ICCCaccRc
2 Subtests
   SiMr2.91 + (Si+Mr) + 41.880.8850.6480.8330.8740.8660.8550.9070.9220.849
   SiBd2.95 + (Si+Bd) + 41.070.8750.6110.8260.8700.8700.8400.8960.9130.838
   SiFw2.93 + (Si+Fw) + 41.480.9110.6300.8290.8610.8040.8610.9010.8800.835
   VoMr2.99 + (Vo+Mr) + 40.240.8820.5710.8260.8660.8560.8490.9010.9140.833
   VoBd3.03 + (Vo+Bd) + 39.370.8710.5290.8190.8630.8630.8340.890.9040.822
   VoFw2.97 + (Vo+Fw) + 40.660.9120.5910.8100.8410.7660.8430.8820.8360.810
   CoMr2.99 + (Co+Mr) + 40.240.8680.5710.7410.7860.7440.7350.7930.7530.749
   CoBd3.10 + (Co+Bd) + 37.980.8500.4610.7500.7990.7850.7340.7980.7840.745
   CoFw3.05 + (Co+Fw) + 38.920.8920.5070.7470.7840.7070.7510.7980.7240.739
4 Subtests
   SiVoMrBd1.61 + (Si+Vo+Mr+Bd) + 35.520.9280.7730.8980.9430.9480.9170.9760.9980.923
   SiVoFwBd1.63 + (Si+Vo+Fw+Bd) + 34.970.9360.7670.9000.9410.9340.9250.9780.9990.922
   SiVoMrFw1.60 + (Si+Vo+Mr+Fw) + 35.980.9400.7820.8960.9350.9110.9250.9750.9970.920
   SiCoFwBd1.67 + (Si+Co+Fw+Bd) + 33.180.9270.7410.8770.9220.9060.8870.9440.9590.895
   SiCoMrBd1.64 + (Si+Co+Mr+Bd) + 34.350.9210.7620.8670.9170.9020.8710.9350.9440.890
   SiCoMrFw1.63 + (Si+Co+Mr+Fw) + 34.700.9330.7670.8670.9090.8690.8810.9360.9340.887
   VoCoFwBd1.68 + (Vo+Co+Fw+Bd) + 32.660.9270.7230.8670.9110.8840.8770.9340.9370.883
   VoCoMrBd1.66 + (Vo+Co+Mr+Bd) + 33.550.9210.7400.8600.9100.8900.8650.9290.9300.881
   VoCoMrFw1.64 + (Vo+Co+Mr+Fw) + 34.200.9330.7530.8570.8990.8460.8710.9260.9100.874

N = 1,000,000. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Fw, Figure Weights; Bd, Block Design. Boldfaced values highlight the top three short-forms according to each psychometric measure. All corrected correlation coefficient (r′) are significant at p < 0.01.

Equation (4) from Tellegen and Briggs (.

Reliability and Validity of WISC-V Short Forms from simulated data. N = 1,000,000. Si, Similitaries; Vo, Vocabulary; Co, Comprehension; Mr, Matrix Reasoning; Fw, Figure Weights; Bd, Block Design. Boldfaced values highlight the top three short-forms according to each psychometric measure. All corrected correlation coefficient (r′) are significant at p < 0.01. Equation (4) from Tellegen and Briggs (. Like at the WISC-IV, the short form with the Similarities + Matrix Reasoning [SiMr] subtests has the highest agreement score of all 2 subtest forms, Rc = 0.849. It is strongly correlated with the FSIQ, r′(1e6) = 0.833; p < 0.01, and GAI score, r′(1e6) = 0.855; p < 0.01. In 87% of cases, the IQSF score for the SiMr form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the FSIQ. In 92% of cases, the IQSF score for the SiMr form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the GAI. Like at the WISC-IV, the Similarities + Vocabulary + Matrix Reasoning + Block Design [SiVoMrBd] form has a high agreement score, Rc = 0.923. It is strongly correlated with the FSIQ, r′(1e6) = 0.898; p < 0.01, and GAI score, r′(1e6) = 0.917; p < 0.01. In 95% of cases, the IQSF score for the SiVoMrBd form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the FSIQ. In 99.8% of cases, the IQSF score for the SiVoMrBd form correctly identifies the IGC in our sample, and their IQSF lie within 2 standard errors of measurement of the GAI.

Discussion

Our aim was to make a short form of the recent versions of Wechsler scales, allowing an intellectual capacity assessment in less than 30 min. In the literature, short forms with 2 or 4 subtests are often used to estimate intellectual capacity. We thus tested all possible short form combinations, comprising either 2 or 4 subtests used for the evaluation of high-level cognitive abilities. The results of the present study indicate that the estimation of reliability and validity indicators with simulated data are very close to them estimated with real data. Ours findings also show the short forms of the WISC-IV can have high performance to identify children on having intellectual giftedness. The 4-subtest forms appear to produce better psychometric results than the 2-subtest forms. In addition, the 4-subtest short forms appear to provide a good compromise between test duration and psychometric qualities, which are more satisfactory than those obtained with 2-subtest forms. The ω coefficient also showed that the general factor explained more variance in the 4-subtest than 2-subtest forms. The 4-subtest forms seemed to be a satisfactory trade-off between an accurate estimation of overall cognitive aptitude and administration time (Gignac, 2015). In spite of a number changes, our results show that the 4-subtest form Similarities + Vocabulary + Matrix Reasoning + Block Design [SiVoMrBd], in the WISC-IV and WISC-V, is globally efficient in the identification of IGC. It appears to evaluate the three most discriminating cognitive abilities in the identification of IGC, i.e., Knowledge-Comprehension, Fluid reasoning, and Visual processing (Volker et al., 2006). Moreover, the composite score using the Similarities and Matrix Reasoning (SiMr) subtests, in the two recent Wechsler scale, appears to provide one of the best 2-subtest forms for the identification of IGC. These two types of short form thus appear to provide acceptable means of identification, for the selection of candidates for complementary evaluations (Prewett, 1995). In the context of learning disabilities, an abbreviated intellectual assessment using Similarities, Vocabulary, Matrix Reasoning, and Block Design subtests can allow clinician psychologists to be less biased an overall cognitive aptitude estimation. Indeed, these subtests seem to be less affected by the specific learning disabilities (e.g., Toffalini et al., 2017a,b). The time gain allows us to add another cognitive assessments such as the complex span tasks to assess working memory capacity (e.g., Gonthier et al., 2017). The Gc, Gf, and Gv abilities evaluated by the subtests of the GAI score appear to provide the best discrimination in the identification of IGC. This is based on the idea that these cognitive abilities appear to be the characteristics of high intellectual potential (Margulies and Floyd, 2009). Thus, psychologists should choose measurements that are well adapted to characteristics that are related to intellectual giftedness (Pierson et al., 2012). Our results reveal the importance of relying on a theoretical model of cognitive ability, with the aim of identifying IGC as a CHC model. In addition, this theoretical model can be very helpful for the schooling of children in general (Aubry and Bourdin, 2016) and IGC in particular (Warne, 2016). Any decision in this respect should not be made solely on the basis of the composite score from the short form. It is also important to recognize the reality of measurement errors, which can prevent the correct identification of IGC (Pierson et al., 2012). It is thus recommended to implement a multidimensional evaluation of IGC's characteristics (McClain and Pfeiffer, 2012). We have shown that some short forms have satisfactory psychometric qualities. However, they have to be accompanied by other assessments such as a teacher rating scale, such as the Gifted Rating Scales–School Form (GRS; Pfeiffer and Jarosewich, 2003) in order to improve the quality of IGC identification (McBee et al., 2013, 2016). Short measurements appear to be reasonable tools for the prediction of global scores of complete batteries of tests, for the identification of children with intellectual giftedness (Newton et al., 2008).

Limitations of our study

Our simulation of data is very close to our empirical data. However, their relation is not perfect. So, it may have some difference with empirical data for WISC-V short forms. In terms of future perspectives, it would be interesting to implement a detailed analysis of the specificity and sensitivity of each short form of WISC-V identified as being reliable and valid for the identification of IGC. Among the limitations of our study, the relatively small number of typical children in our sample means that our results must be considered with caution. We tried to estimate the AUC, sensitivity and specificity of all short forms scale.

Conclusion

The identification of IGC based on the use of standardized tests such as the Wechsler scale is often time-consuming. This drawback can prevent other evaluations from being made, which could be essential for the education of these children with specific needs. The development of a short form of the recent version of Wechsler scale is thus useful for the fast and efficient identification of IGC. This short form would then allow the need for a more in-depth evaluation of various cognitive and socioemotional characteristics to be determined. Our study evaluated nine 2-subtest forms and nine 4-subtest forms, based on the linear method of Tellegen and Briggs (1967). In order to evaluate these different short forms, we computed with simulated datasets several psychometric indicators that were reorganized into groups with an agreement indicator for FSIQ and GAI scores. We also computed the AUC, sensibility and specificity, on the basis of the 117 IGC and 52 typical children in our sample. Our results show that the 4-subtest short form at the WISC-IV and WISC-V, Similarities + Vocabulary + Matrix Reasoning + Block Design [SiVoMrBd], appears to be one of the most reliable forms for the identification of IGC. Among the 2-subtest forms at the WISC-IV and WISC-V, the Similarities + Matrix Reasoning [SiMr] version appears to ensure an optimal compromise between reliability and accuracy, for the estimation of FSIQ and GAI scores. In the case of our sample, this outcome led us to question the usefulness of relying on a theory of cognitive aptitudes such as that of the CHC model, in order to determine the specific cognitive characteristics of IGC. We are of the opinion that the elaboration of a short form should take these specific cognitive characteristics into account, in order to obtain sufficiently accurate identification of IGC.

Ethics statement

The participant's data were retrieved from clinical psychologists as part of their day-to-day clinical practice and a further examination ran with aim of developing a short-form of the Wechsler' scale.

Author contributions

AA and BB designed the study. AA performed data collection and analyzed the data. AA and BB wrote the manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  35 in total

1.  Estimating the full scale score on the Wechsler adult intelligence scale from scores on four subtests.

Authors:  J E DOPPELT
Journal:  J Consult Psychol       Date:  1956-02

2.  Toward using confidence intervals to compare correlations.

Authors:  Guang Yong Zou
Journal:  Psychol Methods       Date:  2007-12

3.  Short form of Spanish version of the WISC-IV for intelligence assessment in elementary school children.

Authors:  Carmen Dasi; Maria J Soler; Vicente Bellver; Juan C Ruiz
Journal:  Psychol Rep       Date:  2014-12

4.  Further insights on the French WISC-IV factor structure through Bayesian structural equation modeling.

Authors:  Philippe Golay; Isabelle Reverte; Jérôme Rossier; Nicolas Favez; Thierry Lecerf
Journal:  Psychol Assess       Date:  2012-11-12

5.  Old wine in new skins: grouping Wechsler subtests into new scales.

Authors:  A Tellegen; P F Briggs
Journal:  J Consult Psychol       Date:  1967-10

6.  Assessing the accuracy of the WISC-IV seven-subtest short form and the child and adolescent intellectual disability screening questionnaire in identifying intellectual disability in children.

Authors:  Karen McKenzie; Aja L Murray; Kara R Murray; George C Murray
Journal:  Child Neuropsychol       Date:  2013-06-07       Impact factor: 2.500

7.  Wechsler Intelligence Scale for Children-fourth edition (WISC-IV) short-form validity: a comparison study in pediatric epilepsy.

Authors:  Marianne Hrabok; Brian L Brooks; Taryn B Fay-McClymont; Elisabeth M S Sherman
Journal:  Child Neuropsychol       Date:  2012-12-10       Impact factor: 2.500

8.  cocor: a comprehensive solution for the statistical comparison of correlations.

Authors:  Birk Diedenhofen; Jochen Musch
Journal:  PLoS One       Date:  2015-04-02       Impact factor: 3.240

9.  Sustained attention in intellectually gifted children assessed using a continuous performance test.

Authors:  Jiannong Shi; Ting Tao; Wei Chen; Li Cheng; Long Wang; Xingli Zhang
Journal:  PLoS One       Date:  2013-02-25       Impact factor: 3.240

Review 10.  Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.

Authors:  Daniël Lakens
Journal:  Front Psychol       Date:  2013-11-26
View more
  4 in total

1.  Modular cognitive behavioral therapy for autism-related symptoms in children: A randomized controlled trial.

Authors:  Jeffrey J Wood; Karen Sze Wood; An Chuen Cho; Kashia A Rosenau; Maria Cornejo Guevara; Chardée Galán; Alicia Bazzano; Ari S Zeldin; Gerhard Hellemann
Journal:  J Consult Clin Psychol       Date:  2021-02

2.  A Validated WISC-V Short-Form to Estimate Intellectual Functioning in Very Preterm Children at Early School Age.

Authors:  Andone Sistiaga; Joana Garmendia; Jone Aliri; Itxaso Marti; Garazi Labayru
Journal:  Front Psychol       Date:  2021-12-17

3.  Eye-tracking training improves the learning and memory of children with learning difficulty.

Authors:  Agnes S Chan; Tsz-Lok Lee; Sophia L Sze; Natalie S Yang; Yvonne M Y Han
Journal:  Sci Rep       Date:  2022-08-17       Impact factor: 4.996

4.  Balanced Growth project: a protocol of a single-centre observational study on the involvement of the vestibular system in a child's motor and cognitive development.

Authors:  Ruth Van Hecke; Frederik J A Deconinck; Jan R Wiersema; Chloe Clauws; Maya Danneels; Ingeborg Dhooge; Laura Leyssens; Hilde Van Waelvelde; Leen Maes
Journal:  BMJ Open       Date:  2021-06-11       Impact factor: 2.692

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.