| Literature DB >> 27602170 |
David Hessl1, Stephanie M Sansone1, Elizabeth Berry-Kravis2, Karen Riley3, Keith F Widaman4, Leonard Abbeduto1, Andrea Schneider5, Jeanine Coleman3, Dena Oaklander6, Kelly C Rhodes6, Richard C Gershon7.
Abstract
BACKGROUND: Recent advances in understanding molecular and synaptic mechanisms of intellectual disabilities (ID) in fragile X syndrome (FXS) and Down syndrome (DS) through animal models have led to targeted controlled trials with pharmacological agents designed to normalize these underlying mechanisms and improve clinical outcomes. However, several human clinical trials have failed to demonstrate efficacy of these targeted treatments to improve surrogate behavioral endpoints. Because the ultimate index of disease modification in these disorders is amelioration of ID, the validation of cognitive measures for tracking treatment response is essential. Here, we present preliminary research to validate the National Institutes of Health Toolbox Cognitive Battery (NIH-TCB) for ID.Entities:
Keywords: Assessment; Cognition; Down syndrome; FMR1 gene; Fragile X syndrome; Outcome measures
Year: 2016 PMID: 27602170 PMCID: PMC5012003 DOI: 10.1186/s11689-016-9167-4
Source DB: PubMed Journal: J Neurodev Disord ISSN: 1866-1947 Impact factor: 4.025
Performance and parent-report observational (PRO) measures used for convergent validity by NIH-TCB construct in study 3
| Construct | Toolbox task | Validation measures | Type |
|---|---|---|---|
| Cognitive flexibility | Dimensional charge card sort (DCCS) | KiTAP flexibility: errors and median Rxn time | Performance |
| BRIEF-preschool flexibility scale | PRO | ||
| Inhibitory control and visual attention | Flanker | KiTAP go/no-go: errors, median, and SD Rxn time | Performance |
| ABC hyperactivity subscale raw score | PRO | ||
| BRIEF-preschool and school-age inhibit scale | PRO | ||
| KiTAP distractibility: errors, median, and SD Rxn time | Performance | ||
| SWAN attention subscale | PRO | ||
| Receptive vocabulary | Picture vocabulary | PPVT-4 raw score | Performance |
| Letter ID and word reading | Oral reading | WJ-4 letter/word ID raw score | Performance |
| Episodic memory | Picture sequence memory | Leiter-R forward memory raw score | Performance |
| Leiter-R spatial memory picture score | Performance | ||
| Processing speed | Pattern comparison | KiTAP go/no-go: errors, median, and SD Rxn time | Performance |
| Working memory | List sorting | SB-5 verbal working memory | Performance |
| BRIEF-preschool and school-age working memory scale | PRO |
Fig. 1Story board/picture schedule used in study 3 to increase motivation and understanding of assessment visit schedule. The participant prize or a representation of the prize for compliance and effort is placed in the center, and with the examiner’s assistance, the participant checks off each completed task with a dry erase pen. NIH-TCB tasks from top (clockwise) depicted are flanker, dimensional change card sort, picture sequence memory, picture vocabulary, oral reading, pattern comparison, and list sorting
Study 2 feasibility, test-retest reliability, and examination of practice effects
| Feasibility (% of | Number (test-retest) | Visit 1, mean (SD) | Visit 2, mean (SD) |
|
| Cohen’s | ICC | |
|---|---|---|---|---|---|---|---|---|
| DCCS | 84 | 17 | 3.32 (2.35) | 3.89 (2.27) | −1.51 | .15 | .13 | .88 |
| Flanker | 82 | 17 | 4.44 (1.98) | 4.33 (2.33) | 0.23 | .82 | .23 | .75 |
| List sorting | 66 | 14 | 9.36 (4.31) | 8.00 (4.47) | −1.56 | .14 | .31 | .84 |
| Oral reading | 90 | 17 | −3.94 (3.18) | −3.88 (3.31) | 0.71 | .49 | .03 | .99 |
| Pattern comparison | 90 | 16 | 29.53 (13.47) | 35.35 (12.86) | 2.08 | .01 | .44 | .90 |
| Picture sequence memory | 78 | 17 | −1.39 (0.72) | −1.12 (1.05) | 1.45 | .18 | .46 | .76 |
| Picture vocabulary | 92 | 17 | −2.50 (2.13) | −3.05 (1.94) | −1.27 | .22 | .27 | .77 |
Test-retest reliability measured by intraclass correlation (ICC). Practice effects measured by paired samples t tests. Effect size of difference measured by Cohen’s d
Study 2 ecological validity
| Chronological age | Mental agea | FSIQb | Adaptive behavior composite | |
|---|---|---|---|---|
| Dimensional change card sort | .23 | .40** | .53** | .27 |
| Flanker | −.02 | .59*** | .77*** | .59** |
| Picture vocabulary | .32* | .22 | .38 | .26 |
| Oral reading | .04 | .61*** | .71*** | .73*** |
| Picture sequence memory | .03 | .14 | .44* | .44* |
| Pattern comparison | .19 | .24 | .50** | .10 |
| List sorting | .28 | .54** | .61** | .50* |
| NIH-TCB cognitive composite | – | – | .75*** | .61** |
Data shown are Pearson’s correlations. Note: NIH-TCB age-adjusted z deviation scores were used to correlate with age-adjusted standardized measures (IQ, adaptive behavior), whereas computed or theta scores (unadjusted) were used for correlations with mental and chronological age
*p < .05; **p < .01; ***p < .001
aIn study 2, mental age was either the actual mental age generated from individual IQ tests when test results were available or it was estimated by the examiner
bIn study 2, IQ scores (Leiter-R or Stanford-Binet 5) were available from prior recent records in a subset of the sample (n = 25)
Study 3 descriptive information by diagnostic group
| Total | DS + ID | FXS + ID | IID | |||||
|
| 45 | 19 | 10 | 16 | ||||
| Gender (% male) | 71.1 | 47.4 | 90.0 | 87.5 | ||||
| Primary caregiver education (% with at least a 4-year college degree) | 65.9 | 68.4 | 50.0 | 73.3 | ||||
|
| SD |
| SD |
| SD |
| SD | |
| Chronological age | 15.80 | 5.69 | 15.01 | 6.32 | 16.81 | 5.73 | 16.13 | 5.04 |
| Mental age equivalent (Stanford-Binet 5) | 5.24 | 1.90 | 4.71 | 1.42 | 4.05 | 0.59 | 6.53 | 2.19 |
| Full-scale deviation IQ score (Stanford-Binet 5; [ | 53.04 | 17.14 | 49.76 | 13.51 | 40.29 | 12.92 | 64.90 | 16.44 |
| VABS-2 adaptive behavior composite | 59.20 | 16.73 | 62.58 | 11.21 | 49.80 | 16.82 | 61.20 | 20.82 |
Study 3 feasibility, convergent validity, and discriminant validity
| Feasibility (% valid) | Convergent validity | Discriminant validity | ||||
|---|---|---|---|---|---|---|
| ID combined | Age 3–15, general populationa | ID combined (study 3) | Age 3–6, general populationa | ID combined (study 3) | Age 3–6, general populationa | |
| DCCS | 72.2 % | 79.8 % | −.51* | .69*** | .65*** | .79*** |
| KiTAP flexibility errors | WPPSI-III block design | PPVT-4 | ||||
| Flanker | 77.8 % | 83.7 % | −.61*** | .60*** | .61*** | .67*** |
| KiTAP distractibility | WPPSI-III block design | PPVT-4 | ||||
| Picture vocabularyb | 100.0 % | 83.7 % | .92*** | .90*** | .53*** | .53*** |
| PPVT-4 | Leiter-R forward + Spatial memory | BVMT-R + RAVLT | ||||
| Oral readingb | 94.4 % | 98.1 % | 91*** | .96*** | .56*** | .53*** |
| WJ-4 letter/word ID | WRAT-IV | Leiter-R forward + Spatial memory | BVMT-R + RAVLT | |||
| Picture sequence memory | 88.9 % | 98.1 % | .57*** | .50*** | .64*** | .58*** |
| Leiter-R forward + Spatial memory | NEPSY-II sentence repetition | PPVT-4 | ||||
| Pattern comparison | 66.7 % | 94.4 % | −.40 | .43*** | .49** | .44*** |
| KiTAP go/no-go median RT | WPPSI-III processing speed | PPVT-4 | ||||
| List sorting | 52.8 % | 95.7 % | .76*** | .57*** | .76*** | .63*** |
| SB-5 verbal WM | NEPSY-II sentence | PPVT-4 | ||||
Validity data shown are Pearson’s correlations
*p < .05; **p < .01; ***p < .001
aZelazo and Bauer [59]
bEstimates reported for the general population are aggregated across the entire child and adolescent sample ranging from 3 to 15 years old
Proportion of participants (%) by mental age group able to complete tests yielding scores judged by examiners and by data review to be valid (combined studies 2 and 3)
| Mental age group | Number | DCCS | Flanker | Picture vocabulary | Oral reading | Picture sequence memory | Picture comparison | List sorting |
|---|---|---|---|---|---|---|---|---|
| 3 | 26 | 65.4 | 76.9 | 88.5 | 88.5 | 61.5 | 69.2 | 34.6 |
| 4 | 20 | 85.0 | 80.0 | 90.0 | 90.0 | 85.0 | 90.0 | 50.0 |
| 5–6 | 25 | 84.0 | 88.0 | 92.0 | 92.0 | 96.0 | 84.0 | 76.0 |
| 7–8 | 12 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| 9+ | 5 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
Note that in study 2, mental age was estimated based on chart review and examiner estimation. In study 3, mental age was measured by Stanford-Binet 5 IQ testing
Study 3 test-retest reliability and examination of practice effects
| Number | Visit 1, mean (SD) | Visit 2, mean (SD) |
|
| Cohen’s | Test-retest reliability, (21–49 days) | Test-retest reliability, general population, 3–15 years, (7–21 days)b | |
|---|---|---|---|---|---|---|---|---|
| ICC | ICC | |||||||
| Dimensional change card sort | 27 | 3.52 (3.46) | 4.29 (3.25) | −1.31 | .20 | .25 | .74 | .92 |
| Flanker | 32 | 4.59 (3.12) | 4.38 (3.19) | .83 | .41 | .15 | .94 | .92 |
| List sorting | 26 | 7.50 (4.11) | 8.30 (4.47) | -.85 | .40 | .36 | .93 | .86 |
| Oral reading | 34 | 901.21 (622.51) | 916.91 (604.02) | -.29 | .78 | .05 | .93 | .97 |
| Pattern comparison | 20 | 28.65 (11.49) | 30.08 (13.17) | −1.62 | .12 | .17 | .86 | .84 |
| Picture sequence memory | 30 | 394.84 (99.23) | 404.15 (106.48) | -.38 | .70 | .07 | .28a | .76 |
| Picture vocabulary | 36 | 914.08 (310.58) | 885.14 (294.72) | 1.16 | .25 | .19 | .94 | .81 |
Test-retest reliability measured by intraclass correlation (ICC). Practice effects measured by paired samples t tests. Effect size of difference measured by Cohen’s d
aUnlike study 2, alternate forms A and B were used at each visit (random order)
bZelazo and Bauer [59]
Study 3 ecological validity
| Chronological age | Mental age equivalent | FSIQ (deviation) | Adaptive behavior composite | |
|---|---|---|---|---|
| Dimensional change card sort | .31 | .72*** | .66*** | .33 |
| Flanker | .27 | .61*** | .70*** | .36* |
| Picture vocabulary | .48** | .67*** | .70*** | .52** |
| Oral reading | .39* | .62*** | .71*** | .42** |
| Picture sequence memory | .34* | .55** | .57*** | .16 |
| Pattern comparison | .17 | .45* | .46** | .20 |
| List sorting | .57** | .49* | .52* | -.03 |
| NIH-TCB cognitive composite | – | – | .89*** | .42* |
Data shown are Pearson’s correlations
NIH-TCB age-adjusted z deviation scores were used to correlate with age-adjusted standardized measures (IQ, adaptive behavior), whereas computed or theta scores (unadjusted) were used for correlations with mental and chronological age
*p < .05; **p < .01; ***p < .001
Fig. 2Scatterplot showing the association between the NIH-TCB cognitive composite and Stanford-Binet full-scale IQ (z deviation method). Dotted lines represent the 95 % confidence interval around the regression line. Note that the regression line and correlation (−12.76 + 1.17×; R 2 = .79) show that the composite is a strong predictor of IQ in these samples of individuals with ID
Fig. 3Z scores (±1 SEM) of each NIH-TCB subtest by group. Z scores (age-adjusted) reflect the number of standard deviations from the average (0 for all subtests) in the normative sample from the general population. For example, the FXS + ID sample had a mean performance on flanker that is greater than 7 standard deviations below average, adjusted for age. Note that for picture sequence memory, only data from form A is shown