| Literature DB >> 33269446 |
Giacomo Bignardi1, Edwin S Dalmaijer2, Alexander Anwyl-Irvine2, Duncan E Astle2.
Abstract
Collecting experimental cognitive data with young children usually requires undertaking one-on-one assessments, which can be both expensive and time-consuming. In addition, there is increasing acknowledgement of the importance of collecting larger samples for improving statistical power Button et al. (Nature Reviews Neuroscience 14(5), 365-376, 2013), and reproducing exploratory findings Open Science Collaboration (Science, 349(6251), aac4716-aac4716 2015). One way both of these goals can be achieved more easily, even with a small team of researchers, is to utilize group testing. In this paper, we evaluate the results from a novel tablet application developed for the Resilience in Education and Development (RED) Study. The RED-app includes 12 cognitive tasks designed for groups of children aged 7 to 13 to independently complete during a 1-h school lesson. The quality of the data collected was high despite the lack of one-on-one engagement with participants. Most outcomes from the tablet showed moderate or high reliability, estimated using internal consistency metrics. Tablet-measured cognitive abilities also explained more than 50% of variance in teacher-rated academic achievement. Overall, the results suggest that tablet-based, group cognitive assessments of children are an efficient, reliable, and valid method of collecting the large datasets that modern psychology requires. We have open-sourced the scripts and materials used to make the application, so that they can be adapted and used by others.Entities:
Keywords: Childhood; Cognition; Group testing; Reliability; Tablet; Validity
Mesh:
Year: 2020 PMID: 33269446 PMCID: PMC7710155 DOI: 10.3758/s13428-020-01503-3
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Fig. 1.Screenshots from the novel tablet assessments
Descriptive statistics for tablet cognitive assessments
| Task name | Predictive validity | Reliability | Time taken | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| LB | UB | ω | LB | UB | 10% | 50% | 90% | ||||
| Visual Search Speed | 526 | – .38 | – .46 | – .30 | – .35 | .74 | .68 | .78 | 1.59 | 2.18 | 3.30 |
| Verbal Short-Term Memory | 532 | .43 | .35 | .51 | .40 | 2.28 | 4.36 | 6.82 | |||
| Spatial Short-Term Memory | 532 | .45 | .37 | .52 | .41 | 1.55 | 3.05 | 5.30 | |||
| G/N-G - D' | 362 | .18 | .06 | .30 | .19 | .53 | .43 | .62 | 1.70 | 4.32 | 4.68 |
| G/N-G - Omission Errors | 362 | – .41 | – .50 | – .30 | – .36 | .71 | .65 | .77 | 1.70 | 4.32 | 4.68 |
| G/N-G - Commission Errors | 362 | – .13 | – .24 | – .01 | – .07 | .60 | .51 | .67 | 1.70 | 4.32 | 4.68 |
| Matrix Reasoning | 515 | .48 | .40 | .55 | .46 | .76 | .73 | .78 | 2.55 | 4.17 | 6.38 |
| Reading Fluency | 535 | .63 | .57 | .69 | .66 | .87 | .85 | .89 | 3.15 | 3.20 | 3.32 |
| Rhyme Judgement | 488 | .27 | .18 | .36 | .26 | .68 | .59 | .76 | 1.11 | 3.52 | 4.25 |
| Phonological Discrimination | 275 | .26 | .12 | .39 | .23 | .72 | .67 | .78 | 4.73 | 5.32 | 6.54 |
| Arithmetic Fluency | 535 | .55 | .48 | .61 | .52 | .89 | .87 | .91 | 3.48 | 3.53 | 3.65 |
| Non-Symbolic Num. Discrim. | 515 | .45 | .37 | .53 | .42 | .85 | .83 | .86 | 3.32 | 4.00 | 4.72 |
| Line Estimation | 193 | .53 | .42 | .63 | .53 | .92 | .90 | .94 | 2.20 | 3.22 | 4.44 |
| Liquid Equalization | 348 | .42 | .32 | .51 | .41 | .88 | .85 | .90 | 3.02 | 3.53 | 4.21 |
Note: this table only includes data from the school cohort. Predictive validity was assessed by the linear Pearson correlation with teacher-rated academic ability, along with the lower and upper bounds of a 95% confidence interval (LB, UB). We also report the standardized regression coefficient (b*) for each task predicting academic ability whilst accounting for age and normalized neighborhood deprivation, in a multivariable regression performed separately for each task. The 10%, 50% (median) and 90% percentiles of time taken in minutes to complete each task are reported in the final three columns.
Assessment of measurement invariance across the group-tested (N = 92, school) and individually tested (N = 535, laboratory) cohorts
| Models | χ2 | Comparative Fit Index (CFI) | Root mean square error of approximation | AIC | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CFI | Δ | LB | UB | RMSEA | Δ | LB | UB | |||||
| 1. Configural | 63.4 | 28 | .971 | .0635 | 11012 | |||||||
| 2. Loadings | 73.8 | 34 | .107 | .967 | – 0.004 | – .024 | .001 | .0611 | – 0.002 | – .010 | .010 | 11010 |
| 3. Intercepts | 98.3 | 40 | <.001 | .952 | – 0.015 | – .038 | – .005 | .0682 | 0.007 | – .004 | .018 | 11023 |
| 4. Residuals | 107 | 47 | .273 | .951 | – 0.001 | – .023 | .001 | .0638 | – 0.004 | – .008 | .005 | 11017 |
| 5. Means | 113 | 48 | .015 | .947 | – 0.004 | – .014 | .001 | .0657 | 0.002 | – .001 | .006 | 11021 |
Note: The lower (LB) and upper bounds (UB) of the 95% confidence interval for changes (Δ) in CFI & RSMEA are reported, calculated using non-parametric bootstrap resampling (3000 repeats). Random sampling was performed within each group separately for each iteration.
Fig. 2.Test reliability function, modeling measurement error as a function of latent ability, rather than a constant
Fig. 3.Pearson correlation matrix of primary outcomes, including all RED participants from both school and laboratory cohorts
Fig. 4.Parallel analysis for all tasks, using data from school cohort only. Note. The bottom purple line represents successive factor analysis eigenvalues, and the top orange line for principal components. The dashed lines represent eigenvalues taken from resampled datasets.