| Literature DB >> 35465492 |
Sietske Walda1, Fred Hasselman1, Anna Bosman1.
Abstract
Research based on traditional linear techniques has yet not been able to clearly identify the role of cognitive skills in reading problems, presumably because the process of reading and the factors that are associated with reading reside within a system of multiple interacting and moderating factors that cannot be captured within traditional statistical models. If cognitive skills are indeed indicative of reading problems, the relatively new nonlinear techniques of machine learning should make better predictions. The aim of the present study was to investigate whether cognitive factors play any role in reading skill, questioning (1) the extent to what cognitive skills are indicative of present reading level, and (2) the extent to what cognitive skills are indicative of future reading progress. In three studies with varying groups of participants (average school-aged and poor readers), the results of four supervised machine learning techniques were compared to the traditional General Linear Models technique. Results of all models appeared to be comparable, producing poor to acceptable results, which are however inadequate for making a thorough prediction of reading development. Assumably, cognitive skills are not predictive of reading problems, although they do correlate with one another. This insight has consequences for scientific theories of reading development, as well as for the prevention and remediation of reading difficulties.Entities:
Keywords: cognitive skills; dyslexia; machine learning; reading development; word decoding and reading outcomes
Year: 2022 PMID: 35465492 PMCID: PMC9025592 DOI: 10.3389/fpsyg.2022.869352
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Results of meta-analyses on the relation between reading level and cognitive skills, and reading progress and cognitive skills.
| Article | Measure | Study aim | Sample characteristics | Cognitive skill |
|
| |
|
| Reading level and progress | Concurrent and longitudinal (after > 1 year) correlation with word recognition, composite reading or reading comprehension | Predominantly mixed samples; few learning disabled/disabled reader; aged Kindergarten 2, Grade 3-12, and other. | General academic achievement | 14 / 14 | 0.85 / 0.74 | n.s. |
|
| Reading level | Concurrent correlation with word reading (real and pseudowords) | Average and poor readers; | Spelling | 6 | 0.78 | 0.00 |
|
| Reading level | Concurrent correlation with text decoding | Average school-aged; | Phoneme awareness | 7 | 0.56 | 67.64 |
|
| Reading level | Comparison of children with and without dyslexia (reading level-matched [RL]and chronological age-matched [CA]) | Children with dyslexia and reading level controls (<13 years) | Phonological awareness | 19 / 19 | –0.21 /–0.49 | 87.31 / 83.52 |
|
| Reading level | Comparison of children with and without dyslexia | Dyslexic readers; age-matched controls; | Rapid automatized naming: | |||
|
| Reading level | Comparison of children with and without dyslexia | Children with and without dyslexia; (age 5–18) | Inhibition | |||
|
| Reading progress | Correlation with future (after 1, 2, or 3 years of instruction) word reading, composite reading score or rarely reading comprehension | Unselected samples, few high risk samples; | Concepts of print | 7 | 0.46 | 0.00 |
|
| Reading progress | Treatment effectiveness of early literacy interventions | Students at risk for reading disabilities; | Rapid naming | 7 | 0.47 | 0.00 |
|
| Reading progress | Comparing reading level of responders and low-responders during interventions in reading: | Children at risk for reading disabilities; | General IQ | 2/9 | 0.36 / 0.11 | 0.00 / 91.56 |
n.s. = not specified.
FIGURE 1Schematic overview of conducted studies and aims, research questions, and model building techniques that were involved. RQ, research question.
Confusion matrix for positive class and negative class allocation by the models.
| Group membership according to model | Group membership according to word decoding test | Performance statistic (usefulness) | |
|
| |||
| Lowest decoding | Not lowest decoding | ||
| Lowest decoding | TP | FP | PP |
| Not lowest decoding | FN | TN | NP |
| Performance statistic (identification) | SE | SP | Accuracy |
TP, True positives; FP, False positives; FN, False negatives; TN, True negatives; PP, Positive predictive value; NP, Negative predictive value; SE, Sensitivity; SP, Specificity.
Descriptive statistics for input variables and output variable of the models (n = 2009).
|
|
|
| |
|
| |||
| Nonword repetition | 11–40 | 33.72 | 4.77 |
| Naming speed – digits | 43–166 | 96.25 | 17.09 |
| Naming speed – letters | 1–157 | 91.79 | 19.13 |
| Naming speed – pictures | 5–129 | 61.82 | 11.93 |
| Phoneme segmentation | 0–20 | 18.55 | 3.07 |
| Phoneme manipulation | 0–20 | 18.55 | 2.52 |
|
| |||
| Word decoding efficiency | 0.5–120.75 | 57.47 | 20.88 |
Confidence intervals of summary statistics for the predictive ability of the models built with five machine learning techniques.
| Technique | PP 95% CI | NP 95% CI | SE 95% CI | SP 95% CI | Acc 95% CI | AUC95% CI | |
| Neural network | [0.32, 0.34] | [0.94, 0.95] | [0.61, 0.65] | [0.83, 0.84] | [0.81, 0.82] | [0.29, 0.34] | [0.77, 0.79] |
| K-nn | [0.27, 0.30] | [0.94, 0.95] | [0.58, 0.62] | [0.83, 0.83] | [0.80, 0.81] | [0.27, 0.30] | [0.73, 0.75] |
| Random Forests | [0.32, 0.35] | [0.93, 0.94] | [0.57, 0.61] | [0.83, 0.84] | [0.80, 0.81] | [0.30, 0.33] | [0.78, 0.79] |
| Xg-boost | [0.34, 0.37] | [0.92, 0.93] | [0.55, 0.59] | [0.84, 0.84] | [0.80, 0.81] | [0.31, 0.34] | [0.77, 0.79] |
| GLM | [0.25, 0.28] | [0.96, 0.96] | [0.64, 0.68] | [0.82, 0.83] | [0.81, 0.81] | [0.27, 0.30] | [0.77, 0.79] |
PP, positive predictive value; NP, negative predictive value; SE, sensitivity; SP, specificity; Acc, accuracy; κ, Kappa; AUC, area under the ROC; CI, confidence interval.
FIGURE 2ROC-curves for the predictive ability of the models in all three studies built with five machine learning techniques.
Descriptive statistics for input variables and output variable of the models.
| Variable | Study 2a | Study 2b | ||||
|
|
| |||||
|
|
|
|
|
|
| |
|
| ||||||
| Grapheme-phoneme identification | ||||||
| Speed | 1.28–5.60 | 2.54 | 0.58 | 1.28–4.99 | 2.58 | 0.57 |
| Accuracy | 35.56–100 | 87.26 | 8.66 | 53.33–100.00 | 86.61 | 8.19 |
|
| ||||||
| Speed | 0.83–19.44 | 1.85 | 1.04 | 0.00–3.39 | 1.80 | 0.52 |
| Accuracy | 17.78–96.67 | 81.39 | 10.02 | 0.98–95.56 | 80.40 | 11.74 |
|
| ||||||
| Digits | 3.54–42.58 | 10.39 | 3.03 | 6.51–42.58 | 10.57 | 3.45 |
| Letters | 3.78–20.63 | 11.25 | 2.53 | 7.14–19.28 | 11.32 | 2.37 |
| Pictures | 3.71–31.22 | 15.66 | 3.54 | 9.72–31.22 | 15.60 | 3.41 |
| Vocabulary | 57–144 | 110.11 | 9.96 | 85–144 | 110.84 | 9.26 |
| Nonverbal reasoning | 5–119 | 19.90 | 6.78 | 5–119 | 20.46 | 9.22 |
|
| ||||||
| Forward | 6–36 | 24.15 | 3.83 | 11–32 | 24.25 | 3.92 |
| Backward | 3–39 | 11.05 | 3.74 | 5–30 | 11.17 | 3.65 |
| Block recall | 2–37 | 25.58 | 4.45 | 2–37 | 25.76 | 4.41 |
|
| ||||||
| Reproduction | 16–61 | 39.86 | 8.47 | 16–61 | 40.21 | 8.26 |
| Recall | 1–15 | 8.51 | 2.71 | 1–15 | 8.47 | 2.82 |
|
| ||||||
| Word decoding efficiency | 0–6 | 1.45 | 1.29 | 0–5 | 1.50 | 1.49 |
Confidence intervals of summary statistics for the predictive ability of the models built with five machine learning techniques in study 2a.
| Technique | PP95% CI | NP95% CI | SE95% CI | SP95% CI | Acc95% CI | AUC95% CI | |
| Neural network | [0.33, 0.45] | [0.59, 0.71] | [0.52, 0.58] | [0.51, 0.54] | [0.51, 0.53] | [0.02, 0.06] | [0.55, 0.57] |
| K-nn | [0.50, 0.54] | [0.53, 0.57] | [0.52, 0.55] | [0.52, 0.55] | [0.52, 0.55] | [0.04, 0.09] | [0.56, 0.58] |
| Random forests | [0.33, 0.45] | [0.59, 0.71] | [0.52, 0.58] | [0.51, 0.54] | [0.51, 0.53] | [0.02, 0.06] | [0.60, 0.63] |
| Xg-boost | [0.52, 0.56] | [0.57, 0.61] | [0.55, 0.59] | [0.55, 0.57] | [0.55, 0.58] | [0.10, 0.15] | [0.57, 0.60] |
| GLM | [0.48, 0.53] | [0.58, 0.62] | [0.54, 0.58] | [0.53, 0.56] | [0.54, 0.57] | [0.07, 0.13] | [0.56, 0.59] |
PP, positive predictive value; NP, negative predictive value; SE, sensitivity; SP, specificity; Acc, accuracy; κ, Kappa; AUC, area under the ROC; CI, confidence interval.
Confidence intervals of summary statistics for the predictive ability of the models built with five machine learning techniques in study 2b.
| Technique | PP95% CI | NP95% CI | SE95% CI | SP95% CI | Acc95% CI | AUC95% CI | |
| Neural network | [0.70, 0.79] | [0.20, 0.27] | [0.51, 0.53] | [0.42, 0.53] | [0.49, 0.53] | [–0.05, 0.02] | [0.54, 0.58] |
| K-nn | [0.59, 0.66] | [0.34, 0.40] | [0.51, 0.54] | [0.44, 0.50] | [0.48, 0.53] | [–0.05, 0.04] | [0.56, 0.60] |
| Random forests | [0.53, 0.60] | [0.37, 0.45] | [0.50, 0.54] | [0.42, 0.48] | [0.47, 0.51] | [–0.07, 0.02] | [0.56, 0.60] |
| Xg-boost | [0.48, 0.55] | [0.38, 0.44] | [0.47, 0.52] | [0.40, 0.46] | [0.44, 0.49] | [–0.12, –0.03] | [0.58, 0.62] |
| GLM | [0.58, 0.64] | [0.34, 0.41] | [0.50, 0.54] | [0.42, 0.49] | [0.47, 0.52] | [–0.07, 0.03] | [0.59, 0.63] |
PP, positive predictive value; NP, negative predictive value; SE, sensitivity; SP, specificity; Acc, accuracy; κ, Kappa; AUC, area under the ROC; CI, confidence interval.
Descriptive statistics for raw scores on input variables and output variable of the model.
| Variable | Study 3a | Study 3b | ||||
|
|
| |||||
|
|
|
|
|
|
| |
|
| ||||||
| Rhyme | 0–28 | 17.67 | 5.11 | 0–28 | 17.75 | 4.96 |
| Rhyme prime | 2–26 | 12.54 | 5.05 | 2–26 | 12.54 | 5.04 |
| Auditory synthesis | 0–24 | 13.42 | 4.98 | 0–24 | 13.43 | 4.98 |
| Phoneme deletion | 0–29 | 12.47 | 7.98 | 0–29 | 12.41 | 7.98 |
| Letter naming | 0–20 | 7.71 | 6.52 | 0–20 | 7.73 | 6.49 |
|
| ||||||
| Word decoding efficiency | 4–68 | 16.16 | 10.39 | 4–74 | 27.32 | 14.16 |
Confidence intervals of summary statistics for the predictive ability of the models built with five machine learning techniques in study 3a.
| Technique | PP95% CI | NP95% CI | SE95% CI | SP95% CI | Acc95% CI | AUC95% CI | |
| Neural network | [0.10, 0.15] | [0.88, 0.93] | [0.20, 0.30] | [0.80, 0.80] | [0.79, 0.80] | [0.00, 0.01] | [0.64, 0.66] |
| K-nn | [0.13, 0.17] | [0.91, 0.96] | [0.41, 0.53] | [0.81, 0.82] | [0.78, 0.80] | [0.10, 0.15] | [0.60, 0.63] |
| Random Forests | [0.20, 0.25] | [0.88, 0.93] | [0.39, 0.46] | [0.82, 0.83] | [0.77, 0.79] | [0.14, 0.20] | [0.63, 0.67] |
| Xg-boost | [0.17, 0.22] | [0.84, 0.90] | [0.28, 0.35] | [0.81, 0.82] | [0.74, 0.76] | [0.07, 0.13] | [0.63, 0.66] |
| GLM | [0.08, 0.11] | [0.94, 0.98] | [0.42, 0.57] | [0.81, 0.81] | [0.79, 0.80] | [0.05, 0.10] | [0.67, 0.71] |
PP, positive predictive value; NP, negative predictive value; SE, sensitivity; SP, specificity; Acc, accuracy; κ, Kappa; AUC, area under the ROC; CI, confidence interval.
Confidence intervals of summary statistics for the predictive ability of the models built with five machine learning techniques in study 3b.
| Technique | PP95% CI | NP95% CI | SE95% CI | SP95% CI | Acc95% CI | AUC95% CI | |
| Neural network | [0.36, 0.43] | [0.72, 0.80] | [0.40, 0.49] | [0.75, 0.76] | [0.75, 0.76] | [0.00, 0.01] | [0.65, 0.67] |
| K-nn | [0.20, 0.24] | [0.91, 0.93] | [0.48, 0.58] | [0.78, 0.79] | [0.74, 0.76] | [0.14, 0.19] | [0.63, 0.66] |
| Random Forests | [0.19, 0.23] | [0.91, 0.93] | [0.46, 0.56] | [0.78, 0.79] | [0.74, 0.76] | [0.14, 0.19] | [0.61, 0.64] |
| Xg-boost | [0.23, 0.28] | [0.84, 0.87] | [0.35, 0.41] | [0.78, 0.79] | [0.70, 0.72] | [0.10, 0.15] | [0.61, 0.64] |
| GLM | [0.07, 0.10] | [0.93, 0.95] | [0.28, 0.40] | [0.76, 0.76] | [0.73, 0.74] | [0.01, 0.05] | [0.67, 0.70] |
PP, positive predictive value; NP, negative predictive value; SE, sensitivity; SP, specificity; Acc, accuracy; κ, Kappa; AUC, area under the ROC; CI, confidence interval.