| Literature DB >> 33264301 |
Luz Rello1, Ricardo Baeza-Yates2, Abdullah Ali3, Jeffrey P Bigham4, Miquel Serra5.
Abstract
Dyslexia is a specific learning disorder related to school failure. Detection is both crucial and challenging, especially in languages with transparent orthographies, such as Spanish. To make detecting dyslexia easier, we designed an online gamified test and a predictive machine learning model. In a study with more than 3,600 participants, our model correctly detected over 80% of the participants with dyslexia. To check the robustness of the method we tested our method using a new data set with over 1,300 participants with age customized tests in a different environment -a tablet instead of a desktop computer- reaching a recall of over 78% for the class with dyslexia for children 12 years old or older. Our work shows that dyslexia can be screened using a machine learning approach. An online screening tool in Spanish based on our methods has already been used by more than 200,000 people.Entities:
Year: 2020 PMID: 33264301 PMCID: PMC7710040 DOI: 10.1371/journal.pone.0241687
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
and or
Cognitive indicators used in the creation of test exercises.
| Language Skills | Working Memory |
|---|---|
| Alphabetic Awareness | Visual (alphabetical) |
| Phonological Awareness | Auditory (phonology) |
| Syllabic Awareness | Sequential (auditory) |
| Lexical Awareness | Sequential (visual) |
| Morphological Awareness | |
| Syntactic Awareness | Activation and Attention |
| Semantic Awareness | Sustained Attention |
| Orthographic Awareness | Simultaneous Attention |
| Visual Discrimination and Categorization | |
| Auditory Discrimination and Categorization | |
Fig 1Examples of four test questions: Find ‘d’ among ‘b’, ‘p’, and ‘q (top left); build a correct word (‘nadie’, ‘nobody’) by substituting one letter (top right); re-order the letter to write a correct (‘siete’, ‘seven’) (bottom left); and find the word ‘boda’ (‘wedding’) (bottom right).
The instructions of the game were given via prerecorded voice prompts.
Characteristics of the data sets (age range).
| Data set | Participants | Ave. Age | Dyslexia | Female | Male |
|---|---|---|---|---|---|
| A1 (7-17) | 3,644 | 10.90 | 10.8% | 49.2% | 50.8% |
| A2 (9-17) | 2,733 | 11.45 | 12.0% | 49.4% | 50.6% |
| A3 (7-11) | 2,539 | 9.12 | 10.1% | 49.3% | 50.7% |
| A4 (9-11) | 1,628 | 9.97 | 11.9% | 49.6% | 50.4% |
| A5 (12-17) | 1,105 | 13.62 | 12.2% | 49.0% | 51.0% |
| A6 (7-8) | 911 | 7.60 | 6.9% | 48.7% | 51.3% |
,
Confusion matrix for the main predictive model.
| Predicted | ||
|---|---|---|
| Class | Dyslexia | Control |
| Dyslexia | 316 | 76 |
| Control | 684 | 2568 |
Results for the different data sets.
| Data set (age range) | Size | Accuracy (%) | Recall (Dys., %) | Precision (Dys., %) | ROC | Threshold |
|---|---|---|---|---|---|---|
| A1 (7-17) | 3644 | 79.8 | 80.6 | 79.3 | 0.873 | 0.240 |
| A2 (9-17) | 2734 | 80.1 | 79.9 | 80.1 | 0.878 | 0.260 |
| A3 (7-11) | 2540 | 80.8 | 80.9 | 80.7 | 0.868 | 0.250 |
| A4 (9-11) | 1629 | 81.6 | 82.0 | 81.4 | 0.878 | 0.275 |
| A5 (12-17) | 1106 | 77.0 | 77.0 | 77.0 | 0.851 | 0.245 |
| A6 (7-8) | 912 | 69.2 | 69.8 | 69.0 | 0.782 | 0.150 |
| Female | 1793 | 78.3 | 76.8 | 79.2 | 0.855 | 0.240 |
| Male | 1851 | 76.8 | 76.7 | 76.8 | 0.856 | 0.240 |
Fig 2Accuracy, ROC, and predictive power for the different data sets.
Model precision and recall per class for a threshold of 0.24.
| Class | Dyslexia (%) | No Dyslexia (%) |
|---|---|---|
| Precision | 79.7 | 79.1 |
| Recall | 80.4 | 78.4 |
Fig 3Precision and recall curve for the dyslexia class, varying the model threshold.
Fig 4ROC in function of two Random Forest parameters for the main model.
Relative question importance based on feature analysis.
| Question | % | Question | % | Question | % |
|---|---|---|---|---|---|
| Q1 | 100.0 | Q10 | 75.8 | Demog. | 67.2 |
| Q3 | 100.0 | Q13 | 75.8 | Q23 | 64.3 |
| Q2 | 98.0 | Q17 | 75.8 | Q26 | 64.3 |
| Q4 | 79.2 | Q21 | 75.4 | Q24 | 61.5 |
| Q5 | 89.3 | Q16 | 73.4 | Q27 | 61.1 |
| Q6 | 87.3 | Q19 | 71.7 | Q30 | 60.7 |
| Q7 | 85.7 | Q18 | 71.3 | Q25 | 60.2 |
| Q8 | 85.7 | Q12 | 70.1 | Q31 | 60.2 |
| Q9 | 84.8 | Q15 | 69.3 | Q32 | 60.2 |
| Q14 | 79.9 | Q22 | 69.3 | Q29 | 59.8 |
| Q11 | 79.5 | Q20 | 68.4 | Q28 | 59.4 |
Relative importance by feature type aggregation.
| Type | % | Type | % |
|---|---|---|---|
| Hits | 100.0 | Accuracy | 97.3 |
| Score | 99.6 | Miss rate | 90.1 |
| Misses | 99.3 | Demography | 16.5 |
| Clicks | 97.4 |
Results for the tablet test.
| Data set (age range) | Accur. (%) | Recall (Dys, %) | Precis. (Dys, %) | Recall (Non, %) | Precis. (Non, %) | ROC | Thresh. |
|---|---|---|---|---|---|---|---|
| N1 (12-17) | 76.8 | 78.1 | 76.2 | 77.7 | 75.5 | 0.806 | 0.155 |
| N2 (9-11) | 74.4 | 75.8 | 73.8 | 75.0 | 73.1 | 0.818 | 0.230 |
| N3 (7-8) | 61.2 | 72.2 | 59.2 | 64.6 | 50.3 | 0.663 | 0.255 |