| Literature DB >> 24744745 |
Serje Robidoux1, Stephen C Pritchard1.
Abstract
DRC (Coltheart et al., 2001) and CDP++ (Perry et al., 2010) are two of the most successful models of reading aloud. These models differ primarily in how their sublexical systems convert letter strings into phonological codes. DRC adopts a set of grapheme-to-phoneme conversion rules (GPCs) while CDP++ uses a simple trained network that has been exposed to a combination of rules and the spellings and pronunciations of known words. Thus far the debate between fixed rules and learned associations has largely emphasized reaction time experiments, error rates in dyslexias, and item-level variance from large-scale databases. Recently, Pritchard et al. (2012) examined the models' non-word reading in a new way. They compared responses produced by the models to those produced by 45 skilled readers. Their item-by-item analysis is informative, but leaves open some questions that can be addressed with a different technique. Using hierarchical clustering techniques, we first examined the subject data to identify if there are classes of subjects that are similar to each other in their overall response profiles. We found that there are indeed two groups of subject that differ in their pronunciations for certain consonant clusters. We also tested the possibility that CDP++ is modeling one set of subjects well, while DRC is modeling a different set of subjects. We found that CDP++ does not fit any human reader's response pattern very well, while DRC fits the human readers as well as or better than any other reader.Entities:
Keywords: computational modeling; hierarchical clustering; non-word reading; reading aloud
Year: 2014 PMID: 24744745 PMCID: PMC3978355 DOI: 10.3389/fpsyg.2014.00267
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Clustering of artificial data and dendrogram. Light gray ellipses indicate clusters, while the dark, dotted ellipses indicate the corresponding clusters in the dendrogram.
Figure 2Results of applying different clustering methods to subject data from Pritchard et al. (. These dendrograms do not include model responses. Methods depicted are (A) medoid, (B) single linkage, (C) complete linkage and (D) Ward's method.
Pronunciations that distinguished between subject clusters 1 and 2.
| 1 | 79.7 | 14.0 | 6.3 | ||
| 2 | 46.1 | 52.8 | 1.1 | ||
| 3 | 44.4 | 55.6 | 0.0 | ||
| 1 | 83.9 | 3.4 | 9.9 | 2.5 | 0.3 |
| 2 | 75.4 | 10.1 | 5.5 | 7.0 | 2.0 |
| 3 | 69.9 | 6.0 | 7.2 | 8.4 | 8.4 |
| 1 | 91.4 | 5.7 | 2.9 | 0.0 | |
| 2 | 61.4 | 34.1 | 2.3 | 2.3 | |
| 3 | 50.0 | 44.4 | 0.0 | 5.6 | |
| 1 | 99.0 | 1.0 | |||
| 2 | 88.8 | 11.2 | |||
| 3 | 97.2 | 2.8 | |||
| ∫ | |||||
| 1 | 80.9 | 4.3 | 12.2 | 1.7 | 0.9 |
| 2 | 67.1 | 15.1 | 5.5 | 8.2 | 4.1 |
| 3 | 64.3 | 25.0 | 3.6 | 0.0 | 7.1 |
| 1 | 80.8 | 6.4 | 10.5 | 2.3 | |
| 2 | 72.4 | 14.1 | 10.6 | 2.9 | |
| 3 | 64.3 | 18.6 | 13.2 | 3.9 | |
| ∫ | |||||
| 1 | 66.1 | 27.3 | 6.1 | 0.5 | |
| 2 | 35.8 | 59.1 | 4.3 | 0.8 | |
| 3 | 41.1 | 43.0 | 8.4 | 7.5 | |
| 1 | 57.2 | 22.2 | 19.8 | 0.8 | |
| 2 | 55.3 | 30.4 | 13.0 | 1.2 | |
| 3 | 27.7 | 63.1 | 6.2 | 3.1 | |
| 1 | 79.3 | 18.1 | 1.6 | 1.1 | |
| 2 | 54.7 | 35.9 | 8.5 | 0.9 | |
| 3 | 27.1 | 50.0 | 18.8 | 4.2 | |
| 1 | 84.5 | 5.6 | 8.5 | 1.4 | |
| 2 | 56.8 | 9.1 | 31.8 | 2.3 | |
| 3 | 33.3 | 16.7 | 33.3 | 16.7 | |
Pronunciation of other ambiguous consonant clusters that might be thought to distinguish clusters 1 and 2, but do not.
| 1 | 91.4 | 6.5 | 2.2 | ||
| 2 | 92.4 | 6.4 | 1.2 | ||
| 3 | 90.3 | 9.7 | 0.0 | ||
| 1 | 74.5 | 21.3 | 4.3 | 0.0 | |
| 2 | 74.6 | 16.9 | 6.8 | 1.7 | |
| 3 | 75.0 | 20.8 | 4.2 | 0.0 | |
| 1 | 73.7 | 23.9 | 0.5 | 1.9 | |
| 2 | 70.2 | 21.4 | 3.2 | 5.2 | |
| 3 | 16.5 | 40.8 | 4.9 | 37.9 | |
| ŋ | |||||
| 1 | 89.4 | 5.6 | 0.7 | 2.0 | 2.2 |
| 2 | 91.4 | 2.5 | 0.0 | 4.0 | 2.1 |
| 3 | 76.9 | 6.7 | 1.5 | 6.0 | 9.0 |
| θ | |||||
| 1 | 97.6 | 1.5 | 0.5 | 0.4 | |
| 2 | 96.8 | 3.1 | 0.0 | 0.2 | |
| 3 | 93.7 | 4.2 | 0.4 | 1.7 | |
| 1 | 97.6 | 0.0 | 2.2 | 0.3 | |
| 2 | 97.0 | 0.4 | 1.7 | 0.9 | |
| 3 | 91.6 | 0.0 | 1.1 | 7.4 | |
Figure 3Clustering results for the Pritchard et al. (.