| Literature DB >> 30524342 |
Jutta Trautwein1, Sascha Schroeder1.
Abstract
In this study, we examine the development of orthographic networks in the mental lexicon using graph theory. According to this view, words are represented by nodes in a network and connected as a function of their orthographic similarity. With a sampling approach based on a language corpus for German school children, we were able to simulate lexical development for children from Grade 1-8. By sampling different lexicon sizes from the corpus, we were able to analyze the content of the orthographic lexicon at different time points and examined network characteristics using graph theory. Results show that, similar to semantic and phonological networks, orthographic networks possess small-word characteristics defined by short average path lengths between nodes and strong local clustering. Moreover, the interconnectivity of the network decreases with growth. Implications for the study of the effect of network measures on language processing are discussed.Entities:
Keywords: graph theory; mental lexicon; networks; orthographic neighborhood; reading development
Year: 2018 PMID: 30524342 PMCID: PMC6256182 DOI: 10.3389/fpsyg.2018.02252
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Lexicon sizes and network measures in different age groups.
| Network measures | Lexical hermits | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Grade | Lemmas | Types | < | Proportion | |||||
| 1 | 5,925 | 31,570 | 16,027 (84) | 6.78 (0.06) | 9.84 (0.29) | 46.04 (5.51) | 0.49 (0.02) | 15,543 (84) | 49% |
| 2 | 6,097 | 32,606 | 16,580 (86) | 6.79 (0.06) | 9.85 (0.30) | 45.44 (5.37) | 0.49 (0.02) | 16,026 86) | 49% |
| 3 | 11,182 | 46,757 | 24,155 (87) | 6.90 (0.05) | 9.99 (0.28) | 49.22 (6.45) | 0.47 (0.01) | 22,602 (87) | 48% |
| 4 | 14,819 | 58,238 | 30,368 (94) | 6.93 (0.04) | 10.25 (0.25) | 51.64 (6.24) | 0.45 (0.01) | 27,870 (94) | 48% |
| 5 | 18,812 | 71,344 | 37,479 (115) | 6.95 (0.04) | 10.52 (0.19) | 51.30 (4.92) | 0.43 (0.01) | 33,865 (115) | 48% |
| 6 | 25,694 | 93,293 | 49,465 (118) | 6.96 (0.02) | 10.56 (0.13) | 47.48 (3.35) | 0.41 (0.00) | 43,828 (118) | 47% |
| 8 | 38,029 | 130,675 | 70,123 (109) | 6.98 (0.02) | 10.51 (0.08) | 45.18 (3.86) | 0.38 (0.00) | 60,552 (109) | 46% |
FIGURE 1Schematic Illustration of the sampling procedure for one virtual participant.
FIGURE 2Means and standard deviations for the network measures in the different grades with overall effects depicted as lines. ∗p < 0.05, ∗∗p <0.01.
FIGURE 3The network and according clustering coefficient for the word “schreiben” – “to write.” Note that for reasons of comprehensibility only neighbors of a maximal Levenshtein distance of 2 are depicted.
FIGURE 4Log-log plot of the degree distribution for one exemplary virtual participant at each time points. Lines represent the fit of a linear function to the data.