| Literature DB >> 30839818 |
Cynthia S Q Siew1,2.
Abstract
Network science has been applied to study the structure of the mental lexicon, the part of long-term memory where all the words a person knows are stored. Here the tools of network science are used to study the organization of orthographic word-forms in the mental lexicon and how that might influence visual word recognition. An orthographic similarity network of the English language was constructed such that each node represented an English word, and undirected, unweighted edges were placed between words that differed by an edit distance of 1, a commonly used operationalization of orthographic similarity in psycholinguistics. The largest connected component of the orthographic language network had a small-world structure and a long-tailed degree distribution. Additional analyses were conducted using behavioral data obtained from a psycholinguistic database to determine if network science measures obtained from the orthographic language network could be used to predict how quickly and accurately people process written words. The present findings show that the structure of the mental lexicon influences lexical access in visual word recognition.Entities:
Keywords: English lexicon project; Language network; Lexical decision; Network science; Orthography; Speeded naming; Visual word recognition
Year: 2018 PMID: 30839818 PMCID: PMC6214296 DOI: 10.1007/s41109-018-0068-1
Source DB: PubMed Journal: Appl Netw Sci ISSN: 2364-8228
Fig. 1The orthographic structure of the ego network of the word ‘cat’. An undirected and unweighted edge was placed between two words that differed by a Levenshtein edit distance of 1 (i.e., whether the first word could be transformed into the second via the substitution, addition, or deletion of one letter)
Fig. 2The overall orthographic network structure of the English language. Note that the largest connected component (nodes in blue) represented a somewhat limited proportion of the entire network, and the large numbers of smaller connected components and isolates (non-blue nodes)
Summary of network measures derived from the largest connected component of the orthographic network, and the means and standard deviations of network measures of various baseline networks for comparison
| Network measures | ||||||
|---|---|---|---|---|---|---|
| Network | Nodes | Edges | Average degree | Average clustering coefficient | Average shortest path length | Diameter |
| LCC | 11,365 | 32,759 | 5.766 | 0.273 | 8.78 | 31 |
| Random configuration networks ( | 11,365 (0) | 32,759 (0) | 5.766 (0) | 0.000495 (0.000106) | 5.53 (0.00412) | 10.51 (0.522) |
| Bootstrapped LCCs of edit distance 1 ( | 8863.78 (116.35) | 30,560.78 (562.08) | 5.76 (0.0877) | 0.206 (0.00433) | 9.61 (0.270) | 37.1 (3.87) |
| Random word networks ( | 2791.6 (29.11) | 6207.4 (274.98) | 0.335 (0.0137) | 0.267 (0.00819) | 6.45 (0.0864) | 18.8 (0.837) |
Network measures computed for the largest connected component (LCC) of each network
Correlations between the three network measures included in the regression: degree, clustering coefficient, and closeness centrality
| Degree | Clustering Coefficient | |
|---|---|---|
| Clustering Coefficient | 0.14*** | |
| Closeness Centrality | 0.68*** | 0.07*** |
N = 11,365. All correlations were statistically significant, p < .001***
Summary of regression results for speeded naming and lexical decision
| (i) Speeded naming | RT | ACC | ||
| Predictors | ||||
| | ||||
| Number of letters | ||||
| Number of phonemes | ||||
| Number of syllables | ||||
| Log frequency | ||||
| | ||||
| Degree | ||||
| Clustering coefficient | ||||
| Closeness centrality | ||||
| Δ | Δ | |||
| (ii) Lexical decision | ||||
| | ||||
| Number of letters | ||||
| Number of phonemes | ||||
| Number of syllables | ||||
| Log frequency | ||||
| | ||||
| Degree | ||||
| Clustering coefficient | ||||
| Closeness centrality | ||||
| Δ | Δ | |||
+indicates p < .10, * indicates p < .05, ** indicates p < .01, *** indicates p < .001