| Literature DB >> 34591284 |
Paul Hoffman1, Matthew A Lambon Ralph2, Timothy T Rogers3.
Abstract
Semantic diversity refers to the degree of semantic variability in the contexts in which a particular word is used. We have previously proposed a method for measuring semantic diversity based on latent semantic analysis (LSA). In a recent paper, Cevoli et al. (2020) attempted to replicate our method and obtained different semantic diversity values. They suggested that this discrepancy occurred because they scaled their LSA vectors by their singular values, while we did not. Using their new results, they argued that semantic diversity is not related to ambiguity in word meaning, as we originally proposed. In this reply, we demonstrate that the use of unscaled vectors provides better fits to human semantic judgements than scaled ones. Thus we argue that our original semantic diversity measure should be preferred over the Cevoli et al. version. We replicate Cevoli et al.'s analysis using the original semantic diversity measure and find (a) our original measure is a better predictor of word recognition latencies than the Cevoli et al. equivalent and (b) that, unlike Cevoli et al.'s measure, our semantic diversity is reliably associated with a measure of polysemy based on dictionary definitions. We conclude that the Hoffman et al. semantic diversity measure is better-suited to capturing the contextual variability among words and that words appearing in a more diverse set of contexts have more variable semantic representations. However, we found that homonyms did not have higher semantic diversity values than non-homonyms, suggesting that the measure does not capture this special case of ambiguity.Entities:
Keywords: Lexical ambiguity; Polysemy; Semantic diversity
Mesh:
Year: 2021 PMID: 34591284 PMCID: PMC9374602 DOI: 10.3758/s13428-021-01693-4
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Spearman correlations between human semantic relatedness judgements and vector cosines
| MEN | WordSim-353 | SimLex-999 | |
|---|---|---|---|
| BNC_unscaled | 0.72 | 0.64 | 0.27 |
| BNC_scaled | 0.65 | 0.57 | 0.19 |
| wiki_unscaled | 0.69 | 0.67 | 0.26 |
| wiki_scaled | 0.65 | 0.60 | 0.21 |
| word2vec | 0.78 | 0.69 | 0.44 |
Fig. 1Comparisons of scaled and unscaled LSA vectors. a Singular values for the first 30 dimensions when singular value decomposition is applied to BNC data. b Correlations between MEN semantic relatedness ratings and cosine similarity when different dimensions were omitted from the LSA vectors. Omitting dimensions had no effect on unscaled vectors, but omitting either of the first two dimensions considerably improved the fit to human data for scaled vectors. c Mean cosine similarity between neighbouring contexts, as a function of their distance in the corpus. Both sets of vectors show reduced similarity as distance increases, but this effect is significantly stronger for the unscaled vectors. The dashed lines indicate the mean similarity between two contexts chosen at random from the corpus (calculated by permuting the order of contexts in the corpus and calculating the mean similarity between adjacent contexts in this permuted corpus)
Fig. 2Effects of H13_SemD on word recognition latencies at various levels of word frequency. Word frequency values indicate log counts in the BNC
Model fit statistics for models predicting word recognition latencies with and without semantic diversity
| Dataset: | BLP: Lexical decision | ELP: Lexical decision | ELP: Word naming | |||
|---|---|---|---|---|---|---|
| Model | AIC | BIC | AIC | BIC | AIC | BIC |
| Baseline | – 116838 | – 116690 | – 12888 | – 12735 | – 170609 | – 170469 |
| Baseline + C20_SemD and its interactions | – 116903 | – 116713 | – 12980 | – 12784 | – 170671 | – 170488 |
| Baseline + H13_SemD and its interactions | ||||||
The lowest AIC/BIC values in each case (indicating best model fit) are underlined
Linear regression models predicting semantic diversity in stimuli used in Rodd et al.’s Experiment 1
| Dependent variable: | C20_SemD | H13_SemD | ||
|---|---|---|---|---|
| Predictor | ||||
| Number of meanings | 0.086 | 1.06 | 0.079 | 1.21 |
| Number of senses | 0.081 | 0.95 | 0.244 | 3.51*** |
| Orthographic neighbours (Coltheart’s N) | 0.031 | 0.34 | 0.058 | 0.78 |
| Frequency | 0.262 | 3.34** | 0.280 | 4.39*** |
| Length | 0.113 | 1.22 | – 0.025 | 0.33 |
| Concreteness | – 0.248 | 3.41*** | – 0.405 | 6.93*** |
C20_SemD model: df = 163, R = 0.21. H13_SemD model: df = 164, R = 0.48. * = p < 0.05; ** = p < 0.01; *** = p < 0.001
Fig. 3Effects of lexical ambiguity on semantic diversity in stimuli used in Rodd et al.’s Experiment 1
Fig. 4Effects of lexical ambiguity on semantic diversity in stimuli used in Rodd et al.’s Experiment 2
Results of ANCOVA predicting semantic diversity in stimuli used in Rodd et al.’s Experiment 2
| Dependent variable: | C20_SemD | H13_SemD | ||
|---|---|---|---|---|
| Effect | ||||
| Number of meanings | 0.000 | 0.54 | 0.052 | 0.82 |
| Number of senses | 0.385 | 0.99 | 7.293 | 0.008** |
| Meanings * Senses | 0.018 | 0.18 | 0.334 | 0.56 |
| Frequency | 1.835 | 0.10 | 6.144 | 0.015* |
| Length | 2.813 | 0.89 | 0.012 | 0.91 |
C20_SemD model: df = 1,122. H13_SemD model: df = 1,122. * = p < 0.05; ** = p < 0.01; *** = p < 0.001
Linear regression models predicting semantic diversity in polysemy stimuli used by Armstrong & Plaut
| Dependent variable: | C20_SemD | H13_SemD | ||
|---|---|---|---|---|
| Predictor | ||||
| Number of senses | – 0.044 | 0.65 | 0.179 | 2.62** |
| Frequency | 0.522 | 6.38*** | 0.138 | 1.64 |
| Orthographic neighbours (OLD20) | – 0.012 | 0.13 | – 0.115 | 1.17 |
| Number of syllables | 0.041 | 0.52 | 0.006 | 0.07 |
| Length | – 0.069 | 0.68 | 0.133 | 1.32 |
| Familiarity (residual) | – 0.425 | 4.86*** | 0.030 | 0.33 |
| Concreteness | – 0.130 | 1.84 | – 0.423 | 5.85*** |
C20_SemD model: df = 179, R = 0.24. H13_SemD model: df = 178, R = 0.22. * = p < 0.05; ** = p < 0.01; *** = p < 0.001
Fig. 5Effects of lexical ambiguity on semantic diversity in stimuli used by Armstrong and Plaut Left panel shows mean semantic diversity values for words classified as Homonyms (H), Unambiguous (U) and Polysemous (P)
Linear regression models predicting semantic diversity in homonymy stimuli used by Armstrong & Plaut
| Dependent variable: | C20_SemD | H13_SemD | ||
|---|---|---|---|---|
| Predictor | ||||
| Inverse dominance | – 0.060 | 0.88 | – 0.113 | 1.60 |
| Frequency | 0.490 | 6.11*** | 0.149 | 1.76 |
| Orthographic neighbours (OLD20) | – 0.005 | 0.05 | – 0.019 | 0.19 |
| Number of syllables | – 0.010 | 0.12 | 0.023 | 0.27 |
| Length | – 0.090 | 0.88 | – 0.044 | 0.41 |
| Familiarity (residual) | – 0.257 | 2.89** | 0.080 | 0.86 |
| Concreteness | – 0.170 | 2.30* | – 0.385 | 4.96*** |
C20_SemD model: df = 173, R = 0.25. H13_SemD model: df = 173, R = 0.19. * = p < 0.05; ** = p < 0.01; *** = p < 0.001