| Literature DB >> 23205008 |
Gabriel Recchia1, Michael N Jones.
Abstract
We contrasted the predictive power of three measures of semantic richness-number of features (NFs), contextual dispersion (CD), and a novel measure of number of semantic neighbors (NSN)-for a large set of concrete and abstract concepts on lexical decision and naming tasks. NSN (but not NF) facilitated processing for abstract concepts, while NF (but not NSN) facilitated processing for the most concrete concepts, consistent with claims that linguistic information is more relevant for abstract concepts in early processing. Additionally, converging evidence from two datasets suggests that when NSN and CD are controlled for, the features that most facilitate processing are those associated with a concept's physical characteristics and real-world contexts. These results suggest that rich linguistic contexts (many semantic neighbors) facilitate early activation of abstract concepts, whereas concrete concepts benefit more from rich physical contexts (many associated objects and locations).Entities:
Keywords: abstract concepts; concreteness; feature norms; lexical decision; semantic richness
Year: 2012 PMID: 23205008 PMCID: PMC3506984 DOI: 10.3389/fnhum.2012.00315
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Descriptive statistics for stimulus characteristics (predictors and dependent variables).
| Rated concreteness (MRC norms) | 5.07 (1.37) | 3.21 (0.49) | 6.24 (0.11) |
| Log frequency (Brysbaert and New, | 2.75 (0.76) | 3.15 (0.82) | 2.86 (0.63) |
| Number of morphemes | 1.27 (0.53) | 1.48 (0.68) | 1.09 (0.31) |
| Number of syllables | 1.78 (0.84) | 2.03 (0.98) | 1.60 (0.76) |
| Number of letters (length) | 5.85 (1.97) | 6.35 (2.09) | 5.29 (1.80) |
| OLD20 (Yarkoni et al., | 2.12 (0.84) | 2.18 (0.68) | 1.97 (0.79) |
| PLD20 (Yarkoni et al., | 1.96 (0.94) | 2.05 (0.84) | 1.76 (0.88) |
| Number of semantic neighbors | 150.61 (97.77) | 199.59 (91.00) | 167.08 (96.40) |
| Number of features (Analysis 1) | 11.08 (3.83) | 8.56 (3.78) | 13.00 (3.24) |
| Number of features (McRae et al., | 14.13 (3.64) | – | 15.40 (3.59) |
| Number of words (Analysis 1) | 32.10 (7.47) | 27.36 (7.26) | 35.61 (6.33) |
| Log ctx. dispersion (Brysbaert and New, | 2.51 (0.68) | 2.91 (0.67) | 2.59 (0.57) |
| Log number of senses (Miller, | 1.32 (0.82) | 1.57 (0.79) | 1.26 (0.75) |
| Lexical decision task RT (Balota et al., | 653.16 (84.53) | 644.35 (78.55) | 630.71 (74.49) |
| Lexical decision task RT, standardized | −0.48 (0.28) | −0.52 (0.27) | −0.56 (0.26) |
| Pronunciation task RT (Balota et al., | 635.37 (61.17) | 634.23 (58.83) | 624.97 (49.73) |
| Pronunciation task RT, standardized | −0.42 (0.26) | −0.42 (0.26) | −0.47 (0.22) |
Note: OLD20, Orthographic Levenshtein Distance 20, a measure of orthographic neighborhood density; PLD20, Phonological Levenshtein Distance 20, a measure of phonological neighborhood density. The number of features measure in the McRae et al. (2005) dataset limited to those concepts that were used as stimuli in our dataset and which were also members of the McRae norms (first column, N = 281; third column, N = 93). Average rated concreteness for the entire stimulus set was computed over only the 446 words for which MRC concreteness ratings were available.
Intercorrelations among predictor and dependent variables in the regression analyses for all stimuli.
| NSN | – | −0.199 | 0.189 | 0.055 | 0.729 | 0.686 | −0.035 | −0.191 | −0.201 | −0.311 | −0.307 | 0.496 | −0.519 | −0.551 | −0.366 | −0.373 |
| NFRJ | – | 0.238 | 0.475 | −0.169 | −0.133 | −0.163 | −0.031 | −0.057 | 0.068 | 0.051 | −0.307 | 0.009 | 0.011 | −0.004 | 0.013 | |
| NFM | – | 0.098 | 0.219 | 0.232 | 0.057 | −0.093 | −0.089 | −0.081 | −0.089 | 0.037 | −0.235 | −0.243 | −0.102 | −0.119 | ||
| NW | – | 0.095 | 0.093 | −0.150 | −0.139 | −0.131 | −0.075 | −0.083 | 0.005 | −0.113 | −0.129 | −0.130 | −0.124 | |||
| CD | – | 0.985 | −0.157 | −0.314 | −0.354 | −0.435 | −0.422 | 0.582 | −0.634 | −0.685 | −0.507 | −0.514 | ||||
| Freq | – | −0.195 | −0.333 | −0.379 | −0.442 | −0.430 | 0.562 | −0.623 | −0.673 | −0.496 | −0.504 | |||||
| Nm | – | 0.541 | 0.621 | 0.465 | 0.502 | −0.230 | 0.291 | 0.323 | 0.305 | 0.291 | ||||||
| Ns | – | 0.820 | 0.756 | 0.804 | −0.386 | 0.506 | 0.527 | 0.466 | 0.451 | |||||||
| Len | – | 0.871 | 0.852 | −0.375 | 0.559 | 0.598 | 0.551 | 0.558 | ||||||||
| OLD | – | 0.916 | −0.457 | 0.570 | 0.607 | 0.510 | 0.519 | |||||||||
| PLD | – | −0.442 | 0.547 | 0.587 | 0.499 | 0.511 | ||||||||||
| Sens | – | – | – | – | – | |||||||||||
| LDTraw | – | 0.949 | 0.647 | 0.642 | ||||||||||||
| LDTZ | – | 0.684 | 0.682 | |||||||||||||
| NTraw | – | 0.948 | ||||||||||||||
| NTZ | – |
Note: NSN, number of semantic neighbors; NFRJ, number of features, Analysis 1; NFM, number of features, McRae et al. (2005); NW, number of words; CD, log contextual dispersion; WF, log word frequency; Nm, number of morphemes; Ns, number of syllables; Len, number of letters; OLD, Orthographic Levenshtein distance 20; PLD, Phonological Levenshtein distance 20; Sens, log number of senses; LDTraw, lexical decision time; LDTZ, lexical decision time (z-scored); NTraw, naming time; NTZ, naming time (z-scored).
NFM has a large number of missing values due to the fact that only 281 words in the McRae et al. (2005) norms also appear in the present dataset.
p < 0.01;
p < 0.05.
Standardized regression coefficients predicting lexical decision and naming latencies, using number-of-features measure derived from data collected in Analysis 1.
| Adjusted | 0.00 | 0.17 |
| Log frequency | −0.505 | −0.352 |
| Number of morphemes | −0.036 | −0.065 |
| Number of syllables | 0.071 | 0.059 |
| Number of letters (length) | 0.286 | 0.407 |
| OLD20 | 0.105 | −0.075 |
| PLD20 | −0.009 | 0.060 |
| Adjusted | 0.59 | 0.55 |
| Change in | 0.59 | 0.38 |
| Number of features | −0.114 | −0.036 |
| Number of words | 0.027 | −0.021 |
| Number of semantic neighbors | −0.135 | −0.027 |
| Log contextual dispersion | −0.848 | −1.038 |
| Log number of senses | −0.051 | 0.034 |
| Adjusted | 0.63 | 0.58 |
| Change in | 0.04 | 0.03 |
Note: LDT, lexical decision time (z-scored); NT, naming time (z-scored); OLD20, Orthographic Levenshtein Distance 20, a measure of orthographic neighborhood density; PLD20, Phonological Levenshtein Distance 20, a measure of phonological neighborhood density. Only semantic richness variables are shown in Step 2 for ease of exposition.
p < 0.10;
p < 0.001.
Standardized regression coefficients predicting lexical decision and naming latencies.
| Adjusted | 0.00 | 0.14 | 0.00 | 0.25 |
| Log frequency | −0.413 | 0.331 | −0.514 | −0.291 |
| Number of morphemes | 0.094 | −0.013 | −0.080 | −0.136 |
| Number of syllables | −0.004 | −0.195 | −0.098 | 0.025 |
| Number of letters (length) | 0.444 | 0.575 | 0.223 | 0.234 |
| OLD20 | 0.053 | −0.231 | 0.066 | −0.186 |
| PLD20 | −0.119 | 0.309 | 0.238 | 0.313 |
| Adjusted | 0.54 | 0.54 | 0.62 | 0.53 |
| Change in | 0.54 | 0.40 | 0.62 | 0.28 |
| Number of features | −0.089 | 0.072 | −0.168 | −0.115 |
| Number of words | 0.024 | 0.008 | 0.083 | 0.006 |
| Number of semantic neighbors | −0.147 | −0.072 | −0.121 | −0.060 |
| Log contextual dispersion | −0.890 | −1.094 | −0.944 | −1.069 |
| Log number of senses | −0.043 | 0.045 | −0.024 | 0.045 |
| Adjusted | 0.59 | 0.58 | 0.67 | 0.57 |
| Change in | 0.04 | 0.04 | 0.05 | 0.04 |
Note: LDT, lexical decision time (z-scored); NT, naming time (z-scored); OLD20, Orthographic Levenshtein Distance 20, a measure of orthographic neighborhood density; PLD20, Phonological Levenshtein Distance 20, a measure of phonological neighborhood density. Only semantic richness variables are shown in Step 2 for ease of exposition.
p < 0.10;
p < 0.05;
p < 0.01;
p < 0.001.
Descriptive statistics for type counts of different feature categories.
| Num. communicative acts (com) | 0.28 (1.00) | 0.66 (1.31) | 0.07 (0.48) |
| Num. materials (has_material) | 0.49 (0.92) | 0.01 (0.12) | 0.77 (1.24) |
| Num. components (has_part) | 1.17 (1.60) | 0.04 (0.23) | 1.67 (1.66) |
| Num. larger continuous wholes (is_material_of) | 0.21 (0.76) | 0.00 (0.00) | 0.56 (1.23) |
| Num. larger discrete wholes (is_part_of) | 0.08 (0.45) | 0.01 (0.08) | 0.16 (0.67) |
| Num. visual properties (vis) | 0.50 (0.88) | 0.09 (0.33) | 0.73 (1.06) |
| Num. non-visual perceptual properties (perc) | 1.35 (1.55) | 0.05 (0.21) | 2.07 (1.54) |
| Num. cognitive states/operations/affects (cog) | 1.00 (1.98) | 2.61 (3.05) | 0.27 (0.51) |
| Num. contingencies (conting) | 0.07 (0.29) | 0.16 (0.42) | 0.05 (0.24) |
| Num. evaluations (eval) | 0.42 (0.84) | 0.61 (1.16) | 0.34 (0.66) |
| Num. negations (neg) | 0.31 (0.52) | 0.48 (0.63) | 0.21 (0.44) |
| Num. social artifacts/actions (soc) | 0.58 (1.35) | 1.35 (2.01) | 0.26 (0.59) |
| Num. events (ev) | 0.23 (0.57) | 0.19 (0.46) | 0.27 (0.71) |
| Num. locations (loc) | 0.85 (1.15) | 0.29 (0.80) | 1.20 (1.17) |
| Num. manners (man) | 0.05 (0.23) | 0.05 (0.21) | 0.07 (0.26) |
| Num. participants (par) | 0.64 (0.96) | 0.78 (1.05) | 0.62 (0.95) |
| Num. associated entities (ae) | 1.48 (1.51) | 0.65 (1.09) | 1.71 (1.45) |
| Num. times (time) | 0.31 (0.81) | 0.39 (1.00) | 0.21 (0.43) |
| Num. super/subordinates (tax) | 1.06 (1.20) | 0.14 (0.40) | 1.75 (1.25) |
| Num. entity properties | 3.80 (3.30) | 0.20 (0.51) | 5.97 (2.74) |
| Num. introspective properties | 1.79 (2.42) | 3.86 (3.40) | 0.86 (0.94) |
| Num. taxonomic properties | 1.06 (1.20) | 0.14 (0.40) | 1.75 (1.25) |
| Num. concrete situation properties | 2.33 (1.89) | 0.95 (1.40) | 2.91 (1.65) |
| Num. other situation properties | 1.23 (1.47) | 1.41 (1.59) | 1.18 (1.30) |
Standardized regression coefficients predicting lexical decision latencies, using feature counts and codes from data collected in Analysis 1.
| Log frequency | −0.505 |
| Number of morphemes | −0.036 |
| Number of syllables | 0.071 |
| Number of letters (length) | 0.286 |
| OLD20 | 0.105 |
| PLD20 | −0.009 |
| Adjusted | 0.59 |
| Number of words | 0.056 |
| Number of semantic neighbors | −0.130 |
| Log contextual dispersion | −0.940 |
| Log number of senses | −0.045 |
| Num. entity properties | −0.074 |
| Num. introspective properties | −0.037 |
| Num. taxonomic properties | −0.051 |
| Num. concrete situation properties | −0.098 |
| Num. other situation properties | −0.060 |
| Adjusted | 0.63 |
| Change in | 0.04 |
Note: OLD20, Orthographic Levenshtein Distance 20, a measure of orthographic neighborhood density; PLD20, Phonological Levenshtein Distance 20, a measure of phonological neighborhood density. Only semantic richness variables are shown in Step 2 for ease of exposition.
p < 0.10;
p < 0.05;
p < 0.01;
p < 0.001.
Standardized regression coefficients predicting lexical decision latencies, for all stimuli used in Analyses 1–2 that occur in the McRae et al. (.
| Log frequency | −0.538 |
| Number of morphemes | −0.081 |
| Number of syllables | 0.083 |
| Number of letters (length) | 0.299 |
| OLD20 | −0.006 |
| PLD20 | 0.051 |
| Adjusted | 0.62 |
| Number of words | −0.007 |
| Number of semantic neighbors | −0.021 |
| Log contextual dispersion | −0.450 |
| Log number of senses | −0.075 |
| Num. entity properties | −0.083 |
| Num. introspective properties | −0.002 |
| Num. taxonomic properties | −0.005 |
| Num. concrete situation properties | −0.101 |
| Num. other situation properties | 0.017 |
| Adjusted | 0.64 |
| Change in | 0.02 |
Note: OLD20, Orthographic Levenshtein Distance 20, a measure of orthographic neighborhood density; PLD20, Phonological Levenshtein Distance 20, a measure of phonological neighborhood density. Only semantic richness variables are shown in Step 2 for ease of exposition.
p < 0.10;
p < 0.05;
p < 0.01;
p < 0.001.