| Literature DB >> 24421777 |
Chris F Westbury1, Cyrus Shaoul2, Geoff Hollis1, Lisa Smithson1, Benny B Briesemeister3, Markus J Hofmann3, Arthur M Jacobs3.
Abstract
Many studies have shown that behavioral measures are affected by manipulating the imageability of words. Though imageability is usually measured by human judgment, little is known about what factors underlie those judgments. We demonstrate that imageability judgments can be largely or entirely accounted for by two computable measures that have previously been associated with imageability, the size and density of a word's context and the emotional associations of the word. We outline an algorithmic method for predicting imageability judgments using co-occurrence distances in a large corpus. Our computed judgments account for 58% of the variance in a set of nearly two thousand imageability judgments, for words that span the entire range of imageability. The two factors account for 43% of the variance in lexical decision reaction times (LDRTs) that is attributable to imageability in a large database of 3697 LDRTs spanning the range of imageability. We document variances in the distribution of our measures across the range of imageability that suggest that they will account for more variance at the extremes, from which most imageability-manipulating stimulus sets are drawn. The two predictors account for 100% of the variance that is attributable to imageability in newly-collected LDRTs using a previously-published stimulus set of 100 items. We argue that our model of imageability is neurobiologically plausible by showing it is consistent with brain imaging data. The evidence we present suggests that behavioral effects in the lexical decision task that are usually attributed to the abstract/concrete distinction between words can be wholly explained by objective characteristics of the word that are not directly related to the semantic distinction. We provide computed imageability estimates for over 29,000 words.Entities:
Keywords: co-occurrence statistics; context effects; emotions; imageability; lexical access
Year: 2013 PMID: 24421777 PMCID: PMC3872786 DOI: 10.3389/fpsyg.2013.00991
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
HiDEx parameter set for computing co-occurrence measures.
| Corpus size | 12,714,502,395 words |
| Context size | 10000 words |
| Window length behind | 5 words |
| Window length ahead | 5 words |
| Weighting scheme | Inverse Ramp |
| Normalization method | PPMI |
| Similarity metric | Cosine |
For more details, see Shaoul and Westbury, 2010a,b,c, 2011.
Regression model for predicting human imageability judgments from two quantitative measures of context derived from co-occurrence similarity measures, on the test data set consisting of 1848 items.
| Intercept | 2.26 | 0.81 | 2.80 | 0.005 |
| ARC | 2.59 | 0.82 | 3.13 | 0.002 |
| INV-NCOUNT | −0.64 | 0.06 | −9.93 | <2e-16 |
Multiple R( = 83.29, p < 2.2e-16.
The sources and names of all “basic emotion” terms considered.
| Ekman et al., | Anger, disgust, fear, happiness, sadness, surprise |
| Ekman, | Amusement, anger, contempt, contentment, disgust, embarrassment, excitement, fear, guilt, happiness, interest, pleasure, pride, relief, sadness, satisfaction, shame, surprise |
| Kassam et al., | Anger, disgust, envy, fear, happiness, lust, sadness, shame |
| Osgood et al., | Active, bad, good, passive, strong, weak |
| Panksepp, | Care, fear, lust, panic, play, rage, seeking |
| Plutchik, | Anger, anticipation, disgust, fear, joy, sadness, surprise |
| Reizenzein, | Aversion, desire, disappointment, fear, happiness, hope, relief, surprise, unhappiness |
| Robinson et al., | Approach, arouse, away, danger, evaluate, from, safe, to, toward, withdraw |
| Stevenson et al., | Anger, disgust, fear, happiness, sadness |
| Tomkins, | Anger, contempt, distress, enjoyment, fear, interest, shame, surprise |
| Tomkins, | Anguish, disgust, excitement, humiliation, joy, rage, startle, terror |
| Wundt, | Depression, excitement, pleasant, relaxation, tension, unpleasant |
For more details, see the Appendix.
Regression model for predicting human imageability judgments using co-occurrence distance from emotion terms, on the test data set consisting of 1848 items.
| Intercept | 4.34 | 0.18 | 23.56 | <2e-16 |
| Admirable | −11.46 | 1.67 | −6.87 | 8.96E-12 |
| Arouse | −14.84 | 1.54 | −9.61 | <2e-16 |
| Envious | −16.50 | 2.82 | −5.85 | 5.87E-09 |
| From | 7.45 | 0.99 | 7.55 | 7.08E-14 |
| Good | −11.31 | 0.93 | −12.11 | <2e-16 |
| Horny | 19.42 | 2.41 | 8.05 | 1.44E-15 |
| Pleasure | 13.17 | 1.10 | 11.93 | <2e-16 |
| Proud | 11.16 | 1.56 | 7.17 | 1.12E-12 |
Multiple R( = 102.4, p < 2.2e-16.
Correlation of regression model estimates with imageability ratings, for ten models composed of eight words selected by backwards regression from 78 random words, and one model composed of eight words selected by backwards regression from 78 emotion terms, on both a development set of ratings and a separate validity set.
| 1 | 0.57 | 0.60 |
| 2 | 0.58 | 0.57 |
| 3 | 0.63 | 0.71 |
| 4 | 0.58 | 0.57 |
| 5 | 0.57 | 0.59 |
| 6 | 0.59 | 0.61 |
| 7 | 0.58 | 0.54 |
| 8 | 0.58 | 0.60 |
| 9 | 0.58 | 0.57 |
| 10 | 0.57 | 0.58 |
| EMOTION | 0.55 | 0.55 |
Figure 1Correlation between beta weights of 80 random word predictors from ten regression models of imageability, graphed against the sign-weighted summed co-occurrence similarity of each random word to eight emotion terms. The overlain bar graph shows the average [SE] sign-weighted summed co-occurrence similarity for random word predictors that had negative or positive beta weights [t(78) = 10.0, p < 9e-16].
Full regression model for predicting human imageability judgments, on the test data set consisting of 1848 items.
| Intercept | 3.25 | 0.72 | 4.51 | 7.00E-06 |
| Admirable | −14.72 | 1.51 | −9.75 | <2.0e-16 |
| Arouse | −12.83 | 1.42 | −9.03 | <2.0e-16 |
| Envious | −9.93 | 2.78 | −3.57 | 3.64E-04 |
| Good | −9.00 | 0.82 | −11.02 | <2.0e-16 |
| Horny | 17.92 | 2.33 | 7.70 | 2.18E-14 |
| Pleasure | 11.49 | 1.08 | 10.65 | <2.0e-16 |
| Proud | 7.47 | 1.55 | 4.84 | 1.44E-06 |
| Arc | 1.91 | 0.71 | 2.69 | 7.18E-03 |
| Inv-ncount | −0.32 | 0.06 | −5.27 | 1.55e-07 |
| Length | 0.07 | 0.03 | 2.54 | 0.011123 |
| Lnfrequency | 0.12 | 0.02 | 7.10 | 1.78E-12 |
Multiple R( = 92.7, p < 2.2e-16.
Figure 2Distribution of high imageability words (top) and low imageability words (bottom), by CONTEXT and EMOTIONALITY category. “High” and “Low” categories along each dimension were defined using a ±1 SD cut-off.
Figure 3Average English Lexicon Project (Balota et al., “High” and “Low” categories were defined using a ±1 SD cut-off. Bars are SE.
| fishing | 3.56 | engender | −4.63 |
| adventures | 3.40 | lot | −4.26 |
| trees | 3.40 | ethic | −4.17 |
| river | 3.39 | inflame | −4.15 |
| mediterranean | 3.34 | better | −4.15 |
| boats | 3.34 | incite | −3.99 |
| wooden | 3.33 | inspire | −3.99 |
| mountain | 3.31 | abominate | −3.99 |
| boat | 3.30 | imply | −3.95 |
| pink | 3.25 | sincere | −3.87 |
| beach | 3.21 | provoke | −3.84 |
| champagne | 3.20 | tenacity | −3.74 |
| decorated | 3.19 | rile | −3.74 |
| kitchen | 3.17 | honesty | −3.71 |
| springs | 3.16 | poise | −3.68 |
| restaurant | 3.15 | honest | −3.63 |
| girl | 3.08 | courage | −3.62 |
| chocolate | 3.08 | lack | −3.60 |
| lake | 3.07 | lacking | −3.58 |
| miniature | 3.06 | arouses | −3.58 |
| dog | 3.06 | toughness | −3.57 |
| tree | 3.04 | certainly | −3.53 |
| water | 3.04 | sense | −3.52 |
| restaurants | 3.03 | patience | −3.52 |
| yellow | 3.02 | tamp | −3.50 |
| spotted | 3.01 | intentioned | −3.46 |
| dancing | 3.01 | instil | −3.44 |
| rock | 2.99 | innate | −3.37 |
| purple | 2.98 | twinge | −3.31 |
| adventure | 2.98 | kind | −3.29 |
| gardens | 2.97 | alienate | −3.29 |
| sea | 2.97 | reflects | −3.28 |
| park | 2.96 | impressed | −3.27 |
| fish | 2.95 | teamwork | −3.26 |
| ocean | 2.94 | allay | −3.26 |
| trailer | 2.94 | galvanize | −3.26 |
| flowers | 2.93 | sincerity | −3.26 |
| daughter | 2.92 | elicit | −3.24 |
| monkey | 2.91 | wetly | −3.24 |
| band | 2.90 | solid | −3.23 |
| courtyard | 2.90 | lacks | −3.22 |
| turtle | 2.90 | understandable | −3.22 |
| candy | 2.89 | arousing | −3.20 |
| nearby | 2.88 | decisiveness | −3.19 |
| room | 2.88 | instincts | −3.16 |
| plastic | 2.88 | ineducable | −3.15 |
| queen | 2.87 | smirked | −3.15 |
| boulevard | 2.87 | pragmatism | −3.15 |
| golden | 2.87 | fostering | −3.14 |
| floating | 2.87 | aroused | −3.14 |