| Literature DB >> 27732599 |
Fritz Günther1, Marco Marelli2.
Abstract
Noun compounds, consisting of two nouns (the head and the modifier) that are combined into a single concept, differ in terms of their plausibility: school bus is a more plausible compound than saddle olive. The present study investigates which factors influence the plausibility of attested and novel noun compounds. Distributional Semantic Models (DSMs) are used to obtain formal (vector) representations of word meanings, and compositional methods in DSMs are employed to obtain such representations for noun compounds. From these representations, different plausibility measures are computed. Three of those measures contribute in predicting the plausibility of noun compounds: The relatedness between the meaning of the head noun and the compound (Head Proximity), the relatedness between the meaning of modifier noun and the compound (Modifier Proximity), and the similarity between the head noun and the modifier noun (Constituent Similarity). We find non-linear interactions between Head Proximity and Modifier Proximity, as well as between Modifier Proximity and Constituent Similarity. Furthermore, Constituent Similarity interacts non-linearly with the familiarity with the compound. These results suggest that a compound is perceived as more plausible if it can be categorized as an instance of the category denoted by the head noun, if the contribution of the modifier to the compound meaning is clear but not redundant, and if the constituents are sufficiently similar in cases where this contribution is not clear. Furthermore, compounds are perceived to be more plausible if they are more familiar, but mostly for cases where the relation between the constituents is less clear.Entities:
Mesh:
Year: 2016 PMID: 27732599 PMCID: PMC5061382 DOI: 10.1371/journal.pone.0163200
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Parameter values for parameters added to the model.
te() indicates non-linear (tensor) interactions.
| Coefficient | Estimate | SE | ||
| Intercept | 1.580 | 0.126 | 12.514 | < .001 |
| Modifier Length | 0.100 | 0.023 | 4.284 | < .001 |
| Reversed-ordered Pair Frequency | -0.106 | 0.149 | -7.075 | < .001 |
| PMI | 0.167 | 0.043 | 3.840 | < .001 |
| Coefficient | Estimated | Residual | ||
| Head Proximity x Modifier Proximity | 16.442 | 18.256 | 9.544 | < .001 |
| Modifier Proximity x Constituent Similarity | 1.689 | 8.000 | 2.845 | < .001 |
| Constituent Similarity x Pair Frequency | 6.439 | 7.843 | 46.074 | < .001 |
Fig 1Heat maps for the non-linear interaction effects including plausibility measures.
The colours indicate parameter values (i.e., predicted deviation from the mean), the points show the data points from which the model was estimated. Upper left: Interaction between Head and Modifier Proximity. Upper right: Interaction between Modifier Proximity and Constituent Similarity. Lower left: Interaction between frequency of bigrams and Constituent Similarity. Lower right: Legend.
Example items for compounds with low vs. high Head Proximity Values.
| Low Head Proximity (< .1) | High Head Proximity (> .6) |
|---|---|
| diamond tennis, milk mouse, | orange juice, golf shirt, |
Example items for compounds with different Modifier Proximity values, all with medium-high Head Proximity values (between .3 and .5).
| Low Mod. Proximity (< .2) | Medium Mod. Proximity (.4 − .6) | High Mod. Proximity (> .6) |
|---|---|---|
| road bed | soup chicken | sun summer |
Example items for compounds with different low vs. high Modifier Proximity values, crossed with low vs. high Constituent Similarity values.
| Low Mod. Proximity (< .4) | High Mod. Proximity (> .4) | |
|---|---|---|
| Low Const. Sim. (< .4) | building car, ship cow, | phone car, salad island, |
| High Const. Sim. (> .4) | pie salmon, dish oven | nut milk, soup pot, |
| meat pig, child infant, |
Example items for compounds with different low vs. high Constituent Similarity values, crossed with low vs. high frequency values.
| Low Const. Similarity (< .4) | High Const. Similarity (> .4) | |
|---|---|---|
| Low Frequency | guy field, engine cat, | door cabin, nut banana, |
| High Frequency | tea bag, police dog, | chicken soup, sea water, |