| Literature DB >> 31139119 |
Ferdy Hubers1,2, Catia Cucchiarini1,2, Helmer Strik1,2,3, Ton Dijkstra1,3.
Abstract
The processing of idiomatic expressions is a topical issue in empirical research. Various factors have been found to influence idiom processing, such as idiom familiarity and idiom transparency. Information on these variables is usually obtained through norming studies. Studies investigating the effect of various properties on idiom processing have led to ambiguous results. This may be due to the variability of operationalizations of the idiom properties across norming studies, which in turn may affect the reliability of the subjective judgements. However, not all studies that collected normative data on idiomatic expressions investigated their reliability, and studies that did address the reliability of subjective ratings used various measures and produced mixed results. In this study, we investigated the reliability of subjective judgements, the relation between subjective and objective idiom frequency, and the impact of these dimensions on the participants' idiom knowledge by collecting normative data of five subjective idiom properties (Frequency of Exposure, Meaning Familiarity, Frequency of Usage, Transparency, and Imageability) from 390 native speakers and objective corpus frequency for 374 Dutch idiomatic expressions. For reliability, we compared measures calculated in previous studies, with the D-coefficient, a metric taken from Generalizability Theory. High reliability was found for all subjective dimensions. One reliability metric, Krippendorff's alpha, generally produced lower values, while similar values were obtained for three other measures (Cronbach's alpha, Intraclass Correlation Coefficient, and the D-coefficient). Advantages of the D-coefficient are that it can be applied to unbalanced research designs, and to estimate the minimum number of raters required to obtain reliable ratings. Slightly higher coefficients were observed for so-called experience-based dimensions (Frequency of Exposure, Meaning Familiarity, and Frequency of Usage) than for content-based dimensions (Transparency and Imageability). In addition, fewer raters were required to obtain reliable ratings for the experience-based dimensions. Subjective and objective frequency appeared to be poorly correlated, while all subjective idiom properties and objective frequency turned out to affect idiom knowledge. Meaning Familiarity, Subjective and Objective Frequency of Exposure, Frequency of Usage, and Transparency positively contributed to idiom knowledge, while a negative effect was found for Imageability. We discuss these relationships in more detail, and give methodological recommendations with respect to the procedures and the measure to calculate reliability.Entities:
Keywords: Dutch; idiom knowledge; idiomatic expressions; reliability; subjective judgements
Year: 2019 PMID: 31139119 PMCID: PMC6527779 DOI: 10.3389/fpsyg.2019.01075
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Mean and SD for ratings on idiom properties and for performance on knowledge question.
| Idiom property | Mean ( |
|---|---|
| Subjective Frequency (scale 1–5) | 3.41 (1.39) |
| Familiarity (scale 1–5) | 3.08 (1.35) |
| Usage (scale 1–5) | 2.17 (1.30) |
| Transparency (scale 1–5) | 3.08 (1.28) |
| Imageability (scale 1–5) | 3.36 (1.33) |
| Objective idiom knowledge (in %) | 85.48 (35.22) |
Correlation matrix based on individual ratings of Dutch idiomatic expressions.
| Subjective | Objective idiom | |||||
|---|---|---|---|---|---|---|
| Frequency | Familiarity | Usage | Transparency | Imageability | knowledge | |
| Familiarity | 0.79∗ | |||||
| Usage | 0.66∗ | 0.68∗ | ||||
| Transparency | 0.28∗ | 0.32∗ | 0.29∗ | |||
| Imageability | 0.35∗ | 0.38∗ | 0.27∗ | 0.34∗ | ||
| Objective idiom knowledge | 0.30∗ | 0.33∗ | 0.24∗ | 0.35∗ | 0.13∗ | |
| Objective Frequency | 0.20∗ | 0.19∗ | 0.19∗ | -0.02 | -0.02 | 0.08∗ |
Mean reliability coefficients, SDs, and range for each idiom property averaged over lists.
| D-coefficient | ICC(2,k) | Cronbach’s alpha | Krippendorff’s alpha | |||||
|---|---|---|---|---|---|---|---|---|
| Mean ( | Min–max | Mean ( | Min–max | Mean ( | Min-max | Mean ( | Min–max | |
| Subjective Frequency | 0.943 (0.023) | 0.894–0.975 | 0.943 (0.023) | 0.894–0.975 | 0.957 (0.015) | 0.924–0.979 | 0.402 (0.090) | 0.281–0.553 |
| Familiarity | 0.943 (0.024) | 0.909–0.973 | 0.943 (0.024) | 0.909–0.973 | 0.958 (0.016) | 0.928–0.979 | 0.403 (0.096) | 0.283–0.558 |
| Usage | 0.932 (0.033) | 0.865–0.966 | 0.932 (0.033) | 0.865–0.966 | 0.976 (0.020) | 0.914–0.976 | 0.365 (0.089) | 0.201–0.470 |
| Transparency | 0.866 (0.043) | 0.771–0.912 | 0.866 (0.043) | 0.771–0.912 | 0.905 (0.032) | 0.834–0.947 | 0.201 (0.060) | 0.109–0.305 |
| Imageability | 0.877 (0.056) | 0.738–0.934 | 0.877 (0.056) | 0.738–0.934 | 0.906 (0.038) | 0.820–0.948 | 0.228 (0.076) | 0.092–0.389 |
D-coefficient for each idiom property based on the full dataset.
| Idiom property | D-coefficient |
|---|---|
| Subjective Frequency | 0.947 |
| Familiarity | 0.946 |
| Usage | 0.937 |
| Transparency | 0.872 |
| Imageability | 0.888 |
FIGURE 1Dependability coefficient for a given number of raters.
Regression model with answer correct as the dependent variable.
| Fixed effects | Estimate | Std. Error | ||
|---|---|---|---|---|
| (Intercept) | 2.97484 | 0.19790 | 15.032 | ∗∗∗ |
| Familiarity | 0.45390 | 0.06469 | 7.016 | ∗∗∗ |
| Transparency | 0.88302 | 0.05114 | 17.268 | ∗∗∗ |
| Imageability | -0.22449 | 0.04964 | -4.522 | ∗∗∗ |
| Subjective Frequency | 0.13631 | 0.05165 | 2.639 | ∗∗ |
| Usage | 0.15305 | 0.06329 | 2.418 | ∗ |
| Objective Frequency | 0.16227 | 0.06099 | 2.660 | ∗∗ |
| Familiarity × Transparency | -0.07417 | 0.02945 | -2.518 | ∗ |
| Familiarity × Imageability | -0.06873 | 0.02749 | -2.500 | ∗ |
FIGURE 2The effect of Transparency (A) and Imageability (B) on idiom knowledge for idiomatic expressions with different Familiarity ratings. The variables Transparency, Imageability, and Familiarity are rated on a scale from 1 (completely not transparent, imageable, and familiar respectively) to 5 (very transparent, imageable, and familiar respectively).