| Literature DB >> 36267060 |
Abstract
How are abstract concepts grounded in perceptual experiences for shaping human conceptual knowledge? Recent studies on abstract concepts emphasizing the role of language have argued that abstract concepts are grounded indirectly in perceptual experiences and language (or words) functions as a bridge between abstract concepts and perceptual experiences. However, this "indirect grounding" view remains largely speculative and has hardly been supported directly by empirical evidence. In this paper, therefore, we test the indirect grounding view by means of multimodal distributional semantics, in which the meaning of a word (i.e., a concept) is represented as the combination of textual and visual vectors. The newly devised multimodal distributional semantic model incorporates the indirect grounding view by computing the visual vector of an abstract word through the visual vectors of concrete words semantically related to that abstract word. An evaluation experiment is conducted in which conceptual representation is predicted from multimodal vectors using a multilayer feed-forward neural network. The analysis of prediction performance demonstrates that the indirect grounding model achieves significantly better performance in predicting human conceptual representation of abstract words than other models that mimic competing views on abstract concepts, especially than the direct grounding model in which the visual vectors of abstract words are computed directly from the images of abstract concepts. This result lends some plausibility to the indirect grounding view as a cognitive mechanism of grounding abstract concepts.Entities:
Keywords: abstract concepts; conceptual representation; embodied cognition; indirect grounding; multimodal distributional semantic model; symbol grounding problem
Year: 2022 PMID: 36267060 PMCID: PMC9577286 DOI: 10.3389/fpsyg.2022.906181
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Summary of conceptual representation theories.
|
|
|
|
|---|---|---|
| Situated simulation view (e.g., Barsalou, | • Concepts are grounded in perceptual experiences via mental simulation. | No |
| Dual coding theory citepPaivio71,Paivio86 | • Concrete concepts are both linguistic and grounded in perceptual experiences. | Yes |
| Hybrid view (e.g., Barsalou et al., | • Concepts are both linguistic and grounded in perceptual experiences. | No |
| Indirect grounding view (e.g., Howell et al., | • Concepts are both linguistic and grounded in perceptual experiences. | Yes |
Figure 1Two methods to compute the visual vector in the multimodal distributional semantic model for indirect grounding. (A) Direct grounding. (B) Indirect grounding.
Figure 2The neural network used for predicting Binder et al.'s (2016) conceptual representation of a word w from textual and/or visual vectors in the evaluation experiment. The visual vector used as input is either (for the indirect grounding and indirect visual models) or (for the hybrid, dual-coding, and visual models).
Predictions of evaluation performance by conceptual representation theories.
|
| ||
|---|---|---|
|
| ||
|
|
|
|
| Situated simulation view (e.g., Barsalou, | Visual | Visual |
| Dual coding theory citepPaivio71,Paivio86 | Dual coding, textual | Dual coding, hybrid, indirect grounding |
| Hybrid view (e.g., Barsalou et al., | Hybrid | Dual coding, hybrid, indirect grounding |
| Indirect grounding view (e.g., Howell et al., | Indirect grounding | Dual coding, hybrid, indirect grounding |
Example of words included in Binder et al.'s (2016) dataset, which are selected mainly from abstract words.
|
|
|
|
|---|---|---|
| Noun | Abstract construct | analogy, irony, truth, verb, worth |
| Cognitive entity | belief, hope, knowledge, sympathy, wit | |
| Emotion | gratitude, joy, love, shame, woe | |
| Social event | advice, deceit, matinee, snub, tribute | |
| Time period | day, era, evening, semester, summer | |
| Verb | Locative action | approach, deliver, go, leave, walk |
| Social action | arrest, celebrate, help, play, write | |
| Adjective | Visual property | black, dark, new, red, shiny |
| Emotional property | angry, dangerous, happy, lonely, peaceful |
Sixty-five attributes used in Binder et al.'s (2016) conceptual representation.
|
|
|
|---|---|
| Vision | Vision, bright, dark, color, pattern, large, small, motion, biomotion, fast, slow, shape, complexity, face, body |
| Somatic | Touch, temperature, texture, weight, pain |
| Audition | Audition, loud, low, high, sound, music, speech |
| Gustation | Taste |
| Olfaction | Smell |
| Motor | Head, upper-limb, lower-limb, practice |
| Spatial | Landmark, path, scene, near, toward, away, number |
| Temporal | Time, duration, long, short |
| Causal | Caused, consequential |
| Social | Social, human, communication, self |
| cognition | Cognition |
| Emotion | Benefit, harm, pleasant, unpleasant, happy, sad, angry, disgusted, fearful, surprised |
| Drive | Drive, needs |
| Attention | Attention, arousal |
Mean correlations for the indirect grounding model and other models.
| θ | θ | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| ||||||
| Indirect grounding ( |
| 0.742 |
|
| 0.761 |
|
| Hybrid ( | 0.764 |
| 0.748 | 0.724 |
| 0.748 |
| Dual coding ( | 0.762 | 0.734 | 0.740 | 0.729 | 0.756 | 0.745 |
|
| ||||||
| Visual ( | 0.536 | 0.475 | 0.488 | 0.480 | 0.494 | 0.488 |
| Textual ( | 0.755 | 0.740 | 0.744 | 0.716 | 0.762 | 0.744 |
| Indirect visual ( | 0.626 | 0.490 | 0.520 | 0.529 | 0.513 | 0.519 |
Boldfaced numbers indicate the highest correlations (i.e., the best performance) among the models.
Figure 3Boxplots of word correlations for the indirect grounding model and other models.
Figure 4Mean correlations over abstract words for the indirect grounding, hybrid, and dual coding models as a function of the concreteness threshold θ. Pairwise differences significant at p < 0.05 are indicated by color bars below the graph.
Figure 5Mean difference of correlation diff(r) between the indirect grounding model and two competing models per each of the equally divided intervals of concreteness. The heatmap depicts the mean correlation of hybrid, dual coding, and indirect grounding models. Numbers in parentheses denote the number of words n within an interval. Red and blue graphs, respectively, denote the degree of improvement against the hybrid model and the dual coding model. (A) θ = 3.0. (B) θ = 4.0.
Figure 6Mean difference of correlation diff(r) between the indirect grounding model and two baseline models averaged per semantic category for abstract words (θ = 4.0). The heatmap depicts the mean correlation of hybrid, textual, and indirect grounding models. Numbers in parentheses denote the number of abstract words contained in semantic categories. Red and blue graphs, respectively, denote the degree of improvement against the hybrid model and the textual model.
Figure 7Correlation coefficient of the indirect grounding model DSM for some individual abstract concepts as a function of concreteness threshold θ. The plot at θ = 1.0 denotes the correlation of the hybrid (i.e., direct grounding) model DSM (because DSM with θ = 1.0 is identical to DSM). The blue horizontal line is drawn at the correlation coefficient of the textual model DSM. The dashed vertical line denotes the word concreteness, and thus the line chart in the red shaded area represents the correlation obtained using indirect visual vectors.