| Literature DB >> 28408890 |
Yi-Chuan Chen1, Charles Spence1.
Abstract
There has been longstanding interest from both experimental psychologists and cognitive neuroscientists in the potential modulatory role of various top-down factors on multisensory integration/perception in humans. One such top-down influence, often referred to in the literature as the 'unity assumption,' is thought to occur in those situations in which an observer considers that various of the unisensory stimuli that they have been presented with belong to one and the same object or event (Welch and Warren, 1980). Here, we review the possible factors that may lead to the emergence of the unity assumption. We then critically evaluate the evidence concerning the consequences of the unity assumption from studies of the spatial and temporal ventriloquism effects, from the McGurk effect, and from the Colavita visual dominance paradigm. The research that has been published to date using these tasks provides support for the claim that the unity assumption influences multisensory perception under at least a subset of experimental conditions. We then consider whether the notion has been superseded in recent years by the introduction of priors in Bayesian causal inference models of human multisensory perception. We suggest that the prior of common cause (that is, the prior concerning whether multisensory signals originate from the same source or not) offers the most useful way to quantify the unity assumption as a continuous cognitive variable.Entities:
Keywords: coupling priors; crossmodal correspondences; semantic congruency; the unity assumption; the unity effect
Year: 2017 PMID: 28408890 PMCID: PMC5374162 DOI: 10.3389/fpsyg.2017.00445
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Summary of the unity effect demonstrated in studies of the spatial ventriloquism effect.
| Study | Origins of the unityassumption | Stimuli | Experimental paradigm | Effect? |
|---|---|---|---|---|
| Semantic congruency, redundant information (temporal structure) | Kettle and whistle vs. light and bell | Spatial ventriloquism | Yes | |
| Instruction, redundant information (temporal synchrony), and semantic congruency | Human face and voice vs. tape mark and voice (Experiment 1) | Spatial ventriloquism (Experiment 1) | Yes, but only when the stimuli were synchronous and semantically congruent | |
| Human face/spot and voice/click (Experiment 4) | Spatial discrimination (Experiment 4) | |||
| Redundant information (spatial and temporal coincidence) | Light and white noise | Spatial ventriloquism | Yes | |
| Crossmodal correspondence (size and pitch) | Visual disk and pure tone | Spatial discrimination | Yes | |
| Semantic congruency | Human speech | Spatial ventriloquism | Yes | |
| Context | Visual white-noise disk and auditory white-noise burst | Auditory spatial realignment | Yes | |
| Semantic congruency | Human speech or playing bongos (full video vs. synchronized light) | Spatial ventriloquism aftereffect | No | |
| Semantic congruency, instruction | Playing bongos (full video vs. synchronized light) | Spatial ventriloquism aftereffect | No | |
| Semantic congruency | Human speech | Spatial ventriloquism | No | |
Summary of the unity effect demonstrated in studies of the temporal ventriloquism effect.
| Study | Origins of the unityassumption | Stimuli | Experimental paradigm | Effect? |
|---|---|---|---|---|
| Semantic congruency | Human speech | TOJ | Yes | |
| Semantic congruency | Human speech, monkey calls | TOJ | Yes, but only for human speech | |
| Crossmodal correspondence (size and pitch) | Visual disk and pure tone | Temporal ventriloquism | Yes | |
| Crossmodal correspondence (size and pitch; shape and pitch) | Visual disk and pure tone, Visual pattern and pure tone | TOJ | Yes | |
| Semantic congruency | Playing instruments and object actions (hammer smash ices and ball bouncing) | TOJ | No | |
| Crossmodal correspondence (size and pitch) | Visual disk and pure tone | Temporal ventriloquism | Replicated the condition in | |
Summary of the unity effect demonstrated in studies of the McGurk effect.
| Study | Origins of the unityassumption | Stimuli | Perception | Effect? |
|---|---|---|---|---|
| Temporal synchrony (±200 ms) | V: /da/; A: /ba/ | /va/ or /ga/ | Yes: /va/ decreased and /ga/ increased when V leading | |
| Temporal synchrony (±360 ms) | V: /aga/ or /igi/; A: /aba/ | /ada/ or /idi/ | Yes: -60 to 240 ms for V/aga/ | |
| Stimulus congruency (vowel) | No effect for V/igi/ | |||
| Temporal synchrony (±467 ms) | V: /ka/ or /ga/; A: /pa/ or /ba/ | /ta/ or /da/ | Yes: -30 to 170 ms | |
| Temporal synchrony (-640 to 720 ms) | V: /ba/; A:/da/ | /bda/ | Yes: -320 to 480 ms | |
| Spatial disparity (±90°) | V: /igi/, /IgI/ or /ægæ/ | /idi/, /IdI/ or /ædæ/ | No | |
| A: /igi/, /IgI/ or /ægæ/ | ||||
| Temporal synchrony (±360 ms) | V: /ava/; A: /aba/ | /ava/ or /aba/ | Yes: -60 to 180 ms | |
| Spatial disparity (±90°) | No effect for spatial disparity | |||
| Congruency (phonetic) | V: lips movements; A: spoken words | Errors in lip-reading | Yes: fewer errors in lip-reading in the higher discrepancy condition | |
| Congruency (gender) | V /ga/ or /gi/; A:/ba/ or /bi/ | /da/, /ða/ or /di/, /ði/ | No | |
| Familiarity (face and voice from familiar or unfamiliar person) | V /ga/ or /gi/; A:/ba/ or /bi/ | /da/, /ða/ or /di/, /ði/ | Yes: the McGurk effect was larger for familiar face and voice | |
| Context (coherence of audiovisual syllables) | V: /ga/; A:/ba/ | /da/ | Yes: larger McGurk effect in the coherent context | |
| Context (coherence of audiovisual syllables) | V: /ga/; A:/ba/ | /da/ | Yes: smaller McGurk effect when perceiving one incoherent syllable, but recovered after perceiving more coherent syllables | |
Summary of the unity effect demonstrated in studies of the Colavita visual dominance effect.
| Study | Origins of the unityassumption | Stimuli | Experimental paradigm | Effect? |
|---|---|---|---|---|
| Temporal synchrony (±600 ms) | V: LED; A: pure tone (4000 Hz) | Speeded detection | Yes: -35 to 150 ms | |
| Spatial disparity (±13° or ±26°) | V: LED; A: white noise | Speeded detection | Yes: larger Colavita effect in the same location condition | |
| Spatial disparity (±12.5°) | V: LED; T: tactile vibrations | Speeded detection | Yes: larger Colavita effect in the same location condition | |
| Semantic congruency | V: dog or cat picture; A: barking or meowing sound | Speeded detection | No | |
| Crossmodal correspondence (size and pitch) | Visual disk and pure tone | Speeded detection | No | |