| Literature DB >> 31183122 |
Stephen Politzer-Ahles1, Lei Pan1.
Abstract
The McGurk effect is an illusion whereby speech sounds are often mis-categorized when the auditory cues in the stimulus conflict with the visual cues from the speaker's face. A recent study claims that 'skilled musicians are not subject to' this effect. It is not clear, however, if this is intended to mean that skilled musicians do not experience the McGurk effect at all, or if they just experience it to a lesser magnitude than non-musicians. The study also does not statistically demonstrate either of these conclusions, as it does report a numerical (albeit non-significant) McGurk effect for musicians and does not report a significant difference between musicians' and non-musicians' McGurk effect sizes. This article reports a pre-registered, higher-power replication of that study (using twice the sample size and changing from a between- to a within-participants manipulation). Contrary to the original study's conclusion, we find that musicians do show a large and statistically significant McGurk effect and that their effect is no smaller than that of non-musicians.Entities:
Keywords: McGurk effect; audiovisual integration; musicians; replication; speech perception
Year: 2019 PMID: 31183122 PMCID: PMC6502376 DOI: 10.1098/rsos.181868
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.Results for each participant, with conditions coded according to Proverbio and colleagues' [2] scheme. (a) Shows the results for non-musicians, and (b) results for musicians. In each panel, accuracy for congruent audiovisual stimuli is shown on the left, followed by accuracy for auditory-only stimuli in the middle and accuracy for incongruent audiovisual stimuli on the right. The y-axis shows accuracy ranging from 0% correct at the bottom to 100% correct at the top. In each panel, each participant's accuracy on the three conditions is represented by a series of three dots connected by lines. Superimposed over the dots in each condition is a thick red line, one per condition, showing average accuracy across participants. For each group, accuracy in the congruent audiovisual condition is clustered near the maximum, accuracy in the audio-only condition is similar or slightly lower for most participants and accuracy in the audiovisual incongruent condition is slightly or substantially lower than accuracy in the audio-only condition for every participant.
Figure 2.Results for each participant, with conditions coded in our targeted analysis (based only on whether the place of articulation is incongruent between audio and visual information). The figure is laid out in the same way as figure 1, and the pattern of results is the same, except that the accuracy for congruent audiovisual trials is generally higher and the accuracy for incongruent audiovisual trials generally lower.
Figure 3.Schematic comparison of the results of this study with the results of Proverbio et al. [2]. In each plot, a rough guess of the effect and confidence interval from Proverbio et al. [2] is shown; exact effects cannot be plotted because the exact values of the effects in the paper are not clear (see text). Likewise, the effect and confidence interval of this study are plotted. The left-hand plot shows the McGurk effect for musicians (represented as accuracy in the audio-only condition minus accuracy in the audiovisual incongruent condition, such that higher values represent larger McGurk effects; the horizontal line represents zero McGurk effect). The effect observed by Proverbio and colleagues [2] was small and positive, but not significant, and probably had a large confidence interval (meaning it is not inconsistent with a large McGurk effect or with no McGurk effect). The effect observed in this study was large and positive, with a small confidence interval. It is not known whether the present effect is or is not within the confidence interval of Proverbio et al. [2] (see text), but we drew the schematic this way to represent that our result is not conceptually inconsistent with theirs. The right-hand plot shows the difference between non-musicians' and musicians’ McGurk effects, such that a higher value represents bigger McGurk effects for non-musicians. The difference observed by Proverbio and colleagues [2] was small and positive, but not significant, and probably had a large confidence interval. The difference in this study was small and in the opposite direction, with a small confidence interval. Again, it is not known whether our effect is within the confidence interval of Proverbio et al. [2].
Summary of differences between Proverbio et al. [2] and this study. ‘Minor differences’ are those which we have no reason to believe would substantially affect the results, but which we nonetheless report here in an attempt to be comprehensive, and with no intention to claim that one approach or the other is superior. Here we have not listed extremely minor procedural differences (such as the type of hardware used or the use of PowerPoint versus DMDX for stimulus presentation) which we have no a priori reason to believe would affect the results at all.
| Proverbio | this study | |
|---|---|---|
| major differences | ||
| pre-registered? | No | Yes |
| sample size | 30 musicians and 30 non-musicians | 62 musicians and 62 non-musicians |
| design | between participants (for audio-only versus audiovisual comparison) | within participants (for audio-only versus audiovisual comparison) |
| key comparisons | some conclusions based on comparing incongruent condition to audio-only, some on comparing incongruent to congruent | all comparisons based on comparing incongruent to audio-only |
| stimuli | little background noise | substantial background noise |
| condition coding | stimuli with same place of articulation can be coded as incongruent | stimuli with same place of articulation coded as congruent (in the second analysis only) |
| analysis | ANOVA on arcsin-transformed data | generalized linear mixed-effects models |
| data availability | summary data in article | all data and code in online repository |
| McGurk effect for musicians? | unclear (but reported as no effect) | large effect |
| musicians’ McGurk effect smaller than non-musicians'? | unclear (effect numerically smaller but not significant) | no (actually significantly |
| minor differences | ||
| non-musicians' inclusion criteria | ‘lack of musical studies and specific interest in music as a hobby’ | no music training within the past 10 years |
| language | Italian | Mandarin and Cantonese |
| inter-trial interval | 5 s | 4 s |
| participant distance from monitor | 80 cm | not controlled |
| individual or group administration | not reported | participants came in groups |
| response coding | manual | automated |