| Literature DB >> 32477059 |
Nina Heins1,2, Jennifer Pomp1,2, Daniel S Kluger2,3, Ima Trempler1,2, Karen Zentgraf4, Markus Raab5,6, Ricarda I Schubotz1,2.
Abstract
Most human actions produce concomitant sounds. Action sounds can be either part of the action goal (GAS, goal-related action sounds), as for instance in tap dancing, or a mere by-product of the action (BAS, by-product action sounds), as for instance in hurdling. It is currently unclear whether these two types of action sounds-incidental or intentional-differ in their neural representation and whether the impact on the performance evaluation of an action diverges between the two. We here examined whether during the observation of tap dancing compared to hurdling, auditory information is a more important factor for positive action quality ratings. Moreover, we tested whether observation of tap dancing vs. hurdling led to stronger attenuation in primary auditory cortex, and a stronger mismatch signal when sounds do not match our expectations. We recorded individual point-light videos of newly trained participants performing tap dancing and hurdling. In the subsequent functional magnetic resonance imaging (fMRI) session, participants were presented with the videos that displayed their own actions, including corresponding action sounds, and were asked to rate the quality of their performance. Videos were either in their original form or scrambled regarding the visual modality, the auditory modality, or both. As hypothesized, behavioral results showed significantly lower rating scores in the GAS condition compared to the BAS condition when the auditory modality was scrambled. Functional MRI contrasts between BAS and GAS actions revealed higher activation of primary auditory cortex in the BAS condition, speaking in favor of stronger attenuation in GAS, as well as stronger activation of posterior superior temporal gyri and the supplementary motor area in GAS. Results suggest that the processing of self-generated action sounds depends on whether we have the intention to produce a sound with our action or not, and action sounds may be more prone to be used as sensory feedback when they are part of the explicit action goal. Our findings contribute to a better understanding of the function of action sounds for learning and controlling sound-producing actions.Entities:
Keywords: action-effect association; auditory action effects; human action sounds; prediction; sensory attenuation
Year: 2020 PMID: 32477059 PMCID: PMC7237737 DOI: 10.3389/fnins.2020.00483
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1Camera positions and set-up during the point light recordings. (A) Camera positions during hurdling from a top view perspective. Green dots represent the cameras, red lines the hurdles, and the yellow arrow the hurdling track. (B) Camera positions during tap dancing from a top view perspective. Green dots represent the cameras, the yellow square the area in which the tap dancer performed the sequence. (C) Set-up during the recording of hurdling. Three hurdle transitions had to be performed during the recording. The two last hurdles are visible in the figure above. The yellow arrow indicates the hurdling track.
Figure 2Position of the point-light markers. (A) Twelve point-light markers were used and positioned at the shoulders, elbows, wrists, hips, knees, ankles, and the tips of the toes. Exemplary videos can be found in Videos S1–S8. (B) Three snapshots of the hurdling action over the course of one video. (C) Three snapshots of the tap-dancing action over the course of one video.
Figure 3Procedure of the study. Participants were filmed on several occasions during their 9-week training in hurdling and tap dancing. Two behavioral sessions were conducted before the MRI session.
Figure 4Trial composition and action quality rating scores. (A) A trial consisted of a video (3–6 s in length), followed by the video rating question (“How well did you perform the action?” in German), and a fixation cross (3.5–4.5 s in length). The total duration of one trial was approximately 10 s. (B) Mean rating scores for the evaluation of the quality of the action performance presented in the observed videos, obtained during the MRI sessions. Rating scores could range from 1 to 6 (1 representing a low, 6 a high rating of quality). Error bars show standard errors. BAS conditions are represented in yellow, GAS conditions in blue. Vertical stripes represent the scores for picture-scrambled conditions, whereas horizontal stripes represent sound-scrambled conditions. Columns with both vertical and horizontal stripes represent the conditions with both picture and sound scrambled. **p = 0.005, ***p = 0.001.
Figure 5Whole-brain activation of the main effects of action. (A) FDR-corrected t-maps (p < 0.05) for the BAS_normal>GAS_normal contrast. (B) FDR-corrected t-maps (p < 0.05) for the GAS_normal>BAS_normal contrast.
Whole-brain activation of the main effects of action.
| Middle occipital gyrus | 18 | −100 | 20 | 6.45 | 718 |
| Heschl's Gyrus | 54 | −13 | 5 | 4.70 | 39 |
| −51 | −16 | 8 | 4.19 | 18 | |
| SMA | −6 | −7 | 65 | 5.51 | 42 |
| pSTG | 48 | −34 | 5 | 4.92 | 30 |
Regions of activation, MNI (Montreal Neurological Institute) coordinates (x, y, z), t-values for the local maxima (FDR-corrected, p <0.05), activation extent in voxels (clusters larger than k = 20).
Region of interest (ROI) results.
| Heschl's Gyrus | 54 | −10 | 5 | 7.40 | 17 |
| −51 | −16 | 8 | 5.99 | 5 | |
| SMA | −3 | −4 | 68 | 10.66 | 397 |
| pSTG | 54 | −31 | 5 | 11.53 | 173 |
| −54 | −31 | 23 | 6.72 | 131 | |
| −54 | 5 | −4 | 6.01 | 23 | |
| Cerebellum | −27 | −58 | −22 | 8.4 | 36 |
| 24 | −61 | −19 | 7.95 | 62 | |
| 27 | −64 | −52 | 7.76 | 91 | |
| −24 | −64 | −52 | 7.01 | 37 | |
MNI (Montreal Neurological Institute) coordinates (x, y, z), t-values for the local maxima (FDR-corrected, p <0.001), activation extent in voxels. ROI, region of interest; pSTG, posterior superior temporal gyrus; SMA, supplementary motor area.