| Literature DB >> 29771359 |
Nele Hellbernd1, Daniela Sammler1.
Abstract
Our ability to understand others' communicative intentions in speech is key to successful social interaction. Indeed, misunderstanding an 'excuse me' as apology, while meant as criticism, may have important consequences. Recent behavioural studies have provided evidence that prosody, that is, vocal tone, is an important indicator for speakers' intentions. Using a novel audio-morphing paradigm, the present functional magnetic resonance imaging study examined the neurocognitive mechanisms that allow listeners to 'read' speakers' intents from vocal prosodic patterns. Participants categorized prosodic expressions that gradually varied in their acoustics between criticism, doubt, and suggestion. Categorizing typical exemplars of the three intentions induced activations along the ventral auditory stream, complemented by amygdala and mentalizing system. These findings likely depict the stepwise conversion of external perceptual information into abstract prosodic categories and internal social semantic concepts, including the speaker's mental state. Ambiguous tokens, in turn, involved cingulo-opercular areas known to assist decision-making in case of conflicting cues. Auditory and decision-making processes were flexibly coupled with the amygdala, depending on prosodic typicality, indicating enhanced categorization efficiency of overtly relevant, meaningful prosodic signals. Altogether, the results point to a model in which auditory prosodic categorization and socio-inferential conceptualization cooperate to translate perceived vocal tone into a coherent representation of the speaker's intent.Entities:
Mesh:
Year: 2018 PMID: 29771359 PMCID: PMC6022564 DOI: 10.1093/scan/nsy034
Source DB: PubMed Journal: Soc Cogn Affect Neurosci ISSN: 1749-5016 Impact factor: 3.436
Fig. 1.(A) Experimental stimulus creation through audio-morphing. Continua between doubt and suggestion, criticism and doubt and criticism and suggestion were created with STRAIGHT (Kawahara, 2006). Acoustic features were mixed in consecutive 20% steps. Red dots indicate CLEAR stimuli with only ±10% physical distance from original sounds. Blue dots show AMBIGUOUS stimuli. Spectrograms in the bottom panel exemplify the acoustic transition in the doubt–suggestion continuum. (B & C) Behavioural results. Prosodies with CLEAR intentions (red) were classified more consistently (B) and faster (C) than AMBIGUOUS prosodies (blue).
Acoustic properties and affect ratings of the morph steps in the seven-step prosody continua
| Cont. | Acoustic features | Affect ratings | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Number of voiced frames | Mean f0 (Hz) | Mean HNR (dB) | Mean Intensity (dB) | Offset–onset f0 (Hz) | Spectral centre of gravity (Hz) | Valence ratings | Arousal ratings | |||
| C | 439.0 ± 29.1 | 279.1 ± 29.6 | 69.8 ± 17.2 | 13.1 ± 4.3 | 66.5 ± 3.1 | 133.7 ± 58.5 | 778.3 ± 385.1 | 577.2 ± 146.4 | 3.3 ± 1.8 | 6.8 ± 1.5 |
| 443.5 ± 30.4 | 266.4 ± 25.5 | 59.0 ± 19.5 | 14.4 ± 4.4 | 67.1 ± 2.4 | 102.6 ± 49.8 | 728.4 ± 366.9 | 558.8 ± 141.1 | 3.7 ± 1.5 | 5.6 ± 1.9 | |
| 448.0 ± 25.9 | 255.4 ± 21.5 | 51.3 ± 20.5 | 15.2 ± 4.4 | 67.7 ± 1.9 | 112.7 ± 54.0 | 683.8 ± 337.9 | 561.8 ± 165.8 | 4.3 ± 1.3 | 4.8 ± 1.7 | |
| 456.0 ± 18.3 | 245.6 ± 17.1 | 46.0 ± 18.7 | 15.4 ± 3.9 | 68.1 ± 1.8 | 95.0 ± 37.1 | 631.0 ± 286.4 | 565.0 ± 163.1 | 4.4 ± 1.4 | 4.1 ± 1.7 | |
| 461.8 ± 18.1 | 236.2 ± 11.9 | 44.0 ± 13.1 | 15.2 ± 4.0 | 68.2 ± 1.8 | 95.6 ± 20.4 | 594.6 ± 247.6 | 572.5 ± 176.1 | 4.3 ± 1.2 | 3.7 ± 1.6 | |
| 467.8 ± 17.9 | 228.1 ± 9.0 | 43.9 ± 5.9 | 14.8 ± 3.8 | 68.4 ± 1.7 | 74.1 ± 21.3 | 563.7 ± 213.6 | 583.1 ± 196.7 | 4.1 ± 1.3 | 3.4 ± 1.5 | |
| D | 474.3 ± 22.8 | 220.8 ± 8.8 | 45.0 ± 4.7 | 14.2 ± 3.4 | 68.5 ± 1.4 | 74.1 ± 16.3 | 556.0 ± 198.6 | 640.8 ± 286.7 | 4.0 ± 1.4 | 3.1 ± 1.4 |
| Avg | 455.8 ± 23.2 | 247.4 ± 17.6 | 51.3 ± 14.2 | 14.6 ± 4.0 | 67.8 ± 2.0 | 98.2 ± 36.8 | 648.0 ± 29.9 | 579.9 ± 182.3 | 4.0 ± 1.4 | 4.5 ± 1.6 |
| C | 360.0 ± 21.9 | 279.2 ± 30.7 | 67.5 ± 18.2 | 12.7 ± 4.2 | 66.5 ± 2.7 | 122.9 ± 68.7 | 783.4 ± 388.7 | 591.9 ± 168.1 | 3.4 ± 1.6 | 6.7 ± 1.6 |
| 360.0 ± 18.2 | 270.4 ± 26.8 | 61.0 ± 19.3 | 13.7 ± 4.2 | 66.8 ± 2.6 | 135.5 ± 58.0 | 726.4 ± 369.6 | 546.5 ± 130.1 | 4.0 ± 1.8 | 6.0 ± 1.7 | |
| 362.5 ± 16.9 | 263.6 ± 23.7 | 58.9 ± 20.4 | 14.8 ± 4.3 | 67.0 ± 2.5 | 154.7 ± 48.1 | 687.7 ± 357.8 | 502.0 ± 96.3 | 4.5 ± 1.2 | 4.9 ± 1.8 | |
| 364.3 ± 18.2 | 257.7 ± 20.7 | 60.5 ± 20.5 | 15.7 ± 3.9 | 67.2 ± 2.4 | 175.0 ± 45.1 | 665.4 ± 353.0 | 472.5 ± 66.0 | 5.3 ± 1.4 | 4.7 ± 1.7 | |
| 364.3 ± 18.4 | 252.2 ± 17.3 | 65.2 ± 20.0 | 16.6 ± 3.8 | 67.4 ± 2.3 | 198.5 ± 35.1 | 647.8 ± 341.4 | 452.7 ± 33.8 | 5.8 ± 1.3 | 4.2 ± 1.9 | |
| 366.0 ± 18.6 | 248.3 ± 14.4 | 72.9 ± 20.5 | 17.0 ± 3.9 | 67.5 ± 2.2 | 221.0 ± 34.9 | 641.7 ± 337.9 | 439.2 ± 13.7 | 6.4 ± 1.0 | 4.4 ± 2.0 | |
| S | 365.3 ± 16.6 | 245.8 ± 10.9 | 82.3 ± 23.0 | 16.1 ± 3.8 | 67.6 ± 2.1 | 236.8 ± 52.6 | 638.4 ± 331.4 | 438.4 ± 30.2 | 6.7 ± 1.1 | 4.2 ± 1.7 |
| Avg | 363.2 ± 18.4 | 259.6 ± 20.6 | 66.9 ± 20.3 | 15.2 ± 4.0 | 67.1 ± 2.0 | 177.8 ± 48.9 | 684.4 ± 354.3 | 491.9 ± 76.9 | 5.1 ± 1.4 | 5.0 ± 1.8 |
| D | 397.8 ± 38.9 | 221.7 ± 9.3 | 43.1 ± 1.1 | 13.2 ± 3.2 | 68.5 ± 1.6 | 83.1 ± 9.6 | 570.7 ± 210.4 | 629.2 ± 239.9 | 4.3 ± 1.4 | 3.2 ± 1.3 |
| 393.8 ± 39.8 | 225.1 ± 8.6 | 45.8 ± 5.2 | 14.7 ± 3.7 | 68.6 ± 1.7 | 100.8 ± 12.2 | 555.3 ± 224.1 | 562.3 ± 174.8 | 4.7 ± 1.2 | 3.1 ± 1.4 | |
| 398.8 ± 32.8 | 229.7 ± 9.2 | 51.4 ± 7.9 | 15.8 ± 3.9 | 68.5 ± 1.7 | 126.3 ± 43.6 | 555.8 ± 246.1 | 514.6 ± 122.5 | 4.9 ± 1.1 | 3.2 ± 1.5 | |
| 397.8 ± 35.1 | 233.9 ± 9.4 | 57.0 ± 11.0 | 16.7 ± 4.0 | 68.4 ± 1.8 | 154.1 ± 26.7 | 570.8 ± 271.6 | 480.4 ± 80.4 | 5.4 ± 0.9 | 3.3 ± 1.7 | |
| 395.5 ± 35.9 | 238.1 ± 10.4 | 63.6 ± 14.1 | 17.4 ± 3.7 | 68.2 ± 1.9 | 184.4 ± 10.2 | 594.9 ± 297.5 | 456.2 ± 38.1 | 5.8 ± 1.2 | 3.6 ± 1.6 | |
| 394.0 ± 36.8 | 243.1 ± 11.8 | 71.5 ± 17.6 | 17.7 ± 3.7 | 67.9 ± 2.2 | 218.3 ± 42.2 | 622.6 ± 320.6 | 442.1 ± 8.0 | 6.3 ± 1.4 | 4.0 ± 1.7 | |
| S | 395.0 ± 35.9 | 249.7 ± 12.1 | 82.0 ± 22.7 | 16.3 ± 4.0 | 67.4 ± 2.8 | 252.1 ± 59.9 | 660.0 ± 349.0 | 439.6 ± 38.4 | 6.6 ± 1.2 | 4.5 ± 1.8 |
| Avg | 396.1 ± 36. 5 | 234.5 ± 10.1 | 59.2 ± 11.4 | 16.0 ± 3.8 | 68.2 ± 2.0 | 159.9 ± 30.6 | 590.0 ± 272.9 | 503.5 ± 100.3 | 5.4 ± 1.2 | 3.6 ± 1.6 |
Note. Values depict mean ± SD. SD = standard deviation, f0 = fundamental frequency, HNR = harmonics-to-noise ratio. All values were extracted using PRAAT 5.3.01 (http://www.praat.org).
Valence ratings: 1 = negative, 9 = positive;
Arousal ratings: 1 = calm; 9 = aroused. Cont.: continuum; C: criticism; D: doubt; S: suggestion; Avg: average.
Statistical comparisons of acoustic stimulus features and affect ratings between clear and ambiguous stimuli
| Cont. | Acoustic features | Affect ratings | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Voiced frames | Mean f0 | SD f0 | Mean HNR | Mean Intensity | Offset- onset f0 | Spectral CoG | Valence ratings | Arousal ratings | ||||||||||||
| C – D | 0.10 | 0.92 | 0.30 | 0.77 | 1.17 | 0.25 | −0.80 | 0.43 | −0.55 | 0.59 | −0.32 | 0.75 | 0.19 | 0.85 | 0.37 | 0.71 | −5.14 | 0.00 | 3.78 | 0.00 |
| C – S | −0.14 | 0.89 | 0.37 | 0.71 | 1.29 | 0.21 | −0.58 | 0.57 | −0.13 | 0.90 | 0.14 | 0.89 | 0.25 | 0.81 | 0.80 | 0.43 | −0.71 | 0.49 | 10.69 | 0.00 |
| D – S | −0.18 | 0.86 | 0.21 | 0.83 | 0.53 | 0.60 | −0.82 | 0.42 | −0.41 | 0.68 | 0.00 | 0.71 | 0.30 | 0.77 | 0.75 | 0.46 | 1.98 | 0.07 | 2.42 | 0.03 |
Note. Cont.: continuum; C – D: criticism—doubt; C – S: criticism–suggestion; D – S: doubt–suggestion; HNR: harmonics-to-noise ratio; CoG: center of gravity.
Independent-samples t-tests.
Paired-samples t-tests.
Fig. 2.Activation maps and PPI analysis. (A) CLEAR > AMBIGUOUS prosodies activated auditory prosodic as well as sociocognitive brain regions. (B) AMBIGUOUS > CLEAR prosodies activated cingulo-opercular areas in both hemispheres. Threshold: voxel P < 0.0001, cluster P < 0.05 FWE-corrected. (C) Amygdala was functionally connected with auditory prosodic as well as cingulo–opercular regions, more strongly during clear than ambiguous prosodies. Results are displayed for the right amygdala seed only (for the similar results of the left amygdala seed, see Table 4). Threshold: voxel P < 0.001, cluster P < 0.05 FWE-corrected. SMG: supramarginal gyrus; pTPJ: posterior temporoparietal junction; AG: angular gyrus; PP: planum polare; STG/STS: superior temporal gyrus/sulcus; MTG: middle temporal gyrus; HG: Heschl’s gyrus; PT: planum temporale; PCC: posterior cingulate cortex; mPFC: medial prefrontal cortex; pre-SMA: pre supplementary motor area; IFG: inferior frontal gyrus; FWE: family-wise error.
Functional activations for the contrasts Clear > Ambiguous and Ambiguous > Clear
| Brain region | Hem. | BA | k | x | y | z | |
|---|---|---|---|---|---|---|---|
| Hippocampus | R | – | 33 | −16 | −17 | 4.68 | |
| Heschl’s gyrus | R | 41/42 | 48 | −7 | 1 | 5.17 | |
| Planum temporale | R | 22 | 57 | −22 | 7 | 4.40 | |
| Posterior superior temporal gyrus | R | 22 | 66 | −34 | 16 | 4.19 | |
| Anterior superior temporal sulcus | R | 22/21 | 48 | −4 | −17 | 4.52 | |
| Anterior middle temporal gyrus | R | 21 | 57 | −7 | −20 | 4.47 | |
| Central operculum | R | 48 | 54 | −13 | 13 | 4.58 | |
| Putamen | L | – | −27 | −1 | −2 | 4.87 | |
| Hippocampus | L | – | −27 | −19 | −17 | 3.88 | |
| Pallidum | L | – | −21 | −4 | 7 | 4.15 | |
| Planum polare | L | 22 | −42 | −13 | −5 | 4.61 | |
| Central operculum | L | 48 | −51 | −4 | 7 | 3.79 | |
| Angular gyrus | L | 40 | −54 | −58 | 34 | 4.87 | |
| Supramarginal gyrus | L | 40 | −60 | −40 | 40 | 4.52 | |
| Paracingulate gyrus | L | 11 | −3 | 38 | −8 | 3.75 | |
| Medial prefrontal cortex | R | 11 | 6 | 41 | −17 | 4.02 | |
| Paracingulate gyrus | R | 32 | 3 | 26 | 40 | 5.47 | |
| Presupplementary motor areaa | L | 6/32 | −3 | 11 | 55 | 5.26 | |
| Anterior Insula | R | 36 | 23 | 1 | 3.98 |
Note. BA: Brodmann area; L: left hemisphere; R: right hemisphere; k: cluster extent (number of voxels); p. op.: pars opercularis; p. tri.: pars triangularis. Coordinates indicate cluster peaks in MNI-space. Main peaks are in bold. P-voxel < 0.0001, P-cluster < 0.05 FWE-corrected.
Peaks that are significant at P-voxel < 0.05 FWE-corrected.
Results of the PPI analysis
| Brain region | Hem. | BA | k | x | y | z | |
|---|---|---|---|---|---|---|---|
| Anterior insula | L | – | −42 | 14 | −5 | 3.92 | |
| Inferior frontal gyrus (p. op.) | L | 44 | −54 | 14 | 7 | 3.86 | |
| Superior temporal gyrus | L | 22 | −57 | −4 | −8 | 3.27 | |
| Inferior frontal gyrus (p. tri.) | R | 45 | 45 | 29 | 7 | 3.95 | |
| Inferior frontal gyrus (p. op.) | R | 44 | 57 | 20 | 13 | 3.54 | |
| Central opercular cortex | R | 48 | 48 | 5 | 7 | 3.45 | |
| Anterior insula / frontal orbital cortex | R | −/47 | 42 | 20 | −8 | 4.76 | |
| Precentral gyrus | R | 6 | 60 | 5 | 22 | 3.53 | |
| Planum polare | R | 22 | 51 | −1 | −5 | 4.29 | |
| Superior temporal gyrus / sulcus | R | 22/21 | 45 | −25 | −2 | 3.93 | |
| Heschl’s gyrus | R | 41/42 | 54 | −13 | 7 | 3.65 | |
| Heschl’s gyrus | L | 41/42 | −54 | −10 | 7 | 3.67 | |
| Central operculum | L | 48 | −39 | −19 | 16 | 4.13 | |
| Superior temporal gyrus | L | 22 | −57 | −22 | 1 | 4.07 | |
| Putamen | L | – | −33 | −10 | 4 | 3.31 | |
| Inferior frontal gyrus (p. op.) | L | 44 | −60 | 14 | 10 | 4.38 | |
| Inferior frontal gyrus (p. tri.) | L | 45 | −51 | 38 | 4 | 3.48 | |
| Central sulcus | L | 4/6 | −30 | −28 | 67 | 3.80 | |
| Postcentral gyrus | L | 3 | −39 | −34 | 61 | 3.59 | |
| Intraparietal sulcus | L | 7/40 | −27 | −46 | 40 | 3.56 | |
| Paracingulate gyrus | R | 32 | 3 | 8 | 46 | 3.49 |
Note. BA: Brodmann area; L: left hemisphere; R: right hemisphere; k: cluster extent (number of voxels).Coordinates indicate cluster peaks in MNI-space. Main peaks are in bold. P-voxel < 0.001, P-cluster < 0.05 FWE-corrected.