| Literature DB >> 32132562 |
Marianna Boros1,2, Anna Gábor3,4, Dóra Szabó4, Anett Bozsik3,5, Márta Gácsi4,6, Ferenc Szalay7, Tamás Faragó4, Attila Andics8,9.
Abstract
In the human speech signal, cues of speech sounds and voice identities are conflated, but they are processed separately in the human brain. The processing of speech sounds and voice identities is typically performed by non-primary auditory regions in humans and non-human primates. Additionally, these processes exhibit functional asymmetry in humans, indicating the involvement of distinct mechanisms. Behavioural studies indicate analogue side biases in dogs, but neural evidence for this functional dissociation is missing. In two experiments, using an fMRI adaptation paradigm, we presented awake dogs with natural human speech that either varied in segmental (change in speech sound) or suprasegmental (change in voice identity) content. In auditory regions, we found a repetition enhancement effect for voice identity processing in a secondary auditory region - the caudal ectosylvian gyrus. The same region did not show repetition effects for speech sounds, nor did the primary auditory cortex exhibit sensitivity to changes either in the segmental or in the suprasegmental content. Furthermore, we did not find evidence for functional asymmetry neither in the processing of speech sounds or voice identities. Our results in dogs corroborate former human and non-human primate evidence on the role of secondary auditory regions in the processing of suprasegmental cues, suggesting similar neural sensitivity to the identity of the vocalizer across the mammalian order.Entities:
Mesh:
Year: 2020 PMID: 32132562 PMCID: PMC7055288 DOI: 10.1038/s41598-020-60395-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Stimulus presentation protocol. (a) The sparse sampling design. Rep – repetition block, Acq – MR volume acquisition, Ch – change block. (b) Examples of stimuli in a repetition block and in a change block in the Speech sound processing experiment spoken by one out of six (3 males and three females) speakers. (c) Examples of stimuli in a repetition block and in a change block in the Voice identity processing experiment.
Speech responsive auditory regions from Model 1 (all acoustic stimuli versus silence contrast).
| Hemisphere | Region | Z Score | cluster size | cluster-level pFWE-corr | Coordinates (based on | ||
|---|---|---|---|---|---|---|---|
| L | Mid ectosylvian gyrus | 4.42 | 196 | <0.001 | −22 | −16 | 20 |
| L | Caudal ectosylvian gyrus | 3.64 | −26 | −24 | 6 | ||
| R | Mid ectosylvian gyrus | 4.38 | 150 | 0.001 | 24 | −18 | 18 |
| R | Caudal ectosylvian gyrus | 3.85 | 24 | −16 | 2 | ||
Thresholds: p < 0.001, uncorrected for multiple comparisons at the voxel level, and an FWE-corrected threshold of p < 0.05 at the cluster level. The table lists all local maxima at least 16 mm apart, for each suprathreshold cluster.
Figure 2Processing of speech sounds and voice identities in dog brains. (a) Whole brain results of Model 1. Group level activities in the bilateral auditory cortices in response to all acoustic stimuli (speech sound processing and voice identity processing experiments combined) rendered on a template dog brain. Thresholds: p < 0.001, uncorrected for multiple comparisons at the voxel level, and an FWE-corrected threshold of p < 0.05 at the cluster level. mESG – mid ectosylvian gyrus (main peak left: −22, −16, 20; right: 24, −18, 18); cESG – caudal ectosylvian gyrus (main peak left: −26, −24, 6; right: 24, −16, 2). (b) The results of the ROI analysis of parameter estimates extracted from the bilateral mESG and cESG in the speech sound processing experiment. (c) The results of the ROI analysis of parameter estimates extracted from the bilateral mESG and cESG in the voice identity processing experiment. Error bars represent the SEM. *p < 0.05.