| Literature DB >> 27369067 |
Stefanie Schelinski1, Kamila Borowiak2,3, Katharina von Kriegstein2,4.
Abstract
The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)-a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses.Entities:
Keywords: auditory; autism spectrum disorder; person identity recognition; superior temporal sulcus; voice recognition
Mesh:
Year: 2016 PMID: 27369067 PMCID: PMC5091681 DOI: 10.1093/scan/nsw089
Source DB: PubMed Journal: Soc Cogn Affect Neurosci ISSN: 1749-5016 Impact factor: 3.436
Descriptive statistics for the ASD (n = 16) and the control group (n = 16) and group comparisons. Each participant in the control group was matched with respect to chronological age, gender, intelligence quotient (IQ), and handedness to the profile of one ASD participant
| ASD | Controls | ||||
|---|---|---|---|---|---|
| Gender | 13 males, 3 females | 13 males, 3 females | |||
| Handedness | 14 right, 2 left | 14 right, 2 left | |||
| SD | |||||
| Age | 33.75 | 10.12 | 33.69 | 9.58 | 0.986 |
| Range | 20–51 | 18–52 | |||
| WAIS | |||||
| Full-scale IQ | 110.31 | 13.79 | 111.50 | 10.97 | 0.789 |
| Verbal IQ | 110.75 | 12.35 | 108.75 | 12.59 | 0.653 |
| Performance IQ | 107.38 | 17.55 | 112.69 | 9.59 | 0.296 |
| Working memory | 108.63 | 2.22 | 108.00 | 3.76 | 0.887 |
| Concentration | 104.19 | 8.61 | 106.06 | 3.41 | 0.645 |
| AQ | 39.81 | 6.61 | 14.13 | 4.77 | <0.001 |
| Range | 26–48 | 5–23 | |||
aHandedness was assessed using the Edinburgh handedness questionnaire (Oldfield, 1971).
bWAIS = Wechsler Adult Intelligence Scale (Wechsler, 1997; German adapted version: von Aster ; M = 100; SD = 10).
cConcentration = d2 Test of Attention (Brickenkamp, 2002; M = 100; SD = 10).
dAQ = Autism Spectrum Quotient (Baron-Cohen ).
*Significant group difference (P < 0.05). M = mean; SD = standard deviation.
Fig. 1.Experimental design of the two fMRI experiments. (A) In the vocal sound experiment, participants listened to blocks of vocal sounds (V), non-vocal sounds (NV), and silence (white boxes). One brain volume was acquired after each block. (B) In the voice identity recognition experiment, there were two conditions. In one condition, participants had to recognise who was speaking (voice identity task). In the other condition, participants had to recognise what was said (speech task). Stimuli consisted of blocks of 13 auditory sentences. At the beginning of each block, a key-word (‘Speaker’, ‘Speech’) on the screen instructed the participants to perform the voice identity or the speech task. Scans were acquired continuously. (C) Example trials of the voice identity recognition experiment: Participants decided for each sentence whether it was spoken by the target speaker (voice identity task) or whether it matched the content of the target sentence (speech task). Stimuli in the voice identity and speech task blocks were the same.
Fig. 2.Behavioural results from the two fMRI experiments (see also Table 2 and Supplementary Table S3). (A) Performance accuracy in the voice identity recognition experiment: The ASD group performed significantly worse than the control group in the voice identity task. There were no significant differences between the ASD and the control group in the speech task. (B) Total amount of recalled sounds after the vocal sound experiment. The ASD group recalled significantly less non-vocal and vocal sounds than the control group. The vocal sound condition contained both speech and non-speech sounds. The ASD group recalled a comparable number of speech sounds but less non-speech sounds as compared to the control group. Error bars represent± 1 SE; *P < 0.05; **P < 0.005; n.s. not significant.
Summary of average scores for all experiments. Scores are summarised as average over group with standard deviation (SD) and p- values from independent t- tests. All analyses include data from 16 ASD participants and their 16 pairwise matched control participants
| ASD | Controls | ||||
|---|---|---|---|---|---|
| Voice tests | |||||
| Voice identity recognition experiment (recognition accuracy %) | |||||
| Voice identity task | 76.36 | 11.61 | 87.36 | 7.15 | 0.003 |
| Speech task | 89.41 | 9.28 | 91.75 | 9.06 | 0.475 |
| Vocal sound experiment (number of recalled sounds) | |||||
| Total | 11.88 | 6.25 | 18.13 | 7.20 | 0.014 |
| Vocal sounds | 5.38 | 2.16 | 8.25 | 4.12 | 0.019 |
| Speech | 2.13 | 1.71 | 2.88 | 1.82 | 0.239 |
| Non-speech | 3.25 | 1.73 | 5.38 | 3.10 | 0.023 |
| Non-vocal sounds | 6.50 | 4.93 | 9.88 | 4.27 | 0.047 |
| Nature | 1.19 | 1.64 | 1.44 | 1.75 | 0.680 |
| Animals | 2.31 | 2.70 | 3.31 | 2.21 | 0.261 |
| Modern environment | 2.31 | 1.30 | 4.31 | 2.12 | 0.003 |
| Musical instruments | 0.69 | 0.95 | 0.81 | 0.98 | 0.716 |
*Significant group differences (P < 0.05).
Fig. 3.Vocal sound experiment. (A) Contrast vocal sounds > silence baseline. The control group as well as the ASD group showed BOLD responses along the STS/STG when listening to vocal sounds. The figure displays results for the right STS/STG, for the left STS/STG see Supplementary Table S2. (B) Contrast vocal > non-vocal sounds. Both groups showed enhanced BOLD responses in the right STS/STG when listening to vocal, compared to non-vocal, sounds. The results are displayed at the threshold of P = 0.05 FWE corrected for the right STS/STG. They are overlaid onto a group specific average of normalised T1- weighted structural images. Colour bars represent t-values.
Fig. 4.Voice identity recognition experiment. (A) Contrast voice identity task > silence baseline. The control group as well as the ASD group showed BOLD responses along the right STS/STG when the task was to recognise voice identity. (B) Contrast voice identity task > speech task. The control group showed greater BOLD responses when recognising voice identity compared to when recognising speech. In the right posterior STS/STG these responses were higher for the control group as compared to the ASD group. (C) In the control, but not in the ASD group, responses in the right STS/STG to voice identity recognition correlated positively with performance in voice identity recognition. This correlation was stronger in the anterior STS/STG in the control group as compared to the ASD group. Results are presented for the right STS/STG and overlaid onto a group specific average image of normalised T1- weighted structural images. The results are significant at P = 0.05 FWE corrected for the ROI. For display purposes only the threshold of P = 0.01 uncorrected was used. Colour bars represent t-values.
Coordinates for significant BOLD-responses in the voice recognition experiment (p < 0.05 FWE- corrected at peak level for the region of interest)
| Voice identity task | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Controls | ASD | |||||||||
| Right STS/STG | x | y | z | Cluster size | x | y | z | Cluster size | ||
| Anterior | 54 | 11 | −11 | 3.74 | 49 | 57 | 8 | −5 | 3.11 | 81 |
| Posterior | 54 | −19 | 1 | 5.68 | 342 | 66 | −25 | 10 | 6.42 | 342 |
| Controls > ASD | ASD > Controls | |||||||||
| Anterior/posterior | – | – | ||||||||
| Voice identity task > speech task | ||||||||||
| Controls | ASD | |||||||||
| Right STS/STG | x | y | z | Cluster size | x | y | z | Cluster size | ||
| Anterior | – | – | ||||||||
| Posterior | 54 | −22 | −2 | 3.35 | 332 | – | ||||
| Controls > ASD | ASD > controls | |||||||||
| Anterior | – | – | ||||||||
| Posterior | 51 | −19 | −2 | 3.63 | 332 | – | ||||
| Correlation voice identity with task performance | ||||||||||
| Controls | ASD | |||||||||
| Right STS/STG | x | y | z | Cluster size | x | y | z | Cluster size | ||
| Anterior | 54 | 11 | −17 | 3.31 | 49 | – | ||||
| Posterior | 48 | −34 | 7 | 3.63 | 332 | – | ||||
| Controls > ASD | ASD > controls | |||||||||
| Anterior | 54 | 11 | −14 | 3.51 | 49 | – | ||||
| Posterior | – | – | ||||||||
Coordinates represent local activation maxima in MNI space (in mm). Cluster size represents the number of voxels within a cluster.