| Literature DB >> 29228111 |
Claudia Roswandowitz1,2, Claudia Kappes1, Hellmuth Obrig1,3, Katharina von Kriegstein1,4,5.
Abstract
Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is only a facultative component of voice-identity recognition in situations where additional face-identity processing is required.Entities:
Keywords: inferior parietal lobe; posterior/mid temporal lobe; voice-identity recognition; voxel-based lesion-behaviour mapping
Mesh:
Year: 2018 PMID: 29228111 PMCID: PMC5837691 DOI: 10.1093/brain/awx313
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 13.501
Figure 1Schematic model on voice-identity processing and overview of potential neuroanatomical representations of voice-identity recognition. (A) Voice-identity processing is conceived as a multistage process, which comprises (i) perceptual voice-identity analysis; (ii) voice-identity recognition (feeling of familiarity for a voice); and (iii) an extended system that attributes meaning to a voice by, for instance, associating semantic information to it (Ellis ; Neuner and Schweinberger, 2000; Belin ; Blank ; Perrodin ). (B) Divergent findings on the neural representation of voice-identity recognition. Most lesion studies suggest that the right inferior parietal lobe is critical for voice-identity recognition while neuroimaging studies agree that voice-identity recognition relies on the temporal lobe.
Demographic details, neuropsychological measures, and behavioural measures
| Mean | Median | SD | Range | ||
|---|---|---|---|---|---|
| Age | 58 | 47.95 | 50.00 | 11.59 | 46.00 |
| Time since onset (month) | 57 | 46.02 | 30.00 | 51.51 | 271.00 |
| Education (years) | 58 | 10.50 | 10.00 | 1.23 | 4.00 |
| Hearing level (dB) | 58 | 18.11 | 16.25 | 8.45 | 35.84 |
| Digit span (PR) | 58 | 32.15 | 29.00 | 22.16 | 88.00 |
| Spatial span (PR) | 58 | 39.35 | 30.00 | 24.95 | 90.50 |
| TAP (alertness) (PR) | 54 | 32.77 | 30.00 | 23.64 | 85.00 |
| German Vocabulary Test (PR) | 54 | 54.51 | 54.00 | 20.61 | 83.80 |
| Face-name learning | 53 | ||||
| Cued + sum (PR) | 61.34 | 66.50 | 34.36 | 100.00 | |
| Memory (%) | 87.56 | 100.00 | 22.04 | 117.00 | |
| Newly-learned voice recognition | |||||
| Newly-learned voice tests (%) | 43 | 59.97 | 58.34 | 10.30 | 41.22 |
| Voice-name test (%) | 58 | 56.14 | 53.34 | 12.91 | 58.33 |
| Voice-face test (%) | 43 | 64.08 | 62.00 | 11.66 | 48.00 |
| Familiar voice recognition | |||||
| Famous voice test (z) | 57 | 0.00 | 0.07 | 0.70 | 3.03 |
| Familiarity decision (d′) | 57 | 1.11 | 1.23 | 0.84 | 5.32 |
| Semantic association (%) | 57 | 69.60 | 72.00 | 18.81 | 75.00 |
| Acoustical control tests | |||||
| Vocal-pitch test (cent) | 58 | 125.17 | 125.20 | 66.05 | 231.68 |
| Vocal-timbre test (SER) | 55 | 9.73 | 10.80 | 4.62 | 16.15 |
| Visual control tests | |||||
| CFMT (%) | 57 | 66.79 | 69.44 | 15.52 | 58.34 |
| Face-name test (%) | 43 | 60.54 | 62.67 | 19.15 | 76.00 |
The table displays mean scores, median scores, standard deviations (SD), range of scores on demographical, neuropsychological measures, and on each behavioural test respectively for all patients who completed the test (n). Hearing levels were averaged over both ears. The newly-learned voice test score is a composite score of the voice-name and voice-face test. The famous voice test score is a composite score of the familiarity decision and semantic association subscores. PR = percentage rank; SER = spatial envelope ratio; TAP = Test of Attentional Performance.
Figure 2Lesion overlay and region of interest map. (A) Lesion overlay map of all patients over the whole brain. Left hemisphere is left on coronal slice. The heatmap displays the number of patients with lesions in that respective area. Coordinates refer to MNI space. (B) Region of interest map. Anatomical region of interest map covered the bilateral temporal lobe (cyan) and the right inferior parietal lobe (turquoise). (C) Lesion overlay map of all patients within the region of interest map. Areas covered by the heatmaps are lesioned in at least three patients.
Figure 3Lesions in the temporal and inferior parietal lobe associated with difficulties in voice-identity recognition. (A) VLSM results of the composite score of the newly-learned and familiar voice score (n = 42). (B) VLSM results for the same score as shown in A, but controlled for performance in face-identity recognition (composite score of CFMT and face-name test, n = 40). (C) VLSM results of the voice-face test (n = 43). (D) VLSM results of the voice-name test (n = 58). VLSM results that are controlled for hearing level and lesion size are overlaid as dark red surfaces on each image. The analyses were restricted to the bilateral temporal lobe and right inferior parietal lobe. For visualization, voxels are shown at P < 0.01 uncorrected.
Overview of lesion-cluster coordinates associated with voice-identity recognition (P < 0.05 cluster-size corrected, 1000 permutations)
| Region | Max T | Effect size | Voxels, | Max T | Effect size | Voxels, | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Planum temporale | 57 | −34 | 16 | 3.96 | 0.005 | 1.22 | 12 460 | |||||||
| Posterior STG | 56 | −37 | 13 | 3.51 | 0.004 | 1.08 | 15 656 | |||||||
| Planum temporale | 59 | −30 | 12 | 3.38 | 0.038 | 1.07 | 5442 | 59 | −30 | 12 | 3.26 | 0.038 | 1.03 | 5442 |
| Parietal operculum cortex | 59 | −33 | 32 | 3.36 | 0.003 | 1.03 | 6921 | 57 | −34 | 28 | 3.30 | 0.013 | 1.00 | 8499 |
| Posterior MTG | 62 | −31 | 0 | 3.03 | 0.030 | 0.80 | 3532 | 60 | −35 | −1 | 3.73 | 0.023 | 0.98 | 8176 |
Centre of mass coordinates are reported in MNI space (in mm). Anatomical labels according to the Harvard-Oxford Cortical and Subcortical Structural Atlas (Desikan ) provided by the FSL Anatomy Toolbox (Smith ). Voxels, n = number of voxels; x, y, z = MNI coordinates.
Figure 4Lesions associated with identity recognition of newly-learned and familiar voices. (A) VLSM results of the newly-learned voice score (composite score of voice-name and voice-face test, n = 43 patients). VLSM results controlled for hearing level and lesion size are overlaid as dark red surfaces. (B) VLSM results of the familiar voice score (composite score of the familiarity decision and semantic association score of the famous voice test, n = 57 patients). For visualization, voxels are shown at P < 0.01 uncorrected. The analyses were restricted to the bilateral temporal lobe and the right inferior parietal lobe.
Overview of lesion-cluster coordinates associated with newly-learned and familiar voice-identity recognition (P < 0.05 cluster-size corrected, 1000 permutations)
| Region | Max T | Effect size, | Voxels, | Max T | Effect size, | Voxels, | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Posterior SMG | 58 | −37 | 32 | 3.25 | 0.002 | 1.00 | 1971 | 57 | −39 | 36 | 3.02 | 0.031 | 0.92 | 2133 | ||||||||
| Posterior STG | 62 | −27 | 3 | 2.69 | 0.003 | 0.82 | 1116 | |||||||||||||||
| Posterior MTG | 48 | −37 | −3 | 2.62 | 0.018 | 0.80 | 90 | 58 | −31 | −5 | 3.24 | 0.019 | 0.99 | 4087 | ||||||||
| Planum temporale | 59 | −17 | 8 | 2.77 | 0.023 | 0.85 | 59 | |||||||||||||||
| Heschl’s gyrus (Te 1.1, Te 1.0) | 44 | −23 | 10 | 2.50 | 0.036 | 0.76 | 17 | |||||||||||||||
| Posterior MTG | −55 | −33 | −8 | 2.99 | 0.04 | 0.79 | 4447 | – | – | – | – | – | – | – | ||||||||
Centre of mass coordinates are reported in MNI space (in mm). Anatomical labels according to the Harvard-Oxford Cortical and Subcortical Structural Atlas (Desikan ) provided by the FSL Anatomy Toolbox (Smith ). Voxels, n = number of voxels; x, y, z = MNI coordinates.
Figure 5Overview of lesions associated with decreased voice-identity recognition performance within the temporal (cyan map) and inferior parietal lobe (turquoise map). Specific voice-identity recognition (dark blue surface) depicts VLSM results of the composite score of newly-learned and familiar voice score when controlled for face-identity recognition. Voice-face association (ochre surface) depicts the VLSM results of the voice-face test.