| Literature DB >> 22844469 |
Marianne Latinus1, Pascal Belin.
Abstract
Humans can identify individuals from their voice, suggesting the existence of a perceptual representation of voice identity. We used perceptual aftereffects--shifts in perceived stimulus quality after brief exposure to a repeated adaptor stimulus--to further investigate the representation of voice identity in two experiments. Healthy adult listeners were familiarized with several voices until they reached a recognition criterion. They were then tested on identification tasks that used vowel stimuli generated by morphing between the different identities, presented either in isolation (baseline) or following short exposure to different types of voice adaptors (adaptation). Experiment 1 showed that adaptation to a given voice induced categorization shifts away from that adaptor's identity even when the adaptors consisted of vowels different from the probe stimuli. Moreover, original voices and caricatures resulted in comparable aftereffects, ruling out an explanation of identity aftereffects in terms of adaptation to low-level features. In Experiment 2, we show that adaptors with a disrupted configuration, i.e., altered fundamental frequency or formant frequencies, failed to produce perceptual aftereffects showing the importance of the preserved configuration of these acoustical cues in the representation of voices. These two experiments indicate a high-level, dynamic representation of voice identity based on the combination of several lower-level acoustical features into a specific voice configuration.Entities:
Mesh:
Year: 2012 PMID: 22844469 PMCID: PMC3402520 DOI: 10.1371/journal.pone.0041384
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Experiment 1.
Results and Acoustical measures (A). Effects of adaptation on voice identification. Shaded areas indicate the bootstrapped 95% confidence intervals (CI95%) of the fit to performance after adaptation to voice A and B. Bottom panel: Spectrogram of one vowel continuum (/a/) from voice A to voice B. x-axis: time (0–670 ms), y-axis: frequency (0–6000 Hz). (B) Fundamental frequency. (C) First formant frequency of the different vowels. Black circles indicate stimuli for the A–B continuum for the vowel/u/. Purple triangle: voice AB adaptors. Pink: voice A (star) or caricature of voice A (triangle) adaptors. Blue: voice B (star) or caricature (triangle) adaptors. (D, E). Statistical analysis of Experiment 1. CI95% (shaded area) of the differences in correct recognition performance (line) for each pair of conditions. x-axis: morph steps of the probe stimulus between voices A and B. Morph steps for which the y = 0 line is not contained in the shaded area represent significant performance differences between adaptation conditions. (D) Adaptation to voice A, voice B versus baseline condition. Note that adaptation to voice A and voice B leads to a perceptual change mostly in the identity-ambiguous region (30 to 70%) of the continuum. (E) Adaptation to voice A versus adaptation to caricature A and adaptation to voice B versus adaptation to caricature B. Note that in the identity-ambiguous region of the continuum no differences are seen between adaptation to original voices and caricatures.
Figure 2Adaptation to formant and fundamental frequencies.
(A) Aftereffects induced with the fundamental frequency adaptor (f0A/formantAB). Right panel: statistical analysis, differences between each adaptation condition and baseline (i.e., no adaptation). Shaded area: 95% confidence interval. (B) Aftereffects induced with the formant adaptor (f0AB/formantA). Right panel: statistical analysis, differences between each adaptation condition and baseline (i.e., no adaptation). Shaded area: 95% confidence interval.