| Literature DB >> 35673798 |
Giuseppe Di Dona1, Michele Scaltritti1, Simone Sulpizio2,3.
Abstract
The present study investigated whether listeners can form abstract voice representations while ignoring constantly changing phonological information and if they can use the resulting information to facilitate voice change detection. Further, the study aimed at understanding whether the use of abstraction is restricted to the speech domain or can be deployed also in non-speech contexts. We ran an electroencephalogram (EEG) experiment including one passive and one active oddball task, each featuring a speech and a rotated speech condition. In the speech condition, participants heard constantly changing vowels uttered by a male speaker (standard stimuli) which were infrequently replaced by vowels uttered by a female speaker with higher pitch (deviant stimuli). In the rotated speech condition, participants heard rotated vowels, in which the natural formant structure of speech was disrupted. In the passive task, the mismatch negativity was elicited after the presentation of the deviant voice in both conditions, indicating that listeners could successfully group together different stimuli into a formant-invariant voice representation. In the active task, participants showed shorter reaction times (RTs), higher accuracy and a larger P3b in the speech condition with respect to the rotated speech condition. Results showed that whereas at a pre-attentive level the cognitive system can track pitch regularities while presumably ignoring constantly changing formant information both in speech and in rotated speech, at an attentive level the use of such information is facilitated for speech. This facilitation was also testified by a stronger synchronisation in the theta band (4-7 Hz), potentially pointing towards differences in encoding/retrieval processes.Entities:
Keywords: MMN; P3b; Theta; speech perception; voice representation
Mesh:
Year: 2022 PMID: 35673798 PMCID: PMC9545905 DOI: 10.1111/ejn.15730
Source DB: PubMed Journal: Eur J Neurosci ISSN: 0953-816X Impact factor: 3.698
Pitch (F0), first and second formant (F1, F2) values of the experimental stimuli for each talker and each condition
| Condition | |||||||
|---|---|---|---|---|---|---|---|
| Speech | Rotated speech | ||||||
| Talkers sex | Vowel | F0 | F1 | F2 | F0 | F1 | F2 |
| Male | a | 121 Hz | 816 Hz | 1252 Hz | 121 Hz | 768 Hz | 1623 Hz |
| e | 121 Hz | 384 Hz | 2141 Hz | 121 Hz | 653 Hz | 1360 Hz | |
| i | 121 Hz | 360 Hz | 2039 Hz | 121 Hz | 795 Hz | 1402 Hz | |
| ɔ | 121 Hz | 561 Hz | 862 Hz | 121 Hz | 772 Hz | 1007 Hz | |
| ɛ | 121 Hz | 571 Hz | 1782 Hz | 121 Hz | 1049 Hz | 1717 Hz | |
| Female | a | 184 Hz | 981 Hz | 1469 Hz | 184 Hz | 1269 Hz | 2081 Hz |
| e | 184 Hz | 368 Hz | 1698 Hz | 184 Hz | 803 Hz | 1332 Hz | |
| i | 184 Hz | 329 Hz | 1209 Hz | 184 Hz | 780 Hz | 1113 Hz | |
| ɔ | 184 Hz | 733 Hz | 1169 Hz | 184 Hz | 964 Hz | 1976 Hz | |
| ɛ | 184 Hz | 695 Hz | 1599 Hz | 184 Hz | 934 Hz | 1675 Hz | |
FIGURE 1Behavioural results of the active oddball task. (a) Proportion of correct responses broken down by condition (first column) and by probability (second column). (b) Reaction times of correct responses to deviant events only. Error bars represent the SE, and grey points represent individual observations. For illustrative purposes, only the relevant portion of the y‐axis is shown in both plots (dashed lines indicate the discontinuity of the axis).
FIGURE 2Event‐related potential (ERP) results. (a) Passive oddball task. The first column displays the ERPs for control (dotted lines), deviant (dashed lines) and differential waveforms (continuous lines) at a representative channel (Fz) for the speech (blue lines) and the rotated speech condition (red lines). The grey rectangles indicate the time window used in the analyses (mismatch negativity [MMN], first row; late discriminative negativity [LDN], second row). In the subsequent columns, topographies show the spatial distribution of the MMN (first row) and LDN (second row) in the time windows where significant differences emerged. The last column represents the voltage difference between conditions, calculated by subtracting the differential waveforms in the rotated speech condition from the ones calculated in the speech condition. Electrodes that were included in the clusters for more than 50% of the samples within the cluster time windows (reported below the topographies) are represented by black asterisk marks superimposed to the maps. (b) Active oddball task. The first column represents the ERPs for standard (dotted lines), deviant (dashed lines) and differential waveforms (continuous lines) at a representative channel (CPz) for the speech (blue lines) and the rotated speech condition (red lines). In the subsequent columns, topographies show the spatial distribution of the differential P300 waveforms, calculated by subtracting the standard ERP from the deviant ERP in the time windows where significant differences emerged for each condition. The last column represents the voltage difference between conditions, calculated by subtracting the differential waveforms in the rotated speech condition from the ones calculated in the speech condition. Electrodes are marked as in A.
FIGURE 3Time‐frequency results for the passive (first row) and the active (second row) oddball tasks. The time‐frequency power spectra show the power modulations (% change) characterising the differential event‐related spectral perturbations (ERSPs) for each condition (first and second columns) as well as the difference between them, corresponding to the interaction effect (third column). Spectra were obtained by averaging activity for the electrodes F5, F3, F1, Fz, F2, F4, F6, FC5, FC3, FC1, FCz, FC2, FC4, FC6, C5, C3, C1, Cz, C2, C4, C6, CP5, CP3, CP1, CPz, CP2, CP4, CP6, P5, P3, P1, Pz, P2, P4, P6, PO5, PO3, PO1, POz, PO2, PO4, PO6. In the plot for power spectra, black squares represent the temporal distribution of the significant clusters within theta (4–7 Hz) and beta (13–30 Hz) bands. The mean number of channels included in each cluster represented in the power spectra was calculated across all time samples, and only the time bins including at least half of the mean number of channels are enclosed in black squares. Topographies in the lower and higher row show the spatial distribution of theta and beta event‐related desynchronisations (ERDs)/event‐related synchronisations (ERSs) characterising the differential ERSPs for each condition (first and second columns) as well as the difference between them, corresponding to the interaction effect (third column). Electrodes that were included in the clusters for more than 50% of the samples within the cluster time windows (reported below each topography) are represented by black asterisk marks superimposed to the maps. Black squares on topographies represent the channels that were included in the averaged spectral plots.