| Literature DB >> 24349399 |
Kevin R Sitek1, Daniel H Mathalon2, Brian J Roach1, John F Houde3, Caroline A Niziolek3, Judith M Ford2.
Abstract
As we talk, we unconsciously adjust our speech to ensure it sounds the way we intend it to sound. However, because speech production involves complex motor planning and execution, no two utterances of the same sound will be exactly the same. Here, we show that auditory cortex is sensitive to natural variations in self-produced speech from utterance to utterance. We recorded event-related potentials (ERPs) from ninety-nine subjects while they uttered "ah" and while they listened to those speech sounds played back. Subjects' utterances were sorted based on their formant deviations from the previous utterance. Typically, the N1 ERP component is suppressed during talking compared to listening. By comparing ERPs to the least and most variable utterances, we found that N1 was less suppressed to utterances that differed greatly from their preceding neighbors. In contrast, an utterance's difference from the median formant values did not affect N1. Trial-to-trial pitch (f0) deviation and pitch difference from the median similarly did not affect N1. We discuss mechanisms that may underlie the change in N1 suppression resulting from trial-to-trial formant change. Deviant utterances require additional auditory cortical processing, suggesting that speaking-induced suppression mechanisms are optimally tuned for a specific production.Entities:
Mesh:
Year: 2013 PMID: 24349399 PMCID: PMC3862760 DOI: 10.1371/journal.pone.0082925
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
ANOVA results for the N1 ERP, with trials binned by trial-to-trial (lag 1) formant Euclidean distance.
| Measure | df | F | sig. |
| Anterior-Posterior (AP) | 1.249 | 49.775 | <0.001 |
| Laterality (Lat) | 2.627 | 19.481 | <0.001 |
| Condition (Cond) | 1 | 12.359 | 0.001 |
| Cond * Lat | 2.812 | 4.789 | 0.004 |
| Cond * Consistency | 1.971 | 3.091 | 0.048 |
| Near versus Far contrast | 1 | 6.23 | 0.014 |
| Cond * AP | 1.262 | 3.025 | 0.075 |
| Cond * Lat * Consistency * AP | 10.995 | 1.088 | 0.367 |
| Cond * Consistency * AP | 2.523 | 0.967 | 0.398 |
| Cond * Lat * Consistency | 4.517 | 0.888 | 0.481 |
| Consistency | 1.952 | 0.61 | 0.54 |
AP (anterior-posterior) and Lat (laterality) reflect electrode location and include frontal, frontal-central, and central AP electrode bands from five lateral bands including and around the midline. We found that Cond (Condition: Talk and Playback) interacted with Consistency (an utterance's formant similarity to the previous utterance: Near, Mid, and Far) to have a significant effect on N1 size. Summary of repeated measures analysis of variance for the N1 event-related potential component, using Greenhouse-Geisser correction for sphericity when appropriate.
Figure 1Relationship between trial-to-trial formant change and N1 suppression.
Examples of trial sorting for one subject, a forward model, and the average event-related potentials (ERPs) of similar and dissimilar trials across all subjects. (a) We estimated each “ah” utterance's first two formant frequencies to find the Euclidean distance between each “ah” and its preceding neighbor. We grouped these trials into thirds: Near, Mid, and Far (referring to their formant similarity to the preceding utterance) and trimmed-mean averaged each grouping together. (b) Grand average ERPs from 99 subjects, recorded from the midline frontal site (Fz), for Talk and Playback conditions for Near and Far trials. Data points with significant Talk Near versus Far differences boxed and include the N1 ERP component (paired t-test, two-tailed, p<0.05). (c) Proposed framework for our findings. Premotor cortex sends an efference copy of a planned action to auditory cortex, where a corollary discharge is formed that represents the expected sensory consequences of the planned speech act. The actual percept is then compared to the predicted percept. Mismatch between predicted and actual percepts may be responsible for reduced suppression during Talk (from Mathalon et al. 2008)[35]. (d) Average N1 amplitude across all subjects and fifteen frontal-central electrodes with standard error bars. Talk versus Playback N1 effect at all trial groupings represents N1 suppression (p = .001). We found an overall trial Consistency effect on N1 suppression (p = .048) as well as a greater Near versus Far effect on SIS (p = .014), showing decreased N1 suppression when an utterance varies highly from its previous neighbor.
Figure 2Cohen's d effect sizes across lags.
Cohen's d effect sizes of Talk minus Playback N1 event-related potential (ERP) suppression for Near versus Far trials when compared in formant space to the most recent trial (Lag 1), the second-most recent trial (Lag 2), etc., through Lag 5. Although only Lag 1 Near vs. Far formant change had a significant effect on N1 suppression, there is a linear relationship between lag and Near vs. Far N1 suppression effect size (p = .023).