| Literature DB >> 32802406 |
Matthew K Leonard1, Bharath Chandrasekaran2, Fernando Llanos2, Jacie R McHaney2, William L Schuerman1, Han G Yi1.
Abstract
Adults struggle to learn non-native speech contrasts even after years of exposure. While laboratory-based training approaches yield learning, the optimal training conditions for maximizing speech learning in adulthood are currently unknown. Vagus nerve stimulation has been shown to prime adult sensory-perceptual systems towards plasticity in animal models. Precise temporal pairing with auditory stimuli can enhance auditory cortical representations with a high degree of specificity. Here, we examined whether sub-perceptual threshold transcutaneous vagus nerve stimulation (tVNS), paired with non-native speech sounds, enhances speech category learning in adults. Twenty-four native English-speakers were trained to identify non-native Mandarin tone categories. Across two groups, tVNS was paired with the tone categories that were easier- or harder-to-learn. A control group received no stimulation but followed an identical thresholding procedure as the intervention groups. We found that tVNS robustly enhanced speech category learning and retention of correct stimulus-response associations, but only when stimulation was paired with the easier-to-learn categories. This effect emerged rapidly, generalized to new exemplars, and was qualitatively different from the normal individual variability observed in hundreds of learners who have performed in the same task without stimulation. Electroencephalography recorded before and after training indicated no evidence of tVNS-induced changes in the sensory representation of auditory stimuli. These results suggest that paired-tVNS induces a temporally precise neuromodulatory signal that selectively enhances the perception and memory consolidation of perceptually salient categories.Entities:
Keywords: Human behaviour; Peripheral nervous system
Year: 2020 PMID: 32802406 PMCID: PMC7410845 DOI: 10.1038/s41539-020-0070-0
Source DB: PubMed Journal: NPJ Sci Learn ISSN: 2056-7936
Fig. 1Methods.
a Pitch contours (M and SD) of the four Mandarin Chinese tones across syllables and female speakers included in the study. b To estimate the categories that would be easier (Tone 1 and Tone 3) and harder (Tone 2 and Tone 4) to learn, we examined an Aggregate dataset of 678 Mandarin tone learners collected across eight published training studies. Left. Individual and mean percent correct responses across learners and tone categories. Right. Mean percent correct responses (99% CI) across learners and categories for easier- and harder-to-learn categories. c Left. Categorization trial structure and categories paired with stimulation in each participant group. Right. tVNS-stimulus alignment in one example trial. d Before the training task, we conducted a perceptual identification task to rule out group differences in perceptual identification skills. Left. Participants were asked to categorize as “rising” or “level” a perceptual continuum of Mandarin tones ranging from high-level (Tone 1) to low-rising (Tone 2) pitch. Right. The slope of the perceptual identification curve was used as a metric of perceptual acuity. e Left. To assess the effects of tVNS on the sensory encoding of stimulus pitch, we collected frequency-following responses (FFRs) to Mandarin tones before and after the training task. Right. To assess neural pitch encoding quality, we correlated neural (FFR) and stimulus pitch.
Fig. 2Behavioral results.
a Left. Percent accuracy improvement (M and SEM) over Block 1 across subjects and categories for each participant group; the Generalization block (Block 7) is denoted as “GEN”. Middle-Right. Percent accuracy improvement (M and SEM) over Block 1 for easier-to-learn (middle) and harder-to-learn (right) categories. The asterisks denote statistical differences for group-by-block interactions (Control group, Block 1 = reference levels) in the following mixed-effects model: response outcome ~ group*block + (1 | subject) + (1 | tone category). b Percentage of false positives for Tone 1 and Tone 3 (M and SEM) by group and block. c Percentage of correct responses across subjects and categories for each participant group (M and SEM) and the Aggregate learning dataset, consisting of 678 comparable learners receiving no stimulation (M and 99% CI to compensate for the large sample size). d Percent of correct trials that were retained from the previous block.
Fig. 3Neural results.
a Top. Autoregression procedure used to remove tVNS pulse artifacts from the EEG signal. Center. Baseline and sub-threshold vagal evoked potentials (M and SEM) for participants receiving stimulation. The three significant evoked potentials are denoted as N1, P1, and N2. Bottom: inverted-U relationship between tVNS intensity and peak magnitude in the tVNS pulse-evoked response. The bars denote individual pulse-evoked magnitudes averaged for each intensity range (low, intermediate, and high intensities). b Stimulus-response correlation coefficients (FFR quality) for each participant, group, and tone before (x-axis) and after (y-axis) the training session. The panel shows a high degree of individual variability in FFR quality within each group (scatter plots) and no group differences in FFR quality before and after the training session (box plots). c Stimulus and neural (FFR) pitch by group (M and SEM) before and after the tVNS session.
No effects of tVNS on sensory encoding quality.
| Tone 1 (Easier to learn) | ||||||
|---|---|---|---|---|---|---|
| tVNS-easy | tVNS-hard | |||||
| Group | 0.15 | 1.38 | 0.17 | 0.02 | 0.19 | 0.85 |
| Group × session | 0.01 | 0.14 | 0.88 | 0.11 | 0.82 | 0.41 |