| Literature DB >> 32552477 |
Tali Rotman1, Limor Lavie1, Karen Banai1.
Abstract
Challenging listening situations (e.g., when speech is rapid or noisy) result in substantial individual differences in speech perception. We propose that rapid auditory perceptual learning is one of the factors contributing to those individual differences. To explore this proposal, we assessed rapid perceptual learning of time-compressed speech in young adults with normal hearing and in older adults with age-related hearing loss. We also assessed the contribution of this learning as well as that of hearing and cognition (vocabulary, working memory, and selective attention) to the recognition of natural-fast speech (NFS; both groups) and speech in noise (younger adults). In young adults, rapid learning and vocabulary were significant predictors of NFS and speech in noise recognition. In older adults, hearing thresholds, vocabulary, and rapid learning were significant predictors of NFS recognition. In both groups, models that included learning fitted the speech data better than models that did not include learning. Therefore, under adverse conditions, rapid learning may be one of the skills listeners could employ to support speech recognition.Entities:
Keywords: age-related hearing loss; auditory learning; fast speech; speech recognition
Mesh:
Year: 2020 PMID: 32552477 PMCID: PMC7303778 DOI: 10.1177/2331216520930541
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Mean Audiograms. Mean thresholds and standard deviations are shown; Older adults (OA) audiogram in full lines, young adults (YA) in dashed lines.
Speech Recognition (Proportions Correct) in Older and Younger Adults.
| Older adults | Younger adults | |
|---|---|---|
| NFS—Talker 1 | ||
| | 0.36 (0.18) [0.31, 0.41] | 0.93 (0.05) [0.92, 0.94] |
| | 0.34 (0.21–0.48) | 0.93 (0.91–0.97) |
| NFS—Talker 2 | ||
| | – | 0.84 (0.07) [0.82, 0.86] |
| | – | 0.85 (0.80–0.90) |
| SIN | ||
| | – | 0.62 (0.14) [0.59, 0.66] |
| | – | 0.67 (0.54–0.72) |
| TCS baseline | ||
| | 0.37 (0.22) [0.30, 0.44] | 0.08 (0.09) [0.06, 0.11] |
| | 0.36 (0.18–0.52) | 0.08 (0–0.1) |
Note. NFS = natural-fast speech; CI = confidence interval; IQR = interquartile range; SIN = speech in noise; TCS = time-compressed speech.
Estimates of Hearing and Cognition.
| Older adults | Younger adults | |
|---|---|---|
| Hearing (PTA4, dB) | ||
| | 50 (8) | 5 (3) |
| | 49 (44–56) | 5 (3–7) |
| Vocabulary (scaled score) | ||
| | 9 (2) | 12 (2) |
| | 9 (7–11) | 12 (11–13) |
| Digit span (scaled score) | ||
| | 9 (2) | 10 (2) |
| | 9 (7–11) | 10 (8–12) |
| Block design (scaled score) | ||
| | 9 (2) | 12 (2) |
| | 9 (7–12) | 12 (10–14) |
| Flanker cost | ||
| | 0.99 (0.15) | 1.00 (0.03) |
| | 1.01 (1.008–1.016) | 1.01 (1.002–1.014) |
Note. PTA = pure-tone average; IQR = interquartile range.
Correlations Between Speech Recognition in Younger and Older Adults and Cognition and Learning.
| NFS 1 | NFS 2 | SIN | ||
|---|---|---|---|---|
| OA | YA | YA | YA | |
| Hearing | −0.50 | −0.03 | −0.04 | 0.1 |
| TCS baseline | 0.82 | 0.09 | 0.26 | 0.32 |
| Vocabulary | 0.47 | 0.30 | 0.40 | 0.35 |
| Working memory | 0.44 | 0.11 | 0.15 | 0.27 |
| Block design | 0.11 | 0.14 | −0.09 | 0.27 |
| Attention | −0.02 | 0.13 | 0.02 | 0.17 |
| Slope | 0.27 | 0.47 | 0.54 | 0.49 |
Note. Pearson correlations are shown. NFS 1 = natural-fast speech, Talker 1; NFS 2 = natural-fast speech, Talker 2; SIN = speech in noise; OA = older adults; YA = young adults; Hearing = average PTA; TCS baseline = average of first two sentences of time-compressed speech. Vocabulary, working memory, and block design—scaled scores from the corresponding tests; Attention = cost in the flanker task; Slope = rapid perceptual learning slope. Note that all correlations between rapid learning slopes and the cognitive variables were low (r < .19).
Figure 2.Rapid Learning of Time-Compressed Speech in Younger (Top Row) and Older (Bottom Row) Adults. The leftmost panel in each row shows performance (averaged over blocks of five sentences, not including baseline sentences; thin gray lines mark individual participants). Slopes of the learning curves (see text for details; individual data are shown with gray symbols) are shown in the middle panel of each row. Boxes mark the interquartile range (25th–75th percentile). The thick line within each box marks the median (blue for young adults and green for older adults). + signs mark outlying data points. The rightmost panel on each line shows performance on the final set of five sentences versus performance on the first set. Dashed diagonal lines mark y = x; thus, all symbols above this diagonal indicate learning. The dashed thicker lines (blue and green) show the linear fit between final and initial performance. Note that statistical analysis was based on the models described later, and fits are shown for demonstration only.
Figure 3.Speech Recognition Versus Rapid Learning. Proportions correct are plotted against the rapid learning slopes for younger (top row) and older (bottom row) adults. Left to right: natural-fast speech (NFS) produced by Talker 1, NFS produced by Talker 2, and speech-in-noise (SIN). Dashed lines show linear fits. Note that for the purpose of visual demonstration only, values of the learning slopes were adjusted to partial out the contribution of baseline recognition of time-compressed speech to the observed correlation (for details, see Manheim et al., 2018). Therefore, values on the x axis do not match the learning slopes shown in Figure 2.
Estimates of Natural-Fast Speech Recognition (Talker 1) Based on the “Full” Model.
| Younger
adults | Older
adults | |||
|---|---|---|---|---|
| Predictor | β (SE) |
| β ( |
|
| Hearing | 0.0148 (0.0995) | 0.15 | −0.1568 (0.0520) | −3.01** |
| Vocabulary | 0.1993 (0.0931) | 2.14* | 0.1910 (0.0585) | 3.50** |
| Working memory | 0.0386 (0.0937) | 0.41 | 0.0725 (0.0497) | 1.46 |
| Attention | 0.0584 (0.1076) | 0.54 | 0.0747 (0.0469) | 1.59 |
| TCS baseline | −0.15 (0.1100) | −1.41 | 0.6498 (0.0578) | 11.23*** |
| Rapid learning | 0.5170 (0.1183) | 4.37*** | 0.2118 (0.0469) | 4.52*** |
Note. TCS = time-compressed speech.
*p < .05. **p < .01. ***p < .001.
Estimates of Natural-Fast Speech (Talker 2) and Speech in Noise in Younger Adults Based on the “Full” Model.
| NFS (Talker
2) | SIN | |||
|---|---|---|---|---|
| Predictor | β ( |
| β ( |
|
| Hearing | 0.0159 (0.0778) | 0.21 | 0.1475 (0.0921) | 1.60 |
| Vocabulary | 0.2273 (0.0735) | 3.09** | 0.2589 (0.0877) | 2.95** |
| Working memory | 0.0667 (0.0735) | 0.91 | 0.2388 (0.0880) | 2.71** |
| Attention | −0.0242 (0.0805) | −0.30 | 0.1104 (0.0940) | 1.21 |
| TCS baseline | 0.0534 (0.0858) | 0.62 | 0.1248 (0.0982) | 1.27 |
| Rapid learning | 0.3696 (0.0880) | 4.20*** | 0.4080 (0.0984) | 4.15*** |
Note. NFS = natural-fast speech; SIN = speech in noise; TCS = time-compressed speech.**p<0.01, ***p<0.001.