| Literature DB >> 32233834 |
Lionel Fontan1, Tom Cretin-Maitenaz2,3, Christian Füllgrabe4.
Abstract
Entities:
Keywords: age-related hearing loss; automatic speech recognition; cognition; speech intelligibility; suprathreshold auditory processing
Year: 2020 PMID: 32233834 PMCID: PMC7119229 DOI: 10.1177/2331216520914769
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Figure 1.Results of Pure-Tone Air-Conduction Audiometry for the Test (i.e., Right) Ears of the 24 OHI Participants. The thin lines represent the individual audiograms. The thick line and associated gray-shaded area represent the mean audiogram ±1 SD.
OHI = older hearing-impaired; SD = standard deviation.
Individual Characteristics for the 24 Older Hearing-Impaired (OHI) Participants in Terms of Gender (F = Female, M = Male), Age (Years), MMSE Score (Out of 30), and Pure-Tone Average for Audiometric Frequencies of 0.5, 1, 2, and 4 kHz (PTA; in dB HL) for the Test (i.e., Right) Ear.
| Participant | Gender | Age | MMSE | PTA |
|---|---|---|---|---|
| OHI1 | F | 73 | 29 | 19.5 |
| OHI2 | F | 69 | 27 | 21 |
| OHI3 | H | 68 | 28 | 24 |
| OHI4 | H | 71 | 29 | 27.5 |
| OHI5 | H | 67 | 29 | 28.5 |
| OHI6 | H | 73 | 29 | 32 |
| OHI7 | F | 75 | 29 | 36 |
| OHI8 | F | 60 | 30 | 36.5 |
| OHI9 | F | 87 | 30 | 37 |
| OHI10 | H | 62 | 29 | 40.5 |
| OHI11 | H | 61 | 30 | 41.5 |
| OHI12 | H | 61 | 29 | 45.5 |
| OHI13 | H | 66 | 28 | 45.5 |
| OHI14 | H | 85 | 27 | 45.5 |
| OHI15 | H | 70 | 29 | 46 |
| OHI16 | F | 89 | 27 | 48 |
| OHI17 | F | 67 | 29 | 50 |
| OHI18 | H | 73 | 28 | 51 |
| OHI19 | H | 77 | 27 | 52 |
| OHI20 | F | 77 | 28 | 53 |
| OHI21 | F | 78 | 27 | 54 |
| OHI22 | H | 67 | 30 | 54.5 |
| OHI23 | H | 67 | 29 | 57 |
| OHI24 | H | 68 | 29 | 61.5 |
| OHIMean | (9F/15M) | 71.3 | 28.6 | 42.0 |
Note. Participants are ranked in increasing order of PTA. MMSE = Mini-Mental State Examination; PTA = pure-tone average.
Figure 2.Speech Intelligibility for Logatoms, Words, and Sentences. Circles represent individual scores for the 24 OHI participants. Overlapping data points at the extremes are displaced horizontally for better visibility. The thick black line indicates median performance, and the gray area represents the associated interquartile range.
Figure 3.Human Intelligibility Scores Plotted as a Function of PTA and Machine Intelligibility Scores. Upper row shows scatterplots relating human intelligibility scores to the PTA computed over frequencies 0.5, 1, 2, and 4 kHz for each participant, and lower row shows scatterplots relating human intelligibility scores to individual scores predicted by the ASR system with a small lexicon and using the ARHL simulation implementing elevation of hearing thresholds, loss of frequency selectivity, and loudness recruitment. Each panel shows the results for a given speech material: logatoms (left panel), words (middle panel), and sentences (right panel). The different colors indicate different ranges of PTAs (see symbol legend).
PTA = pure-tone average; SRT = speech reception threshold.
Median Machine Intelligibility (in %) and Correlation Between Human and Machine Intelligibility Scores (Spearman’s Rho, One-Tailed), Using the Small, Medium, and Large Lexicon (Rows) for Each of the Three Speech Materials (Columns).
Logatoms | Words | Sentences | ||||
|---|---|---|---|---|---|---|
| Machine intelligibility | Correlation | Machine intelligibility | Correlation | Machine intelligibility | Correlation | |
| Small | 30.6 | .81*** (19.7) | 36.7 | .77*** (24.0) | 32.2 | .71*** (31.6) |
| Medium | 27.1 | .83*** (18.9) | 25.0 | .83*** (23.7) | 20.5 | .73*** (31.4) |
| Large | 4.2 | .83*** (22.8) | 12.5 | .76*** (25.1) | 15.8 | .70*** (32.0) |
Note. The root-mean-square error obtained using a linear regression is reported between brackets.
***p < .001; one-tailed test.
Spearman Correlation Coefficients Between Human Intelligibility Scores and Machine Intelligibility Scores (Obtained With the Small Lexicon) for Each of the Three Speech Materials, as a Function of ARHL-Simulation Condition (HT elevation, HT elevation + FS loss, HT elevation + LR, and HT elevation + FS loss + LR).
Human intelligibility | |||
|---|---|---|---|
| ARHL simulation | Logatoms | Words | Sentences |
| HT | .82*** (19.0) | .86*** (21.7) | .75*** (29.6) |
| HT + FS | .75*** (19.6) | .80*** (24.1) | .77*** (31.3) |
| HT + LR | .89*** (18.1) | .84*** (21.8) | .80*** (29.4) |
| HT + FS + LR | .81*** (19.7) | .79*** (24.0) | .71*** (31.6) |
Note. The root-mean-square error obtained using a linear regression is reported between brackets. ARHL = age-related hearing loss. HT = elevation of hearing thresholds; FS = loss of frequency selectivity; LR = loudness recruitment.
***p < .001; one-tailed test.
Results of Multiple Linear Regressions for the Prediction of Human Logatom-, Word-, and Sentence-Intelligibility Scores Using Machine Intelligibility Scores and MMSE Scores as Predictor Variables.
| Speech material |
| RMSE | Predictor | β coef | ||
|---|---|---|---|---|---|---|
| Logatoms | .78 | 17.2 | Machine intelligibility | .828 | .739 | <.001 |
| MMSE score | .196 | .038 | .074 | |||
| Words | .65 | 21.7 | Machine intelligibility | .750 | .584 | <.001 |
| MMSE score | .261 | .068 | .056 | |||
| Sentences | .52 | 26.6 | Machine intelligibility | .602 | .387 | .001 |
| MMSE score | .367 | .134 | .024 |
Note. Machine intelligibility scores were always entered first into the regression analysis. For each of the three speech materials, the explained variance (R2) and root-mean-square error (RMSE) for the entire model are given, as well as the contribution of each predictor in terms of its standardized coefficient (β coef), associated change in the amount of explained variance (R2 change), and significance (p value). MMSE = Mini-Mental State Examination.