| Literature DB >> 35448023 |
Daniel Kim1, Sarah Diehl1, Michael de Riesthal1, Kris Tjaden2, Stephen M Wilson1, Daniel O Claassen3, Antje S Mefferd1.
Abstract
Although researchers have recognized the need to better account for the heterogeneous perceptual speech characteristics among talkers with the same disease, guidance on how to best establish such dysarthria subgroups is currently lacking. Therefore, we compared subgroup decisions of two data-driven approaches based on a cohort of talkers with Huntington's disease (HD): (1) a statistical clustering approach (STATCLUSTER) based on perceptual speech characteristic profiles and (2) an auditory free classification approach (FREECLASS) based on listeners' similarity judgments. We determined the amount of overlap across the two subgrouping decisions and the perceptual speech characteristics driving the subgrouping decisions of each approach. The same speech samples produced by 48 talkers with HD were used for both grouping approaches. The STATCLUSTER approach had been conducted previously. The FREECLASS approach was conducted in the present study. Both approaches yielded four dysarthria subgroups, which overlapped between 50% to 78%. In both grouping approaches, overall bizarreness and speech rate characteristics accounted for the grouping decisions. In addition, voice abnormalities contributed to the grouping decisions in the FREECLASS approach. These findings suggest that apart from overall bizarreness ratings, indexing dysarthria severity, speech rate and voice characteristics may be important features to establish dysarthria subgroups in HD.Entities:
Keywords: auditory free classification; dysarthria; perceptual speech assessment; perceptual speech characteristics
Year: 2022 PMID: 35448023 PMCID: PMC9025673 DOI: 10.3390/brainsci12040492
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Figure 1Auditory free classification approach (FREECLASS) paradigm. A listener can play and listen to the speech samples by clicking each box. A listener can then drag and drop the boxes into the grid. At least one side of one box must touch at least one side of another box on the grid for the boxes to be considered members of the same subgroup. * Additional instruction to allow listeners to make notations and comments was provided because the task could potentially pose a high demand for working memory.
Figure 2A flowchart depicting the summary of study design procedures. Black boxes indicate data and processes that have been previously collected and completed in Diehl et al., 2019 [10]. Blue boxes indicate methods conducted to determine the overlap between the two group approaches. Finally, green boxes indicate study procedures to determine contributing speech characteristics for the findings of the FREECLASS approach. Please note that the same perceptual speech profiles based on the Mayo Clinic dysarthria rating scale were used in both approaches. (A) Additive similarity tree cluster analysis; (B) chi-square test for associations and descriptive statistics; (C) multidimensional scaling (MDS) analysis; (D) correlation analyses; (E) Kruskal–Wallis H tests.
Figure 3A dendrogram depicting the results of an additive similarity tree cluster analysis. The red lines indicate the boundaries made in defining the identified subgroups of the study. Participant FX, circled in red, did not belong to any of the identified four groups and was excluded from the analyses.
Number of talkers overlapping between the identified four subgroups from the FREECLASS approach and the subgroups identified previously using computer-generated cluster analysis based on a dysarthria rating scale (STATCLUSTER) [10].
| Identified Subgroups | Subgroups | # of Overlapped |
|---|---|---|
| (# of Talkers) | (Diehl et al., 2019 [ | Talkers (%) |
| Subgroup 1 | Subgroup 1 (n = 9) | 2 (20) |
| (10 talkers; M = 3, F = 7) | Subgroup 2 (n = 9) | |
| Subgroup 3 (n = 16) | 2 (20) | |
| Subgroup 4 (n = 14) | 1 (10) | |
| Subgroup 2 | Subgroup 2 (n = 9) | 3 (30) |
| (10 talkers; M = 6, F = 4) | Subgroup 3 (n = 16) | |
| Subgroup 3 | Subgroup 1 (n = 9) | 3 (23.1) |
| (13 talkers; M = 1, F = 12) | Subgroup 2 (n = 9) | 1 (7.7) |
| Subgroup 3 (n = 16) | ||
| Subgroup 4 (n = 14) | 2 (15.4) | |
| Subgroup 4 | Subgroup 1 (n = 9) | 3 (21.4) |
| (14 talkers; M = 5, F = 9) | Subgroup 4 (n = 14) |
M, Males; F, Females.
Figure 4A correlation plot describing the results of the chi-square test for associations. The blue circles indicate the degree of positive associations between the subgroups of the FREECLASS approach and the subgroups of the STATCLUSTER approach. The red circles indicate the degree of negative associations.
Figure 5Perceptual similarity data plotted in the common spaces based on the dimensions revealed from MDS analysis. (A) A perceptual space defined by Dimension 1 (D1) and Dimension 2 (D2); (B) a perceptual space defined by D1 and Dimension 3 (D3); (C) a perceptual space defined by D2 and D3.
Significant findings of the correlation analyses between MDS dimensions and 32 speech characteristics with the addition of sex.
| Dimension 1 | r | Dimension 2 | r | Dimension 3 | r |
|---|---|---|---|---|---|
| Bizarreness a b (GI) | 0.835 | Hoarse voice a b (VQ) | 0.422 | Imprecise consonants a b (A) | 0.547 |
| Rate a b (P) | −0.710 | Loudness overall a b (L) | −0.421 | Strained-strangled | 0.539 |
| Excess and equal stress a b (P) | 0.616 | Variable rate a b (P) | −0.414 | Intelligibility a b (GI) | 0.497 |
| Intervals prolonged a b (P) | 0.609 | Breathy voice (VQ) | 0.398 | Irregular articulatory breakdown a b (A) | 0.487 |
| Inappropriate silences a b (P) | 0.597 | Sex | −0.356 | Vowels distorted a b (A) | 0.486 |
| Imprecise consonants a b (A) | 0.583 | Reduced stress (P) | 0.333 | Reduced stress a (P) | 0.412 |
| Phonemes prolonged a b (A) | 0.572 | Harsh voice b (VQ) | 0.302 | Voice stoppages (P) | 0.382 |
| Intelligibility a b (GI) | 0.550 | Voice tremor (PC) | 0.369 | ||
| Vowels distorted a b (A) | 0.536 | Monoloudness b (L) | −0.349 | ||
| Irregular articulatory breakdown a b (A) | 0.521 | Rate b (P) | 0.348 | ||
| Monopitch a b (PC) | 0.460 | Excess and equal stress b (P) | −0.334 | ||
| Excess loudness | 0.414 | ||||
| Alternating loudness a (L) | 0.410 | ||||
| Monoloudness b (L) | 0.394 | ||||
| Loudness overall b (L) | −0.389 | ||||
| Harsh voice b (VQ) | 0.327 | ||||
| Loudness decay (L) | 0.308 |
Note: All speech characteristics with significant correlations (p < 0.05) ranked in order from the highest to lowest correlations under each dimension. A, articulation; GI, general impression; L, loudness; P, prosody; PC, pitch characteristics; R, respiration; VQ, voice quality; a Characteristics that have correlation coefficients 0.4 or greater; b Characteristics that have significant subgroup effects (p < 0.05) from either the Kruskal–Wallis test or Fisher’s exact test.
The results of Kruskal–Wallis H test on variables that are significantly correlated with the revealed three dimensions from MDS analysis.
| Variable | Kruskal–Wallis Test H | df | Sig. |
|---|---|---|---|
| Alternating loudness | 7.02 | 3 | 0.071 |
| Bizarreness | 27.58 | 3 | <0.001 * |
| Breathy voice | 3.27 | 3 | 0.352 |
| Excess and equal stress | 17.83 | 3 | <0.001 * |
| Excess loudness variation | 16.89 | 3 | 0.001 * |
| Harsh voice | 8.64 | 3 | 0.034 * |
| Hoarse voice | 8.23 | 3 | 0.042 * |
| Imprecise consonants | 17.1 | 3 | 0.001 * |
| Inappropriate silences | 19.47 | 3 | <0.001 * |
| Intelligibility | 21.11 | 3 | <0.001 * |
| Intervals prolonged | 14.46 | 3 | 0.002 * |
| Irregular articulatory breakdown | 16.23 | 3 | 0.001 * |
| Loudness decay | 2.59 | 3 | 0.458 |
| Loudness overall | 8.59 | 3 | 0.035 * |
| Monoloudness | 14.53 | 3 | 0.002 * |
| Monopitch | 10.72 | 3 | 0.013 * |
| Phonemes prolonged | 15.28 | 3 | 0.002 * |
| Rate | 21.39 | 3 | <0.001 * |
| Reduced stress | 7.56 | 3 | 0.056 |
| Sex | NA | NA | NA |
| Strained-strangled voice | 11.62 | 3 | 0.009 * |
| Variable rate | 9.12 | 3 | 0.028 * |
| Voice stoppages | 4.32 | 3 | 0.229 |
| Voice tremor | 7.29 | 3 | 0.063 |
| Vowels distorted | 15.25 | 3 | 0.002 * |
* p < 0.05; NA, not adequate for the specific analysis.
Multiple comparisons for each variable that showed significant Kruskal–Wallis H test results and moderate correlations with the dimensions from MDS analysis.
| Variable |
|
| Variable |
|
|
|---|---|---|---|---|---|
| Bizarreness | Excess and equal stress | ||||
| (D1) | (D1) | ||||
| 4-3 | 18.5 | 0.003 * | 4-3 | 17.4 | 0.006 * |
| 4-2 | 19.9 | 0.003 * | 4-2 | 20.9 | 0.001 * |
| 4-1 | 27.8 | <0.001 * | 4-1 | 16.1 | 0.027 * |
| Excess loudness | Inappropriate silences | ||||
| variation | (D1) | ||||
| (D1) | 4-3 | 19.7 | 0.001 * | ||
| 4-1 | 18.0 | 0.001 * | 4-2 | 20.7 | 0.002 * |
| 2-1 | 16.5 | 0.007 * | 4-1 | 16.2 | 0.025 * |
| Intervals prolonged | Rate | ||||
| (D1) | (D1) | ||||
| 4-2 | 19.7 | 0.003 * | 4-2 | −25.9 | <0.001 * |
| 4-1 | 15.7 | 0.031 * | 4-1 | −14.5 | 0.064 |
| 3-2 | −14.9 | 0.059 | |||
| Monopitch | Phonemes prolonged | ||||
| (D1) | (D1) | ||||
| 4-2 | 18.2 | 0.008 * | 4-2 | 15.1 | 0.045 * |
| 4-1 | 20.8 | 0.001 * | |||
| Intelligibility | Imprecise Consonants | ||||
| (D1 and D3) | (D1 and D3) | ||||
| 4-3 | 14.1 | 0.040 * | 4-1 | 23.2 | <0.001 * |
| 4-1 | 25.4 | <0.001 * | |||
| Irregular articulatory breakdown | Vowels distorted | ||||
| (D1 and D3) | (D1 and D3) | ||||
| 4-3 | 15.2 | 0.024 * | 4-3 | 13.3 | 0.068 |
| 4-1 | 20.7 | 0.002 * | 4-1 | 21.3 | 0.001 * |
| Hoarse Voice | Variable rate | ||||
| (D2) | (D2) | ||||
| 3-1 | 11.4 | 0.084 | 4-3 | 14.2 | 0.04 * |
| Loudness overall | Strained-strangled voice | ||||
| (D2) | (D3) | ||||
| 4-1 | −13.7 | 0.052 | 4-1 | 16.7 | 0.017 * |
| 2-1 | 7.3 | 0.041 * |
* p < 0.05 (significance values have been adjusted by the Bonferroni correction for multiple tests).
Figure 6Subgroup means (95% confidence interval) for the ratings of perceptual measures. ** p < 0.05; * p ≤ 0.084.
Ranked correlations between the ratings of overall bizarreness and the ratings of all the perceptual speech characteristics that were moderately correlated with the identified dimensions.
| Variable | Pearson Correlation | Sig. | Correlated Dimensions |
|---|---|---|---|
| Intelligibility | 0.789 | <0.001 * | 1 and 3 |
| Imprecise consonants | 0.764 | <0.001 * | 1 and 3 |
| Irregular articulatory breakdown | 0.730 | <0.001 * | 1 and 3 |
| Vowels distorted | 0.664 | <0.001 * | 1 and 3 |
| Inappropriate silences | 0.619 | <0.001 * | 1 |
| Alternating loudness | 0.520 | <0.001 * | 1 |
| Intervals prolonged | 0.515 | <0.001 * | 1 |
| Excess and equal stress | 0.494 | <0.001 * | 1 and 3 |
| Monopitch | 0.487 | 0.001 * | 1 |
| Rate | −0.474 | 0.001 * | 1 and 3 |
| Reduced stress | 0.449 | 0.002 * | 2 and 3 |
| Excess loudness variation | 0.437 | 0.002 * | 1 |
| Harsh voice | 0.423 | 0.003 * | 1 and 2 |
| Phonemes prolonged | 0.416 | 0.004 * | 1 |
| Strained-strangled voice | 0.415 | 0.004 * | 3 |
| Loudness overall | −0.412 | 0.004 * | 1 and 2 |
| Breathy voice | 0.391 | 0.007 * | 2 |
| Monoloudness | 0.385 | 0.007 * | 1 and 3 |
| Voice stoppages | 0.380 | 0.008 * | 3 |
| Variable rate | 0.368 | <0.001 * | 2 |
| Hoarse voice | 0.284 | 0.053 | 2 |
| Voice tremor | 0.225 | 0.128 | 3 |
* p < 0.05; all speech characteristics that had significant correlations with the identified dimensions are ranked in order from the highest to lowest correlations with the ratings of overall bizarreness.