| Literature DB >> 25409322 |
Jan Rusz1, Carsten Saft2, Uwe Schlegel3, Rainer Hoffman2, Sabine Skodda3.
Abstract
PURPOSE: Although dysphonia has been shown to be a common sign of Huntington disease (HD), the extent of phonatory dysfunction in gene positive premanifest HD individuals remains unknown. The aim of the current study was to explore the possible occurrence of phonatory abnormalities in prodromal HD.Entities:
Mesh:
Year: 2014 PMID: 25409322 PMCID: PMC4237453 DOI: 10.1371/journal.pone.0113412
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Clinical characteristics of PreHD subjects.
| n = 28 (14 men) | Mean (SD) | Range |
| Age (years) | 37.1 (9.3) | 20–55 |
| UHDRS motor score | 2.2 (2.4) | 0–8 |
| Cognitive score | 337 (44) | 242–411 |
| Tapping | 189 (23) | 142–229 |
| Pegboard | 4492 (805) | 3469–7519 |
| Disease burden score | 251 (82) | 116–413 |
| Years to onset (years) | 16.7 (8.2) | 5–36 |
UHDRS = Unified Huntington's Disease Rating Scale.
Number of taps within a time period of 32 seconds, average value of the dominant and non-dominant hand.
Time period of peg insertion in 100 ms, average value of the dominant and non-dominant hand.
Overview of applied phonatory measurements.
| Abbreviation | Description | |
|
| ||
| MPT (s) | Maximum phonation time | Aerodynamic efficiency of the vocal tract measured as the maximum duration of the |
| prolonged vowel. This measure includes all voice breaks occurring during the entire | ||
| vowel phonation. | ||
| FOVB (s) | First occurrence of voice | Maximum duration of the prolonged vowel until the first occurrence of the first voice break, |
| break | present after at least 250 ms of modal phonation. | |
|
| ||
| NVB (−) | Number of voice breaks | Overall count of voice breaks. A voice break is defined as the distance between |
| consecutive pulses longer than 1.25 divided by the bottom of the pitch range. The | ||
| segment was defined as a voice break only if it occurred after at least 250 ms of modal | ||
| phonation and 1 s preceding the termination of phonation. Voice breaks may be | ||
| associated with both low frequency drop and vocal arrest. | ||
| DPB (%) | Degree of pitch breaks | The fraction of pitch frames marked as unvoiced. A frame was considered unvoiced if |
| it had voicing strength below the voicing threshold of 0.45 (autocorrelation function). | ||
| Silent periods were not considered in analyses. | ||
| DVA (%) | Degree of vocal arrests | The fraction of silent periods in the analysed voice signal. |
|
| ||
| F0 SD (st) | Standard deviation of | Variation in frequency of vocal fold vibration. The F0 sequence was converted to |
| fundamental frequency (F0) | a semitone scale to avoid differences in gender. | |
| RPDE (−) | Recurrence period density | Ability of the vocal folds to sustain simple vibration. RPDE quantifies the deviations |
| entropy | from periodicity, representing the uncertainty in the measurement of the pitch period. | |
|
| ||
| Jitter (%) | Frequency perturbation | Extent of variation of the voice range. Jitter is defined as the variability of the |
| fundamental frequency of speech from one cycle to the next. | ||
| Shimmer (%) | Amplitude perturbation | Extent of variation of expiratory flow. Shimmer is defined as the |
| sequence of maximum extent of the signal amplitude within each vocal cycle. | ||
|
| ||
| HNR (dB) | Harmonics-to-noise ratio | The amount of noise in the speech signal, mainly due to incomplete vocal fold closure. |
| HNR is defined as the amplitude of noise relative to tonal components in speech. | ||
| DFA (−) | Detrended fluctuation | The extent of turbulent noise in the speech signal. DFA measures the stochastic |
| analysis | self-similarity of the noise caused by turbulent airflow through the vocal folds. | |
|
| ||
| FTRI (%) | Frequency tremor intensity | Average ratio of the frequency magnitude of the most intense low-frequency modulating |
| index | components to the total frequency magnitude of the analysed voice signal. | |
| ATRI (%) | Amplitude tremor intensity | Average ratio of the amplitude of the most intense low-frequency amplitude modulating |
| index | components to the total amplitude of the analyzed voice signal. | |
|
| ||
| MFCC (−) | Mel-frequency cepstral | Vocal tract transfer function reflecting potential problems with subtle motion of the |
| coefficients | articulators (jaw, tongue, lips). The MFCC parameter here was defined as the mean of | |
| the standard deviations of the 1st-12th MFCCs. It was designed to represent overall stability | ||
| of individual vocal tract elements, as the individual MFCCs overlap the partitions of the | ||
| frequency domain. | ||
| ΔMFCC (−) | Delta MFCCs | The ΔMFCC parameter represents a similar function as MFCC and was defined as the |
| mean of the standard deviations of the 1st-12th delta MFCCs multiplied by 10. | ||
Results of voice analyses in PreHD individuals and appropriate controls.
| Parameter | Group | Effect size | |||
| PreHD | Controls | PreHD | |||
| Mean (SD) | Range | Mean (SD) | Range | vs. controls | |
|
| |||||
| MPT (s) | 16.9 (7.6) | 6.7–43.2 | 14.1 (4.3) | 9.9–25.4 | 0.44 |
| FOVB (s) | 12.4 (6.8) | 0.7–30.8 | 13.9 (4.4) | 9.9–25.4 | −0.26 |
|
| |||||
| NVB (−) | 1.11 (2.04) | 0–7 | 0.04 (0.19) | 0–1 | 0.74 |
| DPB (%) | 0.63 (1.14) | 0–4.50 | 0.09 (0.05) | 0–0.24 | 0.77 |
| DVA (%) | 0.09 (0.27) | 0–1.14 | 0 (0) | 0–0 | 0.46 |
|
| |||||
| F0 SD (st) | 0.30 (0.16) | 0.11–0.79 | 0.30 (0.13) | 0.16–0.67 | −0.02 |
| RPDE (−) | 0.28 (0.07) | 0.14–0.41 | 0.26 (0.05) | 0.18–0.38 | 0.48 |
|
| |||||
| Jitter (%) | 0.80 (0.55) | 0.16–2.25 | 0.61 (0.29) | 0.28–1.51 | 0.41 |
| Shimmer (%) | 3.69 (1.25) | 1.03–8.05 | 3.18 (1.42) | 1.47–8.39 | 0.38 |
|
| |||||
| HNR (dB) | 19.0 (4.1) | 12.4–30.5 | 22.3 (4.0) | 15.8–29.8 | −0.81 |
| DFA (−) | 0.64 (0.02) | 0.59–0.66 | 0.62 (0.02) | 0.58–0.66 | 1.15 |
|
| |||||
| FTRI (%) | 0.37 (0.17) | 0.07–0.73 | 0.42 (0.24) | 0.07–1.05 | −0.22 |
| ATRI (%) | 4.60 (3.39) | 1.09–16.59 | 4.74 (2.27) | 0.96–10.13 | −0.05 |
|
| |||||
| MFCC (−) | 0.43 (0.07) | 0.32–0.61 | 0.41 (0.08) | 0.31–0.65 | 0.26 |
| ΔMFCC (−) | 0.45 (0.06) | 0.36–0.58 | 0.44 (0.07) | 0.31–0.57 | 0.12 |
*p<0.01;
**p<0.001.
Cohen's d: Effect size 0.8 is considered large, 0.5 is considered medium, and 0.2 is considered small.
MPT = maximum phonation time; FOVB = first occurrence of voice break; NVB = number of voice breaks; DPB = degree of pitch breaks; DVA = degree of vocal arrests; F0 SD = variability of fundamental frequency; RPDE = recurrence period density entropy; HNR = harmonics-to-noise ratio; DFA = detrended fluctuation analysis; FTRI = frequency tremor intensity index; ATRI = amplitude tremor intensity index; MFCC = Mel-frequency cepstral coefficients; ΔMFCC = delta Mel-frequency cepstral coefficients.
Figure 1Most salient signs associated with phonatory dysfunction in PreHD individuals: (A) drop in the fundamental frequency to half of its original value over a short period of time; (B) short vocal arrest produced with no vibration of vocal folds; (C) probability density for ratio between harmonics and noise components in voice, with detail of three pitch periods for healthy and disordered voice.
Figure 2Results of voice analyses: (A) percentage of affected participants according to the specific phonatory dimension; (B) number of affected phonatory dimensions across participants.