| Literature DB >> 23762447 |
Jan Rusz1, Jiří Klempíř, Eva Baborová, Tereza Tykalová, Veronika Majerová, Roman Cmejla, Evžen Růžička, Jan Roth.
Abstract
PURPOSE: Although speech motor changes are reported as a common sign of Huntington's disease (HD), the most prominent signs of voice dysfunction remain unknown. The aim of the current study was to explore specific changes in phonatory function in subjects with HD.Entities:
Mesh:
Year: 2013 PMID: 23762447 PMCID: PMC3677914 DOI: 10.1371/journal.pone.0065881
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overview of the phonatory measurements applied to sustained vowel phonations.
| Abbreviation | Description | |
|
| ||
| MPT (s) | Maximum phonation time | The aerodynamic efficiency of the vocal tract measured as the maximum duration of |
| the prolonged vowel. This measure includes all voice breaks occurring during the entire | ||
| vowel phonation. See Ramig et al. | ||
| MPTVB (s) | MPT until voice break | Maximum duration of the prolonged vowel until the first occurrence of voice break, |
| present after at least 250 ms of modal phonation. | ||
|
| ||
| NVB (−) | Number of voice breaks | Overall count of voice breaks. A voice break is defined as the distance between |
| consecutive pulses longer than 1.25 divided by the bottom of the pitch range. The | ||
| segment was defined as a voice break only if it occurred after at least 250 ms of modal | ||
| phonation and 250 ms preceding the termination of phonation. Voice breaks can be | ||
| associated with both low frequency drop and vocal arrest. | ||
| See Boersma and Weenink | ||
| DUV (%) | Degree of voicelessness | The fraction of pitch frames marked as unvoiced. A frame was considered unvoiced if |
| it had voicing strength below the voicing threshold of 0.45 (autocorrelation function). | ||
| See Boersma and Weenink | ||
|
| ||
| F0 SD (st) | Standard deviation of | The variation in frequency of vocal fold vibration. The F0 sequence was converted to |
| fundamental frequency (F0) | a semitone scale to avoid differences in gender. See Rusz et al. | |
| RPDE (−) | Recurrence period density | The ability of the vocal folds to sustain simple vibration. RPDE quantifies the deviations |
| entropy | from periodicity, representing the uncertainty in the measurement of the pitch period. | |
| See Little et al. | ||
| PPE (−) | Pitch period entropy | The inefficiency of voice frequency control. PPE uses the log-transformed linear |
| prediction residuals of the pitch sequence in order to smooth vibrato. See Little et al. | ||
| for further description. | ||
|
| ||
| Jitter (%) | Frequency perturbation | The extent of variation of the voice range. Jitter is defined as the variability of the |
| fundamental frequency of speech from one cycle to the next. | ||
| See Boersma and Weenink | ||
| Shimmer (%) | Amplitude perturbation | The extent of variation of expiratory flow. Shimmer is defined as the |
| sequence of maximum extent of the signal amplitude within each vocal cycle. | ||
| See Boersma and Weenink | ||
|
| ||
| HNR (dB) | Harmonics-to-noise ratio | The amount of noise in the speech signal, mainly due to incomplete vocal fold |
| closure. HNR is defined as the amplitude of noise relative to tonal components in | ||
| speech. See Boersma and Weenink | ||
| DFA (−) | Detrended fluctuation | The extent of turbulent noise in the speech signal. DFA measures the stochastic |
| analysis | self-similarity of the noise caused by turbulent airflow through the vocal folds. | |
| See Little et al. | ||
|
| ||
| MFCC (−) | Mel-frequency cepstral | Subtle changes in the motion of the articulators (jaw, tongue, lips). The MFCC |
| coefficient | represents the vocal tract transfer function reflecting potential problems in the | |
| articulators. The MFCC parameter here was defined as the mean of the standard | ||
| deviations of the 1st–12th MFCCs. It was designed to represent overall stability of | ||
| individual vocal tract elements, as the individual MFCCs overlap the partitions of the | ||
| frequency domain. The 1st–12th MFCCs were extracted using the implementation of | ||
| Brooke's | ||
| information on MFCCs. | ||
Results of voice analyses in HD and HC subjects.
| Parameter | Group | Effect size | ||||
| HD | HC | (Cohen's | ||||
| Mean ± SD | Range | Mean ± SD | Range | HD vs. HC | ||
|
| ||||||
| MPT (s) | 8.35±6.38 | 0.37–24.32 | 22.15±6.43 | 10.30–37.98 | −2.16 | |
| MPTVB (s) | 5.48±4.87 | 0.37–22.63 | 21.46±6.79 | 10.30–37.98 | −2.70 | |
|
| ||||||
| NVB (−) | 4.78±10.90 | 0–50.8 | 0.42±0.99 | 0–3.75 | 0.56 | |
| DUV (%) | 6.28±7.52 | 0–31.42 | 0.22±0.76 | 0–4.36 | 1.14 | |
|
| ||||||
| F0 SD (st) | 1.44±1.12 | 0.31–4.14 | 0.31±0.12 | 0.15–0.65 | 1.43 | |
| RPDE (−) | 0.39±0.10 | 0.19–0.56 | 0.24±0.04 | 0.16–0.34 | 1.89 | |
| PPE (−) | 0.40±0.20 | 0.11–1.08 | 0.21±0.12 | 0.06–0.57 | 1.10 | |
|
| ||||||
| Jitter (%) | 1.22±0.99 | 0.27–4.61 | 0.62±0.34 | 0.17–1.73 | 0.80 | |
| Shimmer (%) | 6.36±3.70 | 2.26–17.61 | 3.99±1.80 | 1.75–9.61 | 0.81 | |
|
| ||||||
| HNR (dB) | 18.29±5.27 | 5.68–25.27 | 22.46±2.97 | 16.63–28.53 | 0.98 | |
| DFA (−) | 0.63±0.02 | 0.60–0.70 | 0.62±0.01 | 0.59–0.64 | 0.97 | |
|
| ||||||
| MFCC (−) | 0.59±0.13 | 0.35–0.90 | 0.38±0.04 | 0.31–0.48 | 2.16 | |
p<0.05;
p<0.01;
p<0.001.
MPT = maximum phonation time, MPTVB = maximum phonation time until first break, NVB = number of voice breaks, DUV = degree of voicelessness, F0 SD = variability of fundamental frequency, RPDE = recurrence period density entropy, PPE = pitch period entropy, HNR = harmonics-to-noise ratio, DFA = detrended fluctuation analysis, MFCC = mel-frequency cepstral coefficient.
Figure 1Selected pairs of the phonatory measures with classification boundary separating HD and HC subjects.
Figure 2Scheme depicting the most salient features of dysphonia in HD and their relationship to motor symptoms.