| Literature DB >> 35937895 |
Mingxing Zhu1,2, Xin Wang2,3,4, Hanjie Deng5, Yuchao He2,4, Haoshi Zhang2,3,4, Zhenzhen Liu6, Shixiong Chen2,4, Mingjiang Wang1, Guanglin Li2,4.
Abstract
Pitch, as a sensation of the sound frequency, is a crucial attribute toward constructing a natural voice for communication. Producing intelligible sounds with normal pitches depend on substantive interdependencies among facial and neck muscles. Clarifying the interrelations between the pitches and the corresponding muscular activities would be helpful for evaluating the pitch-related phonating functions, which would play a significant role both in training pronunciation and in assessing dysphonia. In this study, the speech signals and the high-density surface electromyography (HD sEMG) signals were synchronously acquired when phonating [a:], [i:], and [ә:] vowels with increasing pitches, respectively. The HD sEMG energy maps were constructed based on the root mean square values to visualize spatiotemporal characteristics of facial and neck muscle activities. Normalized median frequency (nMF) and root-mean square (nRMS) were correspondingly extracted from the speech and sEMG recordings to quantitatively investigate the correlations between sound frequencies and myoelectric characteristics. The results showed that the frame-wise energy maps built from sEMG recordings presented that the muscle contraction strength increased monotonously across pitch-rising, with left-right symmetrical distribution for the face/neck. Furthermore, the nRMS increased at a similar rate to the nMF when there were rising pitches, and the two parameters had a significant correlation across different vowel tasks [(a:) (0.88 ± 0.04), (i:) (0.89 ± 0.04), and (ә:) (0.87 ± 0.05)]. These findings suggested the possibility of utilizing muscle contraction patterns as a reference for evaluating pitch-related phonation functions. The proposed method could open a new window for developing a clinical approach for assessing the muscular functions of dysphonia.Entities:
Keywords: high-density; phonation function; pitches; speech communication; surface electromyogram
Year: 2022 PMID: 35937895 PMCID: PMC9354519 DOI: 10.3389/fnins.2022.941594
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Figure 1(A) Experimental setup to record the speech and HD sEMG signals simultaneously; (B) Placement of the high-density surface electrodes on the facial and neck muscles.
Figure 2The amplitude and spectrogram of the speech signals when phonating the vowel [a:] with continuously increasing pitch. (A) The amplitude of the speech signal as a function of time; (B) The time-frequency spectrogram of the speech signal.
Figure 3Amplitude spectra of the speech signal with an increasing pitch from a series of analysis windows (F1–F40).
Figure 4The temporal waveforms of the HD sEMG signals recorded from all the 120 surface electrodes on the facial and neck muscles during the phonation with rising pitch.
Figure 5The HD sEMG energy maps of the facial and neck muscles by using 40 analysis windows with a length of 250 ms during phonating vowel [a:] with pitch increase: (A) A typical frame of HD sEMG energy map visualizing the facial and neck muscle activities; (B) HD energy maps of the facial muscles; (C) HD energy maps of the neck muscles.
Figure 6(A) Comparison of the nMF of the speech signals and the nRMS of the sEMG recordings from one channel when phonating vowel [a:] with increasing pitch; (B) Correlation coefficients between the nRMS values of 120 channels of sEMG recordings and the nMF of the speech signal from three repeated tests.
Figure 7The averaged HD sEMG energy maps of the facial and neck muscles during the phonating vowel [i:] with a pitch increase.
Figure 8The averaged HD sEMG energy maps of the facial and neck muscles during the phonating vowel [ә:] on a rising pitch scale.
Figure 9(A) Comparison of the nMF of the speech signals and the nRMS of the sEMG recordings from channel E8 when phonating three different vowels [(a:), (i:), and (ә:)] with increasing pitch; (B) Correlation coefficients between the nRMS values of 120 channels of sEMG recordings and the nMF of the speech signal across three different vowels [(a:), (i:), and (ә:)].
The average correlation coefficients between the nRMS of sEMG recordings from all channels and the nMF of the speech signals across all recruited subjects (p < 0.05).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| [a:] | M | 0.83 | 0.94 | 0.85 | 0.94 | 0.83 | 0.86 | 0.90 | 0.87 | 0.87 | 0.83 | 0.89 | 0.95 | 0.90 | 0.89 |
| SD | 0.02 | 0.02 | 0.04 | 0.02 | 0.06 | 0.02 | 0.03 | 0.1 | 0.04 | 0.03 | 0.03 | 0.01 | 0.03 | 0.01 | |
| 0.88 ± 0.04 | |||||||||||||||
| [i:] | M | 0.88 | 0.90 | 0.91 | 0.96 | 0.86 | 0.84 | 0.82 | 0.92 | 0.83 | 0.90 | 0.91 | 0.92 | 0.88 | 0.90 |
| SD | 0.05 | 0.02 | 0.02 | 0.02 | 0.05 | 0.03 | 0.07 | 0.02 | 0.03 | 0.03 | 0.06 | 0.03 | 0.03 | 0.05 | |
| 0.89 ± 0.04 | |||||||||||||||
| [ә:] | M | 0.90 | 0.92 | 0.82 | 0.95 | 0.90 | 0.87 | 0.78 | 0.87 | 0.90 | 0.89 | 0.79 | 0.95 | 0.81 | 0.88 |
| SD | 0.02 | 0.01 | 0.05 | 0.01 | 0.01 | 0.01 | 0.02 | 0.03 | 0.04 | 0.08 | 0.03 | 0.01 | 0.08 | 0.04 | |
| 0.87 ± 0.05 | |||||||||||||||