| Literature DB >> 35664509 |
Simon Roessig1, Bodo Winter2, Doris Mücke1.
Abstract
Focus is known to be expressed by a wide range of phonetic cues but only a few studies have explicitly compared different phonetic variables within the same experiment. Therefore, we presented results from an analysis of 19 phonetic variables conducted on a data set of the German language that comprises the opposition of unaccented (background) vs. accented (in focus), as well as different focus types with the nuclear accent on the same syllable (broad, narrow, and contrastive focus). The phonetic variables are measures of the acoustic and articulographic signals of a target syllable. Overall, our results provide the highest number of reliable effects and largest effect sizes for accentuation (unaccented vs. accented), while the differentiation of focus types with accented target syllables (broad, narrow, and contrastive focus) are more subtle. The most important phonetic variables across all conditions are measures of the fundamental frequency. The articulatory variables and their corresponding acoustic formants reveal lower tongue positions for both vowels /o, a/, and larger lip openings for the vowel /a/ under increased prosodic prominence with the strongest effects for accentuation. While duration exhibits consistent mid-ranked results for both accentuation and the differentiation of focus types, measures related to intensity are particularly important for accentuation. Furthermore, voice quality and spectral tilt are affected by accentuation but also in the differentiation of focus types. Our results confirm that focus is realized via multiple phonetic cues. Additionally, the present analysis allows a comparison of the relative importance of different measures to better understand the phonetic space of focus marking.Entities:
Keywords: focus; information structure; intonation; prosody; speech production
Year: 2022 PMID: 35664509 PMCID: PMC9160369 DOI: 10.3389/frai.2022.842546
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Examples for all four conditions.
Descriptive means of all variables.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| F0 mean | 0.14 | 2.47 | 2.90 | 3.15 | 0.79 | 3.90 | 4.38 | 4.56 |
| Tonal onglide | N/A | −0.79 | 0.99 | 2.41 | N/A | 0.44 | 1.82 | 3.30 |
| Peak alignment | N/A | 6.84 | 82.98 | 124.73 | N/A | 14.04 | 70.79 | 103.13 |
| Target height | N/A | 2.85 | 3.75 | 4.37 | N/A | 4.39 | 5.20 | 5.78 |
| Synchrony | N/A | −1.59 | −0.51 | 0.72 | N/A | −0.72 | 0.45 | 1.56 |
| Vowel duration (ms) | 154.46 | 170.02 | 174.75 | 176.88 | 123.84 | 136.13 | 141.31 | 140.74 |
| RMS amplitude | 85.47 | 120.70 | 119.73 | 122.67 | 95.00 | 142.94 | 146.49 | 147.60 |
| Periodic energy | 124.59 | 187.10 | 199.53 | 196.53 | 111.79 | 169.19 | 183.26 | 178.38 |
| H1*-A3* (dB) | 16.31 | 15.83 | 15.85 | 14.71 | 10.25 | 7.89 | 7.61 | 6.42 |
| H1*-H2* (dB) | 6.51 | 8.59 | 9.83 | 10.76 | 6.77 | 7.68 | 8.29 | 8.59 |
| HNR | 26.83 | 28.76 | 28.86 | 28.52 | 26.84 | 28.62 | 28.72 | 27.95 |
| Formant 1 | −53.91 | 6.84 | 20.75 | 26.91 | −14.35 | 4.71 | 4.12 | 5.84 |
| Formant 2 | −2.54 | −3.20 | −0.10 | 4.37 | 38.39 | −2.68 | −17.95 | −22.03 |
| Lip aperture | 2.06 | 3.44 | 3.51 | 4.06 | −3.60 | −3.31 | −3.16 | −3.02 |
| Vertical tongue position | −2.22 | −2.76 | −3.06 | −3.26 | 3.29 | 2.86 | 2.70 | 2.45 |
| Horizontal tongue position | 0.98 | 1.14 | 1.11 | 0.94 | −0.62 | −0.98 | −1.22 | −1.35 |
| Tongue peak velocity | 150.88 | 152.99 | 155.84 | 159.44 | 128.26 | 134.81 | 136.40 | 140.93 |
| Tongue time to peak velocity | 107.03 | 111.89 | 112.28 | 112.12 | 110.16 | 114.79 | 117.11 | 118.01 |
| Tongue gesture duration | 216.38 | 227.32 | 224.33 | 227.38 | 210.56 | 223.32 | 231.16 | 230.54 |
Tabular overview of modeling results for unaccented vs. accented, vowel /a/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| F0 mean | 0.90 | 0.18 | 0.60 | 1.19 | 1.00 | 68.94 | 21.67 | 1.50 |
| Periodic energy mass | 0.84 | 0.14 | 0.60 | 1.06 | 1.00 | 65.15 | 18.10 | 1.69 |
| Formant 1 | 0.66 | 0.11 | 0.48 | 0.83 | 1.00 | 19.86 | 10.93 | 2.00 |
| RMS amplitude | 0.53 | 0.08 | 0.40 | 0.66 | 1.00 | 92.16 | 7.16 | 0.59 |
| Lip aperture | 0.52 | 0.08 | 0.39 | 0.64 | 1.00 | 56.43 | 6.73 | 1.36 |
| Vowel duration | 0.50 | 0.09 | 0.36 | 0.65 | 1.00 | 71.55 | 6.38 | 0.66 |
| HNR | 0.34 | 0.11 | 0.15 | 0.53 | 1.00 | 50.93 | 2.96 | 0.51 |
| Vertical tongue position | −0.28 | 0.09 | −0.43 | −0.13 | 0.00 | 31.82 | 1.95 | 0.53 |
| H1*-H2* | 0.28 | 0.10 | 0.12 | 0.45 | 1.00 | 32.08 | 2.01 | 0.49 |
| Tongue gesture duration | 0.24 | 0.10 | 0.08 | 0.40 | 0.99 | 45.66 | 1.43 | 0.37 |
| Tongue time to peak velocity | 0.13 | 0.12 | −0.06 | 0.32 | 0.88 | 58.23 | 0.45 | 0.34 |
| Horizontal tongue position | 0.07 | 0.09 | −0.08 | 0.22 | 0.79 | 37.73 | 0.17 | 0.14 |
| Tongue peak velocity | 0.06 | 0.07 | −0.05 | 0.17 | 0.83 | 73.05 | 0.11 | 0.08 |
| H1*-A3* | −0.04 | 0.09 | −0.19 | 0.10 | 0.30 | 35.91 | 0.11 | 0.09 |
| Formant 2 | 0.00 | 0.12 | −0.19 | 0.19 | 0.50 | 41.76 | 0.15 | 0.01 |
Figure 2Modeling results for unaccented vs. accented. Top: vowel /a/, bottom: vowel /o/. The whiskers indicate the 90% Credible Interval.
Tabular overview of modeling results for unaccented vs. accented, vowel /o/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| F0 mean | 1.10 | 0.16 | 0.82 | 1.36 | 1.00 | 86.44 | 32.91 | 1.74 |
| Periodic energy mass | 0.84 | 0.14 | 0.61 | 1.06 | 1.00 | 68.08 | 18.34 | 1.77 |
| RMS amplitude | 0.62 | 0.09 | 0.48 | 0.77 | 1.00 | 91.04 | 9.91 | 0.70 |
| Vowel duration | 0.40 | 0.09 | 0.25 | 0.56 | 1.00 | 69.61 | 4.11 | 0.57 |
| Formant 1 | 0.38 | 0.17 | 0.09 | 0.66 | 0.98 | 24.66 | 3.73 | 0.85 |
| HNR | 0.31 | 0.09 | 0.17 | 0.46 | 1.00 | 57.19 | 2.48 | 0.42 |
| Tongue gesture duration | 0.28 | 0.09 | 0.13 | 0.42 | 1.00 | 56.37 | 1.91 | 0.37 |
| H1*-A3* | −0.26 | 0.11 | −0.43 | −0.08 | 0.01 | 30.85 | 1.66 | 0.55 |
| Vertical tongue position | −0.22 | 0.10 | −0.37 | −0.06 | 0.01 | 26.48 | 1.21 | 0.40 |
| H1*-H2* | 0.18 | 0.09 | 0.03 | 0.33 | 0.97 | 30.13 | 0.79 | 0.31 |
| Lip aperture | 0.17 | 0.08 | 0.03 | 0.31 | 0.98 | 34.60 | 0.72 | 0.31 |
| Tongue peak velocity | 0.15 | 0.06 | 0.04 | 0.25 | 0.99 | 69.15 | 0.54 | 0.19 |
| Formant 2 | −0.15 | 0.13 | −0.35 | 0.06 | 0.12 | 15.35 | 0.57 | 0.52 |
| Tongue time to peak velocity | 0.12 | 0.07 | 0.00 | 0.24 | 0.95 | 52.73 | 0.35 | 0.22 |
| Horizontal tongue position | −0.12 | 0.08 | −0.26 | 0.01 | 0.06 | 40.05 | 0.39 | 0.31 |
Conditional and marginal R2 of the focus types models (expressed in %).
|
|
| ||||
|---|---|---|---|---|---|
|
|
|
|
| ||
| Peak alignment | 43.68 | 11.50 | Tonal onglide | 63.07 | 11.38 |
| Tonal onglide | 54.63 | 10.41 | Peak alignment | 48.37 | 9.38 |
| Synchrony | 48.04 | 6.73 | Synchrony | 49.62 | 8.53 |
| Target height | 71.91 | 3.40 | Target height | 83.01 | 2.81 |
| H1*-H2* | 36.99 | 1.94 | Vertical tongue position | 32.86 | 1.01 |
| Vertical tongue position | 30.50 | 1.36 | F0 mean | 83.71 | 0.95 |
| Formant 1 | 8.85 | 1.19 | Periodic energy mass | 63.87 | 0.90 |
| Lip aperture | 57.06 | 1.13 | Tongue gesture duration | 59.58 | 0.69 |
| F0 mean | 72.78 | 1.06 | H1*-H2* | 40.13 | 0.65 |
| Vowel duration | 71.59 | 0.83 | Vowel duration | 70.89 | 0.62 |
| Periodic energy mass | 60.50 | 0.77 | H1*-A3* | 34.09 | 0.62 |
| H1*-A3* | 44.98 | 0.41 | Horizontal tongue position | 39.65 | 0.48 |
| Tongue peak velocity | 73.35 | 0.33 | HNR | 61.61 | 0.43 |
| Horizontal tongue position | 39.42 | 0.28 | Lip aperture | 36.49 | 0.39 |
| HNR | 51.74 | 0.22 | Formant 2 | 12.90 | 0.29 |
| Tongue gesture duration | 49.08 | 0.19 | Tongue peak velocity | 67.49 | 0.26 |
| Formant 2 | 40.94 | 0.18 | Formant 1 | 4.09 | 0.25 |
| Tongue time to peak velocity | 59.32 | 0.12 | Tongue time to peak velocity | 58.33 | 0.19 |
| RMS amplitude | 91.09 | 0.06 | RMS amplitude | 88.62 | 0.17 |
Tabular overview of modeling results for broad vs. contrastive focus, vowel /a/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Peak alignment | 0.81 | 0.13 | 0.59 | 1.01 | 1.00 | 1.41 |
| Tonal onglide | 0.77 | 0.15 | 0.53 | 1.01 | 1.00 | 1.28 |
| Synchrony | 0.63 | 0.11 | 0.44 | 0.81 | 1.00 | 1.07 |
| Target height | 0.44 | 0.09 | 0.30 | 0.58 | 1.00 | 0.56 |
| H1*-H2* | 0.33 | 0.09 | 0.19 | 0.47 | 1.00 | 0.55 |
| Vertical tongue position | −0.27 | 0.09 | −0.41 | −0.13 | 0.00 | 0.47 |
| F0 mean | 0.24 | 0.08 | 0.11 | 0.37 | 1.00 | 0.30 |
| Formant 1 | 0.24 | 0.10 | 0.08 | 0.40 | 0.99 | 0.87 |
| Lip aperture | 0.24 | 0.07 | 0.13 | 0.34 | 1.00 | 0.56 |
| Vowel duration | 0.21 | 0.05 | 0.13 | 0.30 | 1.00 | 0.26 |
| Periodic energy mass | 0.15 | 0.06 | 0.04 | 0.25 | 0.99 | 0.25 |
| Tongue peak velocity | 0.13 | 0.06 | 0.04 | 0.22 | 0.99 | 0.15 |
| H1*-A3* | −0.11 | 0.08 | −0.24 | 0.02 | 0.07 | 0.18 |
| Horizontal tongue position | −0.09 | 0.08 | −0.22 | 0.05 | 0.14 | 0.16 |
| Formant 2 | 0.06 | 0.08 | −0.07 | 0.18 | 0.77 | 0.23 |
| RMS amplitude | 0.05 | 0.03 | 0.00 | 0.10 | 0.94 | 0.05 |
| HNR | −0.03 | 0.09 | −0.19 | 0.12 | 0.35 | 0.06 |
| Tongue gesture duration | 0.01 | 0.08 | −0.12 | 0.13 | 0.55 | 0.00 |
| Tongue time to peak velocity | 0.01 | 0.07 | −0.11 | 0.13 | 0.53 | 0.01 |
Figure 3Modeling results for broad vs. contrastive focus. Top: vowel /a/, bottom: vowel /o/. The whiskers indicate the 90% Credible Interval.
Tabular overview of modeling results for broad vs. contrastive focus, vowel /o/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Tonal onglide | 0.81 | 0.15 | 0.56 | 1.04 | 1.00 | 1.25 |
| Peak alignment | 0.73 | 0.16 | 0.47 | 0.98 | 1.00 | 1.26 |
| Synchrony | 0.71 | 0.12 | 0.51 | 0.89 | 1.00 | 1.18 |
| Target height | 0.40 | 0.08 | 0.26 | 0.54 | 1.00 | 0.49 |
| Vertical tongue position | −0.23 | 0.08 | −0.37 | −0.10 | 0.00 | 0.44 |
| F0 mean | 0.22 | 0.07 | 0.10 | 0.34 | 1.00 | 0.27 |
| H1*-H2* | 0.17 | 0.08 | 0.05 | 0.30 | 0.99 | 0.28 |
| Tongue gesture duration | 0.16 | 0.08 | 0.03 | 0.28 | 0.98 | 0.21 |
| H1*-A3* | −0.16 | 0.08 | −0.30 | −0.03 | 0.03 | 0.37 |
| Vowel duration | 0.15 | 0.06 | 0.05 | 0.25 | 0.99 | 0.22 |
| Periodic energy mass | 0.15 | 0.07 | 0.05 | 0.26 | 0.99 | 0.33 |
| Horizontal tongue position | −0.14 | 0.08 | −0.27 | −0.02 | 0.03 | 0.33 |
| Lip aperture | 0.13 | 0.08 | −0.01 | 0.26 | 0.94 | 0.27 |
| HNR | −0.12 | 0.09 | −0.26 | 0.02 | 0.09 | 0.13 |
| Tongue peak velocity | 0.10 | 0.06 | 0.01 | 0.20 | 0.96 | 0.12 |
| RMS amplitude | 0.08 | 0.06 | −0.01 | 0.17 | 0.94 | 0.09 |
| Tongue time to peak velocity | 0.07 | 0.07 | −0.04 | 0.19 | 0.85 | 0.19 |
| Formant 2 | −0.07 | 0.09 | −0.22 | 0.08 | 0.21 | 0.29 |
| Formant 1 | 0.03 | 0.10 | −0.13 | 0.19 | 0.61 | 0.14 |
Tabular overview of modeling results for broad vs. narrow focus, vowel /a/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Peak alignment | 0.52 | 0.09 | 0.38 | 0.67 | 1.00 | 0.88 |
| Tonal onglide | 0.42 | 0.11 | 0.24 | 0.59 | 1.00 | 0.73 |
| Synchrony | 0.28 | 0.10 | 0.12 | 0.43 | 1.00 | 0.50 |
| Target height | 0.26 | 0.08 | 0.13 | 0.38 | 1.00 | 0.34 |
| Periodic energy mass | 0.20 | 0.06 | 0.09 | 0.30 | 1.00 | 0.31 |
| H1*-H2* | 0.19 | 0.08 | 0.06 | 0.31 | 0.99 | 0.32 |
| Formant 1 | 0.17 | 0.10 | 0.01 | 0.33 | 0.96 | 0.60 |
| Vertical tongue position | −0.15 | 0.09 | −0.30 | −0.01 | 0.04 | 0.30 |
| F0 mean | 0.15 | 0.07 | 0.04 | 0.26 | 0.99 | 0.19 |
| Vowel duration | 0.14 | 0.05 | 0.05 | 0.22 | 0.99 | 0.17 |
| Tongue gesture duration | −0.05 | 0.08 | −0.17 | 0.07 | 0.23 | 0.09 |
| Lip aperture | 0.04 | 0.06 | −0.07 | 0.15 | 0.74 | 0.09 |
| Tongue peak velocity | 0.04 | 0.05 | −0.05 | 0.13 | 0.78 | 0.06 |
| HNR | 0.04 | 0.07 | −0.08 | 0.16 | 0.71 | 0.04 |
| Formant 2 | 0.03 | 0.08 | −0.09 | 0.15 | 0.66 | 0.10 |
| Tongue time to peak velocity | 0.02 | 0.07 | −0.09 | 0.13 | 0.64 | 0.02 |
| RMS amplitude | 0.02 | 0.03 | −0.04 | 0.07 | 0.69 | 0.02 |
| Horizontal tongue position | −0.01 | 0.08 | −0.13 | 0.12 | 0.47 | 0.02 |
| H1*-A3* | 0.01 | 0.07 | −0.11 | 0.13 | 0.58 | 0.00 |
Figure 4Modeling results for broad vs narrow focus. Top: vowel /a/, bottom: vowel /o/. The whiskers indicate the 90% Credible Interval.
Tabular overview of modeling results for broad vs. narrow focus, vowel /o/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Peak alignment | 0.46 | 0.11 | 0.28 | 0.64 | 1.00 | 0.72 |
| Tonal onglide | 0.37 | 0.12 | 0.17 | 0.56 | 1.00 | 0.69 |
| Synchrony | 0.35 | 0.09 | 0.21 | 0.50 | 1.00 | 0.61 |
| Target height | 0.24 | 0.07 | 0.11 | 0.36 | 1.00 | 0.30 |
| Periodic energy mass | 0.22 | 0.06 | 0.11 | 0.32 | 1.00 | 0.44 |
| Tongue gesture duration | 0.17 | 0.07 | 0.05 | 0.28 | 0.99 | 0.21 |
| F0 mean | 0.17 | 0.06 | 0.07 | 0.27 | 1.00 | 0.20 |
| Vowel duration | 0.16 | 0.06 | 0.06 | 0.27 | 0.99 | 0.23 |
| H1*-H2* | 0.11 | 0.09 | −0.03 | 0.25 | 0.90 | 0.18 |
| Vertical tongue position | −0.10 | 0.08 | −0.23 | 0.03 | 0.10 | 0.20 |
| Horizontal tongue position | −0.10 | 0.08 | −0.23 | 0.03 | 0.10 | 0.20 |
| RMS amplitude | 0.07 | 0.04 | 0.01 | 0.14 | 0.96 | 0.08 |
| Formant 2 | −0.06 | 0.10 | −0.22 | 0.10 | 0.26 | 0.18 |
| Lip aperture | 0.05 | 0.08 | −0.08 | 0.18 | 0.74 | 0.15 |
| Tongue time to peak velocity | 0.05 | 0.07 | −0.07 | 0.17 | 0.76 | 0.15 |
| Tongue peak velocity | 0.02 | 0.06 | −0.08 | 0.12 | 0.65 | 0.03 |
| Formant 1 | −0.02 | 0.10 | −0.18 | 0.14 | 0.43 | 0.01 |
| H1*-A3* | −0.02 | 0.08 | −0.15 | 0.11 | 0.39 | 0.06 |
| HNR | 0.02 | 0.06 | −0.09 | 0.12 | 0.62 | 0.05 |
Tabular overview of modeling results for narrow versus contrastive focus, vowel /a/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Tonal onglide | 0.35 | 0.11 | 0.18 | 0.53 | 1.00 | 0.53 |
| Synchrony | 0.35 | 0.10 | 0.19 | 0.51 | 1.00 | 0.53 |
| Peak alignment | 0.28 | 0.11 | 0.10 | 0.46 | 0.99 | 0.57 |
| Lip aperture | 0.19 | 0.07 | 0.08 | 0.31 | 1.00 | 0.49 |
| Target height | 0.19 | 0.07 | 0.07 | 0.30 | 0.99 | 0.21 |
| H1*-H2* | 0.14 | 0.09 | 0.00 | 0.29 | 0.95 | 0.24 |
| Vertical tongue position | −0.12 | 0.10 | −0.28 | 0.05 | 0.12 | 0.21 |
| H1*-A3* | −0.12 | 0.08 | −0.26 | 0.01 | 0.07 | 0.18 |
| Tongue peak velocity | 0.09 | 0.06 | −0.01 | 0.18 | 0.93 | 0.10 |
| F0 mean | 0.09 | 0.07 | −0.02 | 0.20 | 0.91 | 0.10 |
| Horizontal tongue position | −0.08 | 0.09 | −0.23 | 0.06 | 0.17 | 0.14 |
| Vowel duration | 0.08 | 0.06 | −0.02 | 0.17 | 0.91 | 0.09 |
| Formant 1 | 0.07 | 0.10 | −0.10 | 0.24 | 0.77 | 0.23 |
| HNR | −0.07 | 0.10 | −0.23 | 0.08 | 0.21 | 0.10 |
| Tongue gesture duration | 0.06 | 0.08 | −0.07 | 0.20 | 0.78 | 0.09 |
| Periodic energy mass | −0.05 | 0.07 | −0.16 | 0.07 | 0.24 | 0.07 |
| Formant 2 | 0.03 | 0.08 | −0.11 | 0.16 | 0.63 | 0.17 |
| RMS amplitude | 0.03 | 0.04 | −0.02 | 0.09 | 0.83 | 0.03 |
| Tongue time to peak velocity | −0.02 | 0.08 | −0.14 | 0.11 | 0.41 | 0.02 |
Figure 5Modeling results for narrow vs. contrastive focus. Top: vowel /a/, bottom: vowel /o/. The whiskers indicate the 90% Credible Interval.
Tabular overview of modeling results for narrow vs. contrastive focus, vowel /o/. LCI and HCI refer to the low and high boundaries of the 90% Credible Interval.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Tonal onglide | 0.44 | 0.11 | 0.26 | 0.62 | 1.00 | 0.59 |
| Synchrony | 0.35 | 0.11 | 0.17 | 0.53 | 1.00 | 0.54 |
| Peak alignment | 0.27 | 0.12 | 0.06 | 0.47 | 0.98 | 0.58 |
| Target height | 0.17 | 0.06 | 0.07 | 0.27 | 1.00 | 0.18 |
| H1*-A3* | −0.14 | 0.09 | −0.28 | 0.01 | 0.06 | 0.28 |
| HNR | −0.14 | 0.09 | −0.28 | 0.01 | 0.06 | 0.18 |
| Vertical tongue position | −0.13 | 0.09 | −0.27 | 0.01 | 0.07 | 0.23 |
| Tongue peak velocity | 0.08 | 0.07 | −0.03 | 0.19 | 0.89 | 0.09 |
| Lip aperture | 0.08 | 0.09 | −0.07 | 0.22 | 0.81 | 0.12 |
| H1*-H2* | 0.07 | 0.09 | −0.08 | 0.22 | 0.78 | 0.10 |
| Periodic energy mass | −0.06 | 0.07 | −0.18 | 0.05 | 0.17 | 0.11 |
| Horizontal tongue position | −0.05 | 0.09 | −0.18 | 0.10 | 0.29 | 0.11 |
| F0 mean | 0.05 | 0.07 | −0.06 | 0.16 | 0.80 | 0.06 |
| Formant 1 | 0.04 | 0.11 | −0.13 | 0.22 | 0.67 | 0.16 |
| Tongue time to peak velocity | 0.02 | 0.08 | −0.10 | 0.15 | 0.63 | 0.04 |
| Tongue gesture duration | −0.01 | 0.08 | −0.15 | 0.12 | 0.44 | 0.02 |
| Vowel duration | −0.01 | 0.07 | −0.12 | 0.09 | 0.41 | 0.01 |
| Formant 2 | −0.01 | 0.10 | −0.18 | 0.16 | 0.46 | 0.05 |
| RMS amplitude | 0.01 | 0.05 | −0.07 | 0.09 | 0.61 | 0.01 |