| Literature DB >> 28397801 |
Yoshitaka Nakajima1, Kazuo Ueda1, Shota Fujimaru2, Hirotoshi Motomura2, Yuki Ohsaka3.
Abstract
Acoustic analyses of eight different languages/dialects had revealed a language universal: Three spectral factors consistently appeared in analyses of power fluctuations of spoken sentences divided by critical-band filters into narrow frequency bands. Examining linguistic implications of these factors seems important to understand how speech sounds carry linguistic information. Here we show the three general categories of the English phonemes, i.e., vowels, sonorant consonants, and obstruents, to be discriminable in the Cartesian space constructed by these factors: A factor related to frequency components above 3,300 Hz was associated only with obstruents (e.g., /k/ or /z/), and another factor related to frequency components around 1,100 Hz only with vowels (e.g., /a/ or /i/) and sonorant consonants (e.g., /w/, /r/, or /m/). The latter factor highly correlated with the hypothetical concept of sonority or aperture in phonology. These factors turned out to connect the linguistic and acoustic aspects of speech sounds systematically.Entities:
Mesh:
Year: 2017 PMID: 28397801 PMCID: PMC5387398 DOI: 10.1038/srep46049
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Distribution of uttered phonemes in the three-dimensional factor space.
The three phonological categories (vowels, sonorant consonants, and obstruents) are differentiated. The panel (d) shows how the three-dimensional configuration looks if viewed from above-right in the panel (a); the horizontal axis is derived from the combination of the mid-low factor and the high factor, calculating , where x signifies the coordinate of the mid-low factor, and y that of the high factor.
Figure 2Configuration of British English phonemes obtained by averaging all uttered samples of each phoneme in the three-dimensional factor space.
Each point indicated by an International Phonetic Alphabet symbol represents 27–2172 samples. The curve shows a fitting by eye of a sonority/aperture scale as in the linguistics literature171415. The direction of this view is the same as in Fig. 1d.
Previously proposed sonority (aperture) scales.
| Sonority (aperture) scale | Proposed category | Corresponding English phonemes |
|---|---|---|
| 6 | a | /æ, αː,Λ |
| 5 | e, o, ö | /e, ɔ, ɔː/ |
| 4 | i, u, ü | /I, iː, ℧, uː, j, w/ |
| 3 | Liquids | /l, r/ |
| 2 | Nasals | /m, n, ŋ/ |
| 1 | Fricatives | /θ, ð, f, v, s, z, ʃ, ʒ/ |
| 0 | Occlusives | /p, t, k, b, d, g/ |
| 10 | a | /æ, αː, Λ/ |
| 9 | e, o | /e, ɔ, ɔː/ |
| 8 | i, u | /I, iː,℧,uː/ |
| 7 | r | /r/ |
| 6 | l | /l/ |
| 5 | m, n | /m, n/ |
| 4 | s | /s/ |
| 3 | v, z, ð | /v, z, ð/ |
| 2 | f, θ | /f, θ/ |
| 1 | b, d, g | /b, d, g/ |
| 0.5 | p, t, k | /p, t, k/ |
| 5 | Low vowels | /æ, αː Λ, e, ɔ, ɔː/ |
| 4 | High vowels and glides | /I, iː, ℧, uː, j, w/ |
| 3 | Liquids | /l, r/ |
| 2 | Nasals | /m, n, ŋ/ |
| 1 | Fricatives | /θ, ð, f, v, s, z, ʃ, ʒ, h/ |
| 0 | Plosives | /p, t, k, b, d, g/ |
| 5 | Vowels | /æ, αː, Λ, e, ɔ, %ː, ə, əː, I, iː, ℧, uː, aI, a℧, eə, eI, ɔI, ə℧, Iə, ℧ə/ |
| 4 | Glides | /j, w/ |
| 3 | Liquids | /l, r/ |
| 2 | Nasals | /m, n, ŋ/ |
| 1 | Fricatives and affricates | /θ, ð, f, v, s, z, ʃ, ʒ, h, tʃ, dʒ/ |
| 0 | Plosives | /p, t, k, b, d, g/ |
Spearman’s rank-order correlation coefficients between the sonority/aperture proposed in the linguistics literature171415 and the factor scores obtained in the present analysis, averaged over the same phonemes.
| Sonority/aperture scale | Factors | ||
|---|---|---|---|
| de Saussure (1916/1959) | 0.3415 | 0.8251* | −0.3597* |
| Selkirk (1984) | 0.3025 | 0.8708* | −0.2840 |
| Harris (1994) | 0.3691* | 0.8218* | −0.3863* |
| Spencer (1996) | 0.5380* | 0.8347* | −0.4549* |
Asterisks represent statistically significant correlation (p < 0.05).