| Literature DB >> 35340908 |
Abstract
Inspired by information theoretic analyses of L1 speech and language, this study proposes that L1 and L2 speech exhibit distinct information encoding and transmission profiles in the temporal domain. Both the number and average duration of acoustic syllables (i.e., intensity peaks in the temporal envelope) were automatically measured from L1 and L2 recordings of standard texts in English, French, and Spanish. Across languages, L2 acoustic syllables were greater in number (more acoustic syllables/text) and longer in duration (fewer acoustic syllables/second). While substantial syllable reduction (fewer acoustic than orthographic syllables) was evident in both L1 and L2 speech, L2 speech generally exhibited less syllable reduction, resulting in low information density (more syllables with less information/syllable). Low L2 information density compounded low L2 speech rate yielding very low L2 information transmission rate (i.e., less information/second). Overall, this cross-language comparison establishes low information transmission rate as a language-general, distinguishing feature of L2 speech.Entities:
Keywords: communication efficiency; information density; second-language speech production; speech rate
Year: 2021 PMID: 35340908 PMCID: PMC8942379 DOI: 10.1017/s1366728921000717
Source DB: PubMed Journal: Biling (Camb Engl) ISSN: 1366-7289
Overview of recordings. See text for detailed explanations.
| Source | Northwestern University ALLSSTAR Corpus (Total of 248 recordings) | University of Toronto Romance Phonetics Database (Total of 103 recordings) | |||||
|---|---|---|---|---|---|---|---|
| Text | DHR Sentences and NWS Passage | NWS Passage | |||||
| Language | English | French | Spanish | ||||
| Talker Group | L1 | L2 | L1 | L2 | L1 | L2 | |
|
| |||||||
| Talkers (N) | 26 | 98 | 14 | 47 | 19 | 23 | |
|
| |||||||
| Average age (years) | 20 | 25 | 31 | 27 | 29 | 31 | |
|
| |||||||
| Female / Male | 14 F | 36 F | 8 F | 37 F | 13 F | 10 F | |
|
| |||||||
| 12 M | 62 M | 6 M | 10 M | 6 M | 13 M | ||
|
| |||||||
| Average age of L2 onset (range) | -- | 9.4 yrs (0–28) | -- | 9.9 yrs (3–29) | -- | 18 yrs (0–32) | |
|
| |||||||
| L2 Proficiency | Beginner (5) | -- | 1 | -- | 4 | -- | 0 |
|
| |||||||
| Intermediate (67) | -- | 43 | -- | 17 | -- | 7 | |
|
| |||||||
| Advanced/Near-L1 (68) | -- | 29 | -- | 24 | -- | 15 | |
|
| |||||||
| Not available (28) | -- | 25 | -- | 2 | -- | 1 | |
Only two talkers reported an age of L2 acquisiton of 0 years, one with Cantonese as the L1 (ALLSSTAR Corpus, L2 English) and one with Tagalog as the L1 (RPD Corpus, L2 Spanish).
Overview of variables. See text for detailed explanations.
| Grouping and identifying variables (see text for further explanation) | |||
|---|---|---|---|
| Variable name | Description | ||
| 1. corpus | ALL (ALLSSTAR) or RPD (Romance Phonetic Database) | ||
| 2. talkerCode | Unique identifier for each talker (alphanumeric for RPD) | ||
| 3. talkerNum | Unique numerical identifier for each talker | ||
| 4. m_f | Male or female | ||
| 5. l1 | Talker’s L1 | ||
| 6. textLang | English, French, or Spanish | ||
| 7. text | NWS or DHR | ||
| 8. group | L1 or L2 | ||
| 9. age | Age in years at the time of recording (self-reported) | ||
| 10. targetProficiency | Beginner, Intermediate, Advanced/Near-L1, L1 | ||
| 11. aoaL2 | Age of L2 acquisition (self-reported) | ||
| Phonetic variables (see text for further explanation) | |||
| Variable name | Unit | Description | |
| 1. nsyll | number | number of acoustic syllables | |
| 2. npause | number | number of silent pauses | |
| 3. dur | seconds | utterance duration | |
| 4. phonationTime | seconds | phonation time (sum of ‘sounding’ segment durations) | |
| 5. speechRate | syll/sec | nsyll per dur (includes pauses) | |
| 6. articulationRate (AR) | syll/sec | nsyll per phonationTime (excludes pauses) | |
| 7. avgSylDur | sec/syll | average syllable duration (1/AR) | |
| Information transmission variables (see text for further explanation) | |||
| Variable name | Description | ||
| 1. avgInfoPerSyl (ID) | 1/nSyll | syllable information density (ID), proportion of total text information encoded per acoustic syllable | |
| 2. avgInfoPerSec (IR) | avgSylDur / ID | syllable information rate (IR), proportion of total text information conveyed per second | |
| 3. syllRed (LOSS) | (nsyll-ortho) / ortho | syllable reduction (LOSS), number of acoustic syllables relative to number of orthographic syllables | |
Number of sentences, words, and orthographic (i.e., phonological) syllables in the NWS passages and DHR sentences.
| Sentences | Words | Orthographic (phonological) Syllables | |
|---|---|---|---|
| English NWS passage | 5 | 113 | 144 |
| English DHR sentences | 20 | 319 | 522 |
| French NWS passage | 6 | 108 | 165 |
| Spanish NWS passage | 5 | 99 | 184 |
Articulation rate (AR), acoustic syllable duration, number of acoustic syllables, information density (ID), information rate (IR), and acoustic syllable reduction (LOSS) by talker group (L1 versus L2) and by recording text (English DHR, English NWS, French NWS, and Spanish NWS). Data shown are means with standard error of the mean in parentheses. See text for additional explanation for each variable.
| Language | English | French | Spanish | |||||
|---|---|---|---|---|---|---|---|---|
| Text | DHR Sentences | NWS Passage | NWS Passage | NWS Passage | ||||
| Group | L1 | L2 | L1 | L2 | L1 | L2 | L1 | L2 |
| N | 26 | 98 | 26 | 98 | 14 | 47 | 19 | 23 |
|
| ||||||||
| AR (acoustic syllables/second) | 4.94 | 4.18 | 4.67 | 3.97 | 4.95 | 4.07 | 4.74 | 4.43 |
| (0.05) | (0.04) | (0.05) | (0.04) | (0.09) | (0.06) | (0.11) | (0.07) | |
|
| ||||||||
| Acoustic syllable duration (seconds) | 0.203 | 0.242 | 0.215 | 0.254 | 0.203 | 0.248 | 0.213 | 0.227 |
| (0.002) | (0.003) | (0.002) | (0.003) | (0.004) | (0.004) | (0.005) | (0.004) | |
|
| ||||||||
| Number of acoustic syllables | 459 | 483 | 126 | 139 | 150 | 169 | 161 | 176 |
| (5.9) | (3.9) | (1.9) | (1.3) | (3.9) | (3.2) | (3.6) | (3.3) | |
|
| ||||||||
| ID (%) (info/acoustic syll) | 0.22 | 0.21 | 0.80 | 0.72 | 0.67 | 0.60 | 0.63 | 0.57 |
| (0.003) | (0.002) | (0.012) | (0.007) | (0.015) | (0.011) | (0.014) | (0.010) | |
|
| ||||||||
| IR (%) (info/second) | 1.08 | 0.87 | 3.74 | 2.88 | 3.32 | 2.46 | 2.98 | 2.54 |
| (0.022) | (0.012) | (0.060) | (0.043) | (0.096) | (0.065) | (0.122) | (0.061) | |
|
| ||||||||
| Syll. reduction (%) (acoust. vs. ortho) | −12.1 | −7.5 | −12.7 | −3.3 | −9.0 | 2.3 | −12.3 | −4.5 |
| (1.1) | (0.7) | (1.3) | (0.9) | (2.3) | (1.9) | (2.0) | (1.8) | |
Fig. 1.Density plots of articulation rate (AR), information density (ID), information rate (IR) and syllable reduction (LOSS) for the L1 and L2 groups within each recording text (English DHR, English NWS, French NWS, and Spanish NWS). All data are shown on z-transformed scales within their own distributions.
Fig. 2.Density plots of articulation rate (AR), information density (ID), information rate (IR) and syllable reduction (LOSS) by proficiency group (L2 Intermediate, L2 Near-L1/Advanced, and L1) aggregated across texts and languages. All data are shown on z-transformed scales within their own distributions.
Summary of comparisons between models with and without the Group-by-Text interactive term. In all cases the interactive model was a significantly better fit (lower AIC) than the additive model.
| Chi-squared | df | p | |
|---|---|---|---|
| AR | 14.78 | 3 | <.003 (**) |
| ID | 16.65 | 3 | <.001 (**) |
| IR | 15.93 | 3 | <.002 (**) |
| LOSS | 15.72 | 3 | <.002 (**) |
Summaries of the best fit models with the Group by Text interactive terms. The referent category in all models is ENG_DHR and L1 for the Text and L1 factors, respectively.
| Predictors | Estimates | std. Error | Z value | Pr(>|z|) | |
|---|---|---|---|---|---|
| AR Gaussian (identity) | (Intercept) | 4.49 | 0.03 | 158.44 | <0.0001 (***) |
| Text [ENG_NWS] | 0.24 | 0.02 | 9.55 | <0.0001 (***) | |
| Text [FRA_NWS] | −0.19 | 0.07 | −2.68 | 0.007 (**) | |
| Text [SPA_NWS] | −0.07 | 0.08 | −0.90 | 0.368 | |
| Group [L2] | 0.66 | 0.06 | 11.69 | <0.0001 (***) | |
| Text [ENG_NWS] * Group [L2] | 0.07 | 0.05 | 1.39 | 0.165 | |
| Text [FRA_NWS] * Group [L2] | −0.18 | 0.14 | −1.24 | 0.214 | |
| Text [SPA_NWS] * Group [L2] | 0.56 | 0.16 | 3.44 | <0.0001 (***) | |
| Marginal R2 / Conditional R2 | 0.439 / 0.898 | ||||
| ID Gamma (log) | (Intercept) | −5.30 | 0.01 | −758.5 | <0.0001 (***) |
| Text [ENG_NWS] | −1.27 | 0.01 | −195.6 | <0.0001 (***) | |
| Text [FRA_NWS] | 0.18 | 0.02 | 10.5 | <0.0001 (***) | |
| Text [SPA_NWS] | 0.06 | 0.02 | 2.8 | <0.005 (**) | |
| Group [L2] | 0.09 | 0.01 | 6.3 | <0.0001 (***) | |
| Text [ENG_NWS] * Group [L2] | −0.05 | 0.01 | −4.1 | <0.0001 (***) | |
| Text [FRA_NWS] * Group [L2] | −0.01 | 0.03 | −0.3 | 0.75 | |
| Text [SPA_NWS] * Group [L2] | 0.03 | 0.04 | 0.7 | 0.51 | |
| Marginal R2 / Conditional R2 | 0.974 / 0.995 | ||||
| IR Gamma (log) | (Intercept) | −3.81 | 0.01 | −336.2 | <0.0001 (***) |
| Text [ENG_NWS] | −1.22 | 0.01 | −168.5 | <0.0001 (***) | |
| Text [FRA_NWS] | 0.14 | 0.03 | 5.1 | <0.0001 (***) | |
| Text [SPA_NWS] | 0.04 | 0.03 | 1.2 | 0.242 | |
| Group [L2] | 0.24 | 0.02 | 10.5 | <0.0001 (***) | |
| Text [ENG_NWS] * Group [L2] | −0.05 | 0.01 | −3.2 | <0.005 (**) | |
| Text [FRA_NWS] * Group [L2] | −0.04 | 0.06 | −0.8 | 0.4522 | |
| Text [SPA_NWS] * Group [L2] | 0.16 | 0.06 | 2.4 | <0.02 (*) | |
| Marginal R2 / Conditional R2 | 0.935 / 0.994 | ||||
| LOSS Gaussian (identity) | (Intercept) | −0.07 | 0.01 | −10.66 | <0.0001 (***) |
| Text [ENG_NWS] | −0.02 | 0.01 | −2.81 | <0.01 (**) | |
| Text [FRA_NWS] | −0.05 | 0.02 | −2.72 | <0.01 (**) | |
| Text [SPA_NWS] | 0.05 | 0.02 | 2.56 | <0.02 (*) | |
| Group [L2] | −0.08 | 0.01 | −5.99 | <0.0001 (***) | |
| Text [ENG_NWS] * Group [L2] | 0.05 | 0.01 | 3.87 | <0.0001 (***) | |
| Text [FRA_NWS] * Group [L2] | 0.02 | 0.03 | 0.52 | 0.6 | |
| Text [SPA_NWS] * Group [L2] | −0.03 | 0.04 | −0.86 | 0.39 | |
| Marginal R2 / Conditional R2 | 0.199 / 0.841 |
Pairwise comparisons of the estimated means between L1 and L2 within each text.
| L1 vs L2 | Estimate | SE | df | t ratio | p value | |
|---|---|---|---|---|---|---|
| AR | ENG_DHR | 0.77 | 0.08 | 341 | 9.27 | <.0001 (***) |
| ENG_NWS | 0.70 | 0.08 | 341 | 8.44 | <.0001 (***) | |
| FRA_NWS | 0.87 | 0.11 | 341 | 7.65 | <.0001 (***) | |
| SPA_NWS | 0.31 | 0.12 | 341 | 2.69 | 0.01 (*) | |
| ID | ENG_DHR | 0.05 | 0.02 | 341 | 2.40 | 0.02 (*) |
| ENG_NWS | 0.10 | 0.02 | 341 | 4.98 | <.0001 (***) | |
| FRA_NWS | 0.11 | 0.03 | 341 | 4.00 | 0.0001 (***) | |
| SPA_NWS | 0.09 | 0.03 | 341 | 3.00 | 0.01 (*) | |
| IR | ENG_DHR | 0.22 | 0.03 | 341 | 6.75 | <.0001 (***) |
| ENG_NWS | 0.27 | 0.03 | 341 | 8.18 | <.0001 (***) | |
| FRA_NWS | 0.31 | 0.05 | 341 | 6.85 | <.0001 (***) | |
| SPA_NWS | 0.15 | 0.05 | 341 | 3.33 | 0.001 (**) | |
| LOSS | ENG_DHR | −0.05 | 0.02 | 341 | −2.24 | <0.03 (*) |
| ENG_NWS | −0.09 | 0.02 | 341 | −4.69 | <.0001 (***) | |
| FRA_NWS | −0.11 | 0.03 | 341 | −4.04 | 0.0001 (***) | |
| SPA_NWS | −0.08 | 0.03 | 341 | −2.76 | <0.01 (*) |