| Literature DB >> 32243455 |
Theresa Matzinger1,2, Nikolaus Ritt1, W Tecumseh Fitch2.
Abstract
When speaking a foreign language, non-native speakers can typically be readily identified by their accents. But which aspects of the speech signal determine such accents? Speech pauses occur in all languages but may nonetheless vary in different languages with regard to their duration, number or positions in the speech stream, and therefore are one potential contributor to foreign speech production. The aim of this study was therefore to investigate whether non-native speakers pause 'with a foreign accent'. We recorded native English speakers and non-native speakers of German or Serbo-Croatian with excellent English reading out an English text at three different speech rates, and analyzed their vocal output in terms of number, duration and location of pauses. Overall, all non-native speakers were identified by native raters as having non-native accents, but native and non-native speakers made pauses that were similarly long, and had similar ratios of pause time compared to total speaking time. Furthermore, all speakers changed their pausing behavior similarly at different speech rates. The only clear difference between native and non-native speakers was that the latter made more pauses than the native speakers. Thus, overall, pause patterns contributed little to the acoustic characteristics of speakers' non-native accents, when reading aloud. Non-native pause patterns might be acquired more easily than other aspects of pronunciation because pauses are perceptually salient and producing pauses is easy. Alternatively, general cognitive processing mechanisms such as attention, planning or memory may constrain pausing behavior, allowing speakers to transfer their native pause patterns to a second language without significant deviation. We conclude that pauses make a relatively minor contribution to the acoustic characteristics of non-native accents.Entities:
Year: 2020 PMID: 32243455 PMCID: PMC7124187 DOI: 10.1371/journal.pone.0230710
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Distribution of phonetic silences and true pauses.
Dashed line = threshold chosen for the subsequent automatic detection of pauses (0.1 s).
Mean pause duration and mean proportion of short (< 0.1 s), medium (0.1 < x < 0.2 s) and long (> 0.2 s) phonetic silences and true pauses with respective low and high confidence intervals (CIs).
| Duration [s] | Low CI | High CI | Short [%x] | Low CI | High CI | Medium [%] | Low CI | High CI | Long [%] | Low CI | High CI | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phonetic silences | 0.047 | 0.033 | 0.061 | 90.39 | 80.89 | 99.9 | 9.29 | 0.44 | 18.14 | 0.32 | -0.5 | 1.13 |
| True pauses | 0.404 | 0.301 | 0.506 | 0 | 0 | 0 | 6.86 | -0.18 | 13.91 | 93.14 | 86.09 | 100.18 |
Results of the linear mixed model exploring the effects of reading tempo and nativeness on the total reading time (log-transformed).
The table reports estimated model coefficients, standard errors (SE) and lower and upper confidence intervals (CI), χ2 values of likelihood ratio tests and respective degrees of freedom (df) and p-values (P).
| Term | Estimate | SE | lower CI | upper CI | χ2 | df | P |
|---|---|---|---|---|---|---|---|
| 4.07 | 0.03 | 4.02 | 4.13 | ||||
| 0.51 | 0.04 | 0.43 | 0.60 | 163.67 | 1 | < 0.001 | |
| 0.11 | 0.03 | 0.05 | 0.18 | 10.21 | 1 | 0.001 | |
| -0.02 | 0.05 | -0.18 | 0.08 | 0.11 | 1 | 0.740 |
a not shown because of having a very limited interpretation
b Reading tempo and nativeness were deviation coded: reading tempo was coded as a continuous predictor (fast = -0.5, casual = 0, slow = +0.5), and nativeness was coded as a two-level factor (native = -0.5, non-native = +0.5); the indicated tests were obtained from likelihood ratio tests comparing the full with a reduced model lacking reading tempo, nativeness and the interaction, respectively.
Fig 2a) total duration of the readings in seconds, b) pause-to-utterance ratio in %, c) duration of individual pauses in seconds and d) number of pauses in each condition for each native language. The violin plots show median values (horizontal black lines) with first and third quartiles (lower and upper end of boxes), minimum and maximum values limited to values no more than 1.5 IQRs distant from the respective end of the box (lower and upper end of vertical black lines) and outliers (black dots). The area around each box indicates the distribution of the data.
Results of the linear mixed model exploring the effects of reading tempo and native language on pause-to-utterance ratio (logit-transformed).
The table reports estimated model coefficients, standard errors (SE) and lower and upper confidence intervals (CI), χ2 values of likelihood ratio tests and respective degrees of freedom (df) and p-values (P).
| Term | Estimate | SE | lower CI | upper CI | χ2 | df | P |
|---|---|---|---|---|---|---|---|
| -1.79 | 0.09 | -1.96 | -1.61 | ||||
| 1.11 | 0.10 | 0.91 | 1.32 | 137.94 | 1 | < 0.001 | |
| 0.01 | 0.10 | -0.20 | 0.21 | 0.002 | 1 | 0.965 | |
| -0.05 | 0.12 | -0.29 | 0.20 | 0.14 | 1 | 0.709 |
a not shown because of having a very limited interpretation
b Reading tempo and nativeness were deviation coded: reading tempo was coded as a continuous predictor (fast = -0.5, casual = 0, slow = +0.5), and nativeness was coded as a two-level factor (native = -0.5, non-native = +0.5); the indicated tests were obtained from likelihood ratio tests comparing the full with a reduced model lacking reading tempo, nativeness and the interaction, respectively.
Results of the linear mixed model exploring the effects of reading tempo and native language on the duration of individual pauses (log-transformed).
The table reports estimated model coefficients, standard errors (SE) and lower and upper confidence intervals (CI), χ2 values of likelihood ratio tests and respective degrees of freedom (df) and p-values (P).
| Term | Estimate | SE | lower CI | upper CI | χ2 | df | P |
|---|---|---|---|---|---|---|---|
| -0.75 | 0.06 | -0.87 | -0.63 | ||||
| 0.49 | 0.07 | 0.36 | 0.62 | 87.53 | 1 | < 0.001 | |
| -0.10 | 0.07 | -0.24 | 0.05 | 1.73 | 1 | 0.188 | |
| -0.04 | 0.08 | -0.19 | 0.12 | 0.25 | 1 | 0.651 |
a not shown because of having a very limited interpretation
b Reading tempo and nativeness were deviation coded: reading tempo was coded as a continuous predictor (fast = -0.5, casual = 0, slow = +0.5), and nativeness was coded as a two-level factor (native = -0.5, non-native = +0.5); the indicated tests were obtained from likelihood ratio tests comparing the full with a reduced model lacking reading tempo, nativeness and the interaction, respectively.
Results of the logistic regression model exploring the effects of reading tempo, native language and in-text position on the occurrence frequency of pauses.
The table reports estimated model coefficients, standard errors (SE) and lower and upper confidence intervals (CI), χ2 values of likelihood ratio tests and respective degrees of freedom (df) and p-values (P).
| Term | Estimate | SE | lower CI | upper CI | χ2 | df | P |
|---|---|---|---|---|---|---|---|
| 0.60 | 0.17 | 0.27 | 0.94 | ||||
| -2.73 | 0.08 | -2.89 | -2.57 | 9759.82 | 1 | < 0.001 | |
| -6.06 | 0.09 | -6.25 | -5.88 | ||||
| 2.20 | 0.15 | 1.91 | 2.48 | 906.75 | 1 | < 0.001 | |
| 0.54 | 0.20 | 0.14 | 0.94 | 7.05 | 1 | 0.008 | |
| 0.27 | 0.17 | -0.06 | 0.60 | 2.59 | 1 | 0.11 |
a not shown because of having a very limited interpretation
b In-text position was dummy coded with the marked position being the respective reference category. Reading tempo and nativeness were deviation coded: reading tempo was coded as a continuous predictor (fast = -0.5, casual = 0, slow = +0.5), and nativeness was coded as a two-level factor (native = -0.5, non-native = +0.5); the indicated tests were obtained from likelihood ratio tests comparing the full with a reduced model lacking in-text position, reading tempo, nativeness and the interaction between reading tempo and nativeness, respectively.
c This is the overall effect of in-text position on the occurrence frequency of pauses. Considering the differences between the individual levels, the model revealed that at unmarked phrase boundaries (z = -33.49, p < 0.001) and at other word boundaries (z = -65.17, p < 0.001), participants paused significantly less than at punctuation marks.
Predicted probabilities (in %) of making a pause in fast, casual or slow reading at punctuation marks, unmarked phrase boundaries or other positions in the text for native and non-native speakers of English.
| Native | Non-Native | |||||
|---|---|---|---|---|---|---|
| 33.20 | 58.19 | 79.58 | 42.69 | 70.50 | 88.46 | |
| 3.15 | 8.34 | 20.31 | 4.65 | 13.51 | 33.39 | |
| 0.12 | 0.32 | 0.90 | 0.17 | 0.55 | 1.76 | |
aValues are derived from the estimates of our logistic regression model (see Table 5). We used the following formulae for the back-transformation from the logit-transformed model estimates: odds = exp(0.60–2.73 * unmarked phrase boundary– 6.06 * other word boundary + 2.20 * reading tempo + 0.54 * nativeness + 0.27 * reading tempo * nativeness, and y = odds/(1+odds). In-text position was dummy coded, with marked boundaries serving as the reference category. Reading tempo and nativeness were deviation coded: reading tempo was coded as a continuous predictor (fast = -0.5, casual = 0, slow = +0.5), and nativeness was coded as a two-level factor (native = -0.5, non-native = +0.5). The respective values were inserted into the formulae to calculate the predicted probabilities of pause occurrence.
Fig 3a) effect display of the significant main effect of in-text position, b) effect display of the significant main effects of reading tempo and nativeness. The effect of the interaction of reading tempo and nativeness was non-significant. The y-axis displays the probability of occurrence of a word after a pause. Error bars and shaded areas around the estimated effects represent 95% confidence intervals.