| Literature DB >> 29760059 |
Frank Seifart1,2,3, Jan Strunk2, Swintha Danielsen4, Iren Hartmann4, Brigitte Pakendorf3, Søren Wichmann5,6, Alena Witzlack-Makarevich7, Nivja H de Jong5,8, Balthasar Bickel9.
Abstract
By force of nature, every bit of spoken language is produced at a particular speed. However, this speed is not constant-speakers regularly speed up and slow down. Variation in speech rate is influenced by a complex combination of factors, including the frequency and predictability of words, their information status, and their position within an utterance. Here, we use speech rate as an index of word-planning effort and focus on the time window during which speakers prepare the production of words from the two major lexical classes, nouns and verbs. We show that, when naturalistic speech is sampled from languages all over the world, there is a robust cross-linguistic tendency for slower speech before nouns compared with verbs, both in terms of slower articulation and more pauses. We attribute this slowdown effect to the increased amount of planning that nouns require compared with verbs. Unlike verbs, nouns can typically only be used when they represent new or unexpected information; otherwise, they have to be replaced by pronouns or be omitted. These conditions on noun use appear to outweigh potential advantages stemming from differences in internal complexity between nouns and verbs. Our findings suggest that, beneath the staggering diversity of grammatical structures and cultural settings, there are robust universals of language processing that are intimately tied to how speakers manage referential information when they communicate with one another.Entities:
Keywords: language processing; language universals; nouns; speech rate; word planning
Mesh:
Year: 2018 PMID: 29760059 PMCID: PMC5984521 DOI: 10.1073/pnas.1800708115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Location of the nine languages and size of the corpora studied here. For detailed information, see .
Fig. 2.Bora utterance illustrating slow articulation and presence of a pause before a noun compared with fast articulation and no pause before a verb. The example translates as “After you bit my father, he died” and is taken from a Bora mythological narrative, available online at https://hdl.handle.net/1839/00-0000-0000-000C-DFBE-1. (a) Waveform of audio signal; (b) time-aligned transcription of words; (c) word-by-word translation; (d) word class N = noun vs. V = verb vs. X = other; (e) position of word within utterance from 0 = start to 1 = end; (f) z-normalized word length calculated as SDs from mean word length in the language; (g) preword context windows for the noun llihíyoúvuke “my father” and the verb ds “he died,” adjusted in size to word boundaries close to 500 ms before onset of target words (preword window for “after biting” not shown here); (h) length of preword context windows; (i) articulation rate of words (excluding pauses) within preword context windows; and (j) presence vs. absence of pauses within preword context windows. Procedures for time-aligning transcriptions and for determining position, word length, and context window size are described in .
Fig. 3.Speech rate in contexts before nouns versus verbs. The effect displays show a cross-linguistic tendency for slower articulation before nouns (A) and a higher probability of pauses before nouns (B). The effect of word class (nouns vs. verbs) is plotted according to (generalized) linear mixed-effects models, with 95% confidence intervals based on these models. Both studies are based on models that are consistent across the individual languages, controlling for word position and word length as fixed factors and including random intercepts for speaker, text, and word type. The models for articulation speed included an additional interaction between word class and position, but A shows the overall effects of word class, averaging over positions, to simplify the visual representation ( and ). Levels of statistical significance are indicated as *P < 0.05; **P < 0.01; ***P < 0.001; and n.s. (not significant) > 0.05.