| Literature DB >> 34268487 |
Axel Bohmann1, Martin Bohmann2,3, Lars Hinrichs4.
Abstract
We explore the relationship between word dissemination and frequency change for a rapidly receding feature, the relativizer whom. The success of newly emerging words has been shown to correlate with high dissemination scores. However, the reverse-a correlation of lower dissemination scores with receding features-has not been investigated. Based on two established and two newly developed measures of word dissemination-across texts, linguistic environments, registers, and topics-we show that a general correlation between dissemination and frequency does not obtain in the case of whom. Different dissemination measures diverge from each other and show internally variable developments. These can, however, be explained with reference to the specific sociolinguistic history of whom over the past 300 years. Our findings suggest that the relationship between dissemination and word success is not static, but needs to be contextualized against different stages in individual words' life-cycles. Our study demonstrates the applicability of large-scale, quantitative measures to qualitatively informed sociolinguistic research.Entities:
Keywords: dissemination; receding features; register; relativizers; sociolinguistics; whom
Year: 2021 PMID: 34268487 PMCID: PMC8276258 DOI: 10.3389/frai.2021.654154
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 1Frequency development of whom over 180 years of written American English.
The five dimensions of variation in COHA.
| Dimension label | Most salient features | Genre differentiation | Diachronic development |
|---|---|---|---|
| Structural elaboration | Clausal coordination, noun phrase trigrams, prepositions, main verb BE, attributive adjectives | Highest in nonfiction writing; lowest in magazines and newspapers | Consistent decrease in all genres |
| Verbal-personal communication | Pro-verb DO, private (cognitive) verbs, verbal infinitives, first person pronouns, adverbs | Highest in fiction writing, lower in all other genres | Increase in the non-fiction genres in the second half of the 20th century |
| Information density | Attributive adjectives, mean word length, type-token ratio, nouns, prepositions | Highest in nonfiction, lowest in fiction | Increase in all genres, particularly in the 20th century |
| Narration | Simple past, third person pronouns, possessives, quotation marks, public (quotative) verbs | Highest in fiction, lowest in nonfiction | Consistent increase in fiction; irregular developments in other genres |
| Abstraction & generalization | Prepositions, nominalizations, agentless passives, mean word length, infinitives | Highest in newspapers and nonfiction writing, lowest in fiction and magazines | Newspaper, nonfiction, and magazine writing grow closer to the consistently low values for fiction |
The 25 LDA topics developed for COHA.
| Topic | Top words | Topic | Top words |
|---|---|---|---|
| 1 | candidate, democrat, kennedy, nixon, reporter | 14 | george, mexico, mexican, madeline, rollo |
| 2 | team, player, film, coach, movie | 15 | prince, queen, lord, rome, duke |
| 3 | boat, captain, sail, deck, crew | 16 | chinese, china, mountain, stone, surface |
| 4 | railroad, machine, steel, contract, profit | 17 | percent, budget, investment, oil, sales |
| 5 | soul, heaven, lord, dear, sir | 18 | peter, shot, int, sam, camera |
| 6 | p.-a., dear, aunt, mary, sir | 19 | sir, captain, colonel, horse, soldier |
| 7 | kid, guy, stare, phone, nod | 20 | patient, hospital, <br>, medical, drug |
| 8 | paul, bird, planet, flower, moon | 21 | senate, teacher, governor, amendment, candidate |
| 9 | cook, milk, fruit, sugar, meat | 22 | moral, religion, science, christian, religious |
| 10 | horse, wood, dog, mountain, stare | 23 | tom, joe, ben, ruth, phil |
| 11 | poet, poem, jane, poetry, novel | 24 | novel, magazine, editor, publisher, reader |
| 12 | animal, research, science, scientist, cell | 25 | governor, indian, lincoln, county, trial |
| 13 | soviet, russian, communist, germany, russia |
FIGURE 2Social dissemination (DS) of whom over 180 years of written American English.
FIGURE 3Linguistic dissemination (DL) of whom over 180 years of written American English.
FIGURE 4Register dissemination (DR) of whom over 180 years of written American English.
FIGURE 5Coefficients for individual dimensions in models predicting frequency of whom. Points for individual years were omitted to avoid overplotting.
FIGURE 6Topic dissemination (DT) of whom over 180 years of written American English.
Three developmental stages of whom.
| Period | Status of | Sociolinguistic effects | Dissemination developments |
|---|---|---|---|
| Prior to 19th century | Regular grammatical conditioning | Stability due to categorical rules for relativizer choice | Not covered by our data; hypothesized stability of dissemination |
| 19th and early 20th century | Predominantly stylistic conditioning | Variability between | Low register dissemination indicating stylistic specificity; increase in linguistic dissemination as a consequence of weakening grammatical conditioning |
| 1950s onward | Retreat from active use | Avoidance of | Even dissemination due to overall low frequencies in all contexts |