| Literature DB >> 34790894 |
Abstract
Societies continually evolve and speakers use new words to talk about innovative products and practices. While most lexical innovations soon fall into disuse, others spread successfully and become part of the lexicon. In this paper, I conduct a longitudinal study of the spread of 99 English neologisms on Twitter to study their degrees and pathways of diffusion. Previous work on lexical innovation has almost exclusively relied on usage frequency for investigating the spread of new words. To get a more differentiated picture of diffusion, I use frequency-based measures to study temporal aspects of diffusion and I use network analyses for a more detailed and accurate investigation of the sociolinguistic dynamics of diffusion. The results show that frequency measures manage to capture diffusion with varying success. Frequency counts can serve as an approximate indicator for overall degrees of diffusion, yet they miss important information about the temporal usage profiles of lexical innovations. The results indicate that neologisms with similar total frequency can exhibit significantly different degrees of diffusion. Analysing differences in their temporal dynamics of use with regard to their age, trends in usage intensity, and volatility contributes to a more accurate account of their diffusion. The results obtained from the social network analysis reveal substantial differences in the social pathways of diffusion. Social diffusion significantly correlates with the frequency and temporal usage profiles of neologisms. However, the network visualisations and metrics identify neologisms whose degrees of social diffusion are more limited than suggested by their overall frequency of use. These include, among others, highly volatile neologisms (e.g., poppygate) and political terms (e.g., alt-left), whose use almost exclusively goes back to single communities of closely-connected, like-minded individuals. I argue that the inclusion of temporal and social information is of particular importance for the study of lexical innovation since neologisms exhibit high degrees of temporal volatility and social indexicality. More generally, the present approach demonstrates the potential of social network analysis for sociolinguistic research on linguistic innovation, variation, and change.Entities:
Keywords: Twitter; diffusion; lexical innovation; lexicology; social media; social network analysis; sociolinguistics; time-series analysis
Year: 2021 PMID: 34790894 PMCID: PMC8591557 DOI: 10.3389/frai.2021.648583
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 2Temporal dynamics in usage frequency for the selected neologisms.
Total usage frequency (FREQ) in the corpus. Most frequent lexemes.
| Lexeme | FREQ |
|---|---|
| tweeter | 7,367,174 |
| fleek | 3,412,807 |
| bromance | 2,662,767 |
| twitterverse | 1,486,873 |
| blockchain | 1,444,300 |
| smartwatch | 1,106,906 |
Total usage frequency (FREQ) in the corpus. Case study selection.
| Lexeme | FREQ |
|---|---|
| alt-right | 1,012,150 |
| solopreneur | 282,026 |
| hyperlocal | 209,937 |
| alt-left | 167,124 |
| upskill | 57,941 |
| poppygate | 3,807 |
Total usage frequency (FREQ) in the corpus. Least frequent lexemes.
| Lexeme | FREQ |
|---|---|
| microflat | 426 |
| dogfishing | 399 |
| begpacker | 283 |
| halfalogue | 245 |
| rapugee | 182 |
| bediquette | 164 |
FIGURE 1Cumulative increase in usage frequency for the case study lexemes .
Coefficients of variation (VAR) for the selected neologisms.
| Lexeme | VAR |
|---|---|
| hyperlocal | 0.98 |
| upskill | 1.14 |
| solopreneur | 1.20 |
| alt-right | 1.81 |
| poppygate | 4.75 |
| alt-left | 5.31 |
Coefficients of variation (VAR) for the six neologisms with the highest scores in the sample.
| Lexeme | VAR |
|---|---|
| upskirting | 9.39 |
| youthquake | 6.32 |
| alt-left | 5.31 |
| birther | 5.00 |
| poppygate | 4.75 |
| cherpumple | 4.69 |
Degree centrality scores (CENT) for the selected neologisms; the scores are based on the most recent time slice for each neologism in the corpus.
| Lexeme | CENT |
|---|---|
| upskill | 0.0021 |
| hyperlocal | 0.0085 |
| alt-right | 0.0144 |
| alt-left | 0.0238 |
| solopreneur | 0.0523 |
| poppygate | 0.0566 |
FIGURE 3Social network graphs for the last subset of the selected neologisms.
FIGURE 4Pathways of diffusion for the selected neologisms. The graph shows DEGREE CENTRALITY scores over time, each SUBSET representing one network graph which was generated for each of the four equally-sized time slices for each neologism in the sample.
FIGURE 5Social network of diffusion for hyperlocal over time.
Correlations of ‘degree centralization’ (CENTRALITY) with the variables total usage frequency (FREQUENCY), coefficient of variation (VOLATILITY), and observed lifespan in the corpus (AGE) for the full sample of neologisms (n = 99) using Spearman’s correlation coefficient (Spearman 1961) .
| ρ |
| |
|---|---|---|
| Frequency | −0.44 | <0.001 |
| Age | −0.29 | 0.004 |
| Volatility | 0.28 | <0.001 |
FIGURE 6Relationship between total USAGE FREQUENCY and degree centrality (CENTRALIZATION) for the full sample of neologisms (n = 99) and the selected cases.
Total usage frequency (FREQ) in the corpus. Examples around the median.
| Lexeme | FREQ |
|---|---|
| white fragility | 26,688 |
| monthiversary | 23,607 |
| helicopter parenting | 26,393 |
| deepfake | 20,101 |
| newsjacking | 20,930 |
| twittosphere | 20,035 |
Coefficients of variation (VAR) for the six neologisms with the lowest scores in the sample .
| Lexeme | VAR |
|---|---|
| followership | 0.71 |
| lituation | 0.72 |
| twitterverse | 0.72 |
| detweet | 0.74 |
| remoaners | 0.76 |
| twittersphere | 0.77 |
Degree centrality scores (CENT) for the six lexemes with the lowest scores in the sample; the scores are based on the most recent time slice for each neologism in the corpus.
| Lexeme | CENT |
|---|---|
| baecation | 0.0005 |
| fleek | 0.0009 |
| ghosting | 0.0013 |
| man bun | 0.0016 |
| big dick energy | 0.0018 |
| twittersphere | 0.0020 |
Degree centrality scores (CENT) for the six lexemes with the highest scores in the sample; the scores are based on the most recent time slice for each neologisms in the corpus.
| Lexeme | CENT |
|---|---|
| rapugee | 0.2580 |
| levidrome | 0.2373 |
| kushnergate | 0.2309 |
| dronography | 0.1530 |
| dotard | 0.0979 |
| ecocide | 0.0922 |