| Literature DB >> 35750712 |
Nala H Lee1, Cynthia S Q Siew2, Nadine H N Ng3.
Abstract
Language endangerment is one of the most urgent issues of the twenty-first century. Languages are disappearing at unprecedented rates, with dire consequences that affect speaker communities, scientific community and humanity. There is impetus for understanding the nature of language endangerment, and we investigate where language endangerment occurs by performing network analysis on 3423 languages at various levels of risk. Macro-level analysis shows evidence of positive assortative mixing of endangerment statuses-critically endangered languages are surrounded by similarly endangered languages, indicating the prevalence of linguistic hotspots throughout the world. Meso-level analysis using community detection returned 13 communities experiencing different levels of threat. Micro-level analysis of closeness centrality shows that more geographically isolated languages tend to be more critically endangered. Even after accounting for the statistical contributions of linguistic diversity, the structural properties of the spatial network were still significantly associated with endangerment outcomes. Findings support that the notion of hotspots is useful when accounting for language endangerment but go beyond that to establish that quantifying spatial structure is crucial. Language preservation in these hotspots and understanding why endangered languages pattern the way they do in their environments becomes more vital than ever.Entities:
Mesh:
Year: 2022 PMID: 35750712 PMCID: PMC9232642 DOI: 10.1038/s41598-022-14479-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Community structure of endangered languages around the world.
An overview of macro- and meso-level network measures of spatial networks with different thresholds.
| Measure | Threshold | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | ||
| Number of nodes | 3883 | 3883 | 3883 | 3883 | 3883 | 3883 | 3883 | 3883 | 3883 | 3883 | |
| Number of edges | 75,369 | 150,737 | 226,104 | 301,473 | 376,842 | 452,210 | 527,578 | 602,946 | 678,315 | 753,683 | |
| Network density | Proportion of observed edges/number of possible edges (corresponds to threshold, | 0.01 | 0.02 | 0.03 | 0.04 | 0.05 | 0.06 | 0.07 | 0.08 | 0.09 | 0.10 |
| Average degree, | Average number of connections or edges each node has | 38.8 | 77.6 | 116.5 | 155.3 | 194.1 | 232.9 | 271.7 | 310.6 | 349.4 | 388.2 |
| Global clustering coefficient, | Measure of local clustering (closed triangles) | 0.796 | 0.821 | 0.843 | 0.83 | 0.819 | 0.815 | 0.815 | 0.819 | 0.822 | 0.822 |
| Average shortest path length, | Average length of the shortest path between all possible node pairs in the network | 9.71 | 11.04 | 14.83 | 10.46 | 12.94 | 10.75 | 9.13 | 7.97 | 7.15 | 6.36 |
| Components | Number of distinct connected components | 213 | 60 | 28 | 17 | 5 | 4 | 3 | 3 | 3 | 3 |
| lccprop | Proportion of nodes in the largest connected component of the network | 0.249 | 0.463 | 0.756 | 0.757 | 0.996 | 0.996 | 0.999 | 0.999 | 0.999 | 0.999 |
| Modularity, | Meso-level metric quantifying the robustness of community structure (subclusters) of the network | 0.79 | 0.79 | 0.79 | 0.79 | 0.77 | 0.75 | 0.73 | 0.71 | 0.71 | 0.7 |
Figure 2Bubble plot of endangerment statuses among spatially close languages.
Figure 3Community structure of endangered languages around the world.
Means and standard deviations of closeness centrality across languages of different endangerment statuses.
| Level | Count | Mean | SD | Median |
|---|---|---|---|---|
| Vulnerable | 715 | 0.083 | 0.021 | 0.079 |
| Threatened | 1070 | 0.082 | 0.020 | 0.080 |
| Endangered | 825 | 0.084 | 0.018 | 0.086 |
| Severely endangered | 417 | 0.083 | 0.016 | 0.086 |
| Critically endangered | 426 | 0.078 | 0.017 | 0.076 |
| Dormant | 202 | 0.075 | 0.016 | 0.079 |
Model performance indices for regression models containing language families and isolates derived from pre-defined regions (top) and from groups found in the community detection analysis (bottom).
| Model | AIC | BIC | R2 |
|---|---|---|---|
| Pre-defined | 12,127.35 | 12,170.75 | 0.033 |
| Data-driven | 12,081.18 | 12,124.57 | 0.045 |
Odds ratios for predictors in the model containing language families and isolates (community-based) and closeness centrality.
| Predictors | Endangerment level | ||
|---|---|---|---|
| Odds ratios | |||
| Closeness centrality | 0.94 | − 2.06 | 0.040 |
| Diversity | 1.33 | 9.63 | < 0.001 |
| Observations | 3640 | ||
| R2 Nagelkerke | 0.045 | ||