| Literature DB >> 31053716 |
Xia Hua1,2, Simon J Greenhill3,4, Marcel Cardillo5, Hilde Schneemann3,5,6, Lindell Bromham3,5.
Abstract
Language diversity is distributed unevenly over the globe. Intriguingly, patterns of language diversity resemble biodiversity patterns, leading to suggestions that similar mechanisms may underlie both linguistic and biological diversification. Here we present the first global analysis of language diversity that compares the relative importance of two key ecological mechanisms - isolation and ecological risk - after correcting for spatial autocorrelation and phylogenetic non-independence. We find significant effects of climate on language diversity, consistent with the ecological risk hypothesis that areas of high year-round productivity lead to more languages by supporting human cultural groups with smaller distributions. Climate has a much stronger effect on language diversity than landscape features, such as altitudinal range and river density, which might contribute to isolation of cultural groups. The association between biodiversity and language diversity appears to be an incidental effect of their covariation with climate, rather than a causal link between the two.Entities:
Mesh:
Year: 2019 PMID: 31053716 PMCID: PMC6499821 DOI: 10.1038/s41467-019-09842-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Global distribution of language diversity. Left panel: values on a logarithmic scale of number of languages are shown for 200 × 200 km cells of an equal-area grid. Right panel: correlation in language diversity between each pair of grid cells owing to spatial autocorrelation and phylogenetic relatedness. Correlation coefficient is estimated from our generalized least squares model that includes all the climatic and landscape variables as predictors (see Methods). Correlated grid cells are roughly clustered into nine geographic regions, so we color code the rows and columns by these regions. Grid cells within East Asia, Europe, and the Americas are more autocorrelated than grid cells within the other regions
Climatic effects on language diversity
| Predictor | Low ( | Medium ( | High ( |
|---|---|---|---|
| Annual mean precipitation | 0.56 (0.577) | 1.32 (0.190) | 1.74 (0.083) |
| Annual mean temperature | 0.04 (0.968) | −0.90 (0.369) | 1.15 (0.251) 0.80 |
| Precipitation seasonality | 1.54 (0.126) | 0.10 (0.920) | |
| Temperature seasonality | −1.14 (0.257) | − | − |
| Net primary productivity | 1.86 (0.064) | 1.65 (0.101) | 1.58 (0.114) |
| Mean annual growing season | 1.29 (0.199) | 1.06 (0.290) | 0.85 (0.395) |
We list the t value and the p value (in parentheses) of each predictor in a generalized least squares regression that includes all the six eco-climatic predictors (n is the number of grid cells used in the analysis at low, medium, or high resolution). Two additional parameters are the intercept and the coefficient for land coverage. Because collinearity can inflate the standard error of regression coefficient, we also conduct likelihood ratio (LR) tests to assess if adding a predictor significantly increases model fit. If so, the predictor has a significant effect on language diversity beyond its covariation with other predictors. Significant results are in bold. LR value is shown in italic, after t value (p value)
Predictions of the ecological risk hypothesis
| Response | Predictor |
|
| Does mean growing season show significant association with response beyond its covariation with predictor? |
|---|---|---|---|---|
| Low resolution ( | ||||
| Language diversity | Mean growing season |
|
| NA |
| Average population size | Mean growing season | −1.02 | 0.309 | NA |
| Min. population size | Mean growing season | −1.72 | 0.087 | NA |
| Population density | Mean growing season |
|
| NA |
| Language diversity | Latitude | − |
| Yes: |
| Language diversity | Population density | −0.59 | 0.556 | Yes: |
| Medium resolution ( | ||||
| Language diversity | Mean growing season |
|
| NA |
| Average population size | Mean growing season | −1.92 | 0.056 | NA |
| Min. population size | Mean growing season | − |
| NA |
| Population density | Mean growing season |
|
| NA |
| Language diversity | Latitude | − |
| Yes: |
| Language diversity | Population density | −0.30 | 0.763 | Yes: |
| High resolution ( | ||||
| Language diversity | Mean growing season |
|
| NA |
| Average population size | Mean growing season | −1.83 | 0.068 | NA |
| Min. population size | Mean growing season | − |
| NA |
| Population density | Mean growing season |
|
| NA |
| Language diversity | Latitude | − |
| Yes: |
| Language diversity | Population density | 0.66 | 0.512 | Yes: |
We list the t value and the p value of the predictor in each generalized least squares regression for the response variable (n is the number of grid cells used in the analysis at low, medium, or high resolution). Each model includes three parameters: intercept, coefficient of land coverage, and coefficient of the predictor. Significant results are in bold. To test if mean growing season shows significant association with the response variable beyond its covariation with the predictor, we conduct a likelihood ratio test on whether adding mean growing season as an additional predictor significantly increase the model fit
Landscape effects on language diversity and speaker population size
| Response | Predictor | Low ( | Medium ( | High ( |
|---|---|---|---|---|
| Language diversity | Average altitude | −1.21 (0.228) | 0.70 (0.483) | 0.21 (0.830) |
| Altitudinal range | 1.67 (0.096) | 1.01 (0.312) | ||
| Landscape roughness | 0.13 (0.900) | −0.00 (0.999) | 0.98 (0.328) | |
| River density | 1.42 (0.157) | |||
| Average speaker population size | Average altitude | 0.44 (0.662) | 0.74 (0.463) | 0.73 (0.463) |
| Altitudinal range | 1.21 (0.229) | −0.65 (0.515) | −1.20 (0.232) | |
| Landscape roughness | −0.82 (0.412) | 1.24 (0.215) | 1.94 (0.053) | |
| River density | −1.02 (0.310) | −1.20 (0.234) | −0.47 (0.641) | |
| Minimum speaker population size | Average altitude | 0.50 (0.616) | 0.81 (0.420) | 1.20 (0.232) |
| Altitudinal range | 0.22 (0.824) | −1.19 (0.238) | −1.74 (0.083) | |
| Landscape roughness | −1.25 (0.214) | 1.57 (0.118) | 1.04 (0.301) | |
| River density | −0.43 (0.668) | − | −1.32 (0.187) |
We list the t value and the p value (in parentheses) of each landscape variable in each generalized least squares regression (n is the number of grid cells used in the analysis at low, medium, or high resolution). Models with language diversity include all the six climatic and four landscape variables. Models with population size includes all the four landscape variables and population density. Two additional parameters are the intercept and the coefficient for land coverage. We also conduct likelihood ratio test to test if adding a landscape variable significantly increases model fit. If so, the variable has a significant effect on language diversity beyond its covariation with climatic variables and the other landscape variables. Significant results are in bold. LR value is shown in italic, after t value (p value)
Association between biodiversity and language diversity
| Biodiversity | Low ( | Medium ( | High ( |
|---|---|---|---|
| Plant diversity | 1.32 (0.188) | 0.76 (0.449) | 0.28 (0.783) |
| Amphibian diversity | 0.38 (0.705) | −0.67 (0.507) | −0.30 (0.761) |
| Mammal diversity | 1.79 (0.075) | ||
| Bird diversity |
We list the t value and the p value (in parentheses) of a biodiversity variable in a generalized least squares regression that includes the biodiversity variable and all the six climatic and four landscape variables (n is the number of grid cells used in the analysis at low, medium, or high resolution). Two additional parameters are the intercept and the coefficient for land coverage. We also conduct likelihood ratio (LR) test to test if adding the biodiversity variable significantly increases model fit. Significant results are in bold. LR value is shown in italic, after t value (p value)
Fig. 2Global distribution of mammal diversity and bird diversity. Values on logarithm scale of number of species are shown for 200 × 200 km cells of an equal-area grid. For amphibian and plant diversity see Supplementary Figure 3
Fig. 3Global distribution of residuals in language diversity. Residuals after accounting for the climatic and landscape effects on language diversity are shown for 200 × 200 km grid cells of an equal-area grid. Aggregations of grid cells with residuals ≥ 1.96 (red) are circled. These indicate four regions of higher than expected language diversity, compared with regions of similar climate and landscape (New Guinea, Eastern Himalaya, West Africa, and Mesoamerica). Areas of lower than expected language diversity with residuals ≤ −1.96 (blue) are distributed in South America, mostly in the Amazon basin. The figure only shows grid cells for which we have relevant data
Fig. 4Global distribution of the number of language families. Numbers of language families are shown for 200 × 200 km cells of an equal-area grid. Language family is defined by the World Language Mapping System taxonomy[1]. Language isolates are treated as distinct families. Number of language families within a grid cell is calculated as the number of language families that include at least one language distributed in the grid cell. The figure only shows grid cells for which we have relevant data