| Literature DB >> 29308253 |
Nicolas Perrault1, Maxwell J Farrell2, T Jonathan Davies2,3.
Abstract
Languages are being lost at rates exceeding the global loss of biodiversity. With the extinction of a language we lose irreplaceable dimensions of culture and the insight it provides on human history and the evolution of linguistic diversity. When setting conservation goals, biologists give higher priority to species likely to go extinct. Recent methods now integrate information on species evolutionary relationships to prioritize the conservation of those with a few close relatives. Advances in the construction of language trees allow us to use these methods to develop language preservation priorities that minimize loss of linguistic diversity. The evolutionarily distinct and globally endangered (EDGE) metric, used in conservation biology, accounts for a species' originality (evolutionary distinctiveness-ED) and its likelihood of extinction (global endangerment-GE). Here, we use a similar framework to inform priorities for language preservation by generating rankings for 350 Austronesian languages. Kavalan, Tanibili, Waropen and Sengseng obtained the highest EDGE scores, while Xârâcùù (Canala), Nengone and Palauan are among the most linguistically distinct, but are not currently threatened. We further provide a way of dealing with incomplete trees, a common issue for both species and language trees.Entities:
Keywords: biodiversity; conservation; evolutionarily distinct and globally endangered; language preservation; linguistic diversity; phylogeny
Year: 2017 PMID: 29308253 PMCID: PMC5750020 DOI: 10.1098/rsos.171218
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.How the Austronesian tree was reconstructed to compute evolutionary distinctiveness more accurately, using Germanic languages as an example. A tree (a) of Germanic languages with (here invented) branch lengths can be used to compute evolutionary distinctiveness (ED), but missing languages (Dutch and Swedish) will bias this score. Language classifications into families and subfamilies by the Ethnologue (simplified for illustration) can partially compensate for this bias. It can be used to infer a tree (b) with no meaningful branch lengths. Those languages or groups of languages missing from tree (a) are imported from tree (b) to form a reconstructed tree (c). ED, as calculated from tree (c), is usually more accurate than when calculated from tree (a); see main text. This method does not allow computing ED of languages missing from tree (a). In this analysis, we used the Austronesian equivalent of tree (c). Details in the electronic supplementary material.
Definition of global endangerment (GE) scores for language endangerment. GE is a conversion of the EGIDS endangerment scale that parallels Isaac’s conversion of the IUCN Red List, in which increases of one unit in GE represent a doubling in the probability of extinction. The age of youngest users is the most important criterion for the EGIDS scale (details in electronic supplementary material, table S1).
| GE | EGIDS endangerment | youngest users | other criteria |
|---|---|---|---|
| 4 | nearly ext. | grandparents | rarely used |
| 3.5 | moribund | grandparents | — |
| 3 | shifting | parents | — |
| 2 | threatened | children | losing users |
| 1 | vigorous | children | stable user base |
| developing | children | standardized lit. | |
| educational | children | used in schools | |
| wider Comm. | children | used in mass media | |
| provincial | children | local govt. lang. | |
| national | children | national govt. lang. |
Figure 2.How robust ED is to missing languages. Every one of these 35 000 points represents the R2 between the ED scores of Gray et al.’s tree (350 languages) and the ED scores of one of its subtrees. Each subtree was obtained by randomly removing from Gray et al.’s tree a fixed proportion of languages represented on the x-axis. Even when 71.2% of tips are removed (vertical line), ED scores correlate well to that of Gray et al.’s tree, with R2=0.78 on average (horizontal line). If subtrees with 101 languages (i.e. with 71.2% of the 350 languages removed) are reconstructed to 350 languageswith Ethnologue data, as detailed in Materials and methods but not depicted here, the average R2 rises from 0.78 to 0.82. In Gray et al.’s tree of 350 languages, 71.2% of the 1215 ISO 639-3 Austronesian languages are missing. We therefore expect that the ED scores of the reconstructed tree used in this analysis are good approximations.
Figure 3.EDGE distribution of the 350 Austronesian languages shaded by theirrelative evolutionary distinctiveness (a) and endangermentlevel (b).
Languages by EDGE score (full list in the electronic supplementary material .csv file).
| language | EDGE | EDR | endangerment | GE | |
|---|---|---|---|---|---|
| 1 | Kavalan | 2.17 | 3.36 | nearly extinct | 4 |
| 2 | Tanibili | 1.86 | 2.21 | nearly extinct | 4 |
| 3 | Waropen | 1.774 | 2.50 | shifting | 3 |
| 4 | Sengseng | 1.765 | 3.13 | threatened | 2 |
| 5 | Magori | 1.75 | 1.88 | nearly extinct | 4 |
| 6 | Xârâcùù | 1.7133 | 3.66 | vigorous | 1 |
| 7 | Irarutu | 1.7130 | 2.92 | threatened | 2 |
| ⋮ | |||||
| 349 | Tuvaluan | 0.24 | 0.25 | wider comm. | |
| 350 | Indonesian | 0.15 | 0.16 | national |