| Literature DB >> 28796784 |
Luke Maurits1, Robert Forkel2, Gereon A Kaiping3, Quentin D Atkinson1,2.
Abstract
We present a new open source software tool called BEASTling, designed to simplify the preparation of Bayesian phylogenetic analyses of linguistic data using the BEAST 2 platform. BEASTling transforms comparatively short and human-readable configuration files into the XML files used by BEAST to specify analyses. By taking advantage of Creative Commons-licensed data from the Glottolog language catalog, BEASTling allows the user to conveniently filter datasets using names for recognised language families, to impose monophyly constraints so that inferred language trees are backward compatible with Glottolog classifications, or to assign geographic location data to languages for phylogeographic analyses. Support for the emerging cross-linguistic linked data format (CLDF) permits easy incorporation of data published in cross-linguistic linked databases into analyses. BEASTling is intended to make the power of Bayesian analysis more accessible to historical linguists without strong programming backgrounds, in the hopes of encouraging communication and collaboration between those developing computational models of language evolution (who are typically not linguists) and relevant domain experts.Entities:
Mesh:
Year: 2017 PMID: 28796784 PMCID: PMC5552126 DOI: 10.1371/journal.pone.0180908
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Maximum clade credibility tree for the Indo-European languages in our example analysis of Indo-European cognate data.
The coloured blocks correspond to the correctly reconstructed subfamilies Slavic, Germanic and Romance. This tree is a summary of a posterior sample, and some aspects are more or less certain than others. Tree branches are solid if they subtend clades with posterior support exceeding 0.66 (common inside established subfamilies), dashed if support is between 0.33 and 0.66 and dotted if support is below 0.33 (which occurs only for the relationship between Armenian and Greek).
Relative substitution rates of the ten slowest and fastest changing meaning slots in our example analysis of Indo-European cognate data.
| Slowest | Fastest | ||
|---|---|---|---|
| Feature | Rate | Feature | Rate |
| give | 0.11 | walk | 1.61 |
| tooth | 0.11 | heavy | 1.63 |
| sun | 0.12 | snake | 1.68 |
| full | 0.12 | big | 1.76 |
| I | 0.12 | short | 1.76 |
| star | 0.12 | woman | 1.81 |
| eye | 0.12 | many | 1.98 |
| ear | 0.12 | know | 2.02 |
| tongue | 0.12 | tail | 2.20 |
| heart | 0.14 | belly | 2.21 |
Rates are relative to the average across all features, e.g. tooth evolves almost 10 times more slowly than average, while know evolves at just over twice the average rate. Note that many of the slowest meanings are body parts.
Fig 2Distributions of relative substitution rates for different categories of meaning slot.
Most categories have a median rate well below the average of 1.0, with long tails extending to faster than average rates. All colour terms have below average rates, as do all pronouns except for the outlier that.
Relative substitution rates of the ten slowest and fastest changing features in our example analysis of Austronesian typological data.
| Feature | Rate |
|---|---|
| Slowest | |
| Order of Object and Verb | 0.08 |
| Order of Adposition and Noun Phrase | 0.12 |
| Order of Genitive and Noun | 0.23 |
| Position of Pronominal Possessive Affixes | 0.27 |
| Order of Subject and Verb | 0.34 |
| Order of Subject, Object and Verb | 0.38 |
| Preverbal Negative Morphemes | 0.39 |
| Order of Numeral and Noun | 0.46 |
| Position of Interrogative Phrases in Content Questions | 0.47 |
| Numeral Classifiers | 0.50 |
| Fastest | |
| Position of Tense-Aspect Affixes | 1.19 |
| Polar Questions | 1.23 |
| Position of Polar Question Particles | 1.31 |
| SVNegO Order | 1.32 |
| Weight Factors in Weight-Sensitive Stress Systems | 1.41 |
| Indefinite Articles | 1.47 |
| Definite Articles | 1.60 |
| Order of Degree Word and Adjective | 1.63 |
| Weight-Sensitive Stress | 1.86 |
| Fixed Stress Locations | 2.14 |
Rates are relative to the average across all features. Note that many of the slowest features relate to word order.
Fig 3Histogram and kernel density estimate of relative substitution rates across the WALS features in our example analysis of Austronesian typological data.