| Literature DB >> 19918360 |
Ger Reesink1, Ruth Singer, Michael Dunn.
Abstract
The region of the ancient Sahul continent (present day Australia and New Guinea, and surrounding islands) is home to extreme linguistic diversity. Even apart from the huge Austronesian language family, which spread into the area after the breakup of the Sahul continent in the Holocene, there are hundreds of languages from many apparently unrelated families. On each of the subcontinents, the generally accepted classification recognizes one large, widespread family and a number of unrelatable smaller families. If these language families are related to each other, it is at a depth which is inaccessible to standard linguistic methods. We have inferred the history of structural characteristics of these languages under an admixture model, using a Bayesian algorithm originally developed to discover populations on the basis of recombining genetic markers. This analysis identifies 10 ancestral language populations, some of which can be identified with clearly defined phylogenetic groups. The results also show traces of early dispersals, including hints at ancient connections between Australian languages and some Papuan groups (long hypothesized, never before demonstrated). Systematic language contact effects between members of big phylogenetic groups are also detected, which can in some cases be identified with a diffusional or substrate signal. Most interestingly, however, there remains striking evidence of a phylogenetic signal, with many languages showing negligible amounts of admixture.Entities:
Mesh:
Year: 2009 PMID: 19918360 PMCID: PMC2770058 DOI: 10.1371/journal.pbio.1000241
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Geographic location and broad genealogical affiliation of the 121 languages in the sample (numbered west to east).
Figure 2Distribution of likelihood scores for 50 independent runs of STRUCTURE at each value of K from 2 to 15.
Figure 3Sahul linguistic population structure.
At K2 the basic contrast is between AN (Pale Blue) and all non-AN (Dark Blue), whether belonging to Papuan or Australian stocks, with some admixture in both groups. At K3 the Australian languages emerge as a solid cluster (Pale Green) within the non-AN group of K2. At K4 the AN languages are differentiated into a group which is basically the Oceanic subgroup (Pale Blue), and the remainder of western AN (Dark Green). The Oceanic languages of the Solomon Islands, New Caledonia, and Polynesia exhibit a considerable contribution from western AN as well. At K5 the Papuan languages are split into TNG (Dark Blue) and non-TNG (Pink), with some non-TNG of eastern Indonesia and New Britain showing admixture from AN clusters. At K6 a new cluster (Red) emerges, containing the Papuan languages of the Bird's Head and island Indonesia, as well as the non-TNG languages of the Bismarck archipelago. In addition, the AN languages Taba and Biak of eastern Indonesia exhibit a major contribution from this population. Some contribution is also seen in Oceanic languages east of the New Guinea mainland. At K7 we find a first diversification within the Australian set, mainly coinciding with the opposition between PN (Pale Green) and various non-PN (Yellow) families. Two non-PN languages, Garrwa and Kayardild, that had been previously classified as PN but more recently recognized as non-PN (Evans 2003: 12 [7]) cluster in our analysis with recognized PN languages. K8 exhibits a new contributing population among the non-TNG languages (Orange), present mainly in what can be identified as Northwest Papuan languages, different from other northern and southern non-TNG clusters, the latter of which is more clearly delineated in K9. At K9 the South Papuan cluster (Pale Purple) appears very strong in languages of the Trans-Fly area and Yélî Dnye of Rossel Island and has a weak contribution in Inanwatan towards the west and in Bilua of the Solomon Islands in the east. It leaves a group of languages that can be identified as Northeast Papuan. At K10 a new cluster (Dark Purple) is found among the AN languages. Signals of this population are not contiguous, suggesting two different strands in the Oceanic subgroup of the AN family: (1) a New Britain Oceanic also found in Äiwoo of the Reefs-Santa Cruz group (Pale Blue), and three languages of Vanuatu (Mwotlap, South Efate, and Sye); and (2) all other Oceanic languages (Dark Purple). Interestingly, at K11 the bifurcation of Oceanic languages of K10 disappears, while a new contributing population among the non-TNG languages can be identified. Since this K value is the first of a series with lower probabilities, we do not further discuss this, nor higher K values.
Figure 4The geographic patterning of STRUCTURE results for K10.
Recapitulating the K10 row from Figure 3.