Literature DB >> 23401532

Automated reconstruction of ancient languages using probabilistic models of sound change.

Alexandre Bouchard-Côté1, David Hall, Thomas L Griffiths, Dan Klein.   

Abstract

One of the oldest problems in linguistics is reconstructing the words that appeared in the protolanguages from which modern languages evolved. Identifying the forms of these ancient languages makes it possible to evaluate proposals about the nature of language change and to draw inferences about human history. Protolanguages are typically reconstructed using a painstaking manual process known as the comparative method. We present a family of probabilistic models of sound change as well as algorithms for performing inference in these models. The resulting system automatically and accurately reconstructs protolanguages from modern languages. We apply this system to 637 Austronesian languages, providing an accurate, large-scale automatic reconstruction of a set of protolanguages. Over 85% of the system's reconstructions are within one character of the manual reconstruction provided by a linguist specializing in Austronesian languages. Being able to automatically reconstruct large numbers of languages provides a useful way to quantitatively explore hypotheses about the factors determining which sounds in a language are likely to change over time. We demonstrate this by showing that the reconstructed Austronesian protolanguages provide compelling support for a hypothesis about the relationship between the function of a sound and its probability of changing that was first proposed in 1955.

Entities:  

Mesh:

Year:  2013        PMID: 23401532      PMCID: PMC3600485          DOI: 10.1073/pnas.1204678110

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  11 in total

1.  Language trees support the express-train sequence of Austronesian expansion.

Authors:  R D Gray; F M Jordan
Journal:  Nature       Date:  2000-06-29       Impact factor: 49.962

2.  Evolutionary HMMs: a Bayesian approach to multiple alignment.

Authors:  I Holmes; W J Bruno
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

3.  Language-tree divergence times support the Anatolian theory of Indo-European origin.

Authors:  Russell D Gray; Quentin D Atkinson
Journal:  Nature       Date:  2003-11-27       Impact factor: 49.962

4.  A "Long Indel" model for evolutionary sequence alignment.

Authors:  I Miklós; G A Lunter; I Holmes
Journal:  Mol Biol Evol       Date:  2003-12-23       Impact factor: 16.240

5.  BAli-Phy: simultaneous Bayesian inference of alignment and phylogeny.

Authors:  Marc A Suchard; Benjamin D Redelings
Journal:  Bioinformatics       Date:  2006-05-05       Impact factor: 6.937

6.  An evolutionary model for maximum likelihood alignment of DNA sequences.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1991-08       Impact factor: 2.395

7.  Language phylogenies reveal expansion pulses and pauses in Pacific settlement.

Authors:  R D Gray; A J Drummond; S J Greenhill
Journal:  Science       Date:  2009-01-23       Impact factor: 47.728

8.  Genome-wide nucleotide-level mammalian ancestor reconstruction.

Authors:  Benedict Paten; Javier Herrero; Stephen Fitzgerald; Kathryn Beal; Paul Flicek; Ian Holmes; Ewan Birney
Journal:  Genome Res       Date:  2008-10-10       Impact factor: 9.043

9.  Learning phonology with substantive bias: an experimental and computational study of velar palatalization.

Authors:  Colin Wilson
Journal:  Cogn Sci       Date:  2006-09-10

10.  The Austronesian Basic Vocabulary Database: from bioinformatics to lexomics.

Authors:  Simon J Greenhill; Robert Blust; Russell D Gray
Journal:  Evol Bioinform Online       Date:  2008-11-03       Impact factor: 1.625

View more
  9 in total

1.  The descent of words.

Authors:  Quentin D Atkinson
Journal:  Proc Natl Acad Sci U S A       Date:  2013-02-15       Impact factor: 11.205

2.  Tracing the roots of syntax with Bayesian phylogenetics.

Authors:  Luke Maurits; Thomas L Griffiths
Journal:  Proc Natl Acad Sci U S A       Date:  2014-09-05       Impact factor: 11.205

3.  Support for linguistic macrofamilies from weighted sequence alignment.

Authors:  Gerhard Jäger
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-24       Impact factor: 11.205

4.  Detecting regular sound changes in linguistics as events of concerted evolution.

Authors:  Daniel J Hruschka; Simon Branford; Eric D Smith; Jon Wilkins; Andrew Meade; Mark Pagel; Tanmoy Bhattacharya
Journal:  Curr Biol       Date:  2014-12-18       Impact factor: 10.834

5.  How Many Is Enough?-Statistical Principles for Lexicostatistics.

Authors:  Menghan Zhang; Tao Gong
Journal:  Front Psychol       Date:  2016-12-12

6.  The Potential of Automatic Word Comparison for Historical Linguistics.

Authors:  Johann-Mattis List; Simon J Greenhill; Russell D Gray
Journal:  PLoS One       Date:  2017-01-27       Impact factor: 3.240

7.  Evolution and Trade-Off Dynamics of Functional Load.

Authors:  Erich Round; Rikker Dockum; Robin J Ryder
Journal:  Entropy (Basel)       Date:  2022-04-05       Impact factor: 2.738

Review 8.  Unity and disunity in evolutionary sciences: process-based analogies open common research avenues for biology and linguistics.

Authors:  Johann-Mattis List; Jananan Sylvestre Pathmanathan; Philippe Lopez; Eric Bapteste
Journal:  Biol Direct       Date:  2016-08-20       Impact factor: 4.540

9.  Global-scale phylogenetic linguistic inference from lexical resources.

Authors:  Gerhard Jäger
Journal:  Sci Data       Date:  2018-10-09       Impact factor: 6.444

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.