Literature DB >> 21428994

Learning diphone-based segmentation.

Robert Daland1, Janet B Pierrehumbert.   

Abstract

This paper reconsiders the diphone-based word segmentation model of Cairns, Shillcock, Chater, and Levy (1997) and Hockema (2006), previously thought to be unlearnable. A statistically principled learning model is developed using Bayes' theorem and reasonable assumptions about infants' implicit knowledge. The ability to recover phrase-medial word boundaries is tested using phonetic corpora derived from spontaneous interactions with children and adults. The (unsupervised and semi-supervised) learning models are shown to exhibit several crucial properties. First, only a small amount of language exposure is required to achieve the model's ceiling performance, equivalent to between 1 day and 1 month of caregiver input. Second, the models are robust to variation, both in the free parameter and the input representation. Finally, both the learning and baseline models exhibit undersegmentation, argued to have significant ramifications for speech processing as a whole.
Copyright © 2010 Cognitive Science Society, Inc.

Entities:  

Mesh:

Year:  2010        PMID: 21428994     DOI: 10.1111/j.1551-6709.2010.01160.x

Source DB:  PubMed          Journal:  Cogn Sci        ISSN: 0364-0213


  5 in total

1.  Generalization to Novel Consonants: Place Versus Voice.

Authors:  Sara Finley
Journal:  J Psycholinguist Res       Date:  2022-06-25

2.  Interactive language learning by robots: the transition from babbling to word forms.

Authors:  Caroline Lyon; Chrystopher L Nehaniv; Joe Saunders
Journal:  PLoS One       Date:  2012-06-13       Impact factor: 3.240

3.  Disentangling sequential from hierarchical learning in Artificial Grammar Learning: Evidence from a modified Simon Task.

Authors:  Maria Vender; Diego Gabriel Krivochen; Arianna Compostella; Beth Phillips; Denis Delfitto; Douglas Saddy
Journal:  PLoS One       Date:  2020-05-14       Impact factor: 3.240

4.  Variation in the input: a case study of manner class frequencies.

Authors:  Robert Daland
Journal:  J Child Lang       Date:  2012-10-10

5.  The edge factor in early word segmentation: utterance-level prosody enables word form extraction by 6-month-olds.

Authors:  Elizabeth K Johnson; Amanda Seidl; Michael D Tyler
Journal:  PLoS One       Date:  2014-01-08       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.