Literature DB >> 28943715

Methods for eliciting, annotating, and analyzing databases for child speech development.

Mary E Beckman1, Andrew R Plummer2, Benjamin Munson3, Patrick F Reidy4.   

Abstract

Methods from automatic speech recognition (ASR), such as segmentation and forced alignment, have facilitated the rapid annotation and analysis of very large adult speech databases and databases of caregiver-infant interaction, enabling advances in speech science that were unimaginable just a few decades ago. This paper centers on two main problems that must be addressed in order to have analogous resources for developing and exploiting databases of young children's speech. The first problem is to understand and appreciate the differences between adult and child speech that cause ASR models developed for adult speech to fail when applied to child speech. These differences include the fact that children's vocal tracts are smaller than those of adult males and also changing rapidly in size and shape over the course of development, leading to between-talker variability across age groups that dwarfs the between-talker differences between adult men and women. Moreover, children do not achieve fully adult-like speech motor control until they are young adults, and their vocabularies and phonological proficiency are developing as well, leading to considerably more within-talker variability as well as more between-talker variability. The second problem then is to determine what annotation schemas and analysis techniques can most usefully capture relevant aspects of this variability. Indeed, standard acoustic characterizations applied to child speech reveal that adult-centered annotation schemas fail to capture phenomena such as the emergence of covert contrasts in children's developing phonological systems, while also revealing children's nonuniform progression toward community speech norms as they acquire the phonological systems of their native languages. Both problems point to the need for more basic research into the growth and development of the articulatory system (as well as of the lexicon and phonological system) that is oriented explicitly toward the construction of age-appropriate computational models.

Entities:  

Keywords:  automatic speech recognition; big data corpora; child speech development; phonetic transcription; spectral kinematics

Year:  2017        PMID: 28943715      PMCID: PMC5608260          DOI: 10.1016/j.csl.2017.02.010

Source DB:  PubMed          Journal:  Comput Speech Lang        ISSN: 0885-2308            Impact factor:   1.899


  75 in total

1.  Development of [j] in young, midwestern, American children.

Authors:  Richard S McGowan; Susan Nittrouer; Carol J Manning
Journal:  J Acoust Soc Am       Date:  2004-02       Impact factor: 1.840

2.  Locus equations are an acoustic expression of articulator synergy.

Authors:  Khalil Iskarous; Carol A Fowler; D H Whalen
Journal:  J Acoust Soc Am       Date:  2010-10       Impact factor: 1.840

3.  Vowel-dependent variation in Cantonese /s/ from an individual-difference perspective.

Authors:  Alan C L Yu
Journal:  J Acoust Soc Am       Date:  2016-04       Impact factor: 1.840

4.  Developmental and cross-linguistic variation in the infant vowel space: the case of Canadian English and Canadian French.

Authors:  Susan Rvachew; Karen Mattock; Linda Polka; Lucie Ménard
Journal:  J Acoust Soc Am       Date:  2006-10       Impact factor: 1.840

Review 5.  Review of text-to-speech conversion for English.

Authors:  D H Klatt
Journal:  J Acoust Soc Am       Date:  1987-09       Impact factor: 1.840

6.  The acquisition of the voicing contrast in English: study of voice onset time in word-initial stop consonants.

Authors:  M A Macken; D Barton
Journal:  J Child Lang       Date:  1980-02

7.  Developmental changes in the effects of utterance length and complexity on speech movement variability.

Authors:  Neeraja Sadagopan; Anne Smith
Journal:  J Speech Lang Hear Res       Date:  2008-07-29       Impact factor: 2.297

8.  Social feedback to infants' babbling facilitates rapid phonological learning.

Authors:  Michael H Goldstein; Jennifer A Schwade
Journal:  Psychol Sci       Date:  2008-05

9.  HomeBank: An Online Repository of Daylong Child-Centered Audio Recordings.

Authors:  Mark VanDam; Anne S Warlaumont; Elika Bergelson; Alejandrina Cristia; Melanie Soderstrom; Paul De Palma; Brian MacWhinney
Journal:  Semin Speech Lang       Date:  2016-04-25       Impact factor: 1.761

10.  Phonetic imitation by young children and its developmental changes.

Authors:  Kuniko Nielsen
Journal:  J Speech Lang Hear Res       Date:  2014-12       Impact factor: 2.297

View more
  2 in total

1.  Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment.

Authors:  Mark VanDam; Lauren Thompson; Elizabeth Wilson-Fowler; Sarah Campanella; Kiley Wolfenstein; Paul De Palma
Journal:  Comput Speech Lang       Date:  2021-12-02       Impact factor: 1.899

2.  Performance of Forced-Alignment Algorithms on Children's Speech.

Authors:  Tristan J Mahr; Visar Berisha; Kan Kawabata; Julie Liss; Katherine C Hustad
Journal:  J Speech Lang Hear Res       Date:  2021-03-11       Impact factor: 2.297

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.