Literature DB >> 32431473

Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children.

Manoj Kumar1, So Hyun Kim2, Catherine Lord3, Thomas D Lyon4, Shrikanth Narayanan1.   

Abstract

Automatic speech recognition for child speech has been long considered a more challenging problem than for adult speech. Various contributing factors have been identified such as larger acoustic speech variability including mispronunciations due to continuing biological changes in growth, developing vocabulary and linguistic skills, and scarcity of training corpora. A further challenge arises when dealing with spontaneous speech of children involved in a conversational interaction, and especially when the child may have limited or impaired communication ability. This includes health applications, one of the motivating domains of this paper, that involve goal-oriented dyadic interactions between a child and clinician/adult social partner as a part of behavioral assessment. In this work, we use linguistic context information from the interaction to adapt speech recognition models for children speech. Specifically, spoken language from the interacting adult speech provides the context for the child's speech. We propose two methods to exploit this context: lexical repetitions and semantic response generation. For the latter, we make use of sequence-to-sequence models that learn to predict the target child utterance given context adult utterances. Long-term context is incorporated in the model by propagating the cell-state across the duration of conversation. We use interpolation techniques to adapt language models at the utterance level, and analyze the effect of length and direction of context (forward and backward). Two different domains are used in our experiments to demonstrate the generalized nature of our methods - interactions between a child with ASD and an adult social partner in a play-based, naturalistic setting, and in forensic interviews between a child and a trained interviewer. In both cases, context-adapted models yield significant improvement (upto 10.71% in absolute word error rate) over the baseline and perform consistently across context windows and directions. Using statistical analysis, we investigate the effect of source-based (adult) and target-based (child) factors on adaptation methods. Our results demonstrate the applicability of our modeling approach in improving child speech recognition by employing information transfer from the adult interlocutor.

Entities:  

Keywords:  Autism Spectrum Disorder; Automatic Speech Recognition; Child Speech; Forensic Interviews

Year:  2020        PMID: 32431473      PMCID: PMC7236760          DOI: 10.1016/j.csl.2020.101101

Source DB:  PubMed          Journal:  Comput Speech Lang        ISSN: 0885-2308            Impact factor:   1.899


  16 in total

1.  Lexical and affective prosody in children with high-functioning autism.

Authors:  Ruth B Grossman; Rhyannon H Bemis; Daniela Plesa Skwerer; Helen Tager-Flusberg
Journal:  J Speech Lang Hear Res       Date:  2010-06       Impact factor: 2.297

Review 2.  A structured forensic interview protocol improves the quality and informativeness of investigative interviews with children: a review of research using the NICHD Investigative Interview Protocol.

Authors:  Michael E Lamb; Yael Orbach; Irit Hershkowitz; Phillip W Esplin; Dvora Horowitz
Journal:  Child Abuse Negl       Date:  2007-11-19

3.  Developmental acoustic study of American English diphthongs.

Authors:  Sungbok Lee; Alexandros Potamianos; Shrikanth Narayanan
Journal:  J Acoust Soc Am       Date:  2014-10       Impact factor: 1.840

4.  The autism diagnostic observation schedule-generic: a standard measure of social and communication deficits associated with the spectrum of autism.

Authors:  C Lord; S Risi; L Lambrecht; E H Cook; B L Leventhal; P C DiLavore; A Pickles; M Rutter
Journal:  J Autism Dev Disord       Date:  2000-06

5.  The Prevalence of Parent-Reported Autism Spectrum Disorder Among US Children.

Authors:  Michael D Kogan; Catherine J Vladutiu; Laura A Schieve; Reem M Ghandour; Stephen J Blumberg; Benjamin Zablotsky; James M Perrin; Paul Shattuck; Karen A Kuhlthau; Robin L Harwood; Michael C Lu
Journal:  Pediatrics       Date:  2018-12       Impact factor: 7.124

6.  Measuring Changes in Social Communication Behaviors: Preliminary Development of the Brief Observation of Social Communication Change (BOSCC).

Authors:  Rebecca Grzadzinski; Themba Carr; Costanza Colombi; Kelly McGuire; Sarah Dufek; Andrew Pickles; Catherine Lord
Journal:  J Autism Dev Disord       Date:  2016-07

Review 7.  Children's Disclosure of Sexual Abuse: A Systematic Review of Qualitative Research Exploring Barriers and Facilitators.

Authors:  Sarah Elizabeth Morrison; Caroline Bruce; Sarah Wilson
Journal:  J Child Sex Abus       Date:  2018-02-28

Review 8.  Overregularization in language acquisition.

Authors:  G F Marcus; S Pinker; M Ullman; M Hollander; T J Rosen; F Xu
Journal:  Monogr Soc Res Child Dev       Date:  1992

9.  Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond.

Authors:  Shrikanth Narayanan; Panayiotis G Georgiou
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2013-02-07       Impact factor: 10.961

10.  Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations.

Authors:  Prashanth Gurunath Shivakumar; Panayiotis Georgiou
Journal:  Comput Speech Lang       Date:  2020-02-18       Impact factor: 1.899

View more
  1 in total

1.  Developing sequentially trained robust Punjabi speech recognition system under matched and mismatched conditions.

Authors:  Puneet Bawa; Virender Kadyan; Abinash Tripathy; Thipendra P Singh
Journal:  Complex Intell Systems       Date:  2022-06-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.