Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children.

Literature DB >> 32431473

Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children.

Manoj Kumar¹, So Hyun Kim², Catherine Lord³, Thomas D Lyon⁴, Shrikanth Narayanan¹.

Abstract

Automatic speech recognition for child speech has been long considered a more challenging problem than for adult speech. Various contributing factors have been identified such as larger acoustic speech variability including mispronunciations due to continuing biological changes in growth, developing vocabulary and linguistic skills, and scarcity of training corpora. A further challenge arises when dealing with spontaneous speech of children involved in a conversational interaction, and especially when the child may have limited or impaired communication ability. This includes health applications, one of the motivating domains of this paper, that involve goal-oriented dyadic interactions between a child and clinician/adult social partner as a part of behavioral assessment. In this work, we use linguistic context information from the interaction to adapt speech recognition models for children speech. Specifically, spoken language from the interacting adult speech provides the context for the child's speech. We propose two methods to exploit this context: lexical repetitions and semantic response generation. For the latter, we make use of sequence-to-sequence models that learn to predict the target child utterance given context adult utterances. Long-term context is incorporated in the model by propagating the cell-state across the duration of conversation. We use interpolation techniques to adapt language models at the utterance level, and analyze the effect of length and direction of context (forward and backward). Two different domains are used in our experiments to demonstrate the generalized nature of our methods - interactions between a child with ASD and an adult social partner in a play-based, naturalistic setting, and in forensic interviews between a child and a trained interviewer. In both cases, context-adapted models yield significant improvement (upto 10.71% in absolute word error rate) over the baseline and perform consistently across context windows and directions. Using statistical analysis, we investigate the effect of source-based (adult) and target-based (child) factors on adaptation methods. Our results demonstrate the applicability of our modeling approach in improving child speech recognition by employing information transfer from the adult interlocutor.

Entities: Chemical Disease Gene Species

Keywords: Autism Spectrum Disorder; Automatic Speech Recognition; Child Speech; Forensic Interviews

Year: 2020 PMID： 32431473 PMCID： PMC7236760 DOI： 10.1016/j.csl.2020.101101

Source DB: PubMed Journal: Comput Speech Lang ISSN： 0885-2308 Impact factor: 1.899

16 in total

1. Lexical and affective prosody in children with high-functioning autism.

Authors: Ruth B Grossman; Rhyannon H Bemis; Daniela Plesa Skwerer; Helen Tager-Flusberg
Journal: J Speech Lang Hear Res Date: 2010-06 Impact factor: 2.297

Review 2. A structured forensic interview protocol improves the quality and informativeness of investigative interviews with children: a review of research using the NICHD Investigative Interview Protocol.

Authors: Michael E Lamb; Yael Orbach; Irit Hershkowitz; Phillip W Esplin; Dvora Horowitz
Journal: Child Abuse Negl Date: 2007-11-19

3. Developmental acoustic study of American English diphthongs.

Authors: Sungbok Lee; Alexandros Potamianos; Shrikanth Narayanan
Journal: J Acoust Soc Am Date: 2014-10 Impact factor: 1.840

4. The autism diagnostic observation schedule-generic: a standard measure of social and communication deficits associated with the spectrum of autism.

Authors: C Lord; S Risi; L Lambrecht; E H Cook; B L Leventhal; P C DiLavore; A Pickles; M Rutter
Journal: J Autism Dev Disord Date: 2000-06

5. The Prevalence of Parent-Reported Autism Spectrum Disorder Among US Children.

Authors: Michael D Kogan; Catherine J Vladutiu; Laura A Schieve; Reem M Ghandour; Stephen J Blumberg; Benjamin Zablotsky; James M Perrin; Paul Shattuck; Karen A Kuhlthau; Robin L Harwood; Michael C Lu
Journal: Pediatrics Date: 2018-12 Impact factor: 7.124

6. Measuring Changes in Social Communication Behaviors: Preliminary Development of the Brief Observation of Social Communication Change (BOSCC).

Authors: Rebecca Grzadzinski; Themba Carr; Costanza Colombi; Kelly McGuire; Sarah Dufek; Andrew Pickles; Catherine Lord
Journal: J Autism Dev Disord Date: 2016-07

Review 7. Children's Disclosure of Sexual Abuse: A Systematic Review of Qualitative Research Exploring Barriers and Facilitators.

Authors: Sarah Elizabeth Morrison; Caroline Bruce; Sarah Wilson
Journal: J Child Sex Abus Date: 2018-02-28

Review 8. Overregularization in language acquisition.

Authors: G F Marcus; S Pinker; M Ullman; M Hollander; T J Rosen; F Xu
Journal: Monogr Soc Res Child Dev Date: 1992

9. Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond.

Authors: Shrikanth Narayanan; Panayiotis G Georgiou
Journal: Proc IEEE Inst Electr Electron Eng Date: 2013-02-07 Impact factor: 10.961

10. Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations.

Authors: Prashanth Gurunath Shivakumar; Panayiotis Georgiou
Journal: Comput Speech Lang Date: 2020-02-18 Impact factor: 1.899

1 in total

1. Developing sequentially trained robust Punjabi speech recognition system under matched and mismatched conditions.

Authors: Puneet Bawa; Virender Kadyan; Abinash Tripathy; Thipendra P Singh
Journal: Complex Intell Systems Date: 2022-06-02

1 in total