Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations.

Literature DB >> 32372847

Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations.

Prashanth Gurunath Shivakumar¹, Panayiotis Georgiou¹.

Abstract

Children speech recognition is challenging mainly due to the inherent high variability in children's physical and articulatory characteristics and expressions. This variability manifests in both acoustic constructs and linguistic usage due to the rapidly changing developmental stage in children's life. Part of the challenge is due to the lack of large amounts of available children speech data for efficient modeling. This work attempts to address the key challenges using transfer learning from adult's models to children's models in a Deep Neural Network (DNN) framework for children's Automatic Speech Recognition (ASR) task evaluating on multiple children's speech corpora with a large vocabulary. The paper presents a systematic and an extensive analysis of the proposed transfer learning technique considering the key factors affecting children's speech recognition from prior literature. Evaluations are presented on (i) comparisons of earlier GMM-HMM and the newer DNN Models, (ii) effectiveness of standard adaptation techniques versus transfer learning, (iii) various adaptation configurations in tackling the variabilities present in children speech, in terms of (a) acoustic spectral variability, and (b) pronunciation variability and linguistic constraints. Our Analysis spans over (i) number of DNN model parameters (for adaptation), (ii) amount of adaptation data, (iii) ages of children, (iv) age dependent-independent adaptation. Finally, we provide Recommendations on (i) the favorable strategies over various aforementioned - analyzed parameters, and (ii) potential future research directions and relevant challenges/problems persisting in DNN based ASR for children's speech.

Entities: Chemical Disease Gene Species

Keywords: Analysis of Children’s Speech; Automatic Speech Recognition; Children Speech Recognition; Deep Learning; Deep Neural Network; Transfer Learning

Year: 2020 PMID： 32372847 PMCID： PMC7199459 DOI： 10.1016/j.csl.2020.101077

Source DB: PubMed Journal: Comput Speech Lang ISSN： 0885-2308 Impact factor: 1.899

Keyword Cloud
Cited

5 in total

1. Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children.

Authors: Manoj Kumar; So Hyun Kim; Catherine Lord; Thomas D Lyon; Shrikanth Narayanan
Journal: Comput Speech Lang Date: 2020-04-16 Impact factor: 1.899

2. Developing sequentially trained robust Punjabi speech recognition system under matched and mismatched conditions.

Authors: Puneet Bawa; Virender Kadyan; Abinash Tripathy; Thipendra P Singh
Journal: Complex Intell Systems Date: 2022-06-02

3. Fast screening for children's developmental language disorders via comprehensive speech ability evaluation-using a novel deep learning framework.

Authors: Xing Zhang; Feng Qin; Zelin Chen; Leyan Gao; Guoxin Qiu; Shuo Lu
Journal: Ann Transl Med Date: 2020-06

4. COVID-19 Diagnosis via DenseNet and Optimization of Transfer Learning Setting.

Authors: Yu-Dong Zhang; Suresh Chandra Satapathy; Xin Zhang; Shui-Hua Wang
Journal: Cognit Comput Date: 2021-01-18 Impact factor: 5.418

5. Tracking Child Language Development With Neural Network Language Models.

Authors: Kenji Sagae
Journal: Front Psychol Date: 2021-07-08

5 in total