Literature DB >> 25324116

Speech intelligibility estimation using multi-resolution spectral features for speakers undergoing cancer treatment.

Jonathan C Kim1, Hrishikesh Rao1, Mark A Clements1.   

Abstract

Head and neck cancer can significantly hamper speech production which often reduces speech intelligibility. A method of extracting spectral features is presented. The method uses a multi-resolution sinusoidal transform scheme, which enables better representation of spectral and harmonic characteristics. Regression methods were used to predict interval-scaled intelligibility scores of utterances in the NKI-CCRT speech corpus. The inclusion of these features lowered the mean squared estimation error from 0.43 to 0.39 on a scale from 1 to 7, with a p-value less than 0.001. For binary intelligibility classification, their inclusion resulted in an improvement by 5.0 percentage points when tested on a disjoint set.

Entities:  

Mesh:

Year:  2014        PMID: 25324116     DOI: 10.1121/1.4896410

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Musical Emotion Recognition with Spectral Feature Extraction based on a Sinusoidal Model with Model-based and Deep-learning approaches.

Authors:  Baijun Xie; Jonathan C Kim; Chung Hyuk Park
Journal:  Appl Sci (Basel)       Date:  2020-01-30       Impact factor: 2.838

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.