Literature DB >> 12237047

Reconstruction of speech from whispers.

Robert W Morris1, Mark A Clements.   

Abstract

This paper investigates a method for the real-time reconstruction of normal speech from whispers. This system could be used by aphonic individuals as a voice prosthesis. It could also provide improved verbal communication when normal speech is not appropriate. The normal speech is synthesized using the mixed excitation linear prediction model. Differences between whispered and phonated speech are discussed and methods for estimating the parameters of this model from whispered speech for real-time synthesis are proposed. This includes smoothing the noisy linear prediction spectra, modifying the formants, and synthesizing of the excitation signal. Trade-offs between computational complexity, delay, and accuracy of different methods are discussed.

Mesh:

Year:  2002        PMID: 12237047     DOI: 10.1016/s1350-4533(02)00060-7

Source DB:  PubMed          Journal:  Med Eng Phys        ISSN: 1350-4533            Impact factor:   2.242


  1 in total

1.  Speech reconstruction using a deep partially supervised neural network.

Authors:  Ian McLoughlin; Jingjie Li; Yan Song; Hamid R Sharifzadeh
Journal:  Healthc Technol Lett       Date:  2017-06-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.