| Literature DB >> 12237047 |
Robert W Morris1, Mark A Clements.
Abstract
This paper investigates a method for the real-time reconstruction of normal speech from whispers. This system could be used by aphonic individuals as a voice prosthesis. It could also provide improved verbal communication when normal speech is not appropriate. The normal speech is synthesized using the mixed excitation linear prediction model. Differences between whispered and phonated speech are discussed and methods for estimating the parameters of this model from whispered speech for real-time synthesis are proposed. This includes smoothing the noisy linear prediction spectra, modifying the formants, and synthesizing of the excitation signal. Trade-offs between computational complexity, delay, and accuracy of different methods are discussed.Mesh:
Year: 2002 PMID: 12237047 DOI: 10.1016/s1350-4533(02)00060-7
Source DB: PubMed Journal: Med Eng Phys ISSN: 1350-4533 Impact factor: 2.242