Literature DB >> 17407904

Noise estimation in voice signals using short-term cepstral analysis.

Peter J Murphy1, Olatunji O Akande.   

Abstract

Cepstral-based estimation is used to provide a baseline estimate of the noise level in the logarithmic spectrum for voiced speech. A theoretical description of cepstral processing of voiced speech containing aspiration noise, together with supporting empirical data, is provided in order to illustrate the nature of the noise baseline estimation process. Taking the Fourier transform of the liftered (filtered in the cepstral domain) cepstrum produces a noise baseline estimate. It is shown that Fourier transforming the low-pass liftered cepstrum is comparable to applying a moving average (MA) filter to the logarithmic spectrum and hence the baseline receives contributions from the glottal source excited vocal tract and the noise excited vocal tract. Because the estimation process resembles the action of a MA filter, the resulting noise baseline is determined by the harmonic resolution (as determined by the temporal analysis window length) and the glottal source spectral tilt. On selecting an appropriate temporal analysis window length the estimated baseline is shown to lie halfway between the glottal excited vocal tract and the noise excited vocal tract. This information is employed in a new harmonics-to-noise (HNR) estimation technique, which is shown to provide accurate HNR estimates when tested on synthetically generated voice signals.

Mesh:

Year:  2007        PMID: 17407904     DOI: 10.1121/1.2427123

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  5 in total

1.  Automatic evaluation of hypernasality based on a cleft palate speech database.

Authors:  Ling He; Jing Zhang; Qi Liu; Heng Yin; Margaret Lech; Yunzhi Huang
Journal:  J Med Syst       Date:  2015-03-28       Impact factor: 4.460

2.  A study of sounds produced by Algerian esophageal speakers.

Authors:  K Ferrat; M Guerti
Journal:  Afr Health Sci       Date:  2012-12       Impact factor: 0.927

3.  Relationships between vocal function measures derived from an acoustic microphone and a subglottal neck-surface accelerometer.

Authors:  Daryush D Mehta; Jarrad H Van Stan; Robert E Hillman
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2016-01-11

Review 4.  Voice assessment: updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods.

Authors:  Daryush D Mehta; Robert E Hillman
Journal:  Curr Opin Otolaryngol Head Neck Surg       Date:  2008-06       Impact factor: 2.064

5.  Integration of Motor Learning Principles Into Real-Time Ambulatory Voice Biofeedback and Example Implementation Via a Clinical Case Study With Vocal Fold Nodules.

Authors:  Jarrad H Van Stan; Daryush D Mehta; Robert J Petit; Dagmar Sternad; Jason Muise; James A Burns; Robert E Hillman
Journal:  Am J Speech Lang Pathol       Date:  2017-02-01       Impact factor: 2.408

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.