Literature DB >> 16875242

Pitch-based monaural segregation of reverberant speech.

Nicoleta Roman1, DeLiang Wang.   

Abstract

In everyday listening, both background noise and reverberation degrade the speech signal. Psychoacoustic evidence suggests that human speech perception under reverberant conditions relies mostly on monaural processing. While speech segregation based on periodicity has achieved considerable progress in handling additive noise, little research in monaural segregation has been devoted to reverberant scenarios. Reverberation smears the harmonic structure of speech signals, and our evaluations using a pitch-based segregation algorithm show that an increase in the room reverberation time causes degraded performance due to weakened periodicity in the target signal. We propose a two-stage monaural separation system that combines the inverse filtering of the room impulse response corresponding to target location and a pitch-based speech segregation method. As a result of the first stage, the harmonicity of a signal arriving from target direction is partially restored while signals arriving from other directions are further smeared, and this leads to improved segregation. A systematic evaluation of the system shows that the proposed system results in considerable signal-to-noise ratio gains across different conditions. Potential applications of this system include robust automatic speech recognition and hearing aid design.

Entities:  

Mesh:

Year:  2006        PMID: 16875242     DOI: 10.1121/1.2204590

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  6 in total

1.  Factors influencing glimpsing of speech in noise.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2007-08       Impact factor: 1.840

2.  Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction.

Authors:  Ning Li; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2008-03       Impact factor: 1.840

3.  Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.

Authors:  Chengzhu Yu; Kamil K Wójcicki; Philipos C Loizou; John H L Hansen; Michael T Johnson
Journal:  J Acoust Soc Am       Date:  2014-05       Impact factor: 1.840

4.  Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising.

Authors:  Donald S Williamson; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-04-20

5.  A new sound coding strategy for suppressing noise in cochlear implants.

Authors:  Yi Hu; Philipos C Loizou
Journal:  J Acoust Soc Am       Date:  2008-07       Impact factor: 1.840

6.  Decreased ability in the segregation of dynamically changing vowel-analog streams: a factor in the age-related cocktail-party deficit?

Authors:  Pierre Divenyi
Journal:  Front Neurosci       Date:  2014-06-12       Impact factor: 4.677

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.