Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Robust estimation of speech in noisy backgrounds based on aspects of the auditory process.

Literature DB >> 7790661

Robust estimation of speech in noisy backgrounds based on aspects of the auditory process.

Abstract

A new approach to speech enhancement is proposed where constraints based on aspects of the auditory process augment an iterative enhancement framework. The basic enhancement framework is based on a previously developed dual-channel scenario using a two-step iterative Wiener filtering algorithm. Constraints across broad speech sections and over iterations are then experimentally developed on a novel auditory representation derived by transforming the speech magnitude spectrum. The spectral transformations are based on modeling aspects of the human auditory process which include critical band filtering, intensity-to-loudness conversion, and lateral inhibition. The auditory transformations and perceptual based constraints are shown to result in a new set of auditory constrained and enhanced linear prediction (ACE-LP) parameters. The ACE-LP based speech spectrum is then incorporated into the iterative Wiener filtering framework. The improvements due to auditory constraints are demonstrated in several areas. The proposed auditory representation is shown to result in improved spectral characterization in background noise. The auditory constrained iterative enhancement (ACE-II) algorithm is shown to result in improved quality over all sections of enhanced speech. Adaptation of auditory based constraints to changing spectral characteristics over broad classes of speech is another novel aspect of the proposed algorithm. The consistency of speech quality improvement for the ACE-II algorithm is illustrated over time and across all phonemes classified over a large set of phonetically balanced sentences from the TIMIT database. This study demonstrates the application of auditory based perceptual properties of a human listener to speech enhancement in noise, resulting in improved and consistent speech quality over all regions of speech.

Entities: Gene Species

Mesh：

Year: 1995 PMID： 7790661 DOI： 10.1121/1.413108

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

3 in total

1. Speech enhancement using the modified phase-opponency model.

Authors: Om D Deshmukh; Carol Y Espy-Wilson; Laurel H Carney
Journal: J Acoust Soc Am Date: 2007-06 Impact factor: 1.840

2. Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.

Authors: Chengzhu Yu; Kamil K Wójcicki; Philipos C Loizou; John H L Hansen; Michael T Johnson
Journal: J Acoust Soc Am Date: 2014-05 Impact factor: 1.840

3. Single Word Intelligibility of Individuals with Parkinson's Disease in Noise: Pre-Specified Secondary Outcome Variables from a Randomized Control Trial (RCT) Comparing Two Intensive Speech Treatments (LSVT LOUD vs. LSVT ARTIC).

Authors: Geralyn Schulz; Angela Halpern; Jennifer Spielman; Lorraine Ramig; Ira Panzer; Alan Sharpley; Katherine Freeman
Journal: Brain Sci Date: 2021-06-27

3 in total