Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.

Literature DB >> 20191101

Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.

Toshio Irino¹, Roy D Patterson, Hideki Kawahara.

Abstract

We propose a new method to segregate concurrent speech sounds using an auditory version of a channel vocoder. The auditory representation of sound, referred to as an "auditory image," preserves fine temporal information, unlike conventional window-based processing systems. This makes it possible to segregate speech sources with an event synchronous procedure. Fundamental frequency information is used to estimate the sequence of glottal pulse times for a target speaker, and to repress the glottal events of other speakers. The procedure leads to robust extraction of the target speech and effective segregation even when the signal-to-noise ratio is as low as 0 dB. Moreover, the segregation performance remains high when the speech contains jitter, or when the estimate of the fundamental frequency F0 is inaccurate. This contrasts with conventional comb-filter methods where errors in F0 estimation produce a marked reduction in performance. We compared the new method to a comb-filter method using a cross-correlation measure and perceptual recognition experiments. The results suggest that the new method has the potential to supplant comb-filter and harmonic-selection methods for speech enhancement.

Entities: Chemical Disease Gene Species

Year: 2006 PMID： 20191101 PMCID： PMC2828642 DOI： 10.1109/TASL.2006.872611

Source DB: PubMed Journal: IEEE Trans Audio Speech Lang Process ISSN： 1558-7916

12 in total

Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements.

1. A compressive gammachirp auditory filter for both physiological and psychophysical data.

2. Extending the domain of center frequencies for the compressive gammachirp auditory filter.

3. Derivation of auditory filter shapes from notched-noise data.

4. Robust and accurate fundamental frequency estimation based on dominant harmonic components.

5. Separation of speech from interfering sounds based on oscillatory correlation.

6. A duplex theory of pitch perception.

7. Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform.

8. Modeling temporal asymmetry in the auditory system.

9. A comparison of detection and discrimination of temporal asymmetry in amplitude modulation.

10. A Dynamic Compressive Gammachirp Auditory Filterbank.

1. A Dynamic Compressive Gammachirp Auditory Filterbank.

2. Comparison of the roex and gammachirp filters as representations of the auditory filter.