Literature DB >> 2299046

Algorithms for separating the speech of interfering talkers: evaluations with voiced sentences, and normal-hearing and hearing-impaired listeners.

R J Stubbs1, Q Summerfield.   

Abstract

Two signal-processing algorithms, derived from those described by Stubbs and Summerfield [R.J. Stubbs and Q. Summerfield, J. Acoust. Soc. Am. 84, 1236-1249 (1988)], were used to separate the voiced speech of two talkers speaking simultaneously, at similar intensities, in a single channel. Both algorithms use fundamental frequency (FO) as the basis for segregation. One attenuates the interfering voice by filtering the cepstrum of the signal. The other is a hybrid algorithm that combines cepstral filtering with the technique of harmonic selection [T.W. Parsons, J. Acoust. Soc. Am. 60, 911-918 (1976)]. The algorithms were evaluated and compared in perceptual experiments involving listeners with normal hearing and listeners with cochlear hearing impairments. In experiment 1 the processing was used to separate voiced sentences spoken on a monotone. Both algorithms gave significant increases in intelligibility to both groups of listeners. The improvements were equivalent to an increase of 3-4 dB in the effective signal-to-noise ratio (SNR). In experiment 2 the processing was used to separate voiced sentences spoken with time-varying intonation. For normal-hearing listeners, cepstral filtering gave a significant increase in intelligibility, while the hybrid algorithm gave an increase that was on the margins of significance (p = 0.06). The improvements were equivalent to an increase of 2-3 dB in the effective SNR. For impaired listeners, no intelligibility improvements were demonstrated with intoned sentences. The decrease in performance for intoned material is attributed to limitations of the algorithms when FO is nonstationary.

Entities:  

Mesh:

Year:  1990        PMID: 2299046     DOI: 10.1121/1.399257

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  4 in total

1.  Effects of the rate of formant-frequency variation on the grouping of formants in speech perception.

Authors:  Robert J Summers; Peter J Bailey; Brian Roberts
Journal:  J Assoc Res Otolaryngol       Date:  2011-12-13

2.  Estimating speech spectra for copy synthesis by linear prediction and by hand.

Authors:  Robert E Remez; Kathryn R Dubowski; Morgana L Davids; Emily F Thomas; Nina U Paddu; Yael S Grossman; Marina Moskalenko
Journal:  J Acoust Soc Am       Date:  2011-10       Impact factor: 1.840

3.  Listening to speech in the presence of other sounds.

Authors:  C J Darwin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-03-12       Impact factor: 6.237

4.  The role of timbre in the segregation of simultaneous voices with intersecting F0 contours.

Authors:  J F Culling; C J Darwin
Journal:  Percept Psychophys       Date:  1993-09
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.