| Literature DB >> 19062868 |
DeLiang Wang1, Ulrik Kjems, Michael S Pedersen, Jesper B Boldt, Thomas Lunner.
Abstract
For a given mixture of speech and noise, an ideal binary time-frequency mask is constructed by comparing speech energy and noise energy within local time-frequency units. It is observed that listeners achieve nearly perfect speech recognition from gated noise with binary gains prescribed by the ideal binary mask. Only 16 filter channels and a frame rate of 100 Hz are sufficient for high intelligibility. The results show that, despite a dramatic reduction of speech information, a pattern of binary gains provides an adequate basis for speech perception.Mesh:
Year: 2008 PMID: 19062868 DOI: 10.1121/1.2967865
Source DB: PubMed Journal: J Acoust Soc Am ISSN: 0001-4966 Impact factor: 1.840