Literature DB >> 6575670

Coding of the speech spectrum in three time-varying sinusoids.

R E Remez, P E Rubin, D B Pisoni.   

Abstract

Recent perceptual experiments with normal adult listeners show that phonetic information can readily be conveyed by sinewave replicas of speech signals. These tonal patterns are made of three sinusoids set equal in frequency and amplitude to the respective peaks of the first three formants of natural-speech utterances. Unlike natural and most synthetic speech, the spectrum of sinusoidal patterns contains neither harmonics nor broadband formants, and is identified as grossly unnatural in voice timbre. Despite this drastic recoding of the short-time speech spectrum, listeners perceive the phonetic content if the temporal properties of spectrum variation are preserved. These observations suggest that phonetic perception may depend on properties of coherent spectrum variation, a second-order property of the acoustic signal, rather than any particular set of acoustic elements present in speech signals.

Mesh:

Year:  1983        PMID: 6575670     DOI: 10.1111/j.1749-6632.1983.tb31663.x

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


  2 in total

1.  Possible mechanisms of duplex perception: "chirp" identification versus dichotic fusion.

Authors:  H C Nusbaum
Journal:  Percept Psychophys       Date:  1984-01

Review 2.  Three challenges for future research on cochlear implants.

Authors:  David B Pisoni; William G Kronenberger; Michael S Harris; Aaron C Moberly
Journal:  World J Otorhinolaryngol Head Neck Surg       Date:  2018-01-02
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.