Literature DB >> 2341679

Perceptual linear predictive (PLP) analysis of speech.

H Hermansky1.   

Abstract

A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, is presented and examined. This technique uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum: (1) the critical-band spectral resolution, (2) the equal-loudness curve, and (3) the intensity-loudness power law. The auditory spectrum is then approximated by an autoregressive all-pole model. A 5th-order all-pole model is effective in suppressing speaker-dependent details of the auditory spectrum. In comparison with conventional linear predictive (LP) analysis, PLP analysis is more consistent with human hearing. The effective second formant F2' and the 3.5-Bark spectral-peak integration theories of vowel perception are well accounted for. PLP analysis is computationally efficient and yields a low-dimensional representation of speech. These properties are found to be useful in speaker-independent automatic-speech recognition.

Entities:  

Mesh:

Year:  1990        PMID: 2341679     DOI: 10.1121/1.399423

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  20 in total

1.  Perception of Place of Articulation for Plosives and Fricatives in Noise.

Authors:  Abeer Alwan; Jintao Jiang; Willa Chen
Journal:  Speech Commun       Date:  2011-02-01       Impact factor: 2.017

2.  Vowel acoustic space development in children: a synthesis of acoustic and anatomic data.

Authors:  Houri K Vorperian; Ray D Kent
Journal:  J Speech Lang Hear Res       Date:  2007-12       Impact factor: 2.297

3.  Spectral integration of dynamic cues in the perception of syllable-initial stops.

Authors:  Robert Allen Fox; Ewa Jacewicz; Lawrence L Feth
Journal:  Phonetica       Date:  2008-05-28       Impact factor: 1.759

Review 4.  On the recognition of emotional vocal expressions: motivations for a holistic approach.

Authors:  Anna Esposito; Antonietta M Esposito
Journal:  Cogn Process       Date:  2012-08-08

5.  Robust Estimation of Hypernasality in Dysarthria with Acoustic Model Likelihood Features.

Authors:  Michael Saxon; Ayush Tripathi; Yishan Jiao; Julie Liss; Visar Berisha
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2020-08-07

6.  Evaluation of the importance of time-frequency contributions to speech intelligibility in noise.

Authors:  Chengzhu Yu; Kamil K Wójcicki; Philipos C Loizou; John H L Hansen; Michael T Johnson
Journal:  J Acoust Soc Am       Date:  2014-05       Impact factor: 1.840

7.  Auditory spectral integration in the perception of static vowels.

Authors:  Robert Allen Fox; Ewa Jacewicz; Chiung-Yun Chang
Journal:  J Speech Lang Hear Res       Date:  2011-08-23       Impact factor: 2.297

8.  Toward the ultimate synthesis/recognition system.

Authors:  S Furui
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

9.  Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising.

Authors:  Donald S Williamson; DeLiang Wang
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-04-20

10.  A NEW MASK-BASED OBJECTIVE MEASURE FOR PREDICTING THE INTELLIGIBILITY OF BINARY MASKED SPEECH.

Authors:  Chengzhu Yu; Kamil K Wójcicki; P C Loizou; John H L Hansen
Journal:  Proc IEEE Int Conf Acoust Speech Signal Process       Date:  2013
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.