Literature DB >> 11757947

Cross-spectral methods for processing speech.

D J Nelson1.   

Abstract

We present time-frequency methods which are well suited to the analysis of nonstationary multicomponent FM signals, such as speech. These methods are based on group delay, instantaneous frequency, and higher-order phase derivative surfaces computed from the short time Fourier transform (STFT). Unlike more conventional approaches, these methods do not assume a locally stationary approximation of the signal model. We describe the computation of the phase derivatives, the physical interpretation of these derivatives, and a re-mapping algorithm based on these phase derivatives. We show analytically, and by example, the convergence of the re-mapping to the FM representation of the signal. The methods are applied to speech to estimate signal parameters, such as the group delay of a transmission channel and speech formant frequencies. Our goal is to develop a unified method which can accurately estimate speech components in both time and frequency and to apply these methods to the estimation of instantaneous formant frequencies, effective excitation time, vocal tract group delay, and channel group delay. The proposed method has several interesting properties, the most important of which is the ability to simultaneously resolve all FM components of a multicomponent signal, as long as the STFT of the composite signal satisfies a simple separability condition. The method can provide super-resolution in both time and frequency in the sense that it can simultaneously provide time and frequency estimates of FM components, which have much better accuracy than the Heisenberg uncertainty of the STFT. Super-resolution provides the capability to accurately "re-map" each component of the STFT surface to the time and frequency of the FM signal component it represents. To attain high resolution and accuracy, the signal must be jointly estimated simultaneously in time and frequency. This is accomplished by estimating two surfaces, which are essentially the derivatives of the STFT phase with respect to time and frequency. To avoid phase ambiguities, the differentiation is performed as a cross-spectral product.

Mesh:

Year:  2001        PMID: 11757947     DOI: 10.1121/1.1402616

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  6 in total

1.  Multijoint arm stiffness during movements following stroke: implications for robot therapy.

Authors:  D Piovesan; M Casadio; F A Mussa-Ivaldi; P G Morasso
Journal:  IEEE Int Conf Rehabil Robot       Date:  2011

2.  Sparse time-frequency representations.

Authors:  Timothy J Gardner; Marcelo O Magnasco
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-06       Impact factor: 11.205

3.  Experimental measure of arm stiffness during single reaching movements with a time-frequency analysis.

Authors:  Davide Piovesan; Alberto Pierobon; Paul DiZio; James R Lackner
Journal:  J Neurophysiol       Date:  2013-08-14       Impact factor: 2.714

4.  F0-induced formant measurement errors result in biased variabilities.

Authors:  Wei-Rong Chen; D H Whalen; Christine H Shadle
Journal:  J Acoust Soc Am       Date:  2019-05       Impact factor: 1.840

5.  Measuring multi-joint stiffness during single movements: numerical validation of a novel time-frequency approach.

Authors:  Davide Piovesan; Alberto Pierobon; Paul DiZio; James R Lackner
Journal:  PLoS One       Date:  2012-03-20       Impact factor: 3.240

6.  Female resistance and harmonic convergence influence male mating success in Aedes aegypti.

Authors:  Andrew Aldersley; Lauren J Cator
Journal:  Sci Rep       Date:  2019-02-14       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.