Literature DB >> 17407908

Sinusoidal modeling for nonstationary voiced speech based on a local vector transform.

Masashi Ito1, Masafumi Yano.   

Abstract

A voiced speech signal can be expressed as a sum of sinusoidal components of which instantaneous frequency and amplitude continuously vary with time. Determining these parameters from the input, the time-varying characteristics are crucial error sources for the algorithms, which assume their stationarity within a local analysis segment. To overcome this problem, a new method is proposed, local vector transform (LVT), which can determine instantaneous frequency and amplitude for nonstationary sinusoids. The method does not assume the local stationarity. The effectiveness of LVT was examined in parameter determination for synthesized and naturally uttered speech signals. The instantaneous frequency for the first harmonic component was determined with an accuracy almost equal to that of the time-corrected instantaneous frequency method and higher accuracy than that of spectral peak-picking, autocorrelation, and cepstrum. The instantaneous amplitude was also determined accurately by LVT while considerable errors were left in the other algorithms. The signal reconstructed from the determined parameters by LVT agreed well with the corresponding component of voiced speech. These results suggest that the method is effective for analyzing time-varying voiced speech signals.

Mesh:

Year:  2007        PMID: 17407908     DOI: 10.1121/1.2431581

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Using a quadratic parameter sinusoid model to characterize the structure of EEG sleep spindles.

Authors:  Abdul J Palliyali; Mohammad N Ahmed; Beena Ahmed
Journal:  Front Hum Neurosci       Date:  2015-05-05       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.