Literature DB >> 14759026

A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

Lian Apostol1, Pascal Perrier, Gérard Bailly.   

Abstract

A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It is assumed that this variability originates in the differences existing among speakers in the respective lengths of their front and back vocal-tract cavities. In order to characterize, from the spectral description of the acoustic speech signal, these vocal-tract differences between speakers, each formant is interpreted, according to the concept of formant-cavity affiliation, as a resonance of a specific vocal-tract cavity. Its frequency can thus be directly related to the corresponding cavity length, and a transformation model can be proposed from a speaker A to a speaker B on the basis of the frequency ratios of the formants corresponding to the same resonances. In order to minimize the number of sounds to be recorded for each speaker in order to carry out this speaker transformation, the frequency ratios are exactly computed only for the three extreme cardinal vowels [i, a, u] and they are approximated for the remaining vowels through an interpolation function. The method is evaluated through its capacity to transform the (F1,F2) formant patterns of eight oral vowels pronounced by five male speakers into the (F1,F2) patterns of the corresponding vowels generated by an articulatory model of the vocal tract. The resulting formant patterns are compared to those provided by normalization techniques published in the literature. The proposed method is found to be efficient, but a number of limitations are also observed and discussed. These limitations can be associated with the formant-cavity affiliation model itself or with a possible influence of speaker-specific vocal-tract geometry in the cross-sectional direction, which the model might not have taken into account.

Entities:  

Mesh:

Year:  2004        PMID: 14759026     DOI: 10.1121/1.1631946

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  2 in total

1.  Articulatory capacity of Neanderthals, a very recent and human-like fossil hominin.

Authors:  Anna Barney; Sandra Martelli; Antoine Serrurier; James Steele
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2012-01-12       Impact factor: 6.237

2.  Variability of articulator positions and formants across nine English vowels.

Authors:  D H Whalen; Wei-Rong Chen; Mark K Tiede; Hosung Nam
Journal:  J Phon       Date:  2018-02-23
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.