Literature DB >> 16158668

Analysis and synthesis of the three-dimensional movements of the head, face, and hand of a speaker using cued speech.

Guillaume Gibert1, Gérard Bailly, Denis Beautemps, Frederic Elisei, Rémi Brun.   

Abstract

In this paper we present efforts for characterizing the three dimensional (3-D) movements of the right hand and the face of a French female speaker during the audiovisual production of cued speech. The 3-D trajectories of 50 hand and 63 facial flesh points during the production of 238 utterances were analyzed. These utterances were carefully designed to cover all possible diphones of the French language. Linear and nonlinear statistical models of the articulations and the postures of the hand and the face have been developed using separate and joint corpora. Automatic recognition of hand and face postures at targets was performed to verify a posteriori that key hand movements and postures imposed by cued speech had been well realized by the subject. Recognition results were further exploited in order to study the phonetic structure of cued speech, notably the phasing relations between hand gestures and sound production. The hand and face gestural scores are studied in reference with the acoustic segmentation. A first implementation of a concatenative audiovisual text-to-cued speech synthesis system is finally described that employs this unique and extensive data on cued speech in action.

Mesh:

Year:  2005        PMID: 16158668     DOI: 10.1121/1.1944587

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  3 in total

1.  A Method for Transcribing the Manual Components of Cued Speech.

Authors:  Jean C Krause; Katherine A Pelley-Lopez; Morgan P Tessler
Journal:  Speech Commun       Date:  2011-03-01       Impact factor: 2.017

2.  Transforming an embodied conversational agent into an efficient talking head: from keyframe-based animation to multimodal concatenation synthesis.

Authors:  Guillaume Gibert; Kirk N Olsen; Yvonne Leung; Catherine J Stevens
Journal:  Comput Cogn Sci       Date:  2015-09-08

3.  Recording and analysis of head movements, interaural level and time differences in rooms and real-world listening scenarios.

Authors:  Alan W Boyd; William M Whitmer; Michael A Akeroyd
Journal:  ISRA 2013 (2013)       Date:  2014
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.