Literature DB >> 25122851

A video, text, and speech-driven realistic 3-d virtual head for human-machine interface.

Jun Yu, Zeng-Fu Wang.   

Abstract

A multiple inputs-driven realistic facial animation system based on 3-D virtual head for human-machine interface is proposed. The system can be driven independently by video, text, and speech, thus can interact with humans through diverse interfaces. The combination of parameterized model and muscular model is used to obtain a tradeoff between computational efficiency and high realism of 3-D facial animation. The online appearance model is used to track 3-D facial motion from video in the framework of particle filtering, and multiple measurements, i.e., pixel color value of input image and Gabor wavelet coefficient of illumination ratio image, are infused to reduce the influence of lighting and person dependence for the construction of online appearance model. The tri-phone model is used to reduce the computational consumption of visual co-articulation in speech synchronized viseme synthesis without sacrificing any performance. The objective and subjective experiments show that the system is suitable for human-machine interaction.

Entities:  

Mesh:

Year:  2014        PMID: 25122851     DOI: 10.1109/TCYB.2014.2341737

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  2 in total

1.  Exploiting Lightweight Statistical Learning for Event-Based Vision Processing.

Authors:  Cong Shi; Jiajun Li; Ying Wang; Gang Luo
Journal:  IEEE Access       Date:  2018-04-04       Impact factor: 3.367

2.  Construction of self-learning classroom history teaching mode based on human-computer interaction emotion recognition.

Authors:  Changwei Ji; Shuyan Zhao
Journal:  Front Psychol       Date:  2022-07-27
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.