Literature DB >> 17297806

Automatic acoustic synthesis of human-like laughter.

Shiva Sundaram1, Shrikanth Narayanan.   

Abstract

A technique to synthesize laughter based on time-domain behavior of real instances of human laughter is presented. In the speech synthesis community, interest in improving the expressive quality of synthetic speech has grown considerably. While the focus has been on the linguistic aspects, such as precise control of speech intonation to achieve desired expressiveness, inclusion of nonlinguistic cues could further enhance the expressive quality of synthetic speech. Laughter is one such cue used for communicating, say, a happy or amusing context. It can be generated in many varieties and qualities: from a short exhalation to a long full-blown episode. Laughter is modeled at two levels, the overall episode level and at the local call level. The first attempts to capture the overall temporal behavior in a parametric model based on the equations that govern the simple harmonic motion of a mass-spring system is presented. By changing a set of easily available parameters, the authors are able to synthesize a variety of laughter. At the call level, the authors relied on a standard linear prediction based analysis-synthesis model. Results of subjective tests to assess the acceptability and naturalness of the synthetic laughter relative to real human laughter samples are presented.

Entities:  

Mesh:

Year:  2007        PMID: 17297806     DOI: 10.1121/1.2390679

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Detecting paralinguistic events in audio stream using context in features and probabilistic decisions.

Authors:  Rahul Gupta; Kartik Audhkhasi; Sungbok Lee; Shrikanth Narayanan
Journal:  Comput Speech Lang       Date:  2015-09-11       Impact factor: 1.899

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.