Literature DB >> 29960497

A corpus of audio-visual Lombard speech with frontal and profile views.

Najwa Alghamdi1, Steve Maddock1, Ricard Marxer1, Jon Barker1, Guy J Brown1.   

Abstract

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual "Grid" corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421-2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.

Mesh:

Year:  2018        PMID: 29960497     DOI: 10.1121/1.5042758

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Analysis and Calibration of Lombard Effect and Whisper for Speaker Recognition.

Authors:  Finnian Kelly; John H L Hansen
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2021-01-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.