Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A corpus of audio-visual Lombard speech with frontal and profile views.

Literature DB >> 29960497

A corpus of audio-visual Lombard speech with frontal and profile views.

Najwa Alghamdi¹, Steve Maddock¹, Ricard Marxer¹, Jon Barker¹, Guy J Brown¹.

Abstract

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual "Grid" corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421-2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.

Mesh：

Year: 2018 PMID： 29960497 DOI： 10.1121/1.5042758

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

1 in total

1. Analysis and Calibration of Lombard Effect and Whisper for Speaker Recognition.

Authors: Finnian Kelly; John H L Hansen
Journal: IEEE/ACM Trans Audio Speech Lang Process Date: 2021-01-21

1 in total