A corpus of audio-visual Lombard speech with frontal and profile views | Zendy

Najwa Alghamdi | Zendy; Steve Maddock | Zendy; Ricard Marxer | Zendy; Jon Barker | Zendy; Guy J. Brown | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A corpus of audio-visual Lombard speech with frontal and profile views

Author(s) -

Najwa Alghamdi,

Steve Maddock,

Ricard Marxer,

Jon Barker,

Guy J. Brown

Publication year - 2018

Publication title -

the journal of the acoustical society of america

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.619

H-Index - 187

eISSN - 1520-8524

pISSN - 0001-4966

DOI - 10.1121/1.5042758

Subject(s) - utterance , vowel , sentence , speech recognition , computer science , acoustics , linguistics , natural language processing , physics , philosophy

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual "Grid" corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421-2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research