A corpus of audio-visual Lombard speech with frontal and profile views

Abstract : This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual “Grid” corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421–2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.
Type de document :
Article dans une revue
Journal of the Acoustical Society of America, Acoustical Society of America, 2018, 143 (6), pp.EL523-EL529. 〈10.1121/1.5042758〉
Liste complète des métadonnées

Littérature citée [4 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01867824
Contributeur : Ricard Marxer <>
Soumis le : mercredi 5 septembre 2018 - 11:08:30
Dernière modification le : vendredi 14 septembre 2018 - 01:17:50

Fichier

 Accès restreint
Fichier visible le : 2018-12-26

Connectez-vous pour demander l'accès au fichier

Identifiants

Collections

Citation

Najwa Alghamdi, Steve Maddock, Ricard Marxer, Jon Barker, Guy Brown. A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, Acoustical Society of America, 2018, 143 (6), pp.EL523-EL529. 〈10.1121/1.5042758〉. 〈hal-01867824〉

Partager

Métriques

Consultations de la notice

12