z-logo
open-access-imgOpen Access
The clear speech intelligibility benefit for text-to-speech voices: Effects of speaking style and visual guise
Author(s) -
Nicholas B. Aoki,
Michelle Cohn,
Georgia Zellou
Publication year - 2022
Publication title -
jasa express letters
Language(s) - English
Resource type - Journals
ISSN - 2691-1191
DOI - 10.1121/10.0010274
Subject(s) - intelligibility (philosophy) , perception , psychology , style (visual arts) , speech perception , linguistics , speech recognition , computer science , art , philosophy , literature , epistemology , neuroscience
This study examined how speaking style and guise influence the intelligibility of text-to-speech (TTS) and naturally produced human voices. Results showed that TTS voices were less intelligible overall. Although using a clear speech style improved intelligibility for both human and TTS voices (using “newscaster” neural TTS), the clear speech effect was stronger for TTS voices. Finally, a visual device guise decreased intelligibility, regardless of voice type. The results suggest that both speaking style and visual guise affect intelligibility of human and TTS voices. Findings are discussed in terms of theories about the role of social information in speech perception.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom