z-logo
open-access-imgOpen Access
The Zero Resource Speech Challenge 2019: TTS Without T
Author(s) -
Ewan Dunbar,
Robin Algayres,
Julien Karadayi,
Mathieu Bernard,
Juan Benjumea,
Xuan-Nga Cao,
Lucie Miskic,
Charlotte Dugrain,
Lucas Ondel,
Alan W. Black,
Laurent Besacier,
Sakriani Sakti,
Emmanuel Dupoux
Publication year - 2019
Publication title -
interspeech 2022
Language(s) - English
Resource type - Conference proceedings
DOI - 10.21437/interspeech.2019-2904
Subject(s) - computer science , speech synthesis , speech recognition , zero (linguistics) , baseline (sea) , natural language processing , resource (disambiguation) , artificial intelligence , linguistics , computer network , philosophy , oceanography , geology
We present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or phonetic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery dataset) and align them to the voice recordings in a way that works best for the purpose of synthesizing novel utterances from novel speakers, similar to the target speakeru0027s voice. We describe the metrics used for evaluation, a baseline system consisting of unsupervised subword unit discovery plus a standard TTS system, and a topline TTS using gold phoneme transcriptions. We present an overview of the 19 submitted systems from 11 teams and discuss the main results.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom