z-logo
open-access-imgOpen Access
On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data
Author(s) -
Imran Sheikh,
Emmanuel Vincent,
Irina Illina
Publication year - 2020
Publication title -
interspeech 2022
Language(s) - English
Resource type - Conference proceedings
DOI - 10.21437/interspeech.2020-2242
Subject(s) - computer science , labeled data , training set , word error rate , detector , artificial intelligence , mutual information , semi supervised learning , speech recognition , word (group theory) , error analysis , pattern recognition (psychology) , mean squared prediction error , machine learning , mathematics , telecommunications , geometry
This work investigates semi-supervised training of acoustic models (AM) with the lattice-free maximum mutual information (LF-MMI) objective in practically relevant scenarios with a limited amount of labeled in-domain data. An error detection driven semi-supervised AM training approach is proposed, in which an error detector controls the hypothesized transcriptions or lattices used as LF-MMI training targets on additional unlabeled data. Under this approach, our first method uses a single error-tagged hypothesis whereas our second method uses a modified supervision lattice. These methods are evaluated and compared with existing semi-supervised AM training methods in three different matched or mismatched, limited data setups. Word error recovery rates of 28 to 89% are reported.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom