On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data | Zendy

Imran Sheikh | Zendy; Emmanuel Vincent | Zendy; Irina Illina | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data

Author(s) -

Imran Sheikh,

Emmanuel Vincent,

Irina Illina

Publication year - 2020

Publication title -

interspeech 2022

Language(s) - English

Resource type - Conference proceedings

DOI - 10.21437/interspeech.2020-2242

Subject(s) - computer science , labeled data , training set , word error rate , detector , artificial intelligence , mutual information , semi supervised learning , speech recognition , word (group theory) , error analysis , pattern recognition (psychology) , mean squared prediction error , machine learning , mathematics , telecommunications , geometry

This work investigates semi-supervised training of acoustic models (AM) with the lattice-free maximum mutual information (LF-MMI) objective in practically relevant scenarios with a limited amount of labeled in-domain data. An error detection driven semi-supervised AM training approach is proposed, in which an error detector controls the hypothesized transcriptions or lattices used as LF-MMI training targets on additional unlabeled data. Under this approach, our first method uses a single error-tagged hypothesis whereas our second method uses a modified supervision lattice. These methods are evaluated and compared with existing semi-supervised AM training methods in three different matched or mismatched, limited data setups. Word error recovery rates of 28 to 89% are reported.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research