z-logo
open-access-imgOpen Access
Recent progresses in deep learning based acoustic models
Author(s) -
Dong Yu,
Jinyu Li
Publication year - 2017
Publication title -
ieee/caa journal of automatica sinica
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.277
H-Index - 41
eISSN - 2329-9274
pISSN - 2329-9266
DOI - 10.1109/jas.2017.7510508
Subject(s) - computing and processing , communication, networking and broadcast technologies , general topics for engineers , robotics and control systems
In this paper, we summarize recent progresses made in deep learning based acoustic models and the motivation and insights behind the surveyed techniques. We first discuss models such as recurrent neural networks U+0028 RNNs U+0029 and convolutional neural networks U+0028 CNNs U+0029 that can effectively exploit variablelength contextual information, and their various combination with other models. We then describe models that are optimized end-to-end and emphasize on feature representations learned jointly with the rest of the system, the connectionist temporal classification U+0028 CTC U+0029 criterion, and the attention-based sequenceto-sequence translation model. We further illustrate robustness issues in speech recognition systems, and discuss acoustic model adaptation, speech enhancement and separation, and robust training strategies. We also cover modeling techniques that lead to more efficient decoding and discuss possible future directions in acoustic model research.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom