Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
Author(s) -
Chao Hao,
Song Cheng
Publication year - 2016
Publication title -
journal of information processing systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.288
H-Index - 23
eISSN - 2092-805X
pISSN - 1976-913X
DOI - 10.3745/jips.03.0052
Subject(s) - computer science , speech recognition , mandarin chinese , landmark , decoding methods , artificial intelligence , natural language processing , linguistics , telecommunications , philosophy
In this paper, we propose a framework that attempts to incorporate landmarks into a segment-based Mandarin speech recognition system. In this method, landmarks provide boundary information and phonetic class information, and the information is used to direct the decoding process. To prove the validity of this method, two kinds of landmarks that can be reliably detected are used to direct the decoding process of a segment model (SM) based Mandarin LVCSR (large vocabulary continuous speech recognition) system. The results of our experiment show that about 30% decoding time can be saved without an obvious decrease in recognition accuracy. Thus, the potential of our method is demonstrated.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom