FPGA Implementation of a Pipelined Gaussian Calculation for HMM-Based Large Vocabulary Speech Recognition
Author(s) -
Richard Veitch,
Louis-Marie Aubert,
Roger Woods,
Scott Fischaber
Publication year - 2010
Publication title -
international journal of reconfigurable computing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.236
H-Index - 16
eISSN - 1687-7209
pISSN - 1687-7195
DOI - 10.1155/2011/697080
Subject(s) - computer science , field programmable gate array , hidden markov model , scalability , speech recognition , gaussian , set (abstract data type) , virtex , vocabulary , parallel computing , computer hardware , programming language , database , linguistics , philosophy , physics , quantum mechanics
A scalable large vocabulary, speaker independent speech recognitionsystem is being developed using Hidden MarkovModels (HMMs) for acoustic modeling and a Weighted FiniteState Transducer (WFST) to compile sentence, word,and phoneme models. The system comprises a softwarebackend search and an FPGA-based Gaussian calculationwhich are covered here. In this paper, we present an efficientpipelined design implemented both as an embedded peripheraland as a scalable, parallel hardware accelerator. Both architectureshave been implemented on an Alpha Data XRC-5T1, reconfigurable computer housing a Virtex 5 SX95TFPGA. The core has been tested and is capable of calculatinga full set of Gaussian results from 3825 acoustic modelsin 9.03 ms which coupled with a backend search of 5000words has provided an accuracy of over 80%. Parallel implementationshave been designed with up to 32 cores andhave been successfully implemented with a clock frequencyof 133 MHz
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom