Speech recognition systems on the Cell Broadband Engine processor
Author(s) -
Yingchun Liu,
Henry E. Jones,
Shefali Vaidya,
Michael Perrone,
B. Tydlitat,
A.K. Nanda
Publication year - 2007
Publication title -
ibm journal of research and development
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.47
H-Index - 95
eISSN - 2151-8556
pISSN - 0018-8646
DOI - 10.1147/rd.515.0583
Subject(s) - computer science , pipeline (software) , speech recognition , speech processing , voice activity detection , process (computing) , computer hardware , operating system
In this paper we describe our design, implementation, and initial results of a prototype connected-phoneme—based speech recognition system on the Cell Broadband Engine™ (Cell/B.E.) processor. Automated speech recognition decodes speech samples into plaintext (other representations are possible) and must process samples at real-time rates. Fortunately, the computational tasks involved in this pipeline are highly data parallel and can receive significant hardware acceleration from vector-streaming architectures such as the Cell/B.E. Architecture. Identifying and exploiting these parallelism opportunities is challenging and critical to improving system performance. From our initial performance timings, we observed that a single Cell/B.E. processor can recognize speech from thousands of simultaneous voice channels in real time—a channel density that is orders of magnitude greater than the capacity of existing software speech recognizers based on CPUs (central processing units). This result emphasizes the potential for Cell/B.E. processor-based speech recognition and will likely lead to the development of production speech systems using Cell/B.E. processor clusters.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom