A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems
Author(s) -
Damjan Vlaj,
Bojan Kotnik,
Bogomir Horvat,
Zdravko Kačič
Publication year - 2005
Publication title -
eurasip journal on advances in signal processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.317
H-Index - 88
eISSN - 1687-6180
pISSN - 1687-6172
DOI - 10.1155/asp.2005.487
Subject(s) - computer science , speech recognition , filter (signal processing) , algorithm , signal (programming language) , voice activity detection , transmission (telecommunications) , filter bank , noise (video) , speech processing , telecommunications , artificial intelligence , computer vision , image (mathematics) , programming language
This paper presents a novel computationally efficient voice activity detection (VAD) algorithm and emphasizes the importance of such algorithms in distributed speech recognition (DSR) systems. When using VAD algorithms in telecommunication systems, the required capacity of the speech transmission channel can be reduced if only the speech parts of the signal are transmitted. A similar objective can be adopted in DSR systems, where the nonspeech parameters are not sent over the transmission channel. A novel approach is proposed for VAD decisions based on mel-filter bank (MFB) outputs with the so-called Hangover criterion. Comparative tests are presented between the presented MFB VAD algorithm and three VAD algorithms used in the G.729, G.723.1, and DSR (advanced front-end) Standards. These tests were made on the Aurora 2 database, with different signal-to-noise (SNRs) ratios. In the speech recognition tests, the proposed MFB VAD outperformed all the three VAD algorithms used in the standards by 14.19% relative (G.723.1 VAD), by 12.84% relative (G.729 VAD), and by 4.17% relative (DSR VAD) in all SNRs
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom