A Statistical Approach for Voiced Speech Detection
Author(s) -
Mihir Narayan Mohanty,
Aurobinda Routray,
Prithviraj Kabisatpathy
Publication year - 2011
Publication title -
international journal of computer and communication technology
Language(s) - English
Resource type - Journals
eISSN - 2231-0371
pISSN - 0975-7449
DOI - 10.47893/ijcct.2011.1104
Subject(s) - estimator , computer science , voice activity detection , speech recognition , a priori and a posteriori , noise (video) , coding (social sciences) , speech processing , key (lock) , statistical hypothesis testing , likelihood ratio test , speech coding , linear predictive coding , background noise , speech enhancement , signal to noise ratio (imaging) , algorithm , statistics , artificial intelligence , mathematics , telecommunications , philosophy , epistemology , image (mathematics) , computer security
Detection of Voice in speech signal is a challenging problem in developing high-performance systems used in noisy environments. In this paper, we present an efficient algorithm for robust voiced speech detection and for the application to variable-rate speech coding. The key idea of the algorithm is considering speech energy and zero crossings rate (ZCR) information simultaneously when processing speech signals and finding the end point of the signal. Next to it a decision rule and a background noise statistics estimator, by applying a statistical model. A robust decision rule is derived from the generalized likelihood ratio test (LRT) by assuming that the noise statistics are known a priori. The algorithm is most efficient for the time-varying noise. According to our simulation results, the proposed algorithm shows significantly better performance in low signal-to-noise ratio and in noisy environments.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom