
Speech Segregation based on Pitch Track Correction and Music-Speech Classification
Author(s) -
Hg Kim Han-Gyu Kim,
Gj-Jang Gil-Jin Jang,
Js Park Jeong-Sik Park,
Jh Kim Ji-Hwan Kim,
Yh Oh Yung-Hwan Oh
Publication year - 2012
Publication title -
advances in electrical and computer engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.254
H-Index - 23
eISSN - 1844-7600
pISSN - 1582-7445
DOI - 10.4316/aece.2012.02003
Subject(s) - speech recognition , computer science , track (disk drive) , pitch detection algorithm , voice activity detection , speech processing , artificial intelligence , operating system
A novel approach for pitch track correction and music-speech classification is proposed in order to improve the performance of the speech segregation system. The proposed pitch track correction method adjusts unreliable pitch estimates from adjacent reliable pitch streaks, in contrast to the previous approach using a single pitch streak which is the longest among the reliable pitch streaks in a sentence. The proposed music and speech classification method finds continuous pitch streaks of the mixture, and labels each streak as music-dominant or speech-dominant based on the observation that music pitch seldom changes in a short-time period whereas speech pitch fluctuates a lot. The speech segregation results for mixtures of speech and various competing sound sources demonstrated that the proposed methods are superior to the conventional method, especially for mixtures of speech and music signals.close