Speech Segregation based on Pitch Track Correction and Music-Speech Classification | Zendy

Hanil Kim | Zendy; GilJin Jang | Zendy; Jeongsik Park | Zendy; Jinho Kim | Zendy; YoungTaek Oh | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Speech Segregation based on Pitch Track Correction and Music-Speech Classification

Author(s) -

Hanil Kim,

GilJin Jang,

Jeongsik Park,

Jinho Kim,

YoungTaek Oh

Publication year - 2012

Publication title -

advances in electrical and computer engineering

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.254

H-Index - 23

eISSN - 1844-7600

pISSN - 1582-7445

DOI - 10.4316/aece.2012.02003

Subject(s) - speech recognition , computer science , track (disk drive) , pitch detection algorithm , voice activity detection , speech processing , artificial intelligence , operating system

A novel approach for pitch track correction and music-speech classification is proposed in order to improve the performance of the speech segregation system. The proposed pitch track correction method adjusts unreliable pitch estimates from adjacent reliable pitch streaks, in contrast to the previous approach using a single pitch streak which is the longest among the reliable pitch streaks in a sentence. The proposed music and speech classification method finds continuous pitch streaks of the mixture, and labels each streak as music-dominant or speech-dominant based on the observation that music pitch seldom changes in a short-time period whereas speech pitch fluctuates a lot. The speech segregation results for mixtures of speech and various competing sound sources demonstrated that the proposed methods are superior to the conventional method, especially for mixtures of speech and music signals.close

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research