Purging of silence for robust speaker identification in colossal database | Zendy

P. Rama Koteswara Rao | Zendy; Sunitha Ravi | Zendy; Thotakura Haritha | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Purging of silence for robust speaker identification in colossal database

Author(s) -

P. Rama Koteswara Rao,

Sunitha Ravi,

Thotakura Haritha

Publication year - 2021

Publication title -

international journal of power electronics and drive systems/international journal of electrical and computer engineering

Language(s) - English

Resource type - Journals

eISSN - 2722-2578

pISSN - 2722-256X

DOI - 10.11591/ijece.v11i4.pp3084-3092

Subject(s) - computer science , support vector machine , feature extraction , speech recognition , pattern recognition (psychology) , noise (video) , feature (linguistics) , artificial intelligence , speaker recognition , identification (biology) , philosophy , linguistics , botany , image (mathematics) , biology

The aim of this work is to develop an effective speaker recognition system under noisy environments for large data sets. The important phases involved in typical identification systems are feature extraction, training and testing. During the feature extraction phase, the speaker-specific information is processed based on the characteristics of the voice signal. Effective methods have been proposed for the silence removal in order to achieve accurate recognition under noisy environments in this work. Pitch and Pitch-strength parameters are extracted as distinct features from the input speech spectrum. Multi-linear principle component analysis (MPCA) is is utilized to minimize the complexity of the parameter matrix. Silence removal using zero crossing rate (ZCR) and endpoint detection algorithm (EDA) methods are applied on the source utterance during the feature extraction phase. These features are useful in later classification phase, where the identification is made on the basis of support vector machine (SVM) algorithms. Forward loking schostic (FOLOS) is the efficient large-scale SVM algorithm that has been employed for the effective classification among speakers. The evaluation findings indicate that the methods suggested increase the performance for large amounts of data in noise ecosystems.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore