Analysis of MFCC and Multitaper MFCC Feature Extraction Methods
Author(s) -
G. Rupali,
Sanjiv Bhatia
Publication year - 2015
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/ijca2015906883
Subject(s) - multitaper , mel frequency cepstrum , computer science , feature extraction , extraction (chemistry) , feature (linguistics) , pattern recognition (psychology) , speech recognition , artificial intelligence , linguistics , philosophy , chemistry , chromatography
In speech & audio applications, short-term signal spectrum is often represented using mel-freuency cepstral coefficient (MFCC) computed from a windowed discrete Fourier transform (DFT). Windowing reduces spectral leakage but variance of the spectrum estimate remains high. An extension to windowed DFT is called multitaper method which uses multiple time domain windows which are called as tapers with frequency domain averaging. Then detailed statistical analysis of MFCC bias & variance is done. For speaker verification the extracted feature is used to design a model using classifier (GMM), which implements likelihood ratio test to decide whether to accept or deny the registered speaker.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom