z-logo
open-access-imgOpen Access
Improved speech inversion using general regression neural network
Author(s) -
Shamima Najnin,
Bonny Banerjee
Publication year - 2015
Publication title -
the journal of the acoustical society of america
Language(s) - Uncategorized
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.4929626
Subject(s) - timit , computer science , artificial neural network , speech recognition , inversion (geology) , pattern recognition (psychology) , artificial intelligence , regression , feature vector , mean squared error , mathematics , hidden markov model , statistics , paleontology , structural basin , biology
The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to estimate a set of articulatory features for improved speech recognition. Experiments with MOCHA-TIMIT and MNGU0 databases reveal that, for speech inversion, GRNN yields a lower root-mean-square error and a higher correlation than DBN. It is also shown that conjunction of acoustic and GRNN-estimated articulatory features yields state-of-the-art accuracy in broad class phonetic classification and phoneme recognition using less computational power.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom