Improved speech inversion using general regression neural network | Zendy

Shamima Najnin | Zendy; Bonny Banerjee | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Improved speech inversion using general regression neural network

Author(s) -

Shamima Najnin,

Bonny Banerjee

Publication year - 2015

Publication title -

the journal of the acoustical society of america

Language(s) - Uncategorized

Resource type - Journals

SCImago Journal Rank - 0.619

H-Index - 187

eISSN - 1520-8524

pISSN - 0001-4966

DOI - 10.1121/1.4929626

Subject(s) - timit , computer science , artificial neural network , speech recognition , inversion (geology) , pattern recognition (psychology) , artificial intelligence , regression , feature vector , mean squared error , mathematics , hidden markov model , statistics , paleontology , structural basin , biology

The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to estimate a set of articulatory features for improved speech recognition. Experiments with MOCHA-TIMIT and MNGU0 databases reveal that, for speech inversion, GRNN yields a lower root-mean-square error and a higher correlation than DBN. It is also shown that conjunction of acoustic and GRNN-estimated articulatory features yields state-of-the-art accuracy in broad class phonetic classification and phoneme recognition using less computational power.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research