z-logo
open-access-imgOpen Access
Audio-Visual Speech Recognition for People with Speech Disorders
Author(s) -
Elham S. Salama,
Reda A. El-Khoribi,
Mahmoud Shoman
Publication year - 2014
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/16770-6337
Subject(s) - computer science , speech recognition , mel frequency cepstrum , hidden markov model , discrete cosine transform , robustness (evolution) , artificial intelligence , speaker recognition , classifier (uml) , feature (linguistics) , feature vector , pattern recognition (psychology) , dysarthria , feature extraction , medicine , biochemistry , chemistry , linguistics , philosophy , radiology , image (mathematics) , gene
recognition of disorder people is a difficult task due to the lack of motor-control of the speech articulators. Multimodal speech recognition can be used to enhance the robustness of disordered speech. This paper introduces an automatic speech recognition system for people with dysarthria speech disorder based on both speech and visual components. The Mel-Frequency Cepestral Coefficients (MFCC) is used as features representing the acoustic speech signal. For the visual counterpart, the Discrete Cosine Transform (DCT) Coefficients are extracted from the speaker's mouth region. Face and mouth regions are detected using the Viola-Jones algorithm. The acoustic and visual input features are then concatenated on one feature vector. Then, the Hidden Markov Model (HMM) classifier is applied on the combined feature vector of acoustic and visual components. The system is tested on isolated English words spoken by disorder speakers from UA-Speech data. Results of the proposed system indicate that visual features are highly effective and can improve the accuracy to reach 7.91% for speaker dependent experiments and 3% for speaker independent experiments.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom