
Visual Speech Recognition using Convolutional Neural Network
Author(s) -
B Soundarya,
Ramaswamy Krishnaraj,
S. Mythili
Publication year - 2021
Publication title -
iop conference series. materials science and engineering
Language(s) - English
Resource type - Journals
eISSN - 1757-899X
pISSN - 1757-8981
DOI - 10.1088/1757-899x/1084/1/012020
Subject(s) - hidden markov model , computer science , speech recognition , convolutional neural network , reading (process) , artificial intelligence , pattern recognition (psychology) , image (mathematics) , linguistics , philosophy
Visual speech Recognition or Lip reading is used for teaching differently abled persons to communicate with others. It has been determining speech by looking at the movement of the lips. There are many practical difficulties in traditional lip-reading recognition systems like complicated image processing, difficult to teach classifiers and recognition processes will take a long time. In our paper, we proposed the use of convolutional neural networks- Hidden Markov model (CNN-HMM) in lip reading. Since CNN will assign importance to an input image, it is easier to see a difference among the images. HMM used to handle the dynamics of the image sequence. First we convert the incoming video into images and these images selected for further operation. HMM provides a highly reliable way of speech recognition.