
Efficient inception V2 based deep convolutional neural network for real‐time hand action recognition
Author(s) -
Bose S. Rubin,
Kumar V. Sathiesh
Publication year - 2020
Publication title -
iet image processing
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.401
H-Index - 45
eISSN - 1751-9667
pISSN - 1751-9659
DOI - 10.1049/iet-ipr.2019.0985
Subject(s) - convolutional neural network , computer science , artificial intelligence , computation , pattern recognition (psychology) , intersection (aeronautics) , data set , set (abstract data type) , detector , single shot , deep learning , training set , algorithm , optics , telecommunications , engineering , physics , programming language , aerospace engineering
The most effective and accurate deep convolutional neural network (faster region‐based convolutional neural network (Faster R‐CNN) Inception V2 model, single shot detector (SSD) Inception V2 model) based architectures for real‐time hand gesture recognition is proposed. The proposed models are tested on standard data sets (NUS hand posture data set‐II, Senz‐3D) and custom‐developed (MITI hand data set (MITI‐HD)) data set. The performance metrics are analysed for intersection over union (IoU) ranges between 0.5 and 0.95. IoU value of 0.5 resulted in higher precision compared to other IoU values considered (0.5:0.95, 0.75). It is observed that the Faster R‐CNN Inception V2 model resulted in higher precision (0.990 for AP all , IoU = 0.5) compared to SSD Inception V2 model (0.984 for all ) for MITI‐HD 160. The computation time of Faster R‐CNN Inception V2 is higher compared to SSD Inception V2 model and also resulted in less number of mispredictions. Increasing the size of samples (MITI‐HD 300) resulted in improvement of AP all = 0.991. Improvement in large (APlarge) and medium (APmedium) size detections are not significant when compared to small (APsmall) detections. It is concluded that the Faster R‐CNN Inception V2 model is highly suitable for real‐time hand gesture recognition system under unconstrained environments.