Toward Robust Action Retrieval in Video
Author(s) -
Samy Bakheet,
Ayoub Al-Hamadi,
Bernd Michaelis,
Usama Sayed
Publication year - 2010
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5244/c.24.44
Subject(s) - computer science , artificial intelligence , histogram , action recognition , classifier (uml) , video retrieval , artificial neural network , task (project management) , fuzzy logic , histogram of oriented gradients , machine learning , pattern recognition (psychology) , sigmoid function , computer vision , image (mathematics) , management , economics , class (philosophy)
Retrieving human actions from video databases is a paramount but challenging task in computer vision. In this work, we develop such a framework for robustly recognizing human actions in video sequences. The contribution of the paper is twofold. First a reliable neural model, the Multi-level Sigmoidal Neural Network (MSNN) as a classifier for the task of action recognition is presented. Second we unfold how the temporal shape variations can be accurately captured based on both temporal self-similarities and fuzzy log-polar histograms. When the method is evaluated on the popular KTH dataset, an average recognition rate of 94.3% is obtained. Such results have the potential to compare very favorably to those of other investigators published in the literature. Further the approach is amenable for real-time applications due to its low computational requirements.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom