
Multiple subsequence combination in human action recognition
Author(s) -
Onofri Leonardo,
Soda Paolo,
Iannello Giulio
Publication year - 2014
Publication title -
iet computer vision
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.38
H-Index - 37
eISSN - 1751-9640
pISSN - 1751-9632
DOI - 10.1049/iet-cvi.2013.0015
Subject(s) - subsequence , computer science , longest common subsequence problem , artificial intelligence , longest increasing subsequence , pattern recognition (psychology) , object (grammar) , sequence (biology) , computer vision , action (physics) , action recognition , scale (ratio) , mathematics , class (philosophy) , algorithm , geography , mathematical analysis , physics , quantum mechanics , bounded function , cartography , biology , genetics
Human action recognition is an active research area with applications in several domains such as visual surveillance, video retrieval and human–computer interaction. Current approaches assign action labels to video streams considering the whole video as a single sequence but, in some cases, the large variability between frames may lead to misclassifications. The authors propose a multiple subsequence combination (MSC) method that divides the video into several consecutive subsequences. It applies part‐based and bag of visual words approaches to classify each subsequence. Then, it combines subsequence labels to assign an action label to the video. The proposed approach was tested on the KTH, UCF sports, Youtube and Robo‐Kitchen datasets, which have large differences in terms of video length, object appearance and pose, object scale, viewpoint, background, as well as number, type and complexity of actions performed. Two main results were achieved. First, the MSC approach shows better performances compared to classify the video as a whole, even when few subsequences are used. Second, the approach is robust and stable since, for each dataset, its performances are comparable to the part‐based approach at the state‐of‐the‐art.