Spatial Attention Adapted to a LSTM Architecture with Frame Selection for Human Action Recognition in Videos | Zendy

Carlos Alberto Peña Orozco | Zendy; María Elena Buemi | Zendy; Julio Jacobo Berllés | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Spatial Attention Adapted to a LSTM Architecture with Frame Selection for Human Action Recognition in Videos

Author(s) -

Carlos Alberto Peña Orozco,

María Elena Buemi,

Julio Jacobo Berllés

Publication year - 2021

Language(s) - English

Resource type - Conference proceedings

DOI - 10.52591/2021072411

Subject(s) - computer science , search engine indexing , metric (unit) , artificial intelligence , action recognition , frame (networking) , architecture , selection (genetic algorithm) , action (physics) , pattern recognition (psychology) , computer vision , machine learning , telecommunications , engineering , physics , quantum mechanics , class (philosophy) , art , operations management , visual arts

Action recognition in videos is currently a topic of interest in the area of computer vision, due to potential applications such as: multimedia indexing, surveillance in public spaces, among others. In this work we propose an attention mechanism adapted to a CNN–LSTM base architecture. To carry out the training and testing phases, we used the HMDB-51 and UCF-101 datasets. We evaluate the performance of our system using accuracy as the evaluation metric, obtaining 57.3% and 90.4% for HMDB-51 an UCF-101 respectively

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research