Hybrid Semantic Concept Temporal Pooling for Large‐Scale Video Event Analysis | Zendy

Liu Wu | Zendy; Ma Huadong | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Hybrid Semantic Concept Temporal Pooling for Large‐Scale Video Event Analysis

Author(s) -

Liu Wu,

Ma Huadong

Publication year - 2017

Publication title -

chinese journal of electronics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.267

H-Index - 25

eISSN - 2075-5597

pISSN - 1022-4653

DOI - 10.1049/cje.2017.09.010

Subject(s) - pooling , computer science , discriminative model , event (particle physics) , pyramid (geometry) , artificial intelligence , partition (number theory) , scale (ratio) , pattern recognition (psychology) , machine learning , mathematics , physics , quantum mechanics , geometry , combinatorics

To solve the task of detecting and recounting events in videos with limited training examples, we propose a novel two‐stage hybrid concept temporal pooling approach that is aware of potential concept drift in the video stream. We initially partition videos into temporal pyramids consisting of keyframes. Semantic concepts in keyframes is detected, which enables us to derive aggregated detection scores for each temporal pyramid using average‐pooling and ultimately for the entire video via max‐pooling. Owing to this refined hybrid pooling, our method yields more discriminative semantic representations with respect to the event query. We also develop an effective filtering strategy to cope with noisy concept detectors to robustify the textual description generation in recounting. Experiments on the large scale TRECVID MEDTest dataset demonstrate our method improves the accuracies over state‐of‐the‐art methods, both for event detection and recounting.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research