z-logo
open-access-imgOpen Access
Learning frame relevance for video classification
Author(s) -
Hua Wang,
Feiping Nie,
Heng Huang,
Yi Yang
Publication year - 2011
Publication title -
proceedings of the 30th acm international conference on multimedia
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2072298.2072011
Subject(s) - computer science , artificial intelligence , frame (networking) , class (philosophy) , clips , ambiguity , relevance (law) , machine learning , online video , computer vision , multimedia , telecommunications , political science , law , programming language
Traditional video classification methods typically require a large number of labeled training video frames to achieve satisfactory performance. However, in the real world, we usually only have sufficient labeled video clips (such as tagged online videos) but lack labeled video frames. In this paper, we formalize the video classification problem as a Multi-Instance Learning (MIL) problem, an emerging topic in machine learning in recent years, which only needs bag (video clip) labels. To solve the problem, we propose a novel Parameterized Class-to-Bag (P-C2B) Distance method to learn the relative importance of a training instance with respect to its labeled classes, such that the instance level labeling ambiguity in MIL is tackled and the frame relevances of training video data with respect to the semantic concepts of interest are given. Promising experimental results have demonstrated the effectiveness of the proposed method.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom