A Hamming Embedding Kernel with Informative Bag-of-Visual Words for Video Semantic Indexing
Author(s) -
Feng Wang,
WanLei Zhao,
ChongWah Ngo,
Bernard Mérialdo
Publication year - 2014
Publication title -
acm transactions on multimedia computing communications and applications
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.558
H-Index - 49
eISSN - 1551-6865
pISSN - 1551-6857
DOI - 10.1145/2535938
Subject(s) - embedding , computer science , artificial intelligence , kernel (algebra) , search engine indexing , hamming distance , scale invariant feature transform , pattern recognition (psychology) , hamming code , image (mathematics) , mathematics , algorithm , combinatorics , decoding methods , block code
In this article, we propose a novel Hamming embedding kernel with informative bag-of-visual words to address two main problems existing in traditional BoW approaches for video semantic indexing. First, Hamming embedding is employed to alleviate the information loss caused by SIFT quantization. The Hamming distances between keypoints in the same cell are calculated and integrated into the SVM kernel to better discriminate different image samples. Second, to highlight the concept-specific visual information, we propose to weight the visual words according to their informativeness for detecting specific concepts. We show that our proposed kernels can significantly improve the performance of concept detection.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom