z-logo
open-access-imgOpen Access
Embedding Visual Words into Concept Space for Action and Scene Recognition
Author(s) -
Behrouz Saghafi Khadem,
Elahe Farahzadeh,
Deepu Rajan,
Andrzej Śluzek
Publication year - 2010
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5244/c.24.15
Subject(s) - probabilistic latent semantic analysis , computer science , artificial intelligence , discriminative model , natural language processing , embedding , space (punctuation) , rank (graph theory) , vocabulary , word embedding , word (group theory) , latent semantic analysis , semantics (computer science) , information retrieval , pattern recognition (psychology) , mathematics , linguistics , operating system , philosophy , geometry , combinatorics , programming language
In this paper we propose a novel approach to introducing semantic relations into the bag-of-words framework. We use the latent semantic models, such as LSA and pLSA, in order to define semantically-rich features and embed the visual features into a semantic space. The semantic features used in LSA technique are derived from the low-rank approximation of word-document occurrence matrix by SVD. Similarly, by using the pLSA approach, the topic-specific distributions of words can be considered dimensions of a concept space. In the proposed space, the distances between words represent the semantic distances which are used for constructing a discriminative and semantically meaningful vocabulary. We have tested our approach on the KTH action database and on the Fifteen Scene database and have achieved very promising results on both.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom