Semantic context detection based on hierarchical audio models
Author(s) -
HaoWen Cheng,
Wei-Ta Chu,
JaLing Wu
Publication year - 2003
Publication title -
ntur (臺灣機構典藏)
Language(s) - English
Resource type - Conference proceedings
ISBN - 1-58113-778-8
DOI - 10.1145/973264.973282
Subject(s) - computer science , semantics (computer science) , event (particle physics) , context (archaeology) , context model , artificial intelligence , natural language processing , audio analyzer , key (lock) , semantic computing , hierarchical database model , hidden markov model , audio signal processing , speech recognition , audio signal , data mining , semantic web , paleontology , physics , speech coding , computer security , quantum mechanics , object (grammar) , biology , programming language
Semantic context detection is one of the key techniques to facilitate efficient multimedia retrieval. Semantic context is a scene that completely represents a meaningful information segment to human beings. In this paper, we propose a novel hierarchical approach that models the statistical characteristics of several audio events, over a time series, to accomplish semantic context detection. The approach consists of two stages: audio event and semantic context detections. HMMs are used to model basic audio events, and event detection is performed in the first stage. Then semantic context detection is achieved based on Gaussian mixture models, which model the correlations among several audio events temporally. With this framework, we bridge the gaps between low-level features and the semantic contexts that last in a time series. The experimental evaluations indicate that the approach is effective in detecting high-level semantics.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom