MultiView: Multilevel video content representation and retrieval
Author(s) -
Jianping Fan,
Walid G. Aref,
Ahmed K. Elmagarmid,
Mohand-Saïd Hacid,
Mirette Marzouk,
Xingquan Zhu
Publication year - 2001
Publication title -
journal of electronic imaging
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.238
H-Index - 66
eISSN - 1560-229X
pISSN - 1017-9909
DOI - 10.1117/1.1406944
Subject(s) - computer science , imaging science , cover (algebra) , multimedia , information retrieval , medical imaging , data science , computer vision , artificial intelligence , mechanical engineering , engineering
In this article, several practical algorithms are proposed to support content-based video analysis, modeling, representation, summarization, indexing, and access. First, a multilevel video database model is given. One advantage of this model is that it provides a reasonable approach to bridging the gap between low-level representative features and high-level semantic concepts from a human point of view. Second, several model-based video analysis techniques are proposed. In order to detect the video shots, we present a novel technique, which can adapt the threshold for scene cut detection to the activities of variant videos or even different video shots. A seeded region aggregation and temporal tracking technique is proposed for generating the semantic video objects. The semantic video scenes can then be generated from these extracted video access units (e.g., shots and objects) according to some domain knowledge. Third, in order to categorize video contents into a set of semantic clusters, an integrated video classification technique is developed to support more efficient multilevel video representation, summarization, indexing, and access techniques. 2001 SPIE and IS&T. [DOI: 10.1117/1.1406944]
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom