Premium
A robust video scene extraction approach to movie content abstraction
Author(s) -
Li Ying,
Kuo C.C. Jay
Publication year - 2003
Publication title -
international journal of imaging systems and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.359
H-Index - 47
eISSN - 1098-1098
pISSN - 0899-9457
DOI - 10.1002/ima.10063
Subject(s) - computer science , shot (pellet) , abstraction , process (computing) , artificial intelligence , computer vision , scheme (mathematics) , feature extraction , video browsing , feature (linguistics) , information retrieval , multimedia , object (grammar) , video tracking , mathematical analysis , philosophy , chemistry , linguistics , mathematics , organic chemistry , epistemology , operating system
This research addresses the problem of automatically extracting semantic video scenes from feature films based on multimodal information. A three‐stage scene detection scheme is proposed. First, we use pure visual information to extract a coarse‐level scene structure based on generated shot sinks. Second, audio cue is integrated to refine the scene detection results by considering various kinds of audiovisual scenarios. Finally, we introduce users into this process by allowing them to interactively tune the final results to their own satisfaction. The generated scene structure forms a compact yet meaningful abstraction of the video data, which can help facilitate the content access. Preliminary experiments on integrating multiple media cues for movie scene extraction have yielded encouraging results. © 2004 Wiley Periodicals, Inc. Int J Imaging Syst Technol 13, 236–244, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.10063