Video Scenes Segmentation Based on Multimodal Genre Prediction | Zendy

Mohamed Bouyahi | Zendy; Yassine Ben Ayed | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Video Scenes Segmentation Based on Multimodal Genre Prediction

Author(s) -

Mohamed Bouyahi,

Yassine Ben Ayed

Publication year - 2020

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2020.08.002

Subject(s) - computer science , segmentation , artificial intelligence , search engine indexing , task (project management) , constraint (computer aided design) , similarity (geometry) , pattern recognition (psychology) , natural language processing , image (mathematics) , mechanical engineering , management , engineering , economics

Recent technologies’ understanding videos content remain limited due to its complexity and length. However, videos segmentation into small coherent units facilitates indexing and searching task. The subjectivity remains the essential constraint of videos, but the genre (drama, action...) does not present any conflict. In this paper, we present a new approach to video segmentation into scenes based on genre prediction. Initially, the video is divided into shots of equal duration. We used architecture, based on audio-visuals deep features extracted from trained neural networks for genre prediction, and we introduced a transition detection method based on the similarity calculation between shots genre. The originality of this method consists in using the highly level semantic relationship between successive shots for transition detection. We reached good performances on videos of the multi varied genre. We used the RAI dataset and BBC dataset to evaluate our method through a comparison with other state-of-the-art approaches.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research