TennisVid2Text: Fine-grained Descriptions for Domain Specific Videos | Zendy

Mohak Sukhwani | Zendy; C. V. Jawahar | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

TennisVid2Text: Fine-grained Descriptions for Domain Specific Videos

Author(s) -

Mohak Sukhwani,

C. V. Jawahar

Publication year - 2015

Language(s) - English

Resource type - Conference proceedings

DOI - 10.5244/c.29.117

Subject(s) - computer science , readability , domain (mathematical analysis) , correctness , focus (optics) , set (abstract data type) , task (project management) , multimedia , the internet , information retrieval , world wide web , shot (pellet) , human–computer interaction , artificial intelligence , natural language processing , mathematical analysis , mathematics , chemistry , physics , management , organic chemistry , optics , economics , programming language

Automatically describing videos has ever been fascinating. In this work, we attempt to describe videos from a specific domain - broadcast videos of lawn tennis matches. Given a video shot from a tennis match, we intend to generate a textual commentary similar to what a human expert would write on a sports website. Unlike many recent works that focus on generating short captions, we are interested in generating semantically richer descriptions. This demands a detailed low-level analysis of the video content, specially the actions and interactions among subjects. We address this by limiting our domain to the game of lawn tennis. Rich descriptions are generated by leveraging a large corpus of human created descriptions harvested from Internet. We evaluate our method on a newly created tennis video data set. Extensive analysis demonstrate that our approach addresses both semantic correctness as well as readability aspects involved in the task.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research