z-logo
open-access-imgOpen Access
Graph‐based approach to scene text localisation and tracking in videos
Author(s) -
Kim Y.G.,
Koo H.I.
Publication year - 2016
Publication title -
electronics letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 146
eISSN - 1350-911X
pISSN - 0013-5194
DOI - 10.1049/el.2016.1008
Subject(s) - false positive paradox , computer science , graph , artificial intelligence , true positive rate , pattern recognition (psychology) , tracking (education) , image (mathematics) , computer vision , theoretical computer science , psychology , pedagogy
A text localisation and tracking method is presented that finds text‐regions in videos and assigns unique IDs to their trajectories. For the goal, a graph‐based framework that can work with existing text detection methods is developed. To be precise, graphs are built where vertices are image‐level text detection results and edges represent the correspondence scores of the vertices. From these graphs, text‐region trajectories by using the graph‐cut algorithm are extracted. This approach allows considering false positives and misses, as well as their patch‐based tracking results at the same time, and text trajectories are reliably extracted. Finally, the results are refined by interpolating misses and filtering out false positives. The proposed method is submitted to the International Conference on Document Analysis and Recognition 2015 robust reading competition (video text localisation) and the method showed the best performance in terms of CLEAR MOT metrics and was ranked third place according to VACE metrics among the seven participating methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here