
Fusing target information from multiple views for robust visual tracking
Author(s) -
Hu Keli,
Zhang Xing,
Gu Yuzhang,
Wang Yingguan
Publication year - 2014
Publication title -
iet computer vision
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.38
H-Index - 37
eISSN - 1751-9640
pISSN - 1751-9632
DOI - 10.1049/iet-cvi.2013.0026
Subject(s) - artificial intelligence , computer vision , computer science , particle filter , classifier (uml) , video tracking , tracking (education) , tracking system , active appearance model , frame (networking) , pattern recognition (psychology) , filter (signal processing) , object (grammar) , image (mathematics) , telecommunications , psychology , pedagogy
In this study, the authors address the problem of tracking a single target in a calibrated multi‐camera surveillance system with information on its location in the first frame of each view. Recently, tracking with online multiple instance learning (OMIL) has been shown to give promising tracking results. However, it may fail in a real surveillance system because of problems arising from target orientation, scale or illumination changes. In this study, the authors show that fusing target information from multiple views can avoid these problems and lead to a more robust tracker. At each camera node, an efficient OMIL algorithm is used to model target appearance. To update the OMIL‐based classifier in one view, a co‐training strategy is applied to generate a representative set of training bags from all views. Bags extracted from each view hold a unique weight depending on similarity of target appearance between the current view and the view which contains the classifier that needs to be updated. In addition, target motion on a camera's image plane is modelled by a modified particle filter guided by the corresponding object two‐dimensional (2D) location and fused 3D location. Experimental results demonstrate that the proposed algorithm is robust for human tracking in challenging scenes.