Generic Object Crowd Tracking by Multi-Task Learning
Author(s) -
Wenhan Luo,
TaeKyun Kim
Publication year - 2013
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5244/c.27.73
Subject(s) - bittorrent tracker , computer science , artificial intelligence , crowds , tracking (education) , task (project management) , object detection , computer vision , video tracking , detector , object (grammar) , trace (psycholinguistics) , machine learning , pattern recognition (psychology) , eye tracking , engineering , psychology , pedagogy , systems engineering , telecommunications , linguistics , philosophy , computer security
We address Multiple Object Tracking (MOT) in crowds, where the type of target objects is generic and not limited to pedestrians as in most previous work. Following the popular tracking-by-detection strategy, we decompose this problem into two main tasks, detection and tracking, and formulate them under the Multiple Task Learning (MTL) framework. A binary detector is learnt to detect objects in images, whilst multiple trackers are learnt on top of the detector by MTL to trace detected objects in subsequent frames. The detector is utilised to anchor the trackers, helping them not drift away from targets. The trackers are jointly learnt by sharing common features. To further improve the performance, we use a smoothness term which considers all labelled and unlabelled data globally. Experiments on challenging new generic object sequences as well as a publicly available sequence show that the proposed method significantly outperforms the state-of-the-art methods.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom