Tracking-Learning-Detection.

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This paper investigates long-term tracking of unknown objects in a video stream. The object is defined by its location and extent in a single frame. In every frame that follows, the task is to determine the object's location and extent or indicate that the object is not present. We propose a novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection. The tracker follows the object from frame to frame. The detector localizes all appearances that have been observed so far and corrects the tracker if necessary. The learning estimates the detector's errors and updates it to avoid these errors in the future. We study how to identify the detector's errors and learn from them. We develop a novel learning method (P-N learning) which estimates the errors by a pair of "experts": (1) P-expert estimates missed detections, and (2) N-expert estimates false alarms. The learning process is modeled as a discrete dynamical system and the conditions under which the learning guarantees improvement are found. We describe our real-time implementation of the TLD framework and the P-N learning. We carry out an extensive quantitative evaluation which shows a significant improvement over state-of-the-art approaches.

Related collections

Author and article information

Journal

Journal ID (iso-abbrev): IEEE Trans Pattern Anal Mach Intell

Title: IEEE transactions on pattern analysis and machine intelligence

ISSN (Electronic): 1939-3539

ISSN (Print): 0098-5589

Publication date (Electronic): Jul 2012

Volume: 34

Issue: 7

Affiliations

[1 ] Centre for Vision, Speech, and Signal Processing, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford, Surrey GU2 7XH, United Kingdom. zdenek.kalal@gmail.com

Article

DOI: 10.1109/TPAMI.2011.239

PubMed ID: 22156098

SO-VID: e1aac953-15aa-4ffb-8b28-590de474d710

History

Data availability:

Comments

Comment on this article

scite_

Cited by 297

See all cited by