Advanced Learning for Tracking and Detection
Advanced Learning for Tracking and Detection
DACH: Österreich - Deutschland - Schweiz
Disciplines
Computer Sciences (100%)
Keywords
-
Computer Vision,
Person Detection,
Visual Tracking,
On-line Learning,
Medical Workflow Analysis
The goal of this project is to significantly advance the state of the art in detection and tracking. This is possible by joining the complementary expertise of three leading computer vision labs in Europe. Medical work-flow analysis, recorded by multiple cameras, will serve as a complex test-bed to (i) pose new challenges for visual detection and tracking and (ii) to benchmark our novel algorithms on a complex real world scenario. Abstract knowledge about actions which are being performed is valuable in the operating room and can be used for many applications such as task planning. Our objective is to analyze the (large scale) events happening in the medical operating room, equipped with multiple cameras, in order to prepare the necessary input for a fully automatic work-flow analysis. From an algorithmic point of view this project will focus on the development of novel tracking methods for textured (and more importantly) un-textured objects (equipment in the operation room); person detection using a multi-camera set-up. The robustness and adaptivity of these tasks will be significantly enhanced by novel unsupervised/semi-supervised on-line learning methods which are a further focus of research within this project. Learning will be an integral part of tracking and detection. By tightly integrating these modules we expect increased learning performance because of the increased label quality and significantly increased tracking and detection performance by scene specific adapted models. The detection and tracking results will be used in analyzing behavioral patterns in the operation room. All developed components will be used in the medical workflow analysis task.
In this project part of the DACH project we investigated statistical learning algorithms to advance object detection and tracking by considering a challenging real-world scenario. In particular, we focused on medical workflow analysis, as operation rooms can serve as a complex test-bed to benchmark our algorithms and to pose new challenges for visual detection and tracking. Our objective was to analyze the events and actions happening in medical operation rooms, recorded from multiple cameras, to prepare the necessary inputs for a fully automatic workflow analysis. To this end, we did research on novel tracking and detection methods as these are the major preprocessing steps for any automated analysis system. Our research resulted in several improvements to object detection frameworks. On the one hand, we were able to improve the detection performance of the popular Hough Forest framework by incorporating an efficient re-weighting step. This approach achieved excellent results, especially in crowded situations, i.e. whenever persons stand close to each other. We increased the flexibility of such classifiers by adapting it to the online learning case. Furthermore, by analyzing the internals of these detection frameworks, we were able to significantly speed up multi-object detection, which is a crucial step towards real-time performance. On the other hand, we introduced a novel training scheme for the Random Forest framework which enables the use of well-defined loss functions, while still being able to parallelize the computations. These improved detection results also supported our research on tracking-by-detection algorithms. In particular, we introduced novel tracking approaches to cope with difficult situations, such as crowded scenes or similar visual appearance of persons. To this end, we investigated both geometric and appearance cues to reliably and robustly link object detections into target trajectories. Our research resulted in novel cost functions for data association schemes and new appearance models which significantly improved the tracking performance. In conclusion, we can say that we clearly achieved and even outreached our initial project goals. The most important results have already been published at relevant conferences and journals.
- Technische Universität Graz - 100%
- Nassir Navab, TU München - Germany
- Pascal Fua, University of Lausanne - Switzerland
Research Output
- 2628 Citations
- 26 Publications
-
2012
Title Relaxed Pairwise Learned Metric for Person Re-identification DOI 10.1007/978-3-642-33783-3_56 Type Book Chapter Author Hirzer M Publisher Springer Nature Pages 780-793 -
2012
Title Discriminative Hough Forests for Object Detection DOI 10.5244/c.26.40 Type Conference Proceeding Abstract Author Wohlhart P Pages 40.1-40.11 Link Publication -
2011
Title Multicamera Multi-object Tracking by Robust Hough-based Homography Projections. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Workshop on Visual Surveillance (in conjunction with the International Conference on Computer Vision, ICCV). -
2011
Title On-line Hough Forests DOI 10.5244/c.25.128 Type Conference Proceeding Abstract Author Schulter S Pages 128.1-128.11 -
2011
Title Multi-camera Multi-object Tracking by Robust Hough-based Homography Projections DOI 10.1109/iccvw.2011.6130453 Type Conference Proceeding Abstract Author Sternig S Pages 1689-1696 -
2014
Title Occlusion Geodesics for Online Multi-Object Tracking. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2013
Title Hough-based tracking of non-rigid objects DOI 10.1016/j.cviu.2012.11.005 Type Journal Article Author Godec M Journal Computer Vision and Image Understanding Pages 1245-1256 -
2013
Title Optimizing 1-Nearest Prototype Classifiers. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2013. -
2013
Title Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities DOI 10.1109/cvpr.2013.310 Type Conference Proceeding Abstract Author Possegger H Pages 2395-2402 -
2013
Title Optimizing 1-Nearest Prototype Classifiers DOI 10.1109/cvpr.2013.66 Type Conference Proceeding Abstract Author Wohlhart P Pages 460-467 -
2013
Title Alternating Decision Forests DOI 10.1109/cvpr.2013.72 Type Conference Proceeding Abstract Author Schulter S Pages 508-515 -
2013
Title Detecting Partially Occluded Objects with an Implicit Shape Model Random Field DOI 10.1007/978-3-642-37331-2_23 Type Book Chapter Author Wohlhart P Publisher Springer Nature Pages 302-315 -
2013
Title Robust Real-Time Tracking of Multiple Objects by Volumetric Mass Densities. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2013
Title Alternating Decision Forests. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2015
Title In Defense of Color-Based Model-Free Tracking DOI 10.1109/cvpr.2015.7298823 Type Conference Proceeding Abstract Author Possegger H Pages 2113-2120 -
2015
Title In Defense of Color-based Modelfree Tracking. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2012
Title Synergy-Based Learning of Facial Identity DOI 10.1007/978-3-642-32717-9_20 Type Book Chapter Author Köstinger M Publisher Springer Nature Pages 195-204 -
2012
Title Large Scale Metric Learning From Equivalence Constraints. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2012
Title Hough Regions for Joining Instance Localization and Segmentation DOI 10.1007/978-3-642-33712-3_19 Type Book Chapter Author Riemenschneider H Publisher Springer Nature Pages 258-271 -
2012
Title Large Scale Metric Learning from Equivalence Constraints*The work was supported by the Austrian Science Foundation (FWF) project Advanced Learning for Tracking and Detection in Medical Workflow Analysis (I535-N23) and by the Austrian Research Promoti DOI 10.1109/cvpr.2012.6247939 Type Conference Proceeding Abstract Author Köstinger M Pages 2288-2295 -
2015
Title Encoding Based Saliency Detection for Videos and Images DOI 10.1109/cvpr.2015.7298864 Type Conference Proceeding Abstract Author Mauthner T Pages 2494-2502 -
2014
Title Accurate Object Detection with Joint Classification-Regression Random Forests DOI 10.1109/cvpr.2014.123 Type Conference Proceeding Abstract Author Schulter S Pages 923-930 -
2014
Title Accurate Object Detection with Joint Classification-Regression Random Forests. Type Conference Proceeding Abstract Author Bischof H Et Al Conference IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
2014
Title Occlusion Geodesics for Online Multi-Object Tracking DOI 10.1109/cvpr.2014.170 Type Conference Proceeding Abstract Author Possegger H Pages 1306-1313 -
2014
Title Hough Forests Revisited: An Approach to Multiple Instance Tracking from Multiple Cameras DOI 10.1007/978-3-319-11752-2_41 Type Book Chapter Author Poier G Publisher Springer Nature Pages 499-510 -
2003
Title On Robust Regression in Photogrammetric Point Clouds DOI 10.1007/978-3-540-45243-0_23 Type Book Chapter Author Schindler K Publisher Springer Nature Pages 172-178