Bilevel Learning for Computer Vision
Bilevel Learning for Computer Vision
Disciplines
Computer Sciences (50%); Mathematics (50%)
Keywords
-
Variational Methods,
Learning,
Computer Vision,
Image Processing,
Bilevel Optimization,
Convex Optimization
Variational methods are among the most successful methods to solve inverse problems in computer vision and image processing. Typical problems are tasks such as image restoration, motion estimation, stereo and 3D reconstruction. Existing variational models in computer vision are mainly hand-designed based on simple principles derived from the intrinsic properties of images. Clearly, these models are often too simple to model the complex physical properties of the visual world. In this project, we make a significant step ahead by considering more complex variational models based on higher-order, non-local and data-adaptive regularization and propose to learn the involved model parameters using optimization methods. The main idea is to learn the parameters of the variational models, such that the solution of the variational model minimizes a certain loss function that measures the error between the ground truth solutions and the solutions predicted by the model. This problem naturally leads to a bilevel optimization problem, where the lower level problem is given by the variational model and the higher level problem is given by the loss function. It turns out that these bilevel optimization problems have many interesting properties, which are still too less investigated in order to make the method accessible for a larger community. Therefore, it is the main goal in this project to develop a unified framework that can be applied to a number of variational problems in computer vision. The unified learning framework will allow us to systematically investigate existing models as well as new models, leading to a better understanding of their advantages and limitations. We expect that the results of this project will lead to new models that can be optimized towards specific applications and image data and hence will perform significantly better than existing models.
The main objective of the BIVISION project was to combine classical variational approaches in image processing with modern methods of machine learning in order to learn richer and more accurate models for image processing from a large number of natural images. The combination of machine learning and variational models naturally leads to so-called bilevel optimization problems, which represent the nested combination of two optimization problems. The parent optimization problem is given by the learning problem while the child optimization problem is being defined by the variational model. In the course of the project, numerous novel numerical algorithms and methods were developed to learn better models from the image data. One of the most important developments of the project were the so-called variational networks, which represent a hybrid between classical variational models and modern deep convolutional neural networks. Variational networks are comparable in performance to modern deep convolutional neural networks, but allow a better understanding of how their individual layers work. As part of the project, the developed models were used in a wide variety of applications in computer vision, image processing and inverse problems. As an example, we highlight the calculation of high-quality magnetic resonance images from noisy and subsampled measurement data. This work was carried out in cooperation with New York University and published in a renowned journal. Future work will be devoted to the theoretical analysis of variation networks. The goal will be to develop a theoretical backbone that allows to give regularity statements and reconstruction guarantees that correspond to those of the classical calculus of variations.
- Technische Universität Graz - 100%
- Antonin Chambolle, Universite de Paris - Dauphine - France
- Daniel Cremers, TU München - Germany
- Michael Hintermüller, Weierstraß-Institut für Angewandte Analysis und Stochastik - Germany
Research Output
- 4035 Citations
- 22 Publications
-
2016
Title Learning Joint Demosaicing and Denoising Based on Sequential Energy Minimization DOI 10.1109/iccphot.2016.7492871 Type Conference Proceeding Abstract Author Klatzer T Pages 1-11 -
2016
Title Techniques for Gradient-Based Bilevel Optimization with Non-smooth Lower Level Problems DOI 10.1007/s10851-016-0663-7 Type Journal Article Author Ochs P Journal Journal of Mathematical Imaging and Vision Pages 175-194 -
2015
Title Bilevel Optimization with Nonsmooth Lower Level Problems DOI 10.1007/978-3-319-18461-6_52 Type Book Chapter Author Ochs P Publisher Springer Nature Pages 654-665 -
2015
Title Vertebrae Segmentation in 3D CT Images Based on a Variational Framework DOI 10.1007/978-3-319-14148-0_20 Type Book Chapter Author Hammernik K Publisher Springer Nature Pages 227-233 -
2015
Title On Iteratively Reweighted Algorithms for Nonsmooth Nonconvex Optimization in Computer Vision DOI 10.1137/140971518 Type Journal Article Author Ochs P Journal SIAM Journal on Imaging Sciences Pages 331-372 -
2017
Title Learning a variational network for reconstruction of accelerated MRI data DOI 10.1002/mrm.26977 Type Journal Article Author Hammernik K Journal Magnetic Resonance in Medicine Pages 3055-3071 Link Publication -
2017
Title Variational Networks: Connecting Variational Methods and Deep Learning DOI 10.1007/978-3-319-66709-6_23 Type Book Chapter Author Kobler E Publisher Springer Nature Pages 281-293 -
2017
Title Trainable Regularization for Multi-frame Superresolution DOI 10.1007/978-3-319-66709-6_8 Type Book Chapter Author Klatzer T Publisher Springer Nature Pages 90-100 -
2017
Title A Deep Learning Architecture for Limited-Angle Computed Tomography Reconstruction DOI 10.1007/978-3-662-54345-0_25 Type Book Chapter Author Hammernik K Publisher Springer Nature Pages 92-97 -
2016
Title Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration DOI 10.1109/tpami.2016.2596743 Type Journal Article Author Chen Y Journal IEEE Transactions on Pattern Analysis and Machine Intelligence Pages 1256-1272 Link Publication -
2014
Title A Deep Variational Model for Image Segmentation DOI 10.1007/978-3-319-11752-2_9 Type Book Chapter Author Ranftl R Publisher Springer Nature Pages 107-118 -
2016
Title Higher order maximum persistency and comparison theorems DOI 10.1016/j.cviu.2015.05.002 Type Journal Article Author Shekhovtsov A Journal Computer Vision and Image Understanding Pages 54-79 Link Publication -
2016
Title U-shaped Networks for Shape from Light Field DOI 10.5244/c.30.37 Type Conference Proceeding Abstract Author Heber S Pages 37.1-37.12 Link Publication -
2015
Title Learning Reaction-Diffusion Models for Image Inpainting DOI 10.1007/978-3-319-24947-6_29 Type Book Chapter Author Yu W Publisher Springer Nature Pages 356-367 -
2015
Title Maximum persistency via iterative relaxed inference with graphical models DOI 10.1109/cvpr.2015.7298650 Type Conference Proceeding Abstract Author Shekhovtsov A Pages 521-529 Link Publication -
2015
Title On Learning Optimized Reaction Diffusion Processes for Effective Image Restoration DOI 10.1109/cvpr.2015.7299163 Type Conference Proceeding Abstract Author Chen Y Pages 5261-5269 Link Publication -
2014
Title A Higher-Order MRF Based Variational Model for Multiplicative Noise Reduction DOI 10.1109/lsp.2014.2337274 Type Journal Article Author Chen Y Journal IEEE Signal Processing Letters Pages 1370-1374 Link Publication -
2014
Title Partial Optimality by Pruning for MAP-inference with General Graphical Models DOI 10.1109/cvpr.2014.153 Type Conference Proceeding Abstract Author Swoboda P Pages 1170-1177 Link Publication -
2014
Title iPiano: Inertial Proximal Algorithm for Nonconvex Optimization DOI 10.1137/130942954 Type Journal Article Author Ochs P Journal SIAM Journal on Imaging Sciences Pages 1388-1419 Link Publication -
2014
Title Insights Into Analysis Operator Learning: From Patch-Based Sparse Models to Higher Order MRFs DOI 10.1109/tip.2014.2299065 Type Journal Article Author Chen Y Journal IEEE Transactions on Image Processing Pages 1060-1072 Link Publication -
2018
Title Assessment of the generalization of learned image reconstruction and the potential for transfer learning DOI 10.1002/mrm.27355 Type Journal Article Author Knoll F Journal Magnetic Resonance in Medicine Pages 116-128 Link Publication -
2018
Title Variational Networks for Joint Image Reconstruction and Classification of Tumor Immune Cell Interactions in Melanoma Tissue Sections DOI 10.1007/978-3-662-56537-7_86 Type Book Chapter Author Effland A Publisher Springer Nature Pages 334-340