Regularization Techniques in Learning with Big Data
Regularization Techniques in Learning with Big Data
Disciplines
Mathematics (100%)
Keywords
-
Regularized Learning Algorithm,
Big Data,
Nyström subsampling,
Distributed Learning,
Regularized Aggregation,
Semi-Supervised Learning
Methods of the Regularization Theory had been introduced in the context of learning in the beginning of 90s by the seminal works of Poggio and Girosi. Due to the nature of the learning tasks, the regularization techniques, such as Lavrentiev regularization, which were originally developed for dealing with continuous ill-posed problems, have to be applied to discrete data. But because of the intrinsic ill-posedness of the learning problems and the risk of the so-called overfitting, a discrete form of the data does not make the application of the regularization methods easier for the analysis and interpretation. Therefore, there is a substantial literature on the regularization based learning, and research in this area is still very active. At the same time, Big Data Learning becomes a hot research area because of its great potential and importance in many scientific and industrial contexts. However, a straightforward application of the known regularized learning algorithms may become intractable for learning from big data. In this project, we argue that the intractability issue can be addressed by using new ideas emerged in the regularization theory.
An important class of machine learning algorithms is formed by the so-called kernel methods operating in the Reproducing Kernel Hilbert Spaces, where the process of determining output values of a function is well-defined for any input. However, the standard implementation of these algorithms is infeasible when dealing with big training data because it exceeds the computational capacity of the conventional computers. One of the achievements of the accomplished project is the development of a fully data-driven strategy for the simultaneous tuning of the hyperparameters of the kernel learning algorithms that allows their feasible implementation with no loss of accuracy order compared to the ideal use of the available big training data sets. To facilitate the testing of the theoretical results against real world data sets, the project was transferred to the Medical University of Innsbruck, where the use of the machine learning algorithms in the neuroradiological data processing was also promoted. The main results of the project have been published in the leading scientific journals in the fields of Computational and Applied Mathematics and Radiology. The project was performed in close cooperation with other project teams supported by different granting agencies. Such a synergy of research activities can be seen as an example of attempts to increase the value of the public money invested in research.
Research Output
- 170 Citations
- 21 Publications
- 2 Scientific Awards
-
2022
Title On a regularization of unsupervised domain adaptation in RKHS DOI 10.1016/j.acha.2021.12.002 Type Journal Article Author Gizewski E Journal Applied and Computational Harmonic Analysis Pages 201-227 -
2024
Title Cervical Artery Tortuosity Is Associated With Dissection Occurrence and Late Recurrence: A Nested Case-Control Study DOI 10.1161/strokeaha.124.049046 Type Journal Article Author Mayer-Suess L Journal Stroke Pages 413-419 -
2024
Title Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning DOI 10.25916/sut.26247044 Type Other Author Pereverzyev S Link Publication -
2024
Title Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning DOI 10.25916/sut.26247044.v1 Type Other Author Pereverzyev S Link Publication -
2019
Title Regularized Nyström subsampling in regression and ranking problems under general smoothness assumptions DOI 10.1142/s021953051850029x Type Journal Article Author Myleiko G Journal Analysis and Applications Pages 453-475 -
2024
Title Peripheral inflammatory response in people after acute ischaemic stroke and isolated spontaneous cervical artery dissection DOI 10.1038/s41598-024-62557-3 Type Journal Article Author Bauer A Journal Scientific Reports Pages 12063 Link Publication -
2024
Title Cervical artery tortuosity—a reliable semi-automated magnetic resonance-based method DOI 10.21037/qims-23-1057 Type Journal Article Author Mayer-Suess L Journal Quantitative Imaging in Medicine and Surgery Pages 1383391-1381391 Link Publication -
2020
Title Novel decision algorithm to discriminate parkinsonism with combined blood and imaging biomarkers DOI 10.1016/j.parkreldis.2020.05.033 Type Journal Article Author Mangesius S Journal Parkinsonism & Related Disorders Pages 57-63 Link Publication -
2019
Title Analysis of regularized Nyström subsampling for regression functions of low smoothness DOI 10.1142/s0219530519500039 Type Journal Article Author Lu S Journal Analysis and Applications Pages 931-946 Link Publication -
2017
Title Galaxy And Mass Assembly: Automatic Morphological Classification of Galaxies Using Statistical Learning DOI 10.48550/arxiv.1711.06125 Type Preprint Author Sreejith S -
2017
Title Operator learning approach for the limited view problem in photoacoustic tomography DOI 10.48550/arxiv.1705.02698 Type Preprint Author Dreier F -
2017
Title The quasi-optimality criterion in the linear functional strategy DOI 10.48550/arxiv.1709.09444 Type Preprint Author Kindermann S -
2018
Title The quasi-optimality criterion in the linear functional strategy DOI 10.1088/1361-6420/aabe4f Type Journal Article Author Kindermann S Journal Inverse Problems Pages 075001 Link Publication -
2018
Title Operator Learning Approach for the Limited View Problem in Photoacoustic Tomography DOI 10.1515/cmam-2018-0008 Type Journal Article Author Dreier F Journal Computational Methods in Applied Mathematics Pages 749-764 Link Publication -
2017
Title Multi-Penalty Regularization for Detecting Relevant Variables DOI 10.1007/978-3-319-55556-0_15 Type Book Chapter Author Hlavácková-Schindler K Publisher Springer Nature Pages 889-916 -
2017
Title Galaxy And Mass Assembly: automatic morphological classification of galaxies using statistical learning DOI 10.1093/mnras/stx2976 Type Journal Article Author Sreejith S Journal Monthly Notices of the Royal Astronomical Society Pages 5232-5258 Link Publication -
2019
Title Adaptive multi-parameter regularization approach to construct the distribution function of relaxation times DOI 10.1007/s13137-019-0138-2 Type Journal Article Author Žic M Journal GEM - International Journal on Geomathematics Pages 2 Link Publication -
2018
Title Joint Inversion of Multiple Observations DOI 10.1007/978-3-319-57181-2_14 Type Book Chapter Author Gerhards C Publisher Springer Nature Pages 855-882 -
2018
Title Analysis of regularized Nyström subsampling for regression functions of low smoothness DOI 10.48550/arxiv.1806.00826 Type Preprint Author Lu S -
2020
Title Feed-forward neural networks using cerebral MR spectroscopy and DTI might predict neurodevelopmental outcome in preterm neonates DOI 10.1007/s00330-020-07053-8 Type Journal Article Author Janjic T Journal European Radiology Pages 6441-6451 Link Publication -
2020
Title Stenosis Detection in Internal Carotid and Vertebral Arteries With the Use of Diameters Estimated from MRI Data DOI 10.20535/ibb.2020.4.3.207624 Type Journal Article Author Nesteruk I Journal Innovative Biosystems and Bioengineering Pages 131-142 Link Publication
-
2020
Title Associate Editor of the journal Mathematical Foundations of Computing Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2018
Title Best Poster Award Type Poster/abstract prize Level of Recognition Continental/International