Inference methods for multivariate and high-dimensional data
Inference methods for multivariate and high-dimensional data
DACH: Österreich - Deutschland - Schweiz
Disciplines
Mathematics (100%)
Keywords
-
Bootstrap,
Factorial designs,
Multiple testing procedure,
Multivariate data,
Nonparametric method,
Semiparametric model
In light of greatly advanced computational resources, the scope of statistical data analysis now accommodates pressing new areas of application. This is particularly the case for the development of analysis tools for data sets containing several measured quantities. Those data are often complex due to their dimensionality, or structure, and it is especially challenging to derive appropriate analysis tools for those situations where classical and strongly simplifying model assumptions are not tenable. These, in practice rather common situations, are the target of the project, which proposes to develop methods for the analysis of complex, high-dimensional data. The methods shall be applicable in general situations, and also yield valid results when the application of currently existing procedures is not appropriate. Specifically, we are dealing with situations in which 1. data cannot be described by a normal distribution, 2. the response variables cannot be measured on a metric scale, but only by greater-smaller relations (ordinal data), 3. the number of measurements collected per person is larger than the number of persons (high dimensional data), or 4. not all persons under study could be observed long enough to obtain an accurate measurement, for example of the length of time leading up to an event (censored data). Using methods from mathematical statistics, we propose to develop procedures that meet the mentioned criteria. To this end, rank based methods as well as resampling techniques will be taken into account. The validity of each procedure will be shown by means of theoretical considerations concerning the performance of the tests for increasing sample sizes. These examinations will be complemented by extensive simulation studies in different designs of interest. Finally, all positively evaluated procedures will be made available in the form of statistical software packages for the free software environment R. The uniqueness and innovation of the project are that currently there exists no generally valid method for the complete analysis of complex data sets with many measured quantities. In practice, methods are used that are too simplistic, or procedures that are not appropriate due to strong underlying requirements, thus leading to unreasonable research results. The project aims to remedy this situation and to provide an effective methodology for the analysis of complex data. The expected results have wide potential applications and expand the role of statistical data analysis by allowing the treatment of new problems with meaningful and effective methods. Overall, we expect the results from the project to have a significant and lasting impact on modern "Data Science".
Nowadays, it is possible to generate bigger and bigger data sets with continuously decreasing technological effort. How does one make sense of these data? Classical statistical procedures are usually only applicable if certain restrictive assumptions are met. Many of the standard procedures are not valid if, for example, the data are high-dimensional, if the measurements cannot be represented as numbers, if observations in different groups exhibit differing variability, or if they are incompletely observed. We have developed and validated statistical methods that can still be applied sensibly in numerous challenging situations of big data or messy data. In order to make it easier for other researchers to actually apply these new methods, we have also devised corresponding free open source software packages and made them publicly available.
- Universität Salzburg - 100%
- Edgar Brunner, Georg-August-Universität Göttingen - Germany
- Jan Beyersmann, Universität Ulm - Germany
- Mark Pauly, École polytechnique fédérale de Lausanne - Switzerland
Research Output
- 257 Citations
- 18 Publications
- 2 Datasets & models
- 1 Disseminations
- 2 Scientific Awards
- 2 Fundings
-
2021
Title Testing for equality of distributions using the concept of (niche) overlap DOI 10.1007/s00362-021-01239-y Type Journal Article Author Parkinson-Schwarz J Journal Statistical Papers Pages 225-242 Link Publication -
2022
Title Testing hypotheses about covariance matrices in general MANOVA designs DOI 10.1016/j.jspi.2021.12.001 Type Journal Article Author Sattler P Journal Journal of Statistical Planning and Inference Pages 134-146 Link Publication -
2019
Title Testing Hypotheses about Covariance Matrices in General MANOVA Designs DOI 10.48550/arxiv.1909.06205 Type Preprint Author Sattler P -
2019
Title Photon-number parity of heralded single photons from a Bragg-reflection waveguide reconstructed loss-tolerantly via moment generating function DOI 10.1088/1367-2630/ab42ae Type Journal Article Author Laiho K Journal New Journal of Physics Pages 103025 Link Publication -
2019
Title Sample sizes and statistical methods in interventional studies on individuals with spinal cord injury: A systematic review DOI 10.1111/jebm.12356 Type Journal Article Author Zimmermann G Journal Journal of Evidence-Based Medicine Pages 200-208 Link Publication -
2019
Title Sample size calculation and blinded recalculation for analysis of covariance models with multiple random covariates DOI 10.1080/10543406.2019.1632871 Type Journal Article Author Zimmermann G Journal Journal of Biopharmaceutical Statistics Pages 143-159 Link Publication -
2019
Title Combined multiple testing of multivariate survival times by censored empirical likelihood DOI 10.1111/sjos.12423 Type Journal Article Author Parkinson J Journal Scandinavian Journal of Statistics Pages 757-786 Link Publication -
2019
Title Small-sample performance and underlying assumptions of a bootstrap-based inference method for a general analysis of covariance model with possibly heteroskedastic and nonnormal errors DOI 10.1177/0962280218817796 Type Journal Article Author Zimmermann G Journal Statistical Methods in Medical Research Pages 3808-3821 Link Publication -
2018
Title HRM: An R Package for Analysing High-dimensional Multi-factor Repeated Measures DOI 10.32614/rj-2018-032 Type Journal Article Author Happ M Journal The R Journal Pages 534 Link Publication -
2018
Title Optimal sample size planning for the Wilcoxon-Mann-Whitney test DOI 10.1002/sim.7983 Type Journal Article Author Happ M Journal Statistics in Medicine Pages 363-375 Link Publication -
2020
Title Pseudo-Ranks: How to Calculate Them Efficiently in R DOI 10.18637/jss.v095.c01 Type Journal Article Author Happ M Journal Journal of Statistical Software Link Publication -
2020
Title Multivariate analysis of covariance with potentially singular covariance matrices and non-normal responses DOI 10.1016/j.jmva.2020.104594 Type Journal Article Author Zimmermann G Journal Journal of Multivariate Analysis Pages 104594 Link Publication -
2018
Title A Fast and Robust Way to Estimate Overlap of Niches, and Draw Inference DOI 10.1515/ijb-2017-0028 Type Journal Article Author Parkinson J Journal The International Journal of Biostatistics Pages 20170028 Link Publication -
2018
Title Testing Mean Differences among Groups: Multivariate and Repeated Measures Analysis with Minimal Assumptions DOI 10.1080/00273171.2018.1446320 Type Journal Article Author Bathke A Journal Multivariate Behavioral Research Pages 348-359 Link Publication -
2018
Title Optimal Sample Size Planning for the Wilcoxon-Mann-Whitney-Test DOI 10.48550/arxiv.1805.12249 Type Preprint Author Happ M -
2017
Title High-dimensional repeated measures DOI 10.1080/15598608.2017.1307792 Type Journal Article Author Happ M Journal Journal of Statistical Theory and Practice Pages 468-477 Link Publication -
2017
Title Combining SPECT and Quantitative EEG Analysis for the Automated Differential Diagnosis of Disorders with Amnestic Symptoms DOI 10.3389/fnagi.2017.00290 Type Journal Article Author Höller Y Journal Frontiers in Aging Neuroscience Pages 290 Link Publication -
2017
Title Reliability of EEG Measures of Interaction: A Paradigm Shift Is Needed to Fight the Reproducibility Crisis DOI 10.3389/fnhum.2017.00441 Type Journal Article Author Höller Y Journal Frontiers in Human Neuroscience Pages 441 Link Publication
-
2017
Title Taught at Summer School Strobl 2017 Type Participation in an activity, workshop or similar
-
2019
Title President of the Austrian-Swiss Region of the International Biometric Society Type Prestigious/honorary/advisory position to an external body Level of Recognition Continental/International -
2019
Title Appointed Editor-in-Chief of Biometrical Journal (jointly with Matthias Schmid from Bonn) Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International
-
2018
Title Marshall Plan Scholarship Type Fellowship Start of Funding 2018 Funder Austrian Marshall Plan Foundation -
2019
Title Ecology and Statistics Type Research grant (including intramural programme) Start of Funding 2019